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This book is dedicated to Jack Michael, whose extraordinary contributions to 
behavior analysis will continue to benefit those who teach and practice the science 
and, most importantly, those whose learning is enhanced by its application. 
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Jack L. Michael 


“One might ask why it is of any value to be able to recognize and correctly name 
these various effects. | would answer that | have found, for myself at least, that 
| cannot understand some things unless | can talk about them clearly.” 


“From “What Every Student of Behavior Analysis Ought to Learn: A System for Classifying the Multiple Effects of Behavioral Variables,” by J. Michael, 1995, 
The Behavior Analyst, 18, p. 284. 
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PREFACE 


As it was 17 years ago, when we began writing the previous 
edition of this book, our overriding goal for the third edition 
was to produce an accurate, comprehensive, and contemporary 
description of applied behavior analysis. The result is a text that 
demands concentrated and serious study. 

Despite its size, scope, and in-depth treatment of concepts, 
principles, procedures, and issues, Applied Behavior Analy- 
sts, Third Edition, should be viewed as an introductory text for 
two reasons. First, the reader need not possess any specialized 
prerequisite knowledge to understand the content. Second, attain- 
ing a full understanding of applied behavior analysis requires 
considerable study and guided experience beyond this text. There 
is no topic presented within these pages that has not been treated 
in greater depth elsewhere. Students of applied behavior analy- 
sis should build upon what they learn from this book by reading 
other sources. How much reading is needed to fully grasp and 
appreciate applied behavior analysis? Don Baer (2005), one of the 
co-founders of applied behavior analysis, estimated that 


[T]he cost of knowing well the basic principles and 
paradigms of the theoretical and experimental aspects of 
behavior analysis would require about 2,000 pages and 
some laboratory experience. ABA shares the same basic 
principles with the theoretical and experimental branches 
of behavior analysis and adds to them an even larger 
number of secondary principles, strategies, and tactics for 
making those basic principles work in the real world as 
they do in the laboratory. ABA also adds a set of principles 
about ethical and humane practice, prominent among which 
is the need to be certain, through constant and extensive 
measurement and experimentation, that the particular case 
in hand is going well and will continue to go well—because 
it will change as it progresses. The cost of knowing all that 
is, I estimate, about 3,000 pages of reading and several 
years of supervised practical experience. (pp. 27-28) 


The field has grown markedly since its formal inception 
in 1968, so much so that Baer’s 3000-page reading assignment 
might now exceed 4000 pages, or more. We trust that this 
book’s 800-plus pages will contribute to Baer’s reading assign- 
ment for many future behavior analysts. Specific suggestions 
for additional readings in applied behavior analysis and in the 
conceptual and basic research branches of behavior analysis are 
cited throughout this text. 

Again, while our objective is to provide a complete 
description of the principles and procedures for changing and 
analyzing socially important behavior, mastery of this book’s 
content represents the beginning, not the end, of one’s study of 
applied behavior analysis. If our efforts as textbook writers and 
chapter contributors, combined with those of instructors who 
assign this book, are successful, the dedicated student will come 
away with a sound repertoire of fundamental knowledge about 
applied behavior analysis. In turn, this knowledge will serve as 
the foundation for more advanced study and supervised prac- 
tica that ultimately will lead to independent efforts to change 


and understand behavior that are scientifically sound, socially 
significant, and ethically appropriate. 


TERMINOLOGY 


Meaningful description of any scientific activity necessitates a 
standard set of technical terms. Effectively communicating the 
design, implementation, outcomes, and/or theoretical bases of 
an applied behavior analysis requires the accurate use of the 
discipline’s terminology. Throughout this text, we have made 
every effort to define and use behavior analytic terminology 
in a conceptually systematic and consistent manner. Mastering 
the specialized vocabulary of applied behavior analysis is an 
important initial step in embracing the science and participat- 
ing effectively as a researcher, practitioner, or consumer. We 
encourage students to study the field’s terminology with dili- 
gence. Toward that end, the third edition includes a glossary of 
more than 500 technical terms and concepts. 


REFERENCES, EXTRACTS, NOTES, 
AND GRAPHS 


An important function of any introductory text to a scientific dis- 
cipline is to expose students to the empirical and conceptual liter- 
ature of that field. This edition contains more than 2700 citations 
to primary-source publications, including historically important 
experiments (e.g., the first graph presented by B. F. Skinner in his 
1938 book The Behavior of Organisms), and classic and contem- 
porary examples of applied behavior analysis research—most of 
which were published in the field’s flagship journal, the Journal 
of Applied Behavior Analysis. We also make extensive use of 
quotations and extracts from key publications representing the 
conceptual literature. We have done this not only for the historical 
and/or technical authority these authors provide, but also because 
their inclusion increases students’ exposure to and appreciation 
for the field’s rich primary-source literature. 

The third edition includes more than 150 graphs of data 
from peer-reviewed research, many of which are accompanied 
by detailed descriptions of the study’s methodology. We have 
a fourfold purpose for providing many procedures, graphs, and 
references. First, we want to illustrate behavior analysis prin- 
ciples and procedures with actual applications and real data, 
not hypothetical examples. Second, reading the procedural 
descriptions will help students appreciate the high degree of 
technical precision and control of complex environments that 
researchers and practitioners must achieve to solve problems 
and show functional relations between variables. Third, the 
references provide students whose interests are piqued by the 
descriptions or graphs with directions to the original studies 
for more in-depth study. Finally, the graphs provide multi- 
ple opportunities for students to develop and refine—through 
practice and discussion with their instructors, mentors, and 
fellow students—higher-level visual analysis skills. 
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THIRD EDITION CONTENT ENHANCEMENTS 
AND FEATURES 


Applied behavior analysis has become more mature and sophisti- 
cated since the second edition was published. Although the basic 
principles of behavior remain unchanged, advances in all three 
interrelated domains of the science of behavior—theoretical, 
basic research, and applied research—have improved our under- 
standing of those principles and led to increased effectiveness in 
developing and applying effective and humane behavior change 
interventions. These developments are reflected in the more than 
1,000 new references to the conceptual, basic, and applied liter- 
atures of behavior analysis added to this edition. 


Chapters by Outstanding Authors 


The third edition includes seven chapters authored by prominent 
scholars in applied behavior analysis. This group of contributors 
includes the current and two former editors of the Journal of 
Applied Behavior Analysis, two previous editors of The Analy- 
sis of Verbal Behavior, and associate editors of the Journal of 
the Experimental Analysis of Behavior. Members of this well- 
known and prolific group of behavior analysts first reported 
some of the most significant advances in behavior analysis in 
publications. 


Negative Reinforcement 


In Chapter 12, Negative Reinforcement, Rick Smith and Brian 
Iwata present an authoritative account of this commonly mis- 
understood and misapplied form of reinforcement. In addition 
to precisely defining this principle, dispelling misconceptions 
about it, and illustrating applications across a broad spec- 
trum of cases, Smith and Iwata provide specific guidelines for 
incorporating negative reinforcement into behavior change 
interventions. 


Motivation 


Until recently, motivation, a major topic in psychological 
theories and everyday explanations of behavior, has been an 
assumed, but inadequately understood, topic in behavior analy- 
sis. Due largely to the work of Jack Michael, behavior analysts 
now have a much better understanding of motivation and its 
role within applied behavior analysis. In Chapter 16, Motivating 
Operations, Jack Michael and Caio Miguel explain how certain 
antecedent events have dual motivating effects: a behavior- 
altering effect, which makes certain behaviors more (or less) 
likely; and a value-altering effect, which makes certain events 
more (or less) effective as reinforcement. 


Verbal Behavior 


In Chapter 18, Verbal Behavior, Mark Sundberg contrasts 
B. F. Skinner’s functional analysis of verbal behavior with tra- 
ditional approaches to language, defines and gives examples of 
basic types of elementary verbal operants (e.g., mands, tacts, 
intraverbals), and describes implications and applications for 
these concepts in designing and implementing language inter- 
vention programs. 


Equivalence-based Instruction 


In Chapter 19, Equivalence-based Instruction, Carol Pilgrim 
builds upon Sidman’s groundbreaking research on stimulus 
equivalence to explain the conditions in which learners acquire 
new skills and verbal relations without direct instruction on 
those skills. Pilgrim defines equivalence-based instruction, 
describes its key outcomes—class formation, delayed emer- 
gence, class expansion and class merger, transfer of function, 
and contextual control—and shows how lessons can be designed 
to promote those outcomes. 


Nonequivalence Relations 


In Chapter 20, Engineering Emergent Learning with Nonequiv- 
alence Relations, Tom Critchfield and Ruth Anne Rehfeldt 
explain how people make sense of and function effectively in 
a world of arbitrary relations in which stimuli “go together,” 
not because they share physical properties, but rather because 
social-verbal reinforcement contingencies teach people to relate 
them in a certain way. Relational frame theory (RFT) and accep- 
tance and commitment therapy (ACT), a therapeutic approach 
grounded in RFT, are described. 


Functional Behavior Assessment 


In Chapter 27, Functional Behavior Assessment, Stephanie 
Peterson and Nancy Neef describe one of the more significant 
developments in applied behavior analysis. Functional behavior 
assessment has become a well-established method for discov- 
ering the function that a problem behavior serves for a person 
(e.g., to obtain social attention, to avoid an assigned task, to 
provide sensory stimulation), information that enables practi- 
tioners to design interventions that teach adaptive replacement 
behaviors that serve the same function. 


Ethics 


In Chapter 31, Ethical and Professional Responsibilities of 
Applied Behavior Analysts, Tom Freeman, Linda LeBlanc, and 
Jose Martinez-Diaz-clarify what ethical behavior is, explain 
why ethical behavior is a necessary part of the applied behavior 
analyst’s repertoire, review ethical codes of conduct for behav- 
ior analysts, and describe specific procedures for ensuring and 
assessing ethical practice. New content regarding client services 
(e.g., informed consent, conflict of interest) and, importantly, 
ethical implications of new technologies, social media, and pro- 
fessional networking to support ethical behavior is presented. 


TEXT ORGANIZATION AND STRUCTURE 


The book’s 31 chapters are organized into 13 parts. The two 
chapters in Part 1 describe some tenets that are fundamental to 
all scientific endeavors, outline a history of behavior analysis 
as a natural science approach to understanding behavior, define 
applied behavior analysis, and describe principles and concepts 
of that science. Parts 2 and 3 examine the elements necessary 
for an applied behavior analysis. Part 2 presents considerations, 
criteria, and procedures for selecting, defining, and measuring 
applied behavior. The five chapters in Part 3 examine the logic 


and operation of specific tactics for the experimental analysis of 
behavior—environment relations, and some issues in planning, 
replicating, and evaluating analyses of behavior. 

The seven chapters in Parts 4 through 6 explore the two 
most significant principles of behavior—reinforcement and pun- 
ishment; how antecedent events alter one’s motivation to respond; 
and how behavior comes under the discriminative control of 
environmental conditions. Part 7 is a detailed examination of 
B. F. Skinner’s analysis of verbal behavior and its implications and 
applications for language development. The five chapters in Part 8 
describe how applied behavior analysts use equivalence-based 
instruction, nonequivalence relations, imitation, modeling, obser- 
vational learning, shaping, and chaining to develop new skills and 
patterns of behavior from simple to complex. 

Part 9 details how problem behaviors can be decreased 
with nonpunishment interventions: extinction, differential 
reinforcement, and antecedent interventions. Part 10 describes 
functional behavioral assessment, sophisticated methods for 
determining the purpose that problem behavior serves for a 
person, and important information that leads to the design of 
treatments that replace the problem behavior with adaptive alter- 
native behavior serving the same function. 

Part 11 describes four special applications of behavior 
change technology: token economy, contingency contracting, 
group contingencies, and self-management. Part 12 outlines 
strategies and tactics for increasing the likelihood that efforts to 
change behavior yield generalized outcomes: behavior changes 
that maintain across time, occur in appropriate settings and sit- 
uations beyond the training setting, and spread to other useful 
behaviors. The book’s final part describes ethical and profes- 
sional responsibilities of behavior analysts, ethical implications 
of new technologies, social media, and professional networking. 
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Powerpoint® Slides (ISBN 1-292-32466-X) 


The PowerPoint slides include key concept summarizations, 
diagrams, and other graphic aids to enhance learning. They are 
designed to help students understand, organize, and remember 
core concepts and theories. 
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The Behavior Analyst Certification Board® (BACB®) is a 
nonprofit corporation established in 1998 to meet professional 
credentialing needs identified by behavior analysts, govern- 
ments, and consumers of behavior analysis services. To be 
certified as a Board Certified Behavior Analyst® (BCBA®) 
or a Board Certified Assistant Behavior Analyst® (BCaBA®), 
a person must meet academic-degree, educational, and 
practical-experience eligibility requirements and then pass a 
psychometrically sound examination. The BCBA and BCaBA 
examinations are based on the BCBA/BCaBA Task List (Sth 
ed.; BACB, 2017a), which was developed by 16 subject matter 
experts and subsequently content validated through a survey 
of more than 6000 BACB certificants (BACB, 2017b). The 
complete BCBA/BCaBA Task List (5th ed.) is available on the 
companion website (www.pearsonglobaleditions.com). 

We have connected the content of this text to the tasks 
that the BACB determined are necessary to function as an entry- 
level behavior analyst. A chart identifying which Task List 
items are covered in each chapter is also provided on the com- 
panion website. Due to the complex nature of applied behavior 
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in a linear fashion, some Task List items are covered in more 
than one chapter. Students studying for BCBA and BCaBA 
examinations can look up key words from Task List items in 
the Subject Index to identify the page numbers where relevant 
information about each item in the BCBA/BCaBA Task List 
(5th ed.) can be found. 

This text presents the basic knowledge that a qualified 
behavior analyst must possess. Although mastering this con- 
tent will help you obtain a passing score on the BCBA or 
BCaBA examinations, two important qualifiers must be rec- 
ognized. First, the BCBA and BCaBA examinations require 
knowledge beyond that included in this, or any, single text- 
book. Therefore, to further prepare for the examinations 
we encourage students to study original sources, engage in 
supervised practica, and discuss areas of personal interest 
with trusted and competent mentors. Second, no matter how 
accurate, extensive, and current this textbook may be, and no 
matter how thoroughly a student masters its content, he or she 
will not be fully qualified to function as a behavior analyst. 
Successful completion of the required coursework in behavior 
analysis is but one step in the preparation to become a BCBA 
or a BCaBA. For the most recent information on the BACB 
requirements, visit the Behavior Analyst Certification Board’s 
website at www.BACB.com. 
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PART 


Introduction and Basic Concepts 


We believe that prior to learning specific principles and procedures for analyzing and changing 
behavior, the student of applied behavior analysis should be introduced to the historical and 
conceptual foundations of the science. Basic knowledge and appreciation of the scientific and 
philosophical underpinnings of behavior analysis are requisites to a thorough understanding of the 
discipline’s nature, scope, and potential. We also believe a preliminary overview of basic concepts, 
principles, and terminology makes the in-depth study of behavior analysis to follow more effective. 
The two chapters in Part | support these two beliefs. Chapter | describes the scientific, conceptual, 
and philosophical roots of applied behavior analysis and identifies the discipline’s defining 
dimensions, characteristics, and overall goals. Chapter 2 defines the field’s fundamental elements— 
behavior and the environmental antecedent and consequential events that influence it—and 
introduces key terms and principles that describe relationships among these elements. 
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CHAP T EAR 1 


Definition and Characteristics of Applied 


Behavior Analysis 


LEARNING OBJECTIVES 


e Describe the basic characteristics and goals of science. 


e Explain behavior in accordance with the philosophical assumptions of behavior analysis. 


e Explain determinism as it relates to behavior analysis. 


¢ State distinguishing features of mentalistic and environmental explanations of behavior. 


e Describe and explain behavior in behavior analytic terms. 


[SJince I was a child I always found my biggest reinforcer 
was something called understanding. I liked to know how 
things worked. And of all of the things in the world there 
are to understand, it became clear to me that the most 
fascinating was what people do. I started with the usual 
physical science stuff, and it was intriguing to me to under- 
stand how radios work, and how electricity works, and 
how clocks work, etcetera. But when it became clear to me 
that we could also learn how people work—not just bio- 
logically, but behaviorally—I thought that’s the best of all. 
Surely, everyone must agree that that’s the most fascinating 
subject matter. That there could be a science of behavior, 
of what we do, of who we are? How could you resist that? 
—Donald M. Baer in Heward & Wood, 
(2003, p. 302) 


pplied behavior analysis is a science devoted to under- 
standing and improving human behavior. Other disci- 
plines have similar intents. What sets applied behavior 
analysis apart? The answer lies in its focus, goals, and meth- 
ods. Applied behavior analysts focus on behaviors of social 
importance, they intervene with research-based strategies and 
tactics to improve the targeted behaviors, and they use sci- 
entific methods—objective description, measurement, and 
experimentation—to demonstrate reliable relations between 
their interventions and the behavioral improvements. In short, 
applied behavior analysis, or ABA, is a scientific approach for 
discovering environmental variables that reliably influence 
socially significant behavior and for developing a technology 
of behavior change that takes practical advantage of those 
discoveries. 
This chapter briefly outlines the history and development 
of behavior analysis, discusses the philosophy that underlies the 
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State and describe each of the dimensions of applied behavior analysis. 


science, and identifies defining dimensions and characteristics 
of applied behavior analysis. Because applied behavior analysis 
is first and foremost a science, we begin with an overview of 
precepts shared by scientists in all disciplines. 


SCIENCE: BASIC CHARACTERISTICS 
AND A DEFINITION 


Science is a systematic approach for seeking and organizing 
knowledge about the natural world. Before offering a defini- 
tion of science, we discuss the purpose of science and the basic 
assumptions and attitudes that guide the work of all scientists, 
irrespective of their fields of study. 


Purpose of Science 


The overall goal of science is to achieve a thorough understand- 
ing of the phenomena under study—socially important behavior 
change, in the case of applied behavior analysis. Science differs 
from other sources of knowledge or ways we obtain knowledge 
about the world around us (e.g., contemplation, common sense, 
logic, authority figures, religious or spiritual beliefs, political 
campaigns, advertisements, testimonials). Science seeks to dis- 
cover nature’s truths: facts and universal laws that exist and 
operate independent of the opinions and beliefs of any person or 
group, including the scientist. Therefore, scientific knowledge 
must be separated from any personal, political, economic, or 
other reasons for which it was sought. Although it is frequently 
misused, science is not a tool for validating the cherished or 
preferred versions of “the truth” held by any group, corporation, 
government, or institution. 

Different types of scientific investigations yield knowledge 
enabling one or more of three levels of understanding: descrip- 
tion, prediction, and control. Each level of understanding contrib- 
utes to the scientific knowledge base of a given field of inquiry. 
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Description 


Systematic observation enhances the understanding of a given 
phenomenon by enabling scientists to describe it accurately. 
Descriptive knowledge consists of a collection of facts about 
the observed events that can be quantified, classified, and exam- 
ined for possible relations with other known facts—a necessary 
and important activity for any scientific discipline. The know]- 
edge obtained from descriptive studies often suggests possible 
hypotheses or questions for additional research. 

The work of John James Audubon, a naturalist and painter 
in the early 19th century, provides a classic example of descrip- 
tive science. While observing birds in their natural habitat, 
Audubon documented their habits with extensive field notes and 
made detailed drawings. He identified 25 new species of birds. 
His major work, The Birds of America (Audubon, 1827-1838), 
contains 435 hand-colored life-sized prints of birds in their nat- 
ural habitat and is considered one of the finest ornithological 
works ever completed. 

White’s (1975) study of classroom teachers’ “natural 
rates” of approval (verbal praise or encouragement) and dis- 
approval (criticisms, reproach) is an example of descriptive 
research in applied behavior analysis. Observations of 104 class- 
room teachers in grades | to 12 yielded two major findings: 
(a) Rates of teacher praise dropped with each grade level, 
and (b) in every grade after second, teachers delivered state- 
ments of disapproval to students at rates exceeding their rates 
of praise. The results of this descriptive study led to dozens of 
subsequent studies aimed at discovering factors responsible 
for the disappointing findings, analyzing the effects of dispro- 
portionate rates of disapproval and praise on student behavior, 
and increasing teachers’ effective use of praise (e.g., Alber, 
Heward, & Hippler, 1999; Duchaine, Jolivette, & Fredrick, 
2011; Fullerton, Conroy, & Correa, 2009; Mrachko, Kostewicz, 
& Martin, 2017; Niwayama & Tanaka-Matsumi, 2016; 
Sutherland, Wehby, & Yoder, 2002). 


Prediction 


A second level of scientific understanding occurs when repeated 
observations reveal that two events consistently covary with each 
other. That is, in the presence of one event (e.g., approaching 
winter) another event occurs (or fails to occur) with some spec- 
ified probability (e.g., certain birds fly south). When systematic 
covariation between two events is found, this relationship— 
termed a correlation—can be used to predict the relative prob- 
ability that one event will occur, based on the presence of the 
other event. “We obviously cannot intervene or manipulate the 
movement of the stars or planets, but by studying their move- 
ments we can gauge the seasons and when we can plant crops to 
produce a bountiful harvest” (Moore, 2010, p. 48). 

Because no variables are manipulated or controlled by the 
researcher, a correlational study cannot demonstrate whether 
one of the observed variables is responsible for the changes in 
the other variable, and no such relations should be inferred. A 
strong correlation exists between hot weather and an increased 
incidence of drowning deaths, but we should not assume that a 
hot and humid day causes anyone to drown. Hot weather also 


correlates with other factors, such as an increased number of 
people (both swimmers and nonswimmers) seeking relief in the 
water, and many instances of drowning have been found to be 
a function of factors such as the use of alcohol or drugs, the 
relative swimming skills of the victims, strong riptides, and the 
absence of supervision by lifeguards. ! 

In addition to their usefulness in aiding prediction, the 
findings of correlational studies can suggest the possibility of 
causal relations, which can then be explored with experimental 
studies. The most common type of correlational study reported 
in the applied behavior analysis literature compares the relative 
rates or conditional probabilities of two or more observed (but not 
manipulated) variables (e.g., Atwater & Morris, 1988; Symons, 
Hoch, Dahl, & McComas, 2003; Thompson & Iwata, 2001). 
For example, McKerchar and Thompson (2004) found cor- 
relations between problem behavior exhibited by 14 preschool 
children and the following consequent events: teacher attention 
(100% of the children), presentation of some material or item to 
the child (79% of the children), and escape from instructional 
tasks (33% of the children). The results of this study not only 
provide empirical validation for the social consequences typi- 
cally used in clinical settings to analyze the variables maintain- 
ing children’s problem behavior, but also increase confidence in 
the prediction that interventions based on the findings from such 
assessments will be relevant to the conditions that occur natu- 
rally in preschool classrooms (see Chapter 27). In addition, by 
revealing the high probabilities with which teachers responded 
to problem behavior in ways that are likely to maintain and 
strengthen it, McKerchar and Thompson’s findings also point 
to the need to train teachers in more effective ways to respond 
to problem behavior. 


Control! 


The ability to predict with a certain degree of confidence 
is a valuable and useful result of science; prediction enables 
preparation. However, the greatest potential benefits from 
science are derived from the third, and highest, level of scien- 
tific understanding—control. Evidence of the kinds of control 
that can be derived from scientific findings in the physical and 
biological sciences surrounds us in the everyday technologies 
we take for granted: pasteurized milk and the refrigerators we 
store it in; flu shots and the automobiles we drive to go get 
them; pain relievers and the televisions that bombard us with 
advertisements and news stories about the drugs. 


The scientific “system,” like the law, is designed to enable 
us to handle a subject matter more efficiently . .. When we 
have discovered the laws which govern a part of the world 
about us, we are then ready to deal effectively with that part 
of the world. By predicting the occurrence of an event we 
are able to prepare for it. By arranging conditions in ways 
specified by the laws of a system, we not only predict, we 
control: we “cause” an event to occur or to assume certain 
characteristics. (Skinner, 1953, pp. 13-14) 


Functional relations, the primary products of basic 
and applied research in behavior analysis, provide the kind 
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of scientific understanding that is most valuable and useful 
to the development of a technology for changing behavior. 
A functional relation exists when a well-controlled experiment 
demonstrates that a specific change in one event (the depen- 
dent variable) is reliably produced by specific manipulations of 
another event (the independent variable), and that the change 
in the dependent variable was unlikely to be the result of other 
extraneous factors (confounding variables). 

Johnston and Pennypacker (1980) described functional 
relations as “the ultimate product of a natural scientific inves- 
tigation of the relation between behavior and its determining 
variables” (p. 16). 


Such a “co-relation” is expressed as y = f(x), where x 

is the independent variable or argument of the function, 
and y is the dependent variable. In order to determine if 

an observed relation is truly functional, it is necessary to 
demonstrate the operation of the values of x in isolation and 
show that they are sufficient for the production of y. .. . 
[H]owever, a more powerful relation exists if necessity 
can be shown (that y occurs only if x occurs). The most 
complete and elegant form of empirical inquiry involves 
applying the experimental method to identifying functional 
relations. (Johnston & Pennypacker, 1993a, p. 239) 


The understanding gained by the scientific discovery 
of functional relations is the basis of applied technologies in 
all fields. 


Assumptions and Attitudes of Science 


Science is first of all a set of attitudes. 
—B. F. Skinner, (1953, p. 12) 


The definition of science lies not in test tubes, spectrometers, or 
electron accelerators, but in the behavior of scientists. To 
begin to understand any science, we need to look past the appa- 
ratus and instrumentation that are most readily apparent and 
examine what scientists do.” The pursuit of knowledge is prop- 
erly called science when it is carried out according to general 
methodological precepts and expectations that define science. 
All scientists share a fundamental assumption about the nature 
of events that are amenable to investigation by science, general 
notions about basic strategy, and perspectives on how to view 
their findings. These attitudes of science—determinism, empir- 
icism, experimentation, replication, parsimony, and philosophic 
doubt—constitute a set of overriding assumptions and values 
that guide the work of all scientists (Whaley & Surratt, 1968). 


Determinism 


Science is predicated on the assumption of determinism. All 
scientists presume that the universe is a lawful and orderly place 
in which all phenomena occur as the result of other events. In 
other words, events do not just happen willy-nilly; they are 
related in systematic ways to other factors, which are themselves 
physical phenomena amenable to scientific investigation. 

If the universe were governed by accidentalism, a phil- 
osophical position antithetical to determinism that holds that 
events occur by accident or without cause, or by fatalism, the 


belief that events are predetermined, the scientific discovery 
of functional relations and use of those discoveries to improve 
things would be impossible. 


If we are to use the methods of science in the field of 
human affairs, we must assume behavior is lawful and 
determined. We must expect to discover what a man does 
is the result of specifiable conditions and that once these 
conditions have been discovered, we can anticipate and to 
some extent determine his actions. (Skinner, 1953, p. 6) 


Determinism plays a pivotal dual role in the conduct of 
scientific practice: It is at once a philosophical stance that does 
not lend itself to proof and the confirmation that is sought by 
each experiment. In other words, the scientist first assumes law- 
fulness and then proceeds to look for lawful relations (Delprato 
& Midgley, 1992). 


Empiricism 
When you can measure what you are speaking about, 
and express it in numbers, you know something about 
it; but when you cannot measure it, when you cannot 
express it in numbers, your knowledge is of a meagre and 


unsatisfactory kind 
— Lord Kelvin, (1824—1907) 


Scientific knowledge is built on, above all, empiricism—the 
practice of objective observation and measurement of the phe- 
nomena of interest. Objectivity in this sense means “‘indepen- 
dent of the individual prejudices, tastes, and private opinions of 
the scientist. Results of empirical methods are objective in that 
they are open to anyone’s observation and do not depend on the 
subjective belief of the individual scientist” (Zuriff, 1985, p. 9). 

In the prescientific era (and in nonscientific and pseudo- 
scientific activities today) (Nichols, 2017), knowledge was (and 
is) the product of contemplation, speculation, personal opinion, 
authority, and the “obvious” logic of common sense. The 
scientist’s empirical attitude, however, demands objective obser- 
vation based on thorough description, systematic and repeated 
measurement, and precise quantification of the phenomena of 
interest. 

As it is in every scientific field, empiricism is the foremost 
rule in behavior analysis. Every effort to understand, predict, 
and improve behavior hinges on the behavior analyst’s ability 
to completely define, systematically observe, and accurately and 
reliably measure occurrences and nonoccurrences of the behav- 
ior of interest. 


Experimentation 


Experimentation is the basic strategy of most sciences. Whaley 
and Surratt (1968) used the following anecdote to introduce the 
need for experimentation. 


A man who lived in a suburban dwelling area was surprised 
one evening to see his neighbor bow to the four winds, 
chant a strange melody, and dance around his front lawn 
beating a small drum. After witnessing the same ritual for 
over a month, the man became overwhelmed with curiosity 
and decided to iook into the matter. 
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“Why do you go through this same ritual each 
evening?” the man asked his neighbor. 

“It keeps my house safe from tigers,” the neighbor 
replied. 

“Good grief!” the man said. “Don’t you know there 
isn’t a tiger within a thousand miles of here?” 

“Yeah,” the neighbor smiled. “Sure works, doesn’t 
it!” (pp. 23-2 to 23-3) 


When events are observed to covary or occur in close 


temporal sequence, a functional relation may exist, but other 


factors may be responsible for the observed values of the 
dependent variable. To investigate the possible existence of a 
functional relation, an experiment (or better, a series of exper- 


iments) must be performed in which the factor(s) suspected of 


having causal status are systematically controlled and manip- 
ulated while the effects on the event under study are carefully 
observed. 

Reliably predicting and controlling any phenomena, 
including the presence of tigers in one’s backyard, requires iden- 
tifying and manipulating the factors that influence those phe- 
nomena. One way that the individual described previously could 
use the experimental method to evaluate the effectiveness of his 
ritual would be to first move to a neighborhood in which tigers 
are regularly observed and then systematically manipulate the 
use of his anti-tiger ritual (e.g., 1 week off, | week on, | week 


off, | week on) while observing and recording the presence of 


tigers under the no-ritual and ritual conditions. 


The experimental method is a method for isolating the 
relevant variables within a pattern of events... . 

[W Jhen the experimental method is employed, it is 
possible to change one factor at a time (independent 
variable) while leaving all other aspects of the situation 
the same, and then to observe what effect this change 
has on the target behavior (dependent variable). Ideally, 
a functional relation may be obtained. Formal techniques 
of experimental control are designed to make sure that 
the conditions being compared are otherwise the same. 
Use of the experimental method serves as a necessary 
condition (sine qua non) to distinguish the experimental 
analysis of behavior from other methods of investiga- 
tion. (Dinsmoor, 2003, p. 152) 


Thus, an experiment is a controlled comparison of some 
measure of the phenomenon of interest (the dependent vari- 
able) under two or more different conditions in which only 
one factor at a time (the independent variable) differs from 
one condition to another. Strategies and tactics for conduct- 
ing experiments in applied behavior analysis are described in 
Chapters 7 through 10. 

Most of the studies cited in this text are experiments that 
have demonstrated or discovered a functional relation between 
a target behavior and one or more environmental variables. Such 
studies are said to have achieved a functional analysis. The 
term functional analysis has two meanings in contemporary 
behavior analysis literature. In its original and most fundamen- 
tal usage, functional analysis denotes demonstrations of func- 
tional relations between environmental variables and behavior. 


Schlinger and Normand (2013) reported that Skinner used the 
term 36 times in Science and Human Behavior and cited this 
example: 


The external variables of which behavior is a function pro- 
vide for what may be called a causal or functional analysis. 
We undertake to predict and control the behavior of the 
individual organism. This is our “dependent variable”—the 
effect for which we are to find the cause. Our “indepen- 
dent variables”—the causes of behavior—are the external 
conditions of which behavior is a function. Relations 
between the two—the “cause-and-effect relationships” in 
behavior—are the laws of a science. (Skinner, 1953, 

p. 35, italics added) 


Iwata, Dorsey, Slifer, Bauman, and Richman (1982) intro- 
duced the second and today most widely recognized usage of 


functional analysis in their groundbreaking article describing 


an experimental methodology for determining environmental 
variables and contingencies maintaining problem behavior (see 
Chapter 27). In its original meaning, functional analysis pro- 
vides the very foundation for an experimental science of 
behavior; as a method for assessing the controlling variables 
for problem behavior, functional analysis informs the design of 
effective treatments. 


Replication 


The results of a single experiment—no matter how well it was 
designed and conducted, no matter how clear and impressive the 
findings—are never sufficient to earn an accepted place among 
the scientific knowledge base of any field. Although the data 
from a single experiment have value in their own right and can- 
not be discounted, only after an experiment has been replicated 
a number of times with the same basic pattern of results are 
scientists convinced of the findings. 

Replication—repeating of experiments (as well as repeat- 
ing independent variable conditions within experiments )— 
“pervades every nook and cranny of the experimental method” 
(Johnston & Pennypacker, 1993a, p. 244). Replication is the 
primary method with which scientists determine the reli- 
ability and usefulness of their findings and discover their 
mistakes (Johnston & Pennypacker, 1980; 1993a; Sidman, 
1960). Replication—not the infallibility or inherent honesty 
of scientists—is the primary reason science is a self-correcting 
enterprise that ultimately gets it right (Skinner, 1953). 

How many times must an experiment be repeated with 
the same results before the scientific community accepts the 
findings? There is no required number of replications, but the 
greater the importance of the findings to theory or practice, the 
greater the number of replications to be conducted. Chapters 7 
through 10 explain the role of replication in behavioral research 
and describe replication strategies used by applied behavior 
analysts. 


Parsimony 

One dictionary definition of parsimony is great frugality, and in 
a special way this connotation accurately describes the behav- 
ior of scientists. As an attitude of science, parsimony requires 
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that all simple, logical explanations for the phenomenon under 
investigation be ruled out, experimentally or conceptually, 
before more complex or abstract explanations are considered. 
Parsimonious interpretations help scientists assess and fit new 
findings within the field’s existing knowledge base. A fully par- 
simonious interpretation consists only of those elements that are 
necessary and sufficient to explain the phenomenon at hand. The 
attitude of parsimony is so critical to scientific explanations that 
it is sometimes referred to as the Law of Parsimony (Whaley & 
Surratt, 1968), a “law” derived from Occam’s Razor, credited 
to William of Occam (c. 1285—1349), who stated: “One should 
not increase, beyond what is necessary, the number of entities 
required to explain anything.” In other words, given a choice 
between two competing and compelling explanations for the 
same phenomenon, one should shave off extraneous variables 
and choose the simplest explanation, the one that requires the 
fewest assumptions. 


Philosophic Doubt 


The attitude of philosophic doubt requires the scientist to con- 
tinually question the truthfulness of what is regarded as fact. 
Scientific knowledge must always be viewed as tentative. 
Scientists must be willing to set aside their most cherished 
beliefs and findings and replace them with the knowledge 
derived from new discoveries. 

Good scientists maintain a healthy level of skepticism. 
Although being skeptical of others’ research may be easy, 
a more difficult but critical characteristic of scientists is 
that they remain open to the possibility—as well as look 
for evidence—that their own findings or interpretations are 
wrong. “Science is a willingness to accept facts even when 
they are opposed to wishes” (Skinner, 1953, p. 12). As Oliver 
Cromwell (1650) stated in another context: “I beseech you... 
think it possible you may be mistaken.” For the true scientist, 
“new findings are not problems; they are opportunities for 
further investigation and expanded understanding” (Todd & 
Morris, 1993, p. 1159). 

Practitioners should be as skeptical as researchers. The 
skeptical practitioner not only requires scientific evidence before 
implementing a new practice, but also evaluates continually its 
effectiveness once the practice has been implemented. Practi- 
tioners must be particularly skeptical of extraordinary claims 
made for the effectiveness of new theories, therapies, or treat- 
ments (Foxx & Mulick, 2016; Maurice, 2017). 


Claims that sound too good to be true usually are. 
Extraordinary claims require extraordinary evidence 
(Sagan, 1996; Shermer, 2002). What constitutes 
extraordinary evidence? In the strictest sense, and 

the sense that should be employed when evaluating 
claims of educational effectiveness, evidence is the 
outcome of the application of the scientific method to 
test the effectiveness of a claim, a theory, or a practice. 
The more rigorously the test is conducted, the more 
often the test is replicated, the more extensively the test 
is corroborated, the more extraordinary the evidence. 
Evidence becomes extraordinary when it is extraordi- 
narily well tested. (Silvestri & Heward, 2016, p. 149) 


We end our discussion of philosophic doubt with two 
pieces of advice, one from Carl Sagan and one from Bik. 
Skinner: “The question is not whether we like the conclusion 
that emerges out of a train of reasoning, but whether the 
conclusion follows from the premise or starting point 
and whether that premise is true” (Sagan, 1996, p. 210). 
“Regard no practice as immutable. Change and be ready 
to change again. Accept no eternal verity. Experiment” 
(Skinner, 1979, p. 346). 


Other Important Attitudes and Values 


The six attitudes of science that we have examined are necessary 
features of science and provide an important context for under- 
standing applied behavior analysis. 

However, the behavior of most productive and successful 
scientists is also characterized by qualities such as thoroughness, 
curiosity, perseverance, diligence, ethics, and honesty. Scientists 
acquire these traits because behaving in such ways has proven 
beneficial to the progress of science. 


A Definition of Science 


Science has no universally accepted, standard definition. 
We offer the following definition as one that encompasses 
the previously discussed purposes and attitudes of science, 
irrespective of the subject matter. Science is a systematic 
approach to understanding natural phenomena—as evidenced 
by description, prediction, and control—that relies on deter- 
minism as its fundamental assumption, empiricism as its prime 
directive, experimentation as its basic strategy, replication 
as its necessary requirement for believability, parsimony as 
its conservative value, and philosophic doubt as its guiding 
conscience. 


A BRIEF HISTORY OF BEHAVIOR ANALYSIS 


The science of behavior analysis entails three interrelated 
domains: philosophy, basic research, and applied research. 
Behaviorism is the philosophy of the science of behavior, basic 
research is the province of the experimental analysis of behav- 
ior (EAB), and developing a technology for improving behavior 
is the concern of applied behavior analysis (ABA). To be fully 
understood, applied behavior analysis must be considered in 
the context of the philosophy and basic research traditions and 
findings from which it evolved and remains connected today. 
This section provides an elementary description of the basic 
tenets of behaviorism and outlines some of the major events 
that have marked the development of behavior analysis.° 
Table 1.1 lists major books, journals, and professional organi- 
zations that have contributed to the advancement of behavior 
analysis since the 1930s. 


Watson’s Stimulus—Response Behaviorism 


Psychology in the early 1900s was dominated by the study of 
states of consciousness, images, and other mental processes. 
Introspection, the act of carefully observing one’s own conscious 
thoughts and feelings, was a primary method of investigation. 


TABLE 1.1 


1930s 
1940s 
1950s 


1960s 


1970s 


1980s 
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Books, Journals, and Organizations That Have Played a Major Role in the Development 
and Dissemination of Behavior Analysis 
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Books 


The Behavior of Organisms—Skinner (1938) 
Walden Two—Skinner (1948) 


Principles of Psychology—Keller and 
Schoenfeld (1950) 


Science and Human Behavior— 
Skinner (1953) 


Schedules of Reinforcement—Ferster and 
Skinner (1957) 


Verbal Behavior—Skinner (1957) 


Tactics of Scientific Research— 
Sidman (1960) 


Child Development, Vols. | & Bijou and 
Baer (1961, 1965) 


The Analysis of Behavior—Holland and 
Skinner (1961) 


Research in Behavior Moditication—Krasner 
and Ullmann (1965) 


Operant Behavior: Areas of Research and 
Application—Honig (1966) 

The Analysis of Human Operant Behavior— 
Reese (1966) 


Principles of Behavioral Analysis— 
Millenson (1967) 


Behavior Principles—Ferster and Perrott 
(1968) 


Contingencies of Reinforcement: 
A Theoretical Analysis—Skinner (1969) 


Beyond Freedom and Dignity— 
Skinner (1971) 


Elementary Principles of Behavior— 
Whaley and Malott (1971) 


About Behaviorism—Skinner (1974) 


Single Case Experimental Designs—Hersen 
and Barlow (1976) 

Applying Behavior-Analysis Procedures with 
Children and Youth—Sulzer-Azaroff and 
Mayer (1977) 


Learning—Catania (1979) 


Strategies and Tactics of Human Behavioral 
Research—Johnston and Pennypacker 
(1980) 


Behaviorism: A Conceptual Reconstruction— 
Zuriff (1985) 

Recent Issues in the Analysis of Behavior— 
Skinner (1989) 


Journals 
The Psychological Record (1937) 


Organizations 


Journal of the Experimental 
Analysis of Behavior (1958) 


Society for the Experimental Analysis 
of Behavior (SEAB) (1957) 


Journal of Applied Behavior 
Analysis (1968) 


American Psychological Association’s 
Division 25 Experimental Analysis of 
Behavior (1964) 


Experimental Analysis of Behaviour 
Group (UK) (1965) 


Norwegian Association for Behavior 
Analysis (1973) 


Behaviorism (1972) (became 
Behavior and Philosophy 
in 1990) 


Revista Mexicana de Analisis 
de la Conducta (1975) 


Behavioural Processes (1976) 


Midwestern Association for Behavior 
Analysis (MABA) (1974) 


Mexican Society of Behavior Analysis 
(1975) 


Association for Behavior Analysis 
(formerly, MABA) (1978) 


Behavior Modification (1977) 


Journal of Organizational 
Behavior Management (1977) 


Education & Treatment of 
Children (1977) 


The Behavior Analyst (1978) 


Journal of Precision Teaching 
and Celeration (formerly, Journal 
of Precision Teaching) (1980) 


Society for the Advancement of 
Behavior Analysis (1980) 


Analysis of Verbal Behavior 
(1982) 
Behavioral Interventions (1986) | Cambridge Center for Behavioral 


Studies (1981) 
(continued) 
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TABLE 1.1 (continued) 


Decade 


1990s 


2000s 


2010s 


Books 


Concepts and Principles of Behavior 
Analysis—Michael (1993) 


Understanding Behaviorism: Science, 
Behavior, and Culture—Baum (1994) 


Radical Behaviorism: The Philosophy and the 
Science—Chiesa (1994) 


Equivalence Relations and Behavior— 
Sidman (1994) 


Behavior Analysis and Learning—Pierce and 
Epling (1995) 


Functional Analysis of Problem Behavior— 
Repp and Horner (1999) 


Relational Frame Theory: A Post-Skinnerian 
Account of Human Language and 
Cognition—Hayes, Barnes-Holmes, and 
Roche (2001) 


Conceptual Foundations of Radical 
Behaviorism—Moore (2008) 


Handbook of Applied Behavior Analysis— 
Fisher, Piazza, and Roane (2011) 


The Science of Consequences—Schneider 
(2012) 


APA Handbook of Behavior Analysis— 
Madden (2013) 


Radical Behaviorism for ABA Practitioners— 
Johnston (2013) 


The Wiley-Blackwell Handbook of Operant 
and Classical Conditioning—McSweeney 
and Murphy (2014) 


The Nurture Effect: How the Science of 
Human Behavior Can Improve Our Lives & 
Our World—Biglan (2015) 


Journals 

Japanese Journal of Behavior 
Analysis (1986) 

Behavior Analysis Digest (1989) 


Behavioural Pharmacology 
(1989) 


Behavior and Social Issues 
(1991) 


Journal of Behavioral Education 
(1991) 


Journal of Positive Behavior 
Interventions (1999) 


The Behavior Analyst Today 
(1999) 


European Journal of Behavior 
Analysis (2000) 


Behavioral Development Bulletin 
(2002) 


Journal of Early and Intensive 
Behavior Intervention (2004) 


Brazilian Journal of Behavior 
Analysis (2005) 


International Journal of 
Behavioral Consultation and 
Therapy (2005) 


Behavior Analysis in Practice 
(2011) 


Journal of Contextual Behavioral 
Science (2012) 


Operants (2014) 


Behavior Analysis: Research and 
Practice (formerly, The Behavior 
Analyst Today (2015) 


Perspectives on Behavior Science 
(formerly, The Behavior Analyst) 
(2018) 


Note: Books are listed by initial year of publication. Some titles are available in more recent editions 


Organizations 


Japanese Association for Behavior 
Analysis (1983) 


Accreditation of Training Programs 
in Behavior Analysis (Association for 
Behavior Analysis) (1993) 


Behavior Analyst Certification Board 
(BACB) (1998) 


Council of Directors of Graduate 
Programs in Behavior Analysis 
(Association for Behavior Analysis) 
(1999) 


First Board Certified Behavior 
Analysts (BCBA) credentialed by the 
BACB (1999) 


European Association for Behaviour 
Analysis (2002) 


Association for Professional Behavior 
Analysts (APBA) (2007) 


Association for Behavior Analysis 
International (ABAI) (formerly, ABA) 
(2008) 


First Registered Behavior Technician 
(RBT) credentialed by the BACB 
(2014) 


BACB credentials the 30,000th 
behavior analyst (2018) 


Membership in ABAI and affiliate 
chapters surpasses 26,000 in 63 
countries (2018) 
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Although the authors of several texts in the first decade of the 
20th century defined psychology as the science of behavior 
(see Kazdin, 1978), John B. Watson is widely recognized as 
the spokesman for a new direction in the field of psychology. In 
his influential article “Psychology as the Behaviorist Views It,” 
Watson (1913) wrote: 


Psychology as the behaviorist views it is a purely objective 
experimental branch of natural science. Its theoretical goal 
is the prediction and control of behavior. Introspection 
forms no essential part of its methods, nor is the scientific 
value of its data dependent upon the readiness with which 
they lend themselves to interpretation in terms of con- 
sciousness. (p. 158) 


Watson argued that the proper subject matter for 
psychology was not states of mind or mental processes but 
observable behavior. Further, the objective study of behavior 
as a natural science should consist of direct observation of 
the relationships between environmental stimuli (S) and the 
responses (R) they evoke. Watsonian behaviorism became 
known as stimulus—response (S—R) psychology. Although 
scientific evidence was insufficient to support S—R psychol- 
ogy as a workable explanation for most behavior, Watson was 
confident that his new behaviorism would lead to the prediction 
and control of human behavior and that it would allow prac- 
titioners to improve performance in areas such as education, 
business, and law. Watson (1924) made bold claims concerning 
human behavior, as illustrated in this famous quotation: 


Give me a dozen healthy infants, well-formed, and my 
own specified world to bring them up in and I'll guarantee 
to take any one at random and train him to become any 
type of specialist I might select—doctor, lawyer, artist, 
merchant-chief and, yes, even beggar-man and thief, 
regardless of his talents, penchants, tendencies, abilities, 
vocations, and race of his ancestors. I am going beyond 
my facts and I admit it, but so have the advocates of the 
contrary and they have been doing it for many thousands 
of years. (p. 104) 


B. F. Skinner (left) in his Indiana University lab circa 1945 and (right) circa 1967. 


It is unfortunate that such extraordinary claims were made, 
exaggerating the ability to predict and control human behavior 
beyond the scientific knowledge available. The quotation just 
cited has been used to discredit Watson and continues to be used 
to discredit behaviorism in general, even though the behaviorism 
that underlies contemporary behavior analysis is fundamentally 
different from the S-R paradigm. Nevertheless, Watson’s conttri- 
butions were of great significance: He made a strong case for the 
study of behavior as a natural science on a par with the physical 
and biological sciences.* 


Experimental Analysis of Behavior 


[Science] is a search for order. It begins, as we all begin, 
by observing single episodes, but it quickly passes on to the 
general rule, to scientific law. 

—B. F. Skinner, (1953, pp. 13-14) 


I had the clue from Pavlov: Control your conditions and 
you will see order. 
—B. F. Skinner, (1956, p. 223) 


The experimental branch of behavior analysis formally began 
with the publication of B. F. Skinner’s the Behavior of Organ- 
isms (1938). The book summarized Skinner’s laboratory 
research conducted from 1930 to 1937 and brought into per- 
spective two kinds of behavior: respondent and operant. 

Respondent behavior is reflexive behavior as in the tradi- 
tion of Ivan Pavlov (1927). Respondents are elicited, or “brought 
out,” by stimuli that immediately precede them. The antecedent 
stimulus (e.g., bright light) and the response it elicits (e.g., pupil 
constriction) form a functional unit called a reflex. Respondent 
behaviors are essentially involuntary and occur whenever the 
eliciting stimulus is presented. 

Skinner was “interested in giving a scientific account 
of all behavior, including that which Descartes had set aside 
as ‘willed’ and outside the reach of science” (Glenn, Ellis, & 
Greenspoon, 1992, p. 1330). But, like other psychologists of the 
time, Skinner found that the S-R paradigm could not explain 
a great deal of behavior, particularly behaviors that had no 
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All responses to the lever were reinforced. The first three reinforcements were 
apparently ineffective. The fourth is followed by a rapid increase in rate. 


apparent antecedent causes in the environment. Compared to 
reflexive behavior with its clear eliciting events, much of the 
behavior of organisms appeared spontaneous or “voluntary.” In 
an attempt to explain the mechanisms responsible for “volun- 
tary” behavior, other psychologists postulated mediating vari- 
ables inside the organism in the form of hypothetical constructs 
such as cognitive processes, drives, and free will. Skinner took 
a different tack. Instead of creating hypothetical constructs, 
presumed but unobserved entities that could not be manipulated 
in an experiment, Skinner continued to look in the environment 
for the determinants of behavior that did not have apparent 
antecedent causes. 


He did not deny that physiological variables played a role 
in determining behavior. He merely felt that this was the 
domain of other disciplines, and for his part, remained 
committed to assessing the causal role of the environment. 
This decision meant looking elsewhere in time. Through 
painstaking research, Skinner accumulated significant, if 
counterintuitive, evidence that behavior is changed less by 
the stimuli that precede it (though context is important) and 
more by the consequences that immediately follow it 

(i.e., consequences that are contingent upon it). The essen- 
tial formulation for this notion is S-R—-S, otherwise known 
as the three—term contingency. It did not replace the S-R 
model—we still salivate, for instance, if we smell food 
cooking when we are hungry. It did, however, account for 
how the environment “selects” the great part of learned 
behavior. 

With the three-term contingency Skinner gave us a 
new paradigm. He achieved something no less profound for 
the study of behavior and learning than Bohr’s model of the 
atom or Mendel’s model of the gene. (Kimball, 2002, p. 71) 


Skinner called the second type of behavior operant 
behavior Operant behaviors are not elicited by preceding stimuli 
but instead are influenced by stimulus changes that have followed 
the behavior in the past. Skinner’s most powerful and fundamen- 
tal contribution to our understanding of behavior was his discov- 
ery and experimental analyses of the effects of consequences on 
behavior. The operant three-term contingency as the primary unit 
of analysis was a revolutionary conceptual breakthrough. 

Skinner (1938) argued that the analysis of operant behav- 
ior “with its unique relation to the environment presents a 
separate important field of investigation” (p. 438). He named 
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Figure 1.1 The first data set in B. F. Skinner's 
The Behavior of Organisms: An Experimental 
Analysis (1938). 

Based on the Behavior of Organisms: An Experimental 
Analysis by B. F. Skinner, p. 67. Original copyright 1938 
by Appleton-Century. Copyright 1991 by B. F. Skinner 
Foundation, Cambridge, MA. Used by permission. 


this new science the experimental analysis of behavior and 
outlined the methodology for its practice. Simply put, Skinner 
recorded the rate at which a single subject (he initially used rats 
and later, pigeons) emitted a given behavior in a controlled and 
standardized experimental chamber. 

The first set of data Skinner presented in The Behavior 
of Organisms was a graph that “gives a record of the resulting 
change in behavior” (p. 67) when a food pellet was delivered 
immediately after a rat pressed a lever (see Figure 1.1). Skinner 
noted that the first three times that food followed a response 
“had no observable effect” but that “the fourth response was 
followed by an appreciable increase in rate showing a swift 
acceleration to a maximum” (pp. 67-68). 

Skinner’s investigative procedures evolved into an ele- 
gant experimental approach that enabled clear and powerful 
demonstrations of orderly and reliable functional relations 
between behavior and various types of environmental events.° 
By systematically manipulating the arrangement and schedul- 
ing of stimuli that preceded and followed behavior in literally 
thousands of laboratory experiments from the 1930s through 
the 1950s, Skinner and his colleagues and students discov- 
ered and verified the basic principles of operant behavior that 
continue to provide the empirical foundation for behavior 
analysis today. Description of these principles of behavior— 
general statements of functional relations between behavior 
and environmental events—and tactics for changing behavior 
derived from those principles constitute a major portion of 
this text. 


Skinner’s Radical Behaviorism 


Behavior analysts dispense with the myth of the inner 
person as creator of behavior. Both philosophically and 
empirically to the behavior analyst, we are what we do, and 
when and where we do it. 

—Murray Sidman, (2013, p. xvi) 


In addition to being the founder of the experimental analysis of 
behavior, B. F. Skinner wrote extensively on the philosophy of that 
science. Without question, Skinner’s writings have been the most 
influential both in guiding the practice of the science of behavior 
and in proposing the application of the principles of behavior to 
new areas.’ In 1948 Skinner published Walden Two, a fictional 
account of how the philosophy and principles of behavior might be 
used in a utopian community (see Altus & Morris, 2009). This was 


Chapter 1 * Definition and Characteristics of Applied Behavior Analysis 27 


followed by his classic text, Science and Human Behavior (1953); 
in which he speculated on how the principles of behavior might 
be applied to complex human behavior in areas such as education, 
religion, government, law, and psychotherapy. 

Much of Skinner’s writing was devoted to the develop- 
ment and explanation of his philosophy of behaviorism. Skinner 
began his book About Behaviorism (1974) with these words: 


Behaviorism is not the science of human behavior: it is the 
philosophy of that science. Some of the questions it asks 
are these: Is such a science really possible? Can it account 
for every aspect of human behavior? What methods can it 
use? Are its laws as valid as those of physics and biology? 
Will it lead to a technology, and if so, what role will it play 
in human affairs? (p. 1) 


The behaviorism Skinner pioneered differed significantly 
(indeed, radically) from other psychological theories, includ- 
ing other forms of behaviorism. Although there were, and 
remain today, many psychological models and approaches to 
the study of behavior, mentalism is the common denominator 
among most. 


In general terms, mentalism may be defined as an 
approach to the study of behavior which assumes that 
a mental or “inner” dimension exists that differs from 
a behavioral dimension. This dimension is ordinarily 
referred to in terms of its neural, psychic, spiritual, 
subjective, conceptual, or hypothetical properties. 
Mentalism further assumes that phenomena in this 
dimension either directly cause or at least mediate 
some forms of behavior, if not all. These phenomena 
are typically designated as some sort of act, state, 
mechanism, process, or entity that is causal in 

the sense of initiating or originating. Mentalism 
regards concerns about the origin of these phenom- 
ena as incidental at best. Finally, mentalism holds 
that an adequate causal explanation of behavior 

must appeal directly to the efficacy of these 

mental phenomena. (Moore, 2003, pp. 181-182) 


Hypothetical constructs and explanatory fictions are the 
stock and trade of mentalism, which has dominated Western 
intellectual thought and most psychological theories (Descartes, 
Freud, Piaget), and it continues to do so into the 21st century. 
Freud, for example, created a complex mental world of hypo- 
thetical constructs—the id, ego, and superego—that he con- 
tended were key to understanding a person’s actions. 

Hypothetical constructs—‘“theoretical terms that refer 
to a possibly existing, but at the moment unobserved process 
or entity” (Moore, 1995, p. 36)—can be neither observed nor 
experimentally manipulated (MacCorquodale & Meehl, 1948; 
Zuriff, 1985). Free will, readiness, innate releasers, language 
acquisition devices, storage and retrieval mechanisms for 
memory, and information processing are all examples of hypo- 
thetical constructs that are inferred from behavior. Although 
Skinner (1953, 1974) clearly indicated that it is a mistake to 
rule out events that influence our behavior because they are 
not accessible to others, he believed that using presumed but 


unobserved mentalistic fictions (i.e., hypothetical constructs) 
to explain the causes of behavior contributed nothing to a func- 
tional account. 

Consider a typical laboratory situation. A food-deprived 
rat pushes a lever each time a light comes on and receives 
food, but the rat seldom pushes the lever when the light is off 
(and if it does, no food is delivered). When asked to explain 
why the rat pushes the lever only when the light is on, most 
will say that the rat has “made the association” between the 
light being on and food being delivered when the lever is 
pressed. As a result of making that association, the animal 
now “knows” to press the lever only when the light is on. 
Attributing the rat’s behavior to a hypothetical cognitive pro- 
cess such as associating or to something called “knowledge” 
adds nothing to a functional account of the situation. First, the 
environment (in this case, the experimenter) paired the light 
and food availability for lever presses, not the rat. Second, the 
knowledge or other cognitive process that is said to explain 
the observed behavior is itself unexplained, which begs for 
still more conjecture. 

The “knowledge” that is said to account for the rat’s perfor- 
mance is an example of an explanatory fiction, a fictitious vari- 
able that often is simply another name for the observed behavior 
that contributes nothing to an understanding of the variables 
responsible for developing or maintaining the behavior. Explan- 
atory fictions are the key ingredient in “a circular way of viewing 
the cause and effect of a situation” (Heron, Tincani, Peterson, & 
Miller, 2005, p. 274) that lead to a false sense of understanding. 


Turning from observed behavior to a fanciful inner world 
continues unabated. Sometimes it is little more than a linguistic 
practice. We tend to make nouns of adjectives and verbs and 
must then find a place for the things the nouns are said to 
represent. We say that a rope 1s strong and before long we are 
speaking of its strength. We call a particular kind of strength 
tensile, and then explain that the rope is strong because it 
possesses tensile strength. The mistake is less obvious but 
more troublesome when matters are more complex. 

Consider now a behavioral parallel. When a person has 
been subject to mildly punishing consequences in walking 
on a slippery surface, he may walk in a manner we describe 
as cautious. It is then easy to say that he walks with caution 
or that he shows caution. There is no harm in this until we 
begin to say that he walks carefully because of his caution. 
(Skinner, 1974, pp. 165-166, emphasis added) 


It is widely believed that Skinner rejected all events 
that could not be independently verified by agreement among 
observers. However, Skinner was explicit early on that he valued 
effective action over agreement among observers. 


The ultimate criterion for the goodness of a concept 

is not whether two people are brought into agreement 
but whether the scientist who uses the concept can 
operate successfully upon his material—all by himself 
if need be. What matters to Robinson Crusoe is not 
whether he is agreeing with himself but whether he 

is getting anywhere with his control over nature. 
(Skinner, 1945, p. 293). 
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Pragmatism, the philosophical position that “the truth 
value of a statement is a function of how well the statement pro- 
motes effective action” (Moore, 2008, p. 400), continues to be a 
primary criterion by which behavior analysts judge the value of 
their findings (Leigland, 2010; Moxley, 2004).* 

In reality, there are many kinds of behaviorism—structur- 
alism, methodological behaviorism, and forms of behaviorism 
that use cognitions as causal factors (e.g., cognitive behavior 
modification and social learning theory)—in addition to the 
radical behaviorism of Skinner. Structuralism and methodolog- 
ical behaviorism do reject all events that are not operationally 
defined by objective assessment. Structuralists avoid mental- 
ism by restricting their activities to descriptions of behavior. 
They make no scientific manipulations; accordingly, they do not 
address questions of causal factors. Methodological behaviorists 
differ from the structuralists by using scientific manipulations to 
search for functional relations between events. Uncomfortable 
with basing their science on unobservable phenomena, some 
early behaviorists either denied the existence of “inner variables” 
or considered them outside the realm of a scientific account. 
Such an orientation is often referred to as methodological 
behaviorism. 

Methodological behaviorists acknowledge the existence 
of mental events but do not consider them in the analysis of 
behavior (Skinner, 1974). Methodological behaviorists’ reliance 
on public events, excluding private events, restricts the knowl- 
edge base of human behavior and discourages innovation in the 
science of behavior. Methodological behaviorism is restrictive 
because it ignores areas of major importance for an understand- 
ing of behavior. 

Contrary to another common misconception, Skinner did 
not object to cognitive psychology’s concern with private events 
(i.e., events taking place “inside the skin”) (Moore, 2000). Skin- 
ner was the first behaviorist to view thoughts and feelings (he 
called them “private events’’) as behavior to be analyzed with the 
same conceptual and experimental tools used to analyze publicly 
observable behavior, not as phenomena or variables that exist 
within and operate according to principles of a separate mental 
world. “I contend that my toothache is just as physical as my 
typewriter” (Skinner, 1945, p. 294). 

Essentially, Skinner’s behaviorism makes three major 
assumptions regarding the nature of private events: (a) Private 
events such as thoughts and feelings are behavior; (b) behav- 
ior that takes place within the skin is distinguished from other 
(“public”) behavior only by its inaccessibility; and (c) private 
behavior is influenced by (i.e., is a function of) the same kinds 
of variables as publicly accessible behavior. 


We need not suppose that events which take place within 
an organism’s skin have special properties for that reason. 
A private event may be distinguished by its limited accessi- 
bility but not, so far as we know, by any special structure of 
nature. (Skinner, 1953, p. 257) 


By incorporating private events into an overall conceptual 
system of behavior, Skinner created a radical behaviorism that 
includes and seeks to understand all human behavior. “What 
is inside the skin, and how do we know about it? The answer 


is, I believe, the heart of radical behaviorism” (Skinner, 1974, 
p. 218). The proper connotations of the word radical in radi- 
cal behaviorism are far-reaching and thoroughgoing, connoting 
the philosophy’s inclusion of all behavior, public and private. 
Radical is also an appropriate modifier for Skinner’s form of 
behaviorism because it represents a dramatic departure from 
other conceptual systems in calling for 


probably the most drastic change ever proposed in our 
way of thinking about man. It is almost literally a matter 
of turning the explanation of behavior inside out. 
(Skinner, 1974, p. 256) 


Skinner and the philosophy of radical behaviorism 
acknowledge the events on which fictions such as cognitive 
processes are based. Radical behaviorism does not restrict the 
science of behavior to phenomena that can be detected by more 
than one person. In the context of radical behaviorism, the term 
observe implies “coming into contact with” (Moore, 1984). 
Radical behaviorists consider private events such as thinking 
or sensing the stimuli produced by a damaged tooth to be no 
different from public events such as oral reading or sensing the 
sounds produced by a musical instrument. According to Skinner 
(1974), “What is felt or introspectively observed is not some 
nonphysical world of consciousness, mind, or mental life but 
the observer’s own body” (pp. 18-19). 

The analysis of private events is a major aspect of rad- 
ical behaviorism and indispensable for a comprehensive sci- 
ence of behavior (Palmer, 2011). Moore (1980, 2015) stated 
it concisely: 


For radical behaviorism, private events are those 
events wherein individuals respond with respect to 
certain stimuli accessible to themselves alone. . . . The 
responses that are made to those stimuli may themselves 
be public, i.e., observable by others, or they may be 
private, i.e., accessible only to the individual involved. 
Nonetheless, to paraphrase Skinner (1953), it need not 
be supposed that events taking place within the skin 
have any special properties for that reason alone... . 
For radical behaviorism, then, one’s responses with 
respect to private stimuli are equally lawful and alike 
in kind to one’s responses with respect to public 
stimuli. (1980, p. 460) 


[T]hese events are critical to understanding behavior in all 
its complexity. Just as importantly, they need not be formu- 
lated in different terms and with different concepts that are 
publicly observable behavior events. (2015, p. 18) 


Scientists and practitioners are affected by their own 
social context, and institutions and schools are dominated by 
mentalism (Heward & Cooper, 1992; Kimball, 2002). A firm 
grasp of the philosophy of radical behaviorism, in addition 
to knowledge of principles of behavior, can help the scientist 
and practitioner resist the mentalistic approach of dropping the 
search for controlling variables in the environment and drifting 
toward explanatory fictions in the effort to understand behav- 
ior. The principles of behavior and the procedures presented 
in this text apply equally to public and private events. Radical 
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behaviorism is the philosophical position underlying the content 
presented in this text. 

As Friman (2017) noted, Skinner’s behaviorism viewed 
behavior as a natural science. 


By taking this stand, he was promoting a larger 

idea, specifically that behavior was solely a physical 
phenomenon brought about, maintained, strengthened, 
or weakened solely by physical (environmental) 
events. In other words, he was promoting the idea 

that behavior is a function of environmental circum- 
stances and their context. This is the most powerful idea 
ever invented by mankind for understanding, knowing, 
and approaching human behavior especially when it is a 
problem. (p. 176) 


A thorough discussion of radical behaviorism is far beyond 
the scope of this text. The serious student of applied behavior 
analysis will devote considerable study to Skinner’s original 
writings and to other authors who have critiqued, analyzed, and 
extended the philosophical foundations of the science of behav- 
ior.” (See Box 1.1 for Don Baer’s perspectives on the meaning 
and importance of radical behaviorism.) 


Applied Behavior Analysis 


The first study to report the human application of principles of 
operant behavior was conducted by Fuller (1949). The subject 
was an 18-year-old boy with profound developmental disabil- 
ities who was described in the language of the time as a “veg- 
etative idiot.” He lay on his back, unable to roll over. Fuller 
filled a syringe with a warm sugar-milk solution and injected 
a small amount of the fluid into the young man’s mouth every 
time he moved his right arm (that arm was chosen because 
he moved it infrequently). Within four sessions the boy was 
moving his arm to a vertical position at a rate of three times 
per minute. !? 


The attending physicians . . . thought it was impos- 

sible for him to learn anything—according to them, 

he had not learned anything in the 18 years of his 
life—yet in four experimental sessions, by using the 
operant conditioning technique, an addition was made 

to his behavior which, at this level, could be termed 
appreciable. Those who participated in or observed the 
experiment are of the opinion that if time permitted, 
other responses could be conditioned and discriminations 
learned. (Fuller, 1949, p. 590) 


During the 1950s and into the early 1960s researchers 
used the methods of the experimental analysis of behavior to 
determine whether the principles of behavior demonstrated in 
the laboratory with nonhuman subjects could be replicated with 
humans. According to Thompson and Hackenberg (2009), “the 
field of applied analysis emerged from the experimental analysis 
of behavior, like Adam’s rib” (p. 271). 

Much of the early research with human subjects was 
conducted in clinic or laboratory settings. Although the par- 
ticipants typically benefited from these studies by learning 


new behaviors, the researchers’ major purpose was to deter- 
mine whether the basic principles of behavior discovered in 
the laboratory operated with humans. For example, Sidney 
Bijou (1955, 1957, 1958)!! researched several principles 
of behavior with typically developing subjects and people 
with intellectual disabilities; Don Baer (1960, 1961, 1962) 
examined the effects of punishment, escape, and avoidance 
contingencies on preschool children; and Ogden Lindsley 
(1956; Lindsley & Skinner, 1954) assessed the effects of 
operant conditioning on the behavior of adults with schizo- 
phrenia. These early researchers clearly established that the 
principles of behavior are applicable to human behavior, and 
they set the stage for the later development of applied behavy- 
ior analysis. 

The branch of behavior analysis that would later be 
called applied behavior analysis (ABA) can be traced to 
the 1959 publication of Ayllon and Michael’s paper titled 
“The Psychiatric Nurse as a Behavioral Engineer.” The authors 
described how direct care personnel in a state hospital used a 
variety of techniques based on the principles of behavior to 
improve the functioning of residents with psychotic disorders 
or intellectual disabilities. During the 1960s many researchers 
began to apply principles of behavior in an effort to improve 
socially important behavior, but these early pioneers faced 
many problems. Laboratory techniques for measuring behavior 
and for controlling and manipulating variables were some- 
times unavailable, or their use was inappropriate in applied 
settings. As a result, the early practitioners of applied behavior 
analysis had to develop new experimental procedures as they 
went along. There was little funding for the new discipline, and 
researchers had no ready outlet for publishing their studies, 
making it difficult to communicate among themselves about 
their findings and solutions to methodological problems. 
Most journal editors were reluctant to publish studies using an 
experimental method unfamiliar to mainstream social science, 
which relied on large numbers of subjects and tests of statis- 
tical inference. 

Despite these problems it was an exciting time, and 
major new discoveries were being made regularly. For exam- 
ple, many pioneering applications of behavior principles to 
education occurred during this period (see, e.g., O'Leary & 
O’Leary, 1972; Ulrich, Stachnik, & Mabry 1974), from which 
were derived teaching procedures such as contingent teacher 
praise and attention (Hall, Lund, & Jackson, 1968), token 
reinforcement systems (Birnbrauer, Wolf, Kidder, & Tague, 
1965), curriculum design (Becker, Engelmann, & Thomas, 
1975), and programmed instruction (Bijou, Birnbrauer, 
Kidder, & Tague, 1966; Markle, 1962). The basic methods 
for reliably improving student performance developed by 
those early applied behavior analysts provided the foundation 
for behavioral approaches to curriculum design, instructional 
methods, classroom management, and the generalization and 
maintenance of learning that continue to be used decades later 
(cf., Twyman, 2013). 

University programs in behavior analysis were begun in 
the 1960s and 1970s at Arizona State University, Florida State 
University, the State University of New York at Stony Brook, 
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What Is Behaviorism? 


Don Baer loved the science of behavior. He loved to write 
about it, and he loved to talk about it. Don was famous for his 
unparalleled ability to speak extemporaneously about complex 
philosophical, experimental, and professional issues in a way 
that always made thorough conceptual, practical, and human 
sense. He did so with the vocabulary and syntax of a great author 
and the accomplished delivery of a master storyteller. The only 
thing Don knew better than his audience was his science. 

On three occasions, in three different decades, graduate 
students and faculty in the special education program at The 
Ohio State University were fortunate to have Professor Baer 
serve as Distinguished Guest Faculty for a doctoral seminar, 
Contemporary Issues in Special Education and Applied Behav- 
ior Analysis. The questions and responses that follow were 
selected from transcripts of two of Professor Baer’s three OSU 
teleconference seminars. 


If a person on the street approached you and asked, 
“What’s behaviorism?” how would you reply? 


The key point of behaviorism is that what people do can 
be understood. Traditionally, both the layperson and the 
psychologist have tried to understand behavior by seeing it as 
the outcome of what we think, what we feel, what we want, 
what we calculate, and etcetera. But we don’t have to think 
about behavior that way. We could look upon it as a process 
that occurs in its own right and has its own causes. And those 
causes are, very often, found in the external environment. 

Behavior analysis is a science of studying how we can 
arrange our environments so they make very likely the behav- 
iors we want to be probable enough, and they make unlikely 
the behaviors we want to be improbable. Behaviorism is under- 
standing how the environment works so that we can make 
ourselves smarter, more organized, more responsible; so we 
can encounter fewer punishments and fewer disappointments. 
A central point of behaviorism is this: We can remake our envi- 
ronment to accomplish some of that much more easily than we 
can remake our inner selves. 


An interviewer once asked Edward Teller, the physicist 
who helped develop the first atomic bomb, “Can you 
explain to a nonscientist what you find so fascinating 
about science, particularly physics?” Teller replied, 
“No.” I sense that Teller was suggesting that a nonsci- 
entist would not be able to comprehend, understand, or 


the University of Illinois, Indiana University, the Univer- 
sity of Kansas, The Ohio State University, the University of 
Oregon, the University of Southern Illinois, the University of 
Washington, West Virginia University, and Western Michigan 
University, among others. Through their teaching and research, 
faculty at each of these programs made major contributions to 
the rapid growth of the field.!? 


appreciate physics and his fascination with tt. If a nonsci- 
entist asked you, “What do you find so fascinating about 
science, particularly the science of human behavior?” 
what would you say? 


Ed Morris organized a symposium on just this topic a 
couple of years ago at the Association for Behavior Analysis 
annual convention, and in that symposium, Jack Michael 
commented on the fact that although one of our discipline’s 
big problems and challenges is communicating with our 
society about who we are, what we do, and what we can do, 
he didn’t find it reasonable to try to summarize what behavior 
analysis is to an ordinary person in just a few words. He gave 
us this example: Imagine a quantum physicist is approached 
at a cocktail party by someone who asks, “What is quantum 
physics?” Jack said that the physicist might very well answer, 
and probably should answer, “I can’t tell you in a few words. 
You should register for my course.” 

I’m very sympathetic with Jack’s argument. But I also 
know, as someone who’s confronted with the politics of relating 
our discipline to society, that although it may be a true answer, 
it’s not a good answer. It’s not an answer that people will hear 
with any pleasure, or indeed, even accept. . . . Therefore, I think 
we have to engage in a bit of honest show business. So, if I had 
to somehow state some connotations of what holds me in the 
field, I guess I would say that since I was a child I always found 
my biggest reinforcer was something called understanding. 
I liked to know how things worked. And of all of the things in 
the world there are to understand, it became clear to me that the 
most fascinating was what people do. I started with the usual 
physical science stuff, and it was intriguing to me to understand 
how radios work, and how electricity works, and how clocks 
work, etcetera. But when it became clear to me that we could 
also learn how people work—not just biologically, but behay- 
iorally—I thought that’s the best of all. Surely, everyone must 
agree that that’s the most fascinating subject matter. That there 
could be a science of behavior, of what we do, of who we are? 
How could you resist that? 


Adapted from “Thursday Afternoons with Don: Selections from Three Tele- 
conference Seminars on Applied Behavior Analysis” by W. L. Heward & 
C. L. Wood (2003). In K. S. Budd & T. Stokes (Eds.), A Small Matter of Proof: 
The Legacy of Donald M. Baer (pp. 293-310). Reno, NV: Context Press. Used 
by permission. 


Two significant events in 1968 mark that year as the for- 
mal beginning of contemporary applied behavior analysis. First, 
the Journal of Applied Behavior Analysis (JABA) began publica- 
tion. JABA was the first journal in the United States to deal with 
applied problems that gave researchers using methodology from 
the experimental analysis of behavior an outlet for publishing 
their findings. JABA was and continues to be the flagship journal 
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of applied behavior analysis. Many of the early articles in JABA 
became model demonstrations of how to conduct and interpret 
applied behavior analysis, which in turn led to improved appli- 
cations and experimental methodology. 

The second major event of 1968 was the publication of 
the paper “Some Current Dimensions of Applied Behavior 
Analysis” by Donald M. Baer, Montrose M. Wolf, and Todd 
R. Risley. These authors, the founding fathers of the new dis- 
cipline, recommended criteria for judging the adequacy of 
research and practice in applied behavior analysis and outlined 
the scope of work they envisioned for those engaged in the sci- 
ence. Their iconic paper is the most widely cited publication in 
applied behavior analysis and generally regarded as the standard 
description of the discipline. 


CHARACTERISTICS OF APPLIED 
BEHAVIOR ANALYSIS 


Baer, Wolf, and Risley (1968) recommended that applied behav- 
ior analysis be applied, behavioral, analytic, technological, 
conceptually systematic, effective, and capable of appropriately 
generalized outcomes. In 1987 Baer and colleagues reported 
that the “seven self-conscious guides to behavior analytic con- 
duct” (p. 319) they had offered 20 years earlier “remain func- 
tional; they still connote the current dimensions of the work 
usually called applied behavior analysis” (p. 314). The seven 
dimensions they posed continue to serve as useful and relevant 
signposts for identifying research in applied behavior analysis. 


Applied 


The applied in applied behavior analysis signals ABA’s commit- 
ment to effecting improvements in behaviors that enhance and 
improve people’s lives. To meet this criterion, the researcher 
or practitioner must select behaviors to change that are socially 
significant for participants: social, language, academic, daily 
living, self-care, vocational, and/or recreation and leisure behav- 
iors that improve the day-to-day life experience of the partic- 
ipants and/or affect their significant others (parents, teachers, 
peers, employers) in such a way that they behave more posi- 
tively with and toward the participant. 


Behavioral 


At first it may seem superfluous to include such an obvious 
criterion—of course applied behavior analysis must be behav- 
ioral. However, Baer and colleagues (1968) made three import- 
ant points relative to the behavioral criterion. First, not just any 
behavior will do; the behavior chosen for study must be the 
behavior in need of improvement, not a similar behavior that 
serves as a proxy for the behavior of interest or the subject’s 
verbal description of the behavior. Behavior analysts conduct 
studies of behavior, not studies about behavior. For example, in 
a study evaluating the effects of a program to teach school chil- 
dren to get along with one another, an applied behavior analyst 
would directly observe and measure clearly defined classes of 
interactions between and among the children instead of using 
indirect measures such as the children’s answers on a sociogram 


or responses to a questionnaire about how they believe they get 
along with one another. 

Second, the behavior must be measurable; the precise and 
reliable measurement of behavior is just as critical in applied 
research as it is in basic research. Applied researchers must meet 
the challenge of measuring socially significant behaviors in their 
natural settings, and they must do so without resorting to the 
measurement of nonbehavioral substitutes. 

Third, when changes in behavior are observed during an 
investigation, it is necessary to ask whose behavior has changed. 
Perhaps only the behavior of the observers has changed. “Explicit 
measurement of the reliability of human observers thus becomes 
not merely good technique, but a prime criterion of whether the 
study was appropriately behavioral” (Baer et al., 1968, p. 93). 
Or perhaps the experimenter’s behavior has changed in an 
unplanned way, making it inappropriate to attribute any observed 
change in the subject’s behavior to the independent variables that 
were manipulated. The applied behavior analyst should attempt 
to monitor the behavior of all persons involved in a study. 


Analytic 


A study in applied behavior analysis is analytic when the 
experimenter has demonstrated a functional relation between 
the manipulated events and a reliable change in some measur- 
able dimension of the targeted behavior. In other words, the 
experimenter must be able to control the occurrence and non- 
occurrence of the behavior. Sometimes, however, society does 
not allow the repeated manipulation of important behaviors to 
satisfy the requirements of experimental method. Therefore, 
applied behavior analysts must demonstrate control to the great- 
est extent possible, given the restraints of the setting and behav- 
ior; and then they must present the results for judgment by the 
consumers of the research. The ultimate issue is believability: 
Has the researcher achieved experimental control to demonstrate 
a reliable functional relation? 

The analytic dimension enables ABA not only to demon- 
strate effectiveness but also to provide the “acid test proof” of 
functional and replicable relations between the interventions it 
recommends and socially significant outcomes. 


Because we are a data- and design-based discipline, we 
are in the remarkable position of being able to prove 
that behavior can work in the way that our technology 
prescribes. We are not theorizing about how behay- 

ior can work; we are describing systematically how it 
has worked many times in real-world applications, in 
designs too competent and with measurement systems 
too reliable and valid to doubt. Our ability to prove that 
behavior can work that way does not, of course, estab- 
lish that behavior cannot work any other way: we are 
not in a discipline that can deny any other approaches, 
only in one that can affirm itself as knowing many of 
its sufficient conditions at the level of experimental 
proof... our subject matter is behavior change, and we 
can specify some actionable sufficient conditions for it. 
(D. M. Baer, personal communication, October 21, 1982, 
emphasis in original) 
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Technological 


A study in applied behavior analysis is technological when all 
of its operative procedures are identified and described with suf- 
ficient detail and clarity “such that a reader has a fair chance of 
replicating the application with the same results” (Baer, Blount, 
Detrich, & Stokes, 1987, p. 320). 


It is not enough to say what is to be done when 

the subject makes response R;; it is essential also 
whenever possible to say what is to be done if the 
subject makes the alternative responses, Ro, R3, etc. 
For example, one may read that temper tantrums in 
children are often extinguished by closing the child 
in his room for the duration of the tantrums plus ten 
minutes. Unless that procedure description also states 
what should be done if the child tries to leave the 
room early, or kicks out the window, or smears feces 
on the walls, or begins to make strangling sounds, etc., 
it is not precise technological description. (Baer et al., 
1968, pp. 95-96) 


No matter how powerful its effects in any given study, 
a behavior change method will be of little value if practi- 
tioners are unable to replicate it. The development of a rep- 
licable technology of behavior change has been a defining 
characteristic and continuing goal of ABA from its inception. 
Behavioral tactics are replicable and teachable to others. Inter- 
ventions that cannot be replicated with sufficient fidelity to 
achieve comparable outcomes are not considered part of the 
technology. 

A good check of the technological adequacy of a proce- 
dural description is to have a person trained in applied behav- 
ior analysis carefully read the description and then act out the 
procedure in detail. If the person makes any mistakes, adds any 
operations, omits any steps, or has to ask any questions to clarify 
the written description, then the description is not sufficiently 
technological and requires improvement. 


Conceptually Systematic 


Although Baer and colleagues (1968) did not state so explic- 
itly, a defining characteristic of applied behavior analysis 
concerns the types of interventions used to improve behavior. 
Although an infinite number of tactics and specific procedures 
can be used to alter behavior, almost all are derivatives and/or 
combinations of a relatively few basic principles of behavior. 
Thus, Baer and colleagues recommended that research reports 
of applied behavior analysis be conceptually systematic, 
meaning that the procedures for changing behavior and any 
interpretations of how or why those procedures were effective 
should be described in terms of the relevant principle(s) from 
which they were derived. 

Baer and colleagues (1968) provided a strong rationale for 
the use of conceptual systems in applied behavior analysis. First, 
relating specific procedures to basic principles might enable the 
research consumer to derive other similar procedures from the 
same principle(s). Second, conceptual systems are needed if 


a technology is to become an integrated discipline instead of a 
“collection of tricks.” Loosely related collections of tricks do not 
lend themselves to systematic expansion, and they are difficult 
to learn and to teach. 


Effective 


An effective application of behavioral techniques must improve 
the behavior under investigation to a practical degree. “In appli- 
cation, the theoretical importance of a variable is usually not at 
issue. Its practical importance, specifically its power in alter- 
ing behavior enough to be socially important, is the essential 
criterion” (Baer et al., 1968, p. 96). Whereas some investigations 
produce results of theoretical importance or statistical signifi- 
cance, to be judged effective an applied behavior analysis study 
must produce behavior changes that reach clinical or social 
significance. 

How much a given behavior of a given subject needs to 
change for the improvement to be considered socially import- 
ant is a practical question. Baer and colleagues stated that the 
answer is most likely to come from the people who must deal 
with the behavior; they should be asked how much the behavior 
needs to change. The necessity of producing behavioral changes 
that are meaningful to the participant and/or those in the partic- 
ipant’s environment has pushed behavior analysts to search for 
“robust” variables, interventions that produce large and consis- 
tent effects on behavior (Baer, 1977a). 

When they revisited the dimension of effectiveness 20 years 
later, Baer and colleagues (1987) recommended that the effec- 
tiveness of ABA also be judged by a second kind of outcome: 
the extent to which changes in the target behaviors result in 
noticeable changes in the reasons those behaviors were selected 
for change originally. If such changes in the subjects’ lives do 
not occur, ABA may achieve one level of effectiveness yet fail 
to achieve a critical form of social validity (Wolf, 1978). 


We may have taught many social skills without examin- 
ing whether they actually furthered the subject’s social 
life; many courtesy skills without examining whether 
anyone actually noticed or cared; many safety skills 
without examining whether the subject was actually 
safer thereafter; many language skills without measur- 
ing whether the subject actually used them to interact 
differently than before; many on-task skills without 
measuring the actual value of those tasks; and, in 
general, many survival skills without examining 

the subject’s actual subsequent survival. (Baer et al., 
1987, p. 322) 


Generality 


A behavior change has generality if it lasts over time, appears 
in environments other than the one in which the intervention 
that initially produced it was implemented, and/or spreads 
to other behaviors not directly treated by the intervention. A 
behavior change that continues after the original treatment 
procedures are withdrawn has generality. And generality is 
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evident when changes in targeted behavior occur in nontreat- 
ment settings or situations as a function of treatment proce- 
dures. Generality also exists when behaviors change that were 
not the focus of the intervention. Although not all instances 
of generality are adaptive (e.g., a beginning reader who has 
just learned to make the sound for the letter p in words such 
as pet and ripe, might make the same sound when seeing the 
letter p in the word phone), desirable generalized behavior 
changes are important outcomes of an applied behavior anal- 
ysis program because they represent additional dividends in 
terms of behavioral improvement. Strategies and tactics for 
promoting desirable generalization of behavior changes are 
detailed in Chapter 30. 

More than 50 years have passed since Baer, Wolf, and 
Risley (1968) proposed these seven dimensions as the defining 
features of applied behavior analysis. For insightful discussion 
of the usefulness of these dimensions for contemporary ABA 
research, see Axelrod (2017), Cataldo (2017), Critchfield and 
Reed (2017), and Friman (2017). 


Other Characteristics of ABA 


Applied behavior analysis offers society an approach toward 
solving problems that is accountable, public, doable, empower- 
ing, and optimistic (Heward, 2005). These characteristics should 
make behavior analysts “feel good” and increase the extent to 
which decision makers and consumers in many areas look to 
applied behavior analysis as a valuable and important source of 
knowledge for achieving improved outcomes. 


Accountable 


Applied behavior analysts’ commitment to effectiveness, 
their focus on accessible environmental variables that reli- 
ably influence behavior, and their reliance on direct and 
frequent measurement to detect changes in behavior yield 
an inescapable and socially valuable form of accountability. 
Direct and frequent measurement—the foundation and most 
important component of ABA practices—enables behavior 
analysts to identify their successes and, equally important, 
their failures so they can make changes in an effort to change 
failure to success (Bushell & Baer, 1994; Greenwood & 
Maheady, 1997). 


Failure is always informative in the logic of behavior 
analysis, just as it is in engineering. The constant reac- 
tion to lack of progress [is] a definitive hallmark 

of ABA. (Baer, 2005, p. 8) 


Gambrill (2003) described the sense of accountability and 
self-correcting nature of applied behavior analysis very well. 


Applied behavior analysis is a scientific approach to 
understanding behavior in which we guess and critically 
test ideas, rather than guess and guess again. It is a process 
for solving problems in which we learn from our mistakes. 
Here, false knowledge and inert knowledge are not 
valued. (p. 67) 


Public 
“Everything about ABA is visible and public, explicit and 
straightforward. . .. ABA entails no ephemeral, mystical, or 


metaphysical explanations; there are no hidden treatments: 
there is no magic” (Heward, 2005, p. 322). The transparent, 
public nature of ABA should raise its value in fields such as 
education, parenting and child care, employee productivity, 
geriatrics, health and safety, and social work—to name only a 
few—whose goals, methods, and outcomes are of vital interest 
to many constituencies. 


Doable 


Classroom teachers, parents, coaches, workplace supervisors, 
and sometimes the participants themselves implemented the 
interventions found effective in many ABA studies. This 
demonstrates the pragmatic element of ABA. “Although ‘doing 
ABA’ requires far more than learning to administer a few simple 
procedures, it is not prohibitively complicated or arduous. As 
many teachers have noted, implementing behavioral strategies in 
the classroom . . . might best be described as good old-fashioned 
hard work” (Heward, 2005, p. 322). 


Empowering 


ABA gives practitioners real tools that work. Knowing how 
to do something and having the tools to accomplish it instills 
confidence in practitioners. Seeing the data showing behavioral 
improvements in one’s clients, students, or teammates, or in 
oneself, not only feels good but also raises one’s self-assurance 
in assuming even more difficult challenges in the future. 


Optimistic 
Practitioners knowledgeable and skilled in behavior analysis 


have genuine cause to be optimistic for four reasons. First, as 
Strain and Joseph (2004) noted: 


The environmental view promoted by behaviorism is 
essentially optimistic; it suggests that (except for gross 
genetic factors) all individuals possess roughly equal 
potential. Rather than assuming that individuals have some 
essential internal characteristic, behaviorists assume that 
poor outcomes originate in the way the environment and 
experience shaped the individual’s current behavior. Once 
these environmental and experiential factors are identified, 
we can design prevention and intervention programs to 
improve the outcomes. . . . Thus, the emphasis on external 
control in the behavioral approach . . . offers a conceptual 
model that celebrates the possibilities for each individual. 
(Strain et al., 1992, p. 58) 


Second, direct and continuous measurement enables 
practitioners to detect small improvements in performance that 
might otherwise be overlooked. Third, the more often a practi- 
tioner uses behavioral tactics with positive outcomes (the most 
common result of behaviorally based interventions), the more 
optimistic she becomes about the prospects for future success. 
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A sense of optimism, expressed by the question “Why 
not?” has been a central part of ABA and has had an 
enormous impact on its development from its earliest 
days. Why can’t we teach a person who does not yet 

talk to talk? Why shouldn’t we go ahead and try to 

change the environments of young children so that they 
will display more creativity? Why would we assume that 
this person with a developmental disability could not learn 
to do the same things that many of us do? Why not try to 
do it? (Heward, 2005, p. 323) 


Fourth, ABA’s peer-reviewed literature provides many 
examples of success in teaching students who had been con- 
sidered unteachable. ABA’s continuous record of achievements 
evokes a legitimate feeling of optimism that future developments 
will yield solutions to behavioral challenges that are currently 
beyond the existing technology. For example, in response to 
the perspective that some people have disabilities so severe and 
profound that they should be viewed as ineducable, Don Baer 
offered this perspective: 


Some of us have ignored both the thesis that all 
persons are educable and the thesis that some 
persons are ineducable, and instead have exper- 
imented with ways to teach some previously 
unteachable people. Those experiments have 
steadily reduced the size of the apparently ineduca- 
ble group relative to the obviously educable group. 
Clearly, we have not finished that adventure. Why 
predict its outcome, when we could simply pursue it, 
and just as well without a prediction? Why not 
pursue it to see if there comes a day when there is 
such a small class of apparently ineducable persons 
left that it consists of one elderly person who is put 
forward as ineducable. If that day comes, it will be a 
very nice day. And the next day will be even better. 
(D. M. Baer, February 15, 2002, personal communi- 
cation, as cited in Heward, Alber-Morgan, & Konrad, 
2017, p. 404) 


A DEFINITION OF APPLIED BEHAVIOR 
ANALYSIS 


We began this chapter by stating that applied behavior analysis 
is a science with a dual purpose of understanding and improv- 
ing socially important behavior. We then described attitudes, 
assumptions, and methods that are fundamental to scientific 
inquiry, briefly reviewed the development of the science and 
philosophy of behavior analysis, and examined the characteris- 
tics of ABA. All of that provided necessary context for offering 
the following definition of applied behavior analysis: 


Applied behavior analysis is the science in which tactics 
derived from the principles of behavior are applied system- 
atically to improve socially significant behavior and exper- 
imentation is used to identify the variables responsible for 
behavior change. 


This definition includes six key components. First, applied 
behavior analysis is a science, which means ABA research- 
ers and practitioners are guided by the attitudes and methods 
of scientific inquiry. Second, all behavior change procedures 
are described and implemented in a systematic, technological 
manner. Third, not any means of changing behavior qualifies 
as applied behavior analysis: The field circumscribes only 
those tactics conceptually derived from the basic principles 
of behavior. Fourth, the focus of applied behavior analysis is 
socially significant behavior. The fifth and sixth parts of the 
definition specify the twin goals of applied behavior analysis: 
improvement and understanding. Applied behavior analysis 
seeks to make meaningful improvement in important behavior 
and to achieve an analysis of the factors responsible for that 
improvement. 8 


Four Interrelated Domains of Behavior 
Analytic Science and Professional Practice 
Guided by That Science 


The science of behavior analysis and its application to human 
problems consists of four domains: the three branches of behav- 
ior analysis—radical behaviorism, EAB, and ABA—and profes- 
sional practice in various fields that is informed and guided by 
that science. Figure 1.2 identifies defining features and charac- 
teristics of each domain. Although most behavior analysts work 
primarily in one or two of the domains shown in Figure 1.2, it is 
common for a behavior analyst to function in multiple domains 
at one time or another (Hawkins & Anderson, 2002; Moore & 
Cooper, 2003). 

Radical behaviorism, the philosophical domain of behav- 
ior analysis, entails pursuing theoretical and conceptual issues. 
Examples of theoretical and conceptual behavior analysis are 
Glenn’s (2004) exploration on the interplay of behavioral and 
cultural practices; Schlinger’s (2008b) analysis of listening as 
behaving verbally; Dillenburger and Keenan’s (2005) discussion 
of bereavement; Layng’s (2017) theoretical account of emotions 
and emotional behavior; and M. Malott’s (2016) examination of 
leadership. 

The experimental analysis of behavior is the basic 
research branch of the science. Basic research consists of exper- 
iments in (mostly) laboratory settings with both human and 
nonhuman subjects with a goal of discovering, extending, and 
clarifying fundamental principles of behavior. Examples of the 
wide range of topics investigated by EAB researchers include 
how organisms direct their attention in complex environments 
(Shahan, 2013); choice making (Mazur & Fantino, 2014); 
remembering and forgetting (White, 2013); delay discounting 
(Odum, 2011); and variability of operant behavior (Neuringer & 
Jensen, 2013)” + 

Applied behavior analysts conduct experiments aimed 
at discovering and clarifying functional relations between 
socially significant behavior and its controlling variables, with 
which they can contribute to the further development of humane 
and effective technologies of behavior change. Examples of 
ABA research include Tarbox, Wallace, and Williams’s (2003) 


Radical Behaviorism 
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Experimental Analysis 
of Behavior (EAB) 


Province 


Primary activity 


Primary goal 
and product 


Secondary goals 


Agreement 
with existing 
database 


Testability 


| Precision 


Theory and philosophy 


Conceptual and philo- 
sophical analysis 


Theoretical account of 
all behavior consistent 
with existing data 


Identify areas in which 
empirical data are 
absent and/or conflict 
and suggest resolutions 


As much as possible, but 
theory must go beyond 
database by design 


Partially—All behav- 
ior and variables of 
interest are not acces- 
sible (e.g., phylogenic 
contingencies) 


Applied Behavior 
Analysis (ABA) 


Figure 1.2 Comparisons and relationships among the four domains of behavior analysis science and practice. 


Practice Guided by 
Behavior Analysis 


Basic research 


Design, conduct, inter- 
pret, and report basic 
experiments 


Discover and clarify basic 
principles of behavior; 
functional relations 
between behavior and 
controlling variables 


Identify questions for 
EAB and/or ABA to 
investigate further; raise 
theoretical issues 


Complete—Although 
differences among data 
sets exist, EAB provides 
the basic research 
database 


Mostly—Technical 
limitations preclude 
measurement and exper- 
imental manipulation of 
some variables 


* 


Translational Behavior Analysis 


Applied research 


Design, conduct, inter- 
pret, and report applied 
experiments 


A technology for 
improving socially sig- 
nificant behavior; func- 
tional relations between 
socially significant 
behavior and controlling 
variables 


Identify questions for 
EAB and/or ABA to 
investigate further; raise 
theoretical issues 


Complete—Although 
differences among data 
sets exist, ABA provides 
the applied research 
database 


Mostly—Same lim- 
itations as EAB plus 
those posed by applied 
settings (e.g., ethical 
concerns, uncontrolled 
events) 


Most < 


ory attempts to account 
for all behavior 


Wide scope because the- 


As much scope as the 
EAB database enables 


As much scope as the 
ABA database enables 


Least < 


Minimal precision is 
possible because experi- 
mental data do not exist 
for all behavior encom- 
passed by theory 


*Translational behavior analysis bridges between EAB and ABA (see Mcllvane, 2009). 


As much precision as 
EAB’s current tech- 
nology for experimen- 
tal control and the 
researcher's skills enable 


As much precision as 
ABA's current tech- 
nology for experimen- 
tal control and the 
researcher's skills enable 


Helping people behave 
more successfully 


Design, implement, 
and evaluate behavior 
change programs 


Improvements in the 
lives of participants/ 
clients as a result 

of changes in their 
behavior 


Increased efficiency in 
achieving primary goal; 
may identify questions 
for ABA and EAB 


As much as possible, 
but practitioners must 
often deal with situa- 
tions not covered by 
existing data 


Partially—All behavior 
and variables of interest 
are not accessible (e.g., 
a student's home life) 


Least 


Narrow scope because 
practitioner's primary 
focus is helping the 
specific situation 


Most 


Maximum precision 
is needed to change 
behavior most effec- 
tively in specific 
instance 


assessment and treatment of elopement (running or walking 
away from a caregiver without permission) by individuals 
with disabilities; Romanowich and Lamb’s (2015) research 
on the effects of various schedules of reinforcement on 
smokers’ abstinence; Crabtree, Alber-Morgan, and Konrad’s 
(2010) experiment on the effects of self-monitoring of story 
elements on the reading comprehension of high school stu- 
dents with learning disabilities; and Poling and colleagues’ 
teaching giant African pouched rats to find unexploded 
landmines (Poling et al., 2010), detect patients infected with 


tuberculosis (Poling et al., 2017), and find people trapped 
under debris from collapsed structures resulting from natural 
disasters, acts of war or terrorism, or engineering mistakes 
(La Bonde eral’, 2015): 

“Behavior analysis is not merely the sum of its basic and 
applied research and conceptual programs. It is their interrela- 
tionship, wherein each branch draws strength and integrity from 
the others. With the unity of behavior analysis clarified, the whole 
of behavior analysis emerges as greater than the sum of its parts” 
(Morris, Todd, Midgley, Schneider, & Johnson, 1990, p. 136). 
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Translational research is evidence of the symbiotic relation 
between the basic and applied domains. Translational research 
includes basic research that “bridges” basic and applied areas 
and applied research that translates the knowledge derived 
from basic research “into state-of-the-art clinical practices 
for use in the community” (Lerman, 2003, p. 415). Ina review 
of translational research on token reinforcement, Hackenberg 
(2018) suggested that all three domains of behavior analy- 
sis science benefit when basic and applied research inform 
each other. 


Unlike standard translational research, based on a 
unidirectional model in which the analysis moves from 
laboratory to the applied realm, work in the area of 

token systems is best served by a bidirectional inter- 

play between laboratory and applied research, where 
applied questions inspire research on basic mechanisms. 
When based on and contributing to an analysis, applied 
research on token economies can be on the leading edge 
of theoretical advances, helping set the scientific research 
agenda. (p. 393)!° 


The delivery of professional services informed by behav- 
ior analysis occurs in the fourth domain. Behavior analytic 
practitioners design, implement, and evaluate behavior change 
programs that consist of behavior change tactics derived 
from fundamental principles of behavior discovered by basic 
researchers and experimentally validated for their effects on 
socially significant behavior by applied researchers. An example 
is when a therapist providing home-based treatment for a child 
with autism embeds frequent opportunities for the child to use 
emerging social and language skills in the context of naturalistic, 
daily routines and ensures that the child’s responses are followed 
with reinforcing events. Another example is a classroom teacher 
trained in behavior analysis who uses positive reinforcement 
and stimulus fading to teach students to identify and classify 
fish into their respective species by the shape, size, and location 
of their fins. 

Although each of the four domains can be defined and 
practiced in its own right, none of the domains are, or should 
be, completely independent of and uninformed by developments 


in the others. Both the science and the effective application of 
its findings benefit when the four domains are interrelated and 
influence one another. 


The Promise and Potential of ABA 


In a paper titled “A Futurist Perspective for Applied Behavior 
Analysis,” Jon Bailey (2000) stated that 


It seems to me that applied behavior analysis is more rele- 
vant than ever before and that it offers our citizens, parents, 
teachers, and corporate and government leaders advan- 
tages that cannot be matched by any other psychological 
approach. . . . | know of no other approach in psychology 
that can boast state-of-the-art solutions to the most trou- 
bling social ills of the day. (p. 477) 


We, too, believe that ABA’s pragmatic, natural science 
approach to discovering environmental variables that reliably 
influence socially significant behavior and to developing a tech- 
nology to take practical advantage of those discoveries offers 
humankind its best hope for solving many of its problems. It 
is important to recognize that the knowledge afforded by ABA 
of “how behavior works” is incomplete, even at the level of 
fundamental principles, as is the technology for changing behay- 
ior derived from that knowledge. There are aspects of behavior 
about which relatively little is known, and additional research, 
both basic and applied, is needed to clarify, extend, and fine-tune 
all existing knowledge. 

Nevertheless, applied behavior analysis research and prac- 
tice have improved human performance and the quality of partic- 
ipants’ lives across a wide range of areas. Figure 1.3 lists some 
of the more than 230 applied behavior analysis research topics 
identified by an informal literature search and survey (Heward 
& Critchfield, 2019). As diverse and impressive as this A-to-Z 
list of topics may seem, no behavioral problem or learning goal 
has been solved completely, and many important problems and 
challenges await analysis. The remainder of this text presents 
a foundation of knowledge that can lead to a fuller and more 
complete understanding of the still young and promising science 
of applied behavior analysis. 


Figure 1.3 ABA: Improving people’s quality of life from A to Z. 


A Cc 

ADHD (Bicard & Neef, 2002), aging (Baker & LeBlanc, 2014), caffeinism (Foxx & Rubinoff, 1979), cancer prevention 
aggression (Brosnan & Healy, 2011), AIDS (DeVries et al., 1991), | (Lombard et al., 1991), child abuse (Van Camp et al., 2003), 
alcohol abuse (Fournier et al., 2004), Alzheimer’s (LeBlanc et al., climate change (Heward & Chance, 2010), coaching athletes 
2006), anorexia nervosa (Solanto et al., 1994), animal training (Stokes et al., 2010), college teaching (Kellum et al., 2001), 
(Protopopova et al., 2016), autism (Ahearn & Tiger, 2013), compassion (Geller, 2012), cooperative learning (Maheady 
aviation safety (Rantz & Van Houten, 2013) et al., 2006), creativity (Winston & Baker, 1985), crying 

" (Bowman, 2013), culturally responsive social skills instruction 


baseball (Heward, 1978), basketball (Kladopoulos & eel dai 

McComas, 2001), bedtime refusal (Friman et al., 1999), D 

behavioral pharmacology (Roll, 2014), bicycle safety (Okinaka = dance (Quinn et al., 2015), delusional speech (Travis & Sturmey 
& Shimazaki, 2011), blackjack skills (Speelman et al., 2015), 2010), dementia (Engelman et al., 1999), depression (Follette & 
braille (Scheithauer & Tiger, 2014), breast cancer detection Darrow, 2014), developmental disabilities (Kurtz & Lind, 2013) 

(Bones et al., 2016), bruxism (Barnoy et al., 2009), bullying diabetes (Raiff et al., 2016), drug addiction (Silverman et al. 
(Ross et al., 2009) 2011), dyslexia (Denton & Meindl, 2016) 
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E 


education (Heward et al., 2005), elopement (Kodak et al., 


2004), energy conservation (Staats et al., 2000), enuresis 
(Friman & Jones, 2005) 


F 

fear of dental procedures (Conyers et al., 2004), feeding dis- 
orders (Volkert & Piazza, 2012), figure skating (Hume et al., 
1985), fire safety (Garcia et al., 2016), food bank donations 
(Farrimond & Leland, 2006), football (Ward & Carnes, 2002), 
foster-family care (Hawkins et al., 1985) 


G 

gambling (Dixon et al., 2015), gamification (Morford et al., 
2014) gender-based violence (Szabo et al., 2019), gerontology 
(Gallagher & Keenan, 2000), golf (Simek et al., 1994), gun 
safety (Miltenberger et al., 2005) 


H 

hairpulling (Rapp et al., 1999), handwriting (Trap et al., 1978), 
happiness (Parsons et al., 2012), headaches (Fitterling et al., 
1988), highway safety (Van Houten et al., 1985), homework 
(Alber et al., 2002), horseback riding (Kelley & Miltenberger, 
2016), horse training (Fox & Belding, 2015), hostage negotia- 
tion (Hughes, 2006), hygiene (Fournier & Berry, 2013) 


| 

impulsivity (Barry & Messer, 2003), incontinence (Adkins & 
Mathews, 1997), industrial safety (Fox et al., 1987), infant 

' care (Dachman, et al., 1986), infection control (Babcock et al., 
/ 1992), intellectual disabilities (Frederick et al., 2013) 


J 
job finding (Azrin et al., 1975), joint attention (Taylor & Hoch, 
2008), juvenile justice (Kirigin et al., 1982) 


K 

keyboard training (DeFulio, 2011), kidnapping prevention 
(Gunby et al., 2010), kitchen skills (Trask-Tyler et al., 1994), 
» kleptomania (Kohn, 2006) 


L 

landmine detection (Edwards et al., 2015), language acquisition 
(Drasgow, et al., 1998), learning disabilities (Wolfe et al., 2000), 
leisure skills Schleien et al., 1981), littering (Powers et al., 1973), 
lockdown drill procedures (Dickson & Vargo, 2017) 


M 

martial arts (BenitezSantiago & Miltenberger, 2016), math 
(Hunter et al., 2016), medical procedures (Hagopian & 
Thompson, 1999), medical training (Levy et al., 2016), mental 
health (A-tjak et al., 2015), music skills (Griffith et al., 2018), 

| myopia (Collins et al., 1981) 


N 

nail biting (Heffernan & Lyons, 2016), nervous habits (see nail 
biting, hair pulling), noise (Ring et al., 2014), noncompliance 
(Mace et al., 1988), nutrition (Horne et al., 2009) 


O 

obesity (De Luca & Holborn, 1992), observational learning 
(DeQuinzio & Taylor, 2015), obsessive-compulsive disorder 
(Penney et al., 2016), organizational behavior management 
| (Rodriguez, 2011), otitis media (O'Reilly, 1997) 


P 

panic disorders, parenting (Miltenberger & Crosland, 2014), 
pediatrics (Friman & Piazza, 2011), phobias (Tyner et al., 
2016), physical activity (Kuhl et al., 2015), physical education 
(McKenzie et al., 2009), pica (Hagopian et al., 2011), play skills 
(Davis-Temple et al., 2014), Prader-Willi syndrome (Page et al., 
1983), problem solving (Axe et al., in press), procrastination 
(Johnson et al., 2016), public health (Biglan & Glenn, 2013), 
public speaking (Mancuso & Miltenberger, 2016) 


Q 

quality control (Kortick & O’Brien, 1996) quantitative analysis 
skills (Fienup & Critchfield, 2010), question asking/answering 
(Ingvarsson et al., 2007) 


R 

x-ray sheilding, reading (Twyman et al., 2005), recycling 
(O'Conner et al., 2010), robbery (Schnelle et al., 1979), rugby 
(Mellalieu et al., 2006), rumination (Woods et al., 2013) 


S 

safe sex (Honnen & Kleinke, 1990), school-wide behavior 
support (Freeman et al., 2016), seat-belt use (Van Houten 

et al., 2010), second language acquisition (May et al., 

2016), self-injury (Lerman & lwata, 1993), self-management 
(Reynolds et al., 2014), sexual abuse (Lumley et al., 1998), shar- 
ing (Marzullo-Kerth et al., 2011), sleep disorders (Piazza et al., 
1997), soccer (Brobst & Ward, 2002), spelling (McNeish et al., 
1992), stereotypy (Ahearn et al., 2003), stuttering (Wagaman 
et al., 1995), substance abuse (Roll et al., 2009), sustainability 
(Leeming et al., 2013), swimming (Hume & Crossman, 1992) 


T 
tantrums (Williams, 1959), TB detection (Poling et al., 2017), 
teacher training (Kretlow et al., 2012), thumb sucking (Friman, 
2000), tobacco use (Romanowich & Lamb, 2015), toileting 
(Greer et al., 2016), toothbrushing (Poche et al., 1982), Tourette 
syndrome (Azrin & Peterson, 1988), track and field (Scott et al., 
1997), traumatic brain injury (Heinicke et al., 2009) 

U 

urban search and rescue (Edwards et al., 2016), 

universal precautions (Luke & Alavosius, 2011) 


V 


vandalism (Mayer et al., 1983), visual acuity (Collins et al., 
1981), vocal tics (Wagaman et al., 1995), vocational training 
(Cullen et al., 2017), voice disorders, (Shriberg, 1971)” 


Ww 

weight loss (VanWormer, 2004), workplace safety (Abernathy & 
Lattal, 2014), writer’s block (Didden et al., 2007), writing skills 
(Hansen & Wills, 2014) 

xX 

x-ray shielding (Greene & Neistat, 1983) 


Y 

yielding to pedestrians (Bennett et al., 2014), yoga (Downs, 
2015), youth sports (Luiselli et al., 2011) 

Z 

zoo animal welfare (Maple & Segura, 2015) 


Notes: Adapted from “ABA from A-to-Z" by W. L. Heward and T. S. Critchfield, 2019. Manuscript in progress. References with three or more coau- 


thors cited as “et al.” 
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SUMMARY 


Science: Basic Characteristics and a Definition 


1. 


Different types of scientific investigations yield knowledge 
that enables the description, prediction, and/or control of 
the phenomena studied. 


. Descriptive studies yield a collection of facts about the 


observed events that can be quantified, classified, and 
examined for possible relations with other known facts. 


. Knowledge gained from a study that finds the systematic 


covariation between two events—termed a correlation— 
can be used to predict the probability that one event will 
occur based on the occurrence of the other event. 


. Results of experiments that show that specific manipula- 


tions of one event (the independent variable) produce a 
reliable change in another event (the dependent variable), 
and that the change in the dependent variable was unlikely 
the result of extraneous factors (confounding variables)—a 
finding known as a functional relation—can be used to 
control the phenomena under investigation. 


. The behavior of scientists in all fields is characterized by a 


common set of assumptions and attitudes: 


¢ Determinism—the assumption that the universe is a 
lawful and orderly place in which phenomena occur as a 
result of other events. 


¢ Empiricism—the objective observation of the phenom- 
ena of interest. 


¢ Experimentation—the controlled comparison of 
some measure of the phenomenon of interest (the 
dependent variable) under two or more different con- 
ditions in which only one factor at a time (the indepen- 
dent variable) differs from one condition to another. 


¢ Replication—repeating experiments (and independent 
variable conditions within experiments) to determine the 
reliability and usefulness of findings. 


e Parsimony—simple, logical explanations must be ruled 
out, experimentally or conceptually, before more comi- 
plex or abstract explanations are considered. 


¢ Philosophic doubt—continually questioning the 
truthfulness and validity of all scientific theory and 
knowledge. 


A Brief History of Behavior Analysis 


6. Behavior analysis consists of three major branches: behav- 


iorism, the experimental analysis of behavior (EAB), and 
applied behavior analysis (ABA). 


. Watson espoused an early form of behaviorism known 


as stimulus—response (S—R) psychology, which did 
not account for behavior without obvious antecedent 
causes. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


We 


names 
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. Skinner founded the experimental analysis of behavior 


(EAB), a natural science approach for discovering orderly 
and reliable relations between behavior and the environ- 
mental variables of which it is a function. 


. EAB is characterized by these methodological features: 


¢ Rate of response is the most common dependent 
variable. 


¢ Repeated or continuous measurement is made of 
carefully defined response classes. 


* Within-subject experimental comparisons are used 
instead of designs comparing the behavior of experi- 
mental and control groups. 


¢ The visual analysis of graphed data is preferred over 
statistical inference. 


¢ A description of functional relations is valued over 
formal theory testing. 


Through thousands of laboratory experiments, 
Skinner and his colleagues and students discovered 
and verified the basic principles of operant behavior 
that provide the empirical foundation for behavior 
analysis today. 


Skinner wrote extensively about a philosophy for a science 
of behavior he called radical behaviorism. Radical behav- 
iorism attempts to explain all behavior, including private 
events such as thinking and feeling. 


Methodological behaviorism is a philosophical position 
that considers behavioral events that cannot be publicly 
observed to be outside the realm of the science. 


Mentalism is an approach to understanding behavior that 
assumes that a mental, or “inner,” dimension exists that 
differs from a behavioral dimension and that phenomena 
in this dimension either directly cause or at least mediate 
some forms of behavior; it relies on hypothetical con- 
structs and explanatory fictions. 


Pragmatism is the philosophical position that the truth or 
value of a scientific statement is determined by the extent 
to which it promotes effective action. 


The first published report of the application of operant 
conditioning with a human subject was a study by Fuller 
(1949), in which an arm-raising response was conditioned 
in an adolescent with profound disabilities. 


The formal beginnings of applied behavior analysis 

can be traced to 1959 and the publication of Ayllon and 
Michael’s article “The Psychiatric Nurse as a Behavioral 
Engineer.” 


Contemporary applied behavior analysis (ABA) began in 
1968 with the publication of the first issue of the Journal 
of Applied Behavior Analysis (JABA). 


Chapter 1 * Definition and Characteristics of Applied Behavior Analysis 


Characteristics of Applied Behavior Analysis 


18. Baer, Wolf, and Risley (1968) stated that a research study 


or behavior change program should meet seven defining 
dimensions to be considered applied behavior analysis: 


* Applied—investigates socially significant behaviors 
with immediate importance to the subject(s). 


* Behavioral—entails precise measurement of the actual 
behavior in need of improvement and documents that it 
was the subject’s behavior that changed. 


* Analytic—demonstrates experimental control over the 
occurrence and nonoccurrence of the behavior—that is, 
if a functional relation is demonstrated. 


* Technological—the written description of all procedures 
used in the study is sufficiently complete and detailed to 
enable others to replicate it. 


Conceptually systematic—behavior change interven- 
tions are derived from basic principles of behavior. 


Effective—improves behavior sufficiently to produce 
practical results for the participant/client. 


Generality—produces behavior changes that last over 
time, appear in other environments, and/or spread to 
other behaviors. 


19. ABA offers society an approach toward solving many 
of its problems that is accountable, public, doable, 


empowering, and optimistic. 


EP ROE NEED TSE REDE SE CEES TEE 


KEY TERMS 


applied behavior analysis (ABA) explanatory fiction 


behaviorism functional analysis 
determinism functional relation 
empiricism hypothetical construct 
experiment mentalism 


experimental analysis of behavior (EAB) 


oe sceopooosasnane Sse DEE EEO EE SEE EOSIN ES CSS STS 
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MULTIPLE-CHOICE QUESTIONS 


1. The level of investigation that involves the collection 
of facts about observed events that can be quantified, 
classified, and examined for possible relations with other 
known facts, and often suggests hypotheses or questions 
for additional research is: 
a. Prediction 
b. Experimentation 
c. Description 
d. Control 
Hint: (See “Science: Basic Characteristics and a Definition’) 


methodological behaviorism 
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A Definition of Applied Behavior Analysis 


20. Applied behavior analysis is the science in which tactics 
derived from the principles of behavior are applied sys- 
tematically to improve socially significant behavior and 
experimentation is used to identify the variables responsi- 
ble for behavior change. 


21. Behavior analysts work in one or more of four interrelated 
domains: behaviorism (theoretical and philosophical 
issues), the experimental analysis of behavior (basic 
research), applied behavior analysis (applied research), 
and professional practice (providing behavior analytic 


services to consumers). 


22. Translational research bridges basic and applied research 


and informs both domains. 


23. ABA’s natural science approach to discovering environ- 


mental variables that reliably influence socially significant 
behavior and developing a technology to take practical 
advantage of those discoveries offers humankind its best 
hope for solving many of its problems. 


24. Applied behavior analysis research and practice 


have improved human performance and the quality 

of participants’ lives across a wide range of areas, 

but no problem has been solved completely, and many 
important problems, challenges, and opportunities 
remain. 


parsimony 
philosophic doubt 
pragmatism 

radical behaviorism 
replication 


science 


2. The level of investigation that demonstrates correlation 
between events and is based on repeated observations is: 


a. Prediction 

b. Experimentation 

c. Description 

d. Control 

Hint: (See “Science: Basic Characteristics and a Definition’) 
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The level of investigation in which functional relations can 
be derived is: 


a. Prediction 

b. Experimentation 

c. Description 

d. Control 

Hint: (See “Science: Basic Characteristics and a Definition”) 


. The overall goal of is to achieve a thorough 
understanding of the phenomenon under study. 
a. Behavior Analysis 
b. Science 


c. Experimentation 
d. Functional relationships 
Hint: (See “Science: Basic Characteristics and a Definition”) 


. A functional relation means that: 


a. Specific change in the independent variable can reliably 
be produced by specific manipulations in the dependent 
variable, and the change in the independent variable 
was unlikely to be the result of confounding variables. 


b. Specific change in the dependent variable can reliably be 
produced by specific manipulations in the independent 
variable, and the change in the independent variable was 
unlikely to be the result of confounding variables. 


c. Specific change in the dependent variable can reliably 
be produced by specific manipulations in the indepen- 
dent variable, and the change in the dependent variable 
was unlikely to be the result of confounding variables. 

d. Specific change in the dependent variable can reliably 
be produced by specific manipulations in the confound- 
ing variable, and the change in the dependent variable 
was unlikely to be the result of the independent variable. 


Hint: (See “Science: Basic Characteristics and a Definition’) 


. This is the assumption upon which science is predicated, 


the idea that the universe is a lawful and orderly place in 
which all phenomena occur as a result of other events. 


a. Mentalism 

b. Determinism 

c. Empiricism 

d. Philosophic Doubt 


Hint: (“Science: Basic Characteristics and a Definition’’) 


. This is the idea that simple, logical explanations must be 


ruled out, experimentally or conceptually, before more 
complex or abstract explanations are considered. 


a. Philosophic Doubt 
b. Experimentation 
c. Replication 

d. Parsimony 


Hint: (“Science: Basic Characteristics and a Definition’) 


8. This is the branch of behavior analysis that focuses on 


10. 


11. 


12. 


basic research: 

a. Applied behavior analysis 

b. Behaviorism 

c. Experimental analysis of behavior 

d. Radical behaviorism 

Hint: (See “A Brief History of Behavior Analysis”) 


. The S-R-S model of psychology is also known as: 


a. Three-term contingency 

b. Watsonian psychology 

c. Respondent behavior model 

d. Reflexive behavior model 

Hint: (See “A Brief History of Behavior Analysis”) 

This person is considered to be the founder of experimental 
analysis of behavior. 

a. John B. Watson 

b. B.F. Skinner 

c. Ivan Pavlov 

d. Don Baer 

Hint: (See “A Brief History of Behavior Analysis”’) 

This is the approach to understanding behavior that 
assumes that a mental or “inner” dimension exists that 
differs from a behavioral dimension and that phenomena 


in this dimension either directly cause or at least mediate 
some forms of behavior. 


a. Radical behaviorism 

b. Methodological behaviorism 

c. Structuralism 

d. Mentalism 

Hint: (See “A Brief History of Behavior Analysis’’) 


These events marked the formal beginning of contemporary 
applied behavior analysis. 


a. “Some Current Dimensions of Applied Behavior 
Analysis,” by Baer, Wolf, and Risley, was published 
and “The Psychiatric Nurse as a Behavioral Engineer,” 
by Ayllon and Michael, was published. 


b. “The Psychiatric Nurse as a Behavioral Engineer,” 
by Ayllon and Michael, was published and Fuller 
conducted a study in which human application of 
operant behavior occurred. 


c. The Journal of Applied Behavior Analysis began 
publication and “Some Current Dimensions of Applied 
Behavior Analysis,” by Baer, Wolf, and Risley, was 
published. 


d. The Journal of Applied Behavior Analysis began 
publication and “The Psychiatric Nurse as a Behavioral 
Engineer,” by Ayllon and Michael, was published. 


Hint: (See “A Brief History of Behavior Analysis”) 
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This is the defining characteristic of behavior analysis that 
focuses on investigating socially significant behaviors 
with immediate importance to the participant(s). 


a. Effective 
b. Applied 
c. Behavioral 
d. Analytical 


Hint: (See “Characteristics of Applied Behavior Analysis”) 


Describe the goal of science and the three different levels 
of investigation. 


Hint: (“Science: Basic Characteristics and a Definition”) 


. State and describe the different scientific attitudes. 


Hint: (Science: Basic Characteristics and a Definition’’) 


. Discuss similarities and differences between differ- 


ent perspectives on behavior (i.e., mentalism, radi- 
cal behaviorism, methodological behaviorism, and 
structuralism). 


Hint: (See “A Brief History of Behavior Analysis’) 


NOTES 


1. 


Spurious correlations result from measures of two randomly chosen vari- 
ables closely tracking each other (e.g., rise in annual organic food sales and 
increased incidence of autism [Redditor Jasonp55, 2018], per capita marga- 
rine consumption and the divorce rate in Maine [Vigen, 2015]). For more 
silly examples, see Vigen (2015); for detailed explanation of the fallacy, 
see West, Bergstrom, and Bergstrom (2010). 

Skinner (1953) noted that although telescopes and cyclotrons give us a 
“dramatic picture of science in action” (p. 12), and science could not have 
advanced very far without them, such devices and apparatus are not science 
themselves. “Nor is science to be identified with precise measurement. We 
can measure and be mathematical without being scientific at all, just as we 
may be scientific without these aids” (p. 12). Scientific instruments bring 
scientists into greater contact with their subject matter and, with measure- 
ment and mathematics, enable a more precise description and control of 
key variables. 

Informative and interesting descriptions of the history of behavior analy- 
sis can be found in Goodall (1972); Guericio (2018); Hackenberg (1995); 
Michael (2004); Morris, Todd, Midgley, Schneider, and Johnson (1990); 
Mountjoy and Cone (1997); Risley (2005); Sidman (2002); Skinner (1956, 
1979): Stokes (2003); Vargas, Vargas, and Knapp (2017); and in a special 
section of articles in the Fall 2003 issue of The Behavior Analyst. 

For interesting biographies and scholarly examinations of J. B. Watson’s 
life and contributions to behavior analysis, see Catania (1993); Morris 
(2013); and Morrow (2017). 

In The Behavior of Organisms, Skinner called the conditioning of respon- 
dent behavior Type S conditioning and the conditioning of operant behavior 
Type R conditioning, but these terms were soon dropped. Respondent and 
operant conditioning and the three-term contingency are further defined 
and discussed in Chapter 2. 

Most of the methodological features of the experimental approach pio- 
neered by Skinner (e.g., rate of response as the primary dependent vari- 
able, within-subject experimental comparisons, visual analysis of graphic 
data displays) continue to characterize both basic and applied research in 


14. 


4. 


This is the defining characteristic of applied behavior 
analysis that demonstrates experimental control over 
the occurrence and non-occurrence of the behavior. 


a. Effective 
b. Analytic 
c. Applied 


d. Experimentation 


Hint: (See “Characteristics of Applied Behavior Analysis”) 


Discuss the origin of applied behavior analysis, including 
early history of the field and defining events of contempo- 
rary applied behavior analysis. 


Hint: (See “A Brief History of Behavior Analysis”) 
Compare and contrast the different branches of behavior 
analysis. 

Hint: (See “A Brief History of Behavior Analysis”) 


. State and describe each of the defining dimensions of 


applied behavior analysis. 


Hint: (See “Characteristics of Applied Behavior Analysis”) 


behavior analysis (e.g., Fisher, Piazza, & Roane, 2011; Lattal, 2013; Perone 
& Hursh, 2013). The five chapters in Part III describe how applied behavior 
analysts use this experimental approach. 

Skinner, considered by many the most eminent psychologist of the 
20th century (Haagbloom et al., 2002), authored or coauthored 291 
primary-source works and a three-volume autobiography (Particulars of My 
Life, 1976; The Shaping of a Behaviorist, 1979; A Matter of Consequences, 
1983). Many of Skinner’s books are available as name-your-price products at 
the B. F. Skinner Foundation (bfskinner.org). Numerous biographical books 
and articles have been written about Skinner, both before and after his death. 
Burrhus Frederic Skinner (1904-1990): A Thank You by Fred Keller (1976), 
B. F. Skinner—The Last Few Days by his daughter Julie Vargas (1990), 
B. F. Skinner, Organism by Charles Catania (1992), B. F: Skinner: A Life by 
Daniel Bjork (1997), Skinner as Self-Manager by Robert Epstein (1997), 
Burrhus F. Skinner: Shaper of Behaviour by Frederik Toates (2009), and 
a series of articles in the 2017, Quarter | issue of Operants provide inter- 
esting and informative insights about Skinner and his work. Morris, Altus, 
and Smith (2005) detail Skinner’s contributions to ABA. Smith and Morris 
(2018) maintain a continually updated bibliography of citations, references, 
and other materials related to Skinner’s writings. 

The nature and role of pragmatism in radical behaviorism are long-running 
topics of discussion and debate (e.g., Barnes-Holmes, 2000; Baum, 2017; 
Moore, 2008; Schoneberger, 2016; Tourinho & Neno, 2003; Zuriff, 1980). 
Excellent discussions and spirited debates about the meaning and useful- 
ness of radical behaviorism as the philosophy for a science of behavior 
can be found in Baum (2011, 2017); Catania and Harnad (1988); Chiesa 
(1994); Diller and Lattal (2008); Johnston (2013); Moore (2008, 2011); 
Palmer (2011); and Rachlin (2018). 

Boyle and Greer (1983) published an extension of Fuller’s classic study 
with comatose patients. 


. Sidney Bijou’s remarkable career and the numerous ways he contributed 


to the founding and advancement of behavior analysis are detailed with a 
personal touch by Morris (2008, 2009). 
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. Articles describing the histories of the applied behavior analysis programs 
at five of these universities can be found in the winter 1993 issue of JABA. 
Kenneth Goodall’s article, The Behavior Shapers (1972), provides a fasci- 
nating look at the work of some of the field’s pioneers. 

. Furman and Lepper (2018) take issue with this definition of applied behav- 
ior analysis because it “employs a subjective criterion (i.e., social relevance 
and/or importance to man and society), taking it out of the realm of an 
objective science” (p. 103) and suggest that ABA be defined as “the scien- 
tific study of behavior change, using the principles of behavior, to evoke or 
elicit a targeted behavioral change” (p. 104). We believe socially significant 
behavior is an essential element of the definition of ABA. The science 
and the philosophy of behaviorism it embodies purposely seek behavior 


14. 


Ils. 


16. 


changes that improve people’s quality of life (Kimball & Heward, 1993; 
Skinner, 1971; 1974). Objective (i.e., scientific) methods for assessing and 
validating the social importance of behaviors targeted for change and the 
extent to which behavior change improves participants’ quality of life are 
described in Chapters 3 and 10. 

Choice making in concurrent schedules, delay discounting, and operant 
variability are discussed in Chapters 13, 27, and 29, respectively. 

For perspectives on the status, role, and future of basic research in behavior 
analysis, see Killeen (2018), Marr (2017), and a special series of articles 
invited by Li, Mahoney, and Poling (2018). 

To learn more about translational research, see Critchfield (201 1a, b) and 
articles in the Fall 2009 and Spring 2011 issues of The Behavior Analyst. 


aero bl Foca 2 


Basic Concepts and Principles 


LEARNING OBJECTIVES 


¢ Define behavior, response, and response class. 


¢ State examples of behavior, response, and response class. 


¢ Define stimulus and stimulus class. 
¢ Define and state examples of positive reinforcement. 
¢ Define and state examples of negative reinforcement. 


Define and state examples of positive punishment. 
e Define and state examples of negative punishment. 
¢ Define and provide examples of stimulus control. 


e Define and provide examples of establishing operations. 


e Describe a behavioral contingency. 
e Describe the respondent conditioning paradigm. 


Describe the operant conditioning paradigm. 


One must understand how the organism is modified by its 
interactions with the environment, how new environment— 
behavior relations are learned and unlearned. 

—Jack L. Michael (2004, p. 1) 


his chapter defines the basic concepts required for a scientific 

analysis of behavior and introduces fundamental principles 

discovered by such analyses. Behavior, the first concept we 
examine, is the most fundamental of all. Because the controlling 
variables of primary importance to applied behavior analysts are 
located in the environment, the concepts of environment and stim- 
ulus are defined next. We then introduce several essential findings 
that the scientific study of behavior—environment relations has 
discovered. Two types of behavior—respondent and operant— 
are described, and the basic ways the environment influences 
each type—respondent conditioning and operant conditioning— 
are introduced. The three-term contingency—a concept for express- 
ing temporal and functional relations between operant behavior 
and environment—and its importance as a focal point in applied 
behavior analysis are then explained.! The chapter’s final section 
recognizes the enormous complexity of human behavior, reminds 
us that behavior analysts possess an incomplete, though ever- 
expanding and progressively sophisticated, knowledge of their 
subject matter, and identifies some of the obstacles and challenges 
faced by those who strive to change behavior in applied settings. 


Define and provide examples of conditioned and unconditioned reinforcement. 


Provide an example of the respondent conditioning paradigm. 


Provide an example of the operant conditioning paradigm. 


BEHAVIOR 


What, exactly, is behavior? In a nutshell, behavior is the activity 
of living organisms. Human behavior is everything people do, 
including how they move and what they say, think, and feel. 
Tearing open a bag of peanuts is behavior, and so is thinking 
how good the peanuts will taste once the bag is open. Reading 
this sentence is behavior, and if you’re holding the book, so is 
feeling its weight and shape in your hands. 

Although words such as activity and movement communi- 
cate the general notion of behavior, a more precise definition 1s 
needed for scientific purposes. How a scientific discipline defines 
its subject matter exerts profound influence on the methods of 
measurement, experimentation, and theoretical analyses that are 
appropriate and possible. 

Building on Skinner’s (1938) definition of behavior as “‘the 
movement of an organism or of its parts in a frame of refer- 
ence provided by the organism or by various external objects 
or fields” (p. 6), Johnston and Pennypacker (1980, 1993, 2009) 
articulated the most conceptually sound and empirically com- 
plete definition of behavior to date. In Strategies and Tactics of 
Behavioral Research, third edition, these authors define behavior 
as follows: 


Behavior is that portion of an organism’s interaction with 
its environment that involves movement of some part of the 
organism (2009, p. 31). 
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Johnston and Pennypacker discuss each part of this defi- 
nition as it relates to researchers and practitioners. The phrase of 
an organism restricts the subject matter to the activity of living 
organisms, leaving notions such as the “behavior” of the stock 
market outside the realm of the scientific use of the term. The 
phrase an organism’s interaction with its environment “avoids 
implying that behavior is a possession of the organism and high- 
lights the requirement for an interactive condition” (2009, p. 31). 
The authors elaborated on this critical part of the definition in 
the second edition of their text: 


Behavior is not a property or attribute of the organism. 

It happens only when there is an interactive condition 
between an organism and its surroundings, which include 
its own body. This means that independent states of the 
organism, whether real or hypothetical, are not behav- 
ioral events, because there is no interactive process. 
Being hungry or being anxious are examples of states that 
are sometimes confused with the behavior that they are 
supposed to explain. Neither phrase specifies an environ- 
mental agent with which the hungry or anxious organism 
interacts, so no behavior is implied. 

Similarly, independent conditions or changes in the 
environment do not define behavioral occurrences because 
no interaction is specified. Someone walking in the rain 
gets wet, but “getting wet” is not an instance of behavior. 
A child may receive tokens for correctly working math 
problems, but “receiving a token” is not behavior. 
Receiving a token implies changes in the environment 
but does not suggest or require change in the child’s 
movement. In contrast, both doing math problems and 
putting the token in a pocket are behavioral events because 
the environment both prompts the child’s actions and is 
then changed by them. (Johnston & Pennypacker, 1993, 
p. 24, emphasis added) 


In addition to excluding static states of the organism, the 
definition does not include bodily movements produced by inde- 
pendent physical forces as behavioral events. For example, being 
blown over by a strong gust of wind is not behavior; given suf- 
ficient wind, nonliving objects and organisms move similarly. 

The phrase movement of some part of the organism identi- 
fied behavior as movement, regardless of scale. “To be observed, 
a response must affect the environment—it must have an effect 
upon an observer or upon an instrument, which in turn can affect 
an observer. This is as true of the contraction of a small group 
of muscle fibers as of pressing a lever or pacing a figure 8” 
(Skinner, 1969, p. 130). Behavior includes movement of body 
parts within the skin not accessible for observation by others. As 
Catania (2013) noted, shifting one’s attention to different instru- 
ments while listening to recorded music need not involve the 
movement of the head or eyes or any other obvious body part. 

The word behavior is usually used in reference to a class 
of responses sharing certain functions (e.g., eating behavior, 
greeting behavior, writing behavior).° The term response refers 
to a specific instance of behavior. A technical definition of 
response is an “action of an organism’s effector. An effector 
is an organ at the end of an efferent nerve fiber that is spe- 
cialized for altering its environment mechanically, chemically, 


or in terms of other energy changes” (Michael, 2004, p. 8, 
italics in original). Human effectors include the striated mus- 
cles (i.e., skeletal muscles such as biceps and quadriceps), 
smooth muscles (e.g., stomach and bladder muscles), and glands 
(e.g., adrenal and pituitary glands). 

Behavior can also be described by its form, or physical 
characteristics. Response topography refers to the physical 
shape or form of behavior. For example, the hand and finger 
movements used to open a bag of peanuts can be described by 
their topographical elements. However, careful observation will 
reveal that the topography differs somewhat each time a person 
opens a bag of snacks. The difference may be slight, but each 
“bag opening response” will vary somewhat from all others. 

Although describing behavior by its topography is some- 
times useful, behavior analysis is characterized by a functional 
analysis of the effects of behavior on the environment. A response 
class is a group of responses with the same function (that is, 
each response in the group produces the same effect on the envi- 
ronment). Some response classes comprise responses of widely 
varying form—imagine the tremendous variability in responses to 
the request to “do something unpredictable” (Neuringer, 2009)— 
whereas the topographical variation among members of other 
response classes is limited (e.g., a person’s signature, proper grip 
for a four-seam fastball). 

Another reason underscoring the importance of a func- 
tional analysis of behavior over a structural or topographical 
description is that two responses of the same topography can 
be completely different behaviors depending on the controlling 
variables. For example, saying “fire” while looking at the letters, 
f-i-r-e, is a vastly different behavior from yelling “Fire!” when 
smelling smoke or seeing flames in a crowded theater. 

Behavior analysts use the term repertoire in at least two 
ways. Repertoire is sometimes used to refer to all of the behay- 
iors a person can do. More often the term denotes a person’s 
collection of knowledge and skills relevant to particular settings 
or tasks. In the latter sense, each of us has learned multiple rep- 
ertoires. For example, each of us has a repertoire of behaviors 
appropriate for informal social situations that differs somewhat 
(or a lot) from the behaviors we use to navigate formal situa- 
tions. And each person has repertoires with respect to language 
skills, academic tasks, everyday routines, recreation, and so on. 
After studying this text, your repertoire of knowledge and skills 
in applied behavior analysis will be advanced. 


ENVIRONMENT 


All behavior occurs within an environmental context; behav- 
ior cannot be emitted in an environmental void or vacuum. 
Johnston and Pennypacker (2009) defined environment and 
noted the implications of that definition for a science of behav- 
ior as follows: 


Environment refers to the full set of physical circum- 
stances in which the organism exists. The term is compre- 
hensive in that any facets of the physical world may be 
considered for their contribution to behavior. The term is 
specific in that for any particular behavior, the focus is 
usually on only those environmental events that are func- 
tionally related to individual responses. . . . 


The relevant environment can even include the 
organism doing the behaving. . . . [W]hen you scratch an 
itch, the stimulation from your skin probably increases the 
effectiveness of relief from the itching as a negative rein- 
forcer and makes the behavior of scratching more likely. 
Our bodies are an ongoing source of antecedent and con- 
sequent environmental events related to responding. This 
fact reminds us that the skin is not an especially important 
boundary in understanding of behavior. (p. 29) 


The environment is a complex, dynamic universe of events 
that differs from moment to moment. When behavior analysts 
describe particular aspects of the environment, they talk in terms 
of stimulus conditions or events. A stimulus is “an energy change 
that affects an organism through its receptor cells” (Michael, 
2004, p. 7). Humans have receptor systems that detect stimulus 
changes occurring outside and inside the body. Exteroceptors 
are sense organs that detect external stimuli and enable vision, 
hearing, olfaction, taste, and cutaneous touch. Two types of sense 
organs sensitive to stimulus changes within the body are intero- 
ceptors, which are sensitive to stimuli originating in the viscera 
(e.g., feeling a stomach ache), and proprioceptors, which enable 
the kinesthetic and vestibular senses of movement and balance. 
Applied behavior analysts most often study the effects of stimu- 
lus changes that occur outside the body. External stimulus con- 
ditions and events are not only more accessible to observation 
and manipulation than are internal conditions, but also are key 
features of the physical and social world in which people live. 

The environment influences behavior primarily by 
stimulus change and not static stimulus conditions. As Michael 
(2004) noted, when behavior analysts speak of the presentation 
or occurrence of a stimulus, they usually mean stimulus change. 


For example, in respondent conditioning the conditioned 
stimulus may be referred to as a tone. However, the rele- 
vant event is actually a change from the absence of tone to 
the tone sounding . . . , and although this is usually under- 
stood without having to be mentioned, it can be overlooked 
in the analysis of more complex phenomena. Operant dis- 
criminative stimuli, conditioned reinforcers, conditioned 
punishers, and conditioned motivative variables are also 
usually important as stimulus changes, not static conditions 
(Michael, 2004, pp. 7-8).* 


Box 2.1, “The Behavioral Stream,” illustrates the contin- 
uous, ever-changing nature of behavior and the environment. 


Classifying and Describing Stimuli 


Stimulus events can be described formally (by their physical 
features), temporally (by when they occur with respect to a 
behavior of interest), and functionally (by their effects on behav- 
ior). Behavior analysts used the term stimulus class to refer to 
any group of stimuli sharing a predetermined set of common 
elements in one or more of these dimensions. 


Formal Dimensions of Stimuli 


Behavior analysts often describe, measure, and manipulate stimuli 
according to their formal dimensions, such as size, color, intensity, 
weight, and spatial position relative to other objects. Stimuli can 
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be nonsocial (e.g., a red light, a high-pitched sound) or socially 
mediated (e.g., a friend asking, “Want some more peanuts?”). 


Temporal Loci of Stimuli 


Because behavior and the environmental conditions that influence 
it occur within and across time, the temporal location of stimulus 
changes is important. In particular, behavior is affected most by 
stimulus changes that occur prior to and soon after the behavior. 
The term antecedent refers to environmental conditions or stim- 
ulus changes that exist or occur prior to the behavior of interest. 
Because behavior cannot occur in an environmental void 
or vacuum, every response takes place in the context of a partic- 
ular situation or set of antecedent conditions. These antecedent 
events play a critical part in learning and motivation, and they do 
so irrespective of whether the learner or someone in the role of 
behavior analyst or teacher has planned or is even aware of them. 


For example, just some of the functionally relevant 

antecedents for a student’s performance on a timed math 

test might include the following: the amount of sleep the 
student had the night before; the temperature, lighting, and 
seating arrangements in the classroom; the teacher remind- 
ing the class that students who beat their personal best 
scores on the test will get a free homework pass; and the 
specific type, format, and sequence of math problems on 
the test. Each of those antecedent variables (and others) has 
the potential to exert a great deal, a little, or no noticeable 
effect on performance as a function of the student’s expe- 
riences with respect to a particular antecedent. (Heward & 

Silvestri, 2005, p. 1135) 

A consequence is a stimulus change that follows a behay- 
ior of interest. Some consequences, especially those that are 
relevant to current motivational states and follow the behavior 
closely in time, have significant influence on future behavior; 
other consequences have little effect. 

Like antecedent stimulus events, consequences may also 
be nonsocial events or socially mediated. In a socially mediated 
contingency, another person presents an antecedent stimulus and/ 
or the consequence for the behavior. Table 2.1 on page 47 shows 
examples of various combinations of nonsocial and socially 
mediated antecedent and consequent events for four behaviors. 


Multiple Functions of Single Stimulus Changes 


Some stimulus changes exert immediate and powerful control 
over behavior, whereas others have delayed effects, or no appar- 
ent effect. Even though we can and often do describe stimuli 
by their physical characteristics (e.g., the pitch and decibel 
level of a tone, the topography of a person’s hand and arm 
movements), stimulus changes are understood best through a 
functional analysis of their effects on behavior. For example, 
the same decibel tone that functions in one environment and set 
of conditions as a prompt for checking the clothes in the dryer 
may function as a warning signal to fasten a seat belt in another 
setting or situation; the same hand and arm motion that produces 
a smile and a “Hi” from another person in one set of conditions 
occasions a scowl and obscene gesture in another. 

Stimulus changes can have one or both of two kinds of 
basic functions or effects on behavior: (a) an immediate but 
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The Behavioral Stream 


Although the concepts of stimulus and response have and practitioners are detectable eslices:’ of the continuous, 
proven useful for conceptual, experimental, and applied ever-dynamic interaction between an organism and its envi- 
analyses of behavior, it is important to recognize that ronment. Mickey Keenan and Karola Dilenbuties (2013) 
stimuli and responses do not exist as discrete events in call this process the “behavioral stream.” They illustrate and 
nature. The stimuli and responses identified by scientists describe it as follows: 


A consequence of this is that we usually 

a continuous process. engage in a mentalistic analysis. That is, we 
attempt to explain a person’s behavior by 
referring to something taking place inside him, 
in his brain or in his mind. 


Upon first meeting a person, we observe only 
a snapshot in his life. 


enables us to retain the image of a person as 
a continuous process. From this perspective, 
the purpose of scientific enquiry is to relate 
segments of this continuous process to the 
independent variables that produce them. 
So-called mental events are part of the contin- 
uous process, not the explanation for what is 


observed. 


Photos and captions from “Behaviour Analysis: A Primer,” by M. Keenan and 


K. Dillenburger, 2013. Copyright 2013 by Celtic Fringe Productions. Used 
by permission. 


The image of a person as a continuous process 
is difficult to retain. 


TABLE 2.1 
or a Combination of Social and Nonsocial 
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Antecedent (Situation) and Consequent Events Can Be Nonsocial (Italicized), Socially Mediated (Boldface), 


‘Environmental situation ~~~ Resuone@ eS 


nvironmental situation Response 


Drink machine 
Five cups on table 


Friend says “Turn left” Turn left 


Friend asks “What time is it?” 


Deposit coins 


“One-two-three-four-five cups” 


“Six-fifteen” 


Consequence 

Cold drink 

Teacher nods and smiles 
Arrive at destination 
Friend says “Thanks” 


From “Individual Behavior, Culture, and Social Change” by S. S. Glenn, 2004, The Behavior Analyst, 27, p. 136. Copyright 2004 by the Association for 


Behavior Analysis. Used by permission. 


temporary effect of increasing or decreasing the frequency 
of the behavior and/or (b) a delayed but relatively permanent 
effect in terms of the frequency of that type of behavior in the 
future (Michael, 1995). For example, a sudden downpour on a 
cloudy day is likely to increase immediately the occurrence of 
all behavior that has resulted in escape from rain in the past, 
such as deploying an umbrella or running for cover under an 
awning. A person caught in the rain without an umbrella may be 
more likely to bring one on cloudy days in the future. 


RESPONDENT BEHAVIOR 


All biologically intact organisms enter the world able to respond 
in predictable ways to certain stimuli; no learning is required. 
These ready-made behaviors protect against harmful stimuli 
(e.g., eyes watering and blinking to remove particles on the 


cornea), help regulate the internal balance and economy of the 
organism (e.g., changes in heart rate and respiration in response 
to changes in temperature and activity levels), and promote repro- 
duction (e.g., sexual arousal). Each of these stimulus—response 
relations, called a reflex, is part of the organism’s genetic endow- 
ment, a product of natural evolution because of its survival value 
to the species. At birth, each intact member of a given species 
comes equipped with the same repertoire of unconditioned (or 
unlearned) reflexes. Reflexes provide the organism with a set 
of built-in responses to specific stimuli; these are behaviors the 
individual organism would not have time to learn. Table 2.2 
shows examples of unconditioned reflexes common to humans. 

The response component of the stimulus—response reflex 
is called respondent behavior. Respondent behavior is behav- 
ior that is elicited by antecedent stimuli. Respondent behavior 
is induced, or brought out, by a stimulus that precedes the 


TABLE 2.2 


Examples of Unconditioned Reflexes in Humans 


Unconditioned Stimulus 
Loud sound or touch to cornea 


Tactile stimulus under lid or chemical 
irritant (smoke) 


Irritation to nasal mucosa 
Irritation to throat 

Low temperature 

High temperature 

Loud sound 


Food in mouth 
Indigestible food in stomach 
Pain stimulus to hand or foot 


A single stimulus that is painful or very 
intense or very unusual 


From Concepts a 
Analysis, Kalamazoo, MI. 


Unconditioned Response 
Eye blink (lid closes) 


Lacrimal gland secretion 
(eyes watering) 


Sneezing 

Coughing 

Shivering, surface vasoconstriction 
Sweating, surface vasodilation 


Contraction of tensor tympani and stapedius 
muscles (reduces amplitude of eardrum vibrations) 


Salivation 

Vomiting 

Hand or foot withdrawal 

Activation syndrome—all of the following: 
Heart rate increase 
Adrenaline secretion 
Liver release of sugar into bloodstream 
Constriction of visceral blood vessels 


Dilation of blood vessels in skeletal 
muscles 


Galvanic skin response (GSR) 
Pupillary dilation (and many more) 


Type of Effector 
Striped muscle 
Gland 


Striped and smooth muscle 
Striped and smooth muscle 
Striped and smooth muscle 
Gland, smooth muscle 
Striped muscles 


Gland 
Striped and smooth muscle 
Striped muscle 


Cardiac muscle 
Gland 
Gland 
Smooth muscle 
Smooth muscle 


Gland 
Smooth muscle 


nd Principles of Behavior Analysis (rev. ed.) by J. L. Michael, 2004, pp. 10-11. Copyright 2004 by Society for the Advancement of Behavior 
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behavior; nothing else is required for the response to occur. 
For example, bright light in the eyes (antecedent stimulus) 
will elicit pupil contraction (respondent). If the relevant body 
parts (i.e., receptors and effectors) are intact, pupil contraction 
will occur every time. However, if the eliciting stimulus is 
presented repeatedly over a short span of time, the strength or 
magnitude of the response will diminish, and in some cases 
the response may not occur at all. This process of gradually 
diminishing response strength is known as habituation. 


Respondent Conditioning 


Formerly neutral stimuli can acquire the ability to elicit respon- 
dents through a learning process called respondent conditioning 
(also called Pavlovian conditioning and classical conditioning). 
While studying the digestive system of dogs, the Russian physiol- 
ogist Ivan Petrovich Pavlov (1849-1936) noticed that the animals 
salivated every time his laboratory assistant opened the cage door 
to feed them. Dogs do not naturally salivate at the sight of someone 
in a lab coat, but in Pavlov’s laboratory they consistently salivated 
when the door was opened. His curiosity aroused, Pavlov (1927) 
designed and conducted an historic series of experiments.” 
Pavlov started a metronome just an instant before feeding 
the dogs. Prior to being exposed to this stimulus—stimulus pairing 
procedure, food in the mouth, an unconditioned stimulus (US), 


elicited salivation, but the sound of the metronome, a neutral 
stimulus (NS), did not. After several trials consisting of the sound 
of the metronome followed by the presentation of food, the dogs 
began salivating in response to the sound of the metronome. The 
metronome had thus become a conditioned stimulus (CS), and 
a conditioned reflex was established.° Respondent conditioning 
is most effective when the NS is presented just before or simul- 
taneous with the US. However, some conditioning effects can 
sometimes be achieved with considerable delay between the onset 
of the NS and the onset of the US, and even with backward con- 
ditioning in which the US precedes the NS. 


Respondent Extinction 


Pavlov also discovered that once a conditioned reflex was estab- 
lished, it would weaken and eventually cease altogether if the con- 
ditioned stimulus was presented repeatedly in the absence of the 
unconditioned stimulus. For example, if the sound of the metronome 
was presented repeatedly without being accompanied or followed 
by food, it would gradually lose its ability to elicit salivation. The 
procedure of repeatedly presenting a conditioned stimulus without 
the unconditioned stimulus until the conditioned stimulus no longer 
elicits the conditioned response is called respondent extinction. 
Figure 2.1 shows schematic representations of respondent 
conditioning and respondent extinction. In this example, a puff 


Figure 2.1 Schematic representation of respondent 
conditioning and respondent extinction. The top 
panel shows an unconditioned reflex: A puff of air 
(unconditioned stimulus, or US) elicits an eye blink 
(an unconditioned response, or UR). Before condi- 
tioning, a clicking sound (a neutral stimulus, or NS) 
has no effect on eye blinking. Respondent condition- 
ing consists of a stimulus—stimulus pairing procedure 
in which the clicking sound is presented repeatedly 
just prior to, or simultaneously with, the air puff. 
The product of respondent conditioning is a condi- 
tioned reflex (CR): In this case the clicking sound has 
become a conditioned stimulus (CS) that elicits an eye 
blink when presented alone. The bottom two panels 
illustrate the procedure and outcome of respondent 
extinction: Repeated presentations of the CS alone 
gradually weaken its ability to elicit eye blinking to 
the point where the CS eventually becomes an NS 
again. The unconditioned reflex remains unchanged 
before, during, and after respondent conditioning. 


Before Conditioning US UR (eye blink) 
5 a 
(air puff) 
NStaw eee eee no eye blink 
(clicking sound) 
Respondent Conditioning NS + US > UR (eye blink) 
(click & air puff) 
\ 
NS + US —_—_—_—_> UR (eye blink) 
(click & air puff) 
ll 
(more trials) 
Product of US : 
Respondent Conditioning (air puff) Cetayesin 
cs ——————— CR (eye blink) 
(clicking sound) 
Respondent Extinction cs eS CR (eye blink) 
(clicking sound) 
u 
CS 
(clicking sound) ——______» CR (eye blink) 
cs 
(clicking sound) ——————> CR (eye blink) 
4 
(more trials) 
Results of US UR (eye blink) 
Respondent Extinction (air puff) 
NS eee See no eye blink 


(clicking sound) 


of air produced by a glaucoma-testing machine is the US for 
the eye blink reflex. The ophthalmologist’s finger pressing the 
button of the machine makes a faint clicking sound. But prior to 
conditioning, the clicking sound is an NS: It has no effect on eye 
blinking. After being paired with the air puff just a few times, the 
finger-on-the-button sound becomes a CS: It elicits eye blinking 
as a conditioned reflex. 

Conditioned reflexes can also be established by stimulus— 
stimulus pairing of an NS with a CS. This form of respon- 
dent conditioning is called higher-order (or secondary) 
conditioning. For example, secondary respondent conditioning 
could occur in a patient who has learned to blink at the click- 
ing sound of the button during the glaucoma-testing situation 
as follows. The patient detects a slight movement of the oph- 
thalmologist’s finger (NS) just before it contacts the button that 
makes the clicking sound (CS). After several NS—CS pairings, 
movement of the ophthalmologist’s finger may become a CS 
capable of eliciting blinking.’ 

The form, or topography, of respondent behaviors changes 
little, if at all, during a person’s lifetime. There are two excep- 
tions: (1) Certain reflexes disappear with maturity, such as that 
of grasping an object placed in the palm of the hand, a reflex 
usually not seen after the age of 3 months (Bijou & Baer, 1965); 
and (2) several unconditioned reflexes first appear later in life, 
such as those related to sexual arousal and reproduction. How- 
ever, during a person’s lifetime an infinite range of stimuli that 
were previously neutral (e.g., the high-pitched whine of the 
dentist’s drill) can come to elicit respondents (e.g., increased 
heartbeat and perspiration). 

Respondents make up a small percentage of the behaviors 
typically of interest to applied behavior analysts. As Skinner 
(1953) pointed out, “Reflexes, conditioned or otherwise, are 
mainly concerned with the internal physiology of the organism. 
We are most often interested, however, in behavior which has 
some effect upon the surrounding world” (p. 59). It is this latter 
type of behavior, and the process by which it is learned, that we 
will now examine. 


OPERANT BEHAVIOR 


A baby in a crib moves her hands and arms, setting in motion 
a mobile dangling above. The baby is literally operating on 
her environment, and the mobile’s movement and musical 
sounds—stimulus changes produced by the baby’s batting at 
the toy with her hands—are consequences of her behavior. 
Her movements are continuously changing as a result of those 
consequences. 

Members of a species whose only way of interacting 
with the world is a genetically determined fixed set of 
responses would find it difficult to survive, let alone thrive, 
in a complex environment that differed from the environment 
in which their distant ancestors evolved. Although respon- 
dent behavior comprises a critically important set of “hard- 
wired” responses, respondent behavior does not provide an 
organism with the ability to learn from the consequences of 
its actions. An organism whose behavior is unchanged by its 
effects on the environment will be unable to adapt to a changing 
environment. 
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Fortunately, in addition to her repertoire of genetically 
inherited respondent behaviors, our baby entered the world with 
some uncommitted behavior that is highly malleable and suscep- 
tible to change through its consequences. This type of behavior, 
called operant behavior, enables the baby over the course of 
her life to learn novel, increasingly complex responses to an 
ever-changing world.® 

Operant behavior is any behavior determined primar- 
ily by its history of consequences. Unlike respondent behay- 
ior, which is elicited by antecedent events, operant behavior is 
selected, shaped, and maintained by the consequences that have 
followed it in the past. 

Unlike respondent behaviors, whose topography and 
basic functions are predetermined, operant behaviors can take 
a virtually unlimited range of forms. The form and function of 
respondent behaviors are constant. By comparison, however, 
the “meaning” of operant behavior cannot be determined by its 
topography. Operants are defined functionally, by their effects. 
Not only does the same operant often include responses of 
widely different topographies (e.g., a diner may obtain a glass 
of water by nodding his head, pointing to a glass of water, or 
saying yes to a waiter), but also, as Skinner (1969) explained, 
the same movements comprise different operants under differ- 
ent conditions. 


Allowing water to pass over one’s hands can perhaps 
be adequately described as topography, but “washing 
one’s hands” is an “operant” defined by the fact that, 
when one has behaved this way in the past, one’s hands 
have become clean—a condition which has become 
reinforcing because, say, it has minimized a threat of 
criticism or contagion. Behavior of precisely the same 
topography would be part of another operant if the 
reinforcement had consisted of simple stimulation 
(e.g., “tickling”) of the hands or the evocation of 
imitative behavior in a child whom one 1s teaching to 
wash his hands. (p. 127) 


Topography is of little or no use in identifying behavior 
as respondent or operant. For example, crying elicited by pain- 
ful stimuli is respondent behavior, but the same tear production 
is operant when a function of parental attention (cf., Bowman, 
Hardesty, & Mendres-Smith, 2013; Epstein, 2012). 

Table 2.3 compares and contrasts defining features and 
key characteristics of respondent behavior and operant behavior. 


Selection by Consequences 


Human behavior is the joint product of (i) the contingencies 
of survival responsible for the natural selection of the spe- 
cies and (ii) the contingencies of reinforcement responsible 
for the repertoires acquired by its members, including (itt) 
the special contingencies maintained by the social environ- 
ment. [ Ultimately, of course, it is all a matter of natural 
selection, since operant conditioning is an evolved process, 
of which cultural practices are special applications. ] 

—B. F. Skinner (1981 p. 502) 


Skinner’s discovery and subsequent elucidation of operant selec- 
tion by consequences have rightly been called “revolutionary” and 
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TABLE 2.3 Comparing and Contrasting Defining Features and Key Characteristics of Respondent and Operant Behavior 
ee 


Characteristics or features 


Definition 
Basic unit 


Examples 


Body parts (effectors) that 
most often produce the 
response (not a defining 
feature) 


Function or usefulness for 
individual organism 


Function or usefulness for 
species 


Conditioning process 


Repertoire limits 


Respondent behavior 
Behavior elicited by antecedent stimuli. 


Reflex: an antecedent stimulus elicits a particular 
response (S-R). 


Newborn’s grasping and suckling to touch; pupil 
constriction to bright light; cough/gag to irritation in 
throat; salivation at smell of food; withdrawing hand 
from painful stimulus; sexual arousal to stimulation. 


Primarily smooth muscles and glands (adrenaline 
squirt); sometimes striated (skeletal) muscles (e.g., 
knee-jerk to tap just below patella). 


Maintains internal economy of the organism; 
provides a set of “ready-made” survival responses 
the organism would not have time to learn. 


Promotes continuation of species indirectly 
(protective reflexes help individuals survive to 
reproductive age) and directly (reflexes related to 
reproduction). 


Respondent (also called classical or Pavlovian) 
conditioning: Through a stimulus—stimulus pair- 
ing procedure in which a neutral stimulus (NS) 
presented just prior to or simultaneous with an 
unconditioned (US) or conditioned (CS) eliciting 
stimulus, the NS becomes a CS that elicits the 
response and a conditioned reflex is created. 
(See Figure 2.1) 


Topography and function of respondents deter- 
mined by natural evolution of species (phylog- 
eny). All biologically intact members of a species 
possess the same set of unconditioned reflexes. 
Although new forms of respondent behavior are 
not learned, an infinite number of conditioned 
reflexes may emerge in an individual's repertoire, 
depending on the stimulus—stimulus pairing the 
individual has experienced (ontogeny). 


Operant behavior 
Behavior selected by its consequences. 


Operant response class: A group of responses 
each of which produces the same effect on the 
environment; described by three-term contin- 
gency relation of antecedent stimulus conditions, 
behavior, and consequence (A-B-C). 


Talking, walking, playing the piano, riding a bike, 
counting change, baking a pie, hitting a curveball, 
laughing at a joke, thinking about a grandparent, 
reading this book. 


Primarily striated (skeletal) muscles; sometimes 
smooth muscles and glands. 


Enables effective interaction and adaptation in 
an ever-changing environment that could not be 
anticipated by evolution. 


Individuals whose behavior is most sensitive to 
consequences are more likely to survive and 
reproduce. 


Operant conditioning: Some stimulus changes 
following a response closely in time increase 
(reinforcement) or decrease (punishment) future 
occurrences of similar responses under similar 
conditions. Previously neutral stimulus changes 
become conditioned reinforcers or punishers as 
result of stimulus—stimulus pairing with other rein- 
forcers or punishers. 


Topography and function of each person's 
repertoire of operant behaviors are selected by 
consequences during the individual's lifetime 
(ontogeny). New and more complex operant 
response classes can emerge. Response products 
of some human operants (e.g., airplanes) enable 
some behaviors not possible by anatomical struc- 
ture alone (e.g., flying). 


“the bedrock on which other behavioral principles rest” (Glenn, 
2004, p. 134). Selectionism “anchors a new paradigm in the life 
sciences. ... A basic tenet of this position is that all forms of life, 
from single cells to complex cultures, evolve as a result of selec- 
tion with respect to function” (Pennypacker, 1994, pp. 12-13). 


Selection by consequences operates during the lifetime of 


the individual organism (ontogeny) and is a conceptual parallel 
to Darwin’s (1872/1958) natural selection in the evolutionary 
history of a species (phylogeny). In response to the question 
“Why do giraffes have long necks?” Baum (2017) provides this 
excellent description of natural selection: 


Darwin’s great contribution was to see that a relatively sim- 
ple mechanism could help explain why phylogeny followed 


the particular course it did. The history of giraffes’ necks, 
Darwin saw, is more than a sequence of changes; it is a his- 
tory of selection. What does the selecting? Not an omnip- 
otent Creator, not Mother Nature, not the giraffes, but a 
natural, mechanical process: natural selection. 

Within any population of organisms, individuals 
vary. They vary partly because of environmental factors 
(e.g., nutrition), and also because of genetic inheritance. 
Among the giraffe ancestors that lived in what is now the 
Serengeti Plain, for instance, variation in genes meant that 
some had shorter necks and some had longer necks. 

As the climate gradually changed, however, new, 
taller types of vegetation became more frequent. The giraffe 
ancestors that had longer necks, being able to reach higher, 


got a little more to eat, on the average. As a result, they 

were a little healthier, resisted disease a little better, evaded 
predators a little better—on the average. Any one individual 
with a longer neck may have died without offspring, but on 
the average longer-necked individuals had more offspring, 
which tended on the average to survive a little better and 
produce more offspring. As longer necks became frequent, 
new genetic combinations occurred, with the result that some 
offspring had still longer necks than those before, and they 
did still better. As the longer-necked giraffes continued to out- 
reproduce the shorter-necked ones, the population consisted 
more and more of longer-necked individuals, and the average 
neck length of the whole population grew. (pp. 59-60) 


Just as natural selection requires a population of individual 
organisms with varied physical features (e.g., giraffes with necks 
of different lengths), operant selection by consequences requires 
variation in behavior. Those behaviors that produce the most 
favorable outcomes are selected and “survive,” which leads to a 
more adaptive repertoire. Natural selection has endowed humans 
with an initial population of uncommitted behavior (e.g., babies 
babbling and moving their limbs about) that is highly malleable 
and susceptible to the influence of the consequences that follow 
it. As Glenn (2004) noted, 


By outfitting humans with a largely uncommitted 
behavioral repertoire, natural selection gave our species 

a long leash for local behavioral adaptations. But the 
uncommitted repertoire of humans would be lethal without 
the .. . susceptibility of human behavior to operant 
selection. Although this behavioral characteristic is shared 
by many species, humans appear to be most exquisitely 
sensitive to behavioral contingencies of selection 
(Schwartz, 1974). (p. 139) 


Operant Conditioning 


Operant conditioning may be seen everywhere in the 
multifarious activities of human beings from birth until 
death. .. . It is present in our most delicate discriminations 
and our subtlest skills; in our earliest crude habits and the 
highest refinements of creative thought. 

—Keller and Schoenfeld (1950, p. 64) 


Operant conditioning refers to the process and selective effects 
of consequences on behavior. '0 From an operant conditioning per- 
spective, a functional consequence is a stimulus change that follows 
a given response closely in time and alters the occurrence of similar 
responses in the future. “In operant conditioning we ‘strengthen’ 
an operant in the sense of making a response more probable or, in 
actual fact, more frequent” (Skinner, 1953, p. 65). If the movement 
and sounds produced by the baby’s batting at the mobile with her 
hands increase the occurrence of hand movements in the direction 
of the toy, operant conditioning has taken place. 

When operant conditioning consists of an increase in 
response rate, reinforcement has taken place, and the conse- 
quence responsible, in this case the movement and sound of 
the mobile, would be called a reinforcer.!! Although operant 
conditioning is used most often to refer to the “strengthen- 
ing” effects of reinforcement, as Skinner described earlier, it 
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also encompasses the principle of punishment. If the mobile’s 
movement and musical sounds resulted in a decrease in the 
rate at which the baby bats it with her hands, punishment has 
occurred, and the mobile’s movement and sound would be 
called punishers. Before we examine the principles of rein- 
forcement and punishment further, it is important to identify 
several important qualifications concerning how consequences 
affect behavior. 


Consequences Affect Only Future Behavior 


Consequences affect only future behavior. Specifically, a behay- 
ioral consequence affects the relative rate at which similar 
responses will be emitted in the future under similar stimulus 
conditions. This point may seem too obvious to merit mention 
because it is both logically and physically impossible for a con- 
sequent event to affect a behavior that preceded it, when that 
behavior is over before the consequent event occurs. Neverthe- 
less, the statement “behavior is controlled by its consequences” 
raises the question. (See Box 2.2, “When the Phone Rings,” for 
further discussion of this apparent logical fallacy.) 


Consequences Select Response Classes, 
Not Individual Responses 


Responses emitted because of the effects of reinforcement will 
differ slightly from previously reinforced responses, but will 
share at least one functional element with the former responses 
to produce the same consequence. 


Reinforcement strengthens responses which differ in topog- 
raphy from the response reinforced. When we reinforce 
pressing a lever, for example, or saying Hello, responses 
differing quite widely in topography grow more proba- 

ble. This is a characteristic of behavior which has strong 
survival value... , since it would be very hard for an 
organism to acquire an effective repertoire if reinforce- 
ment strengthened only identical responses. (Skinner, 
1969p, 131) 


It is the response class—topographically different, but 
functionally similar responses—that is strengthened or weak- 
ened by operant conditioning. “An operant activity is a popula- 
tion of actions all of which have the same environmental effect” 
(Baum, 2017, p. 84). The concept of response class is “implied 
when it is said that reinforcement increases the future frequency 
of the type of behavior that immediately preceded the reinforce- 
ment” (Michael, 2004, p. 9). As will be shown in later chapters, 
the concept of response class is a key to the development and 
elaboration of new behavior. 

If consequences (or natural evolution) selected only a very 
narrow range of responses (or genotypes), the effect would “tend 
toward uniformity and a perfection of sorts” (Moxley, 2004, 
p. 110) that would place the behavior (or species) at risk of 
extinction should the environment change. For example, if the 
mobile’s movement and sound reinforced only arm and hand 
movements that fell within an exact and narrow range of motion 
and no similar movements survived, the baby would be unable 
to contact that reinforcement if one day her mother mounted the 
mobile in a different location above the crib. 
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When the Phone Rings 


The professor was ready to move on to his next point, but a 
raised hand in the front row caught his attention. 


Professor: Yes? 


Student: You said that operant behavior, like talking, writ- 


ing, running, reading, driving a car, most everything we 
do—you said all of those behaviors are controlled by their 
consequences, by things that happen after the response 
was emitted? 


Professor: {said that. Yes. 


Student: Well, [have a hard time with that. When my telephone 


rings and I pick up the receiver, that’s an operant response, 
right? I mean, answering the phone when it rings certainly 
didn’t evolve genetically as a reflex to help our species sur- 
vive. So, we’re talking about operant behavior, correct? 


Professor: Correct. 


Student: All right then. How can we say my picking up my 


telephone is controlled by its consequence? I answer the 
phone because it is ringing. So does everyone. Ringing 
controls the response. And ringing can’t be a consequence 
because it comes before the response. 


The professor hesitated replying just long enough for the 
student to believe himself the hero, nailing a professor 
for pontificating about some theoretical concept that has 
little or no relevance to the everyday real world. Simulta- 
neously sensing victory, other students began to pile on. 


Another Student: How about stepping on the brake when you 


see a stop sign? The sign controls the braking response, 
and that’s not a consequence either. 


A Student from the Back of the Room: And take a common 


classroom example. When a kid sees the problem 2 + 2 
on his worksheet and writes 4, writing the numeral 4 has 
to be controlled by the written problem itself. Otherwise, 
how could anyone learn the correct answers to any ques- 
tion or problem? 


Immediate Consequences Have the Greatest Effect 


Behavior is most sensitive to stimulus changes that occur imme- 
diately after, or within a few seconds of, a response. 


It is essential to emphasize the importance of the immedi- 
acy of reinforcement. Events that are delayed more than 

a few seconds after the response do not directly increase 
its future frequency. When human behavior is apparently 
affected by long-delayed consequences, the change is 
accomplished by virtue of the human’s complex social and 
verbal history, and should not be thought of as an instance 
of the simple strengthening of behavior by reinforcement. 
(p. 110)... [As with reinforcement,] the longer the time 


Most of the Class: Yah, that’s right! 
Professor: (with a wry smile) All of you are correct. . . . So, 
too. am I. 


Someone Else in the Class: What do you mean? 


Professor: That was exactly my next point, and I was hop- 
ing you would pick up on it. (The professor smiled a 
thank you at the student who had started the discussion 
and went on.) All around us, every day, we are exposed 
to countless changing stimulus conditions. The situa- 
tions you’ve described are excellent examples of what 
behavior analysts call stimulus control. When the rate 
of a given behavior is higher in the presence of a given 
stimulus than when that stimulus is absent, we say 
that stimulus control is at work. Stimulus control is a 
fundamentally important principle in behavior analy- 
sis, and it will be the subject of much discussion this 
semester. 

But, and here’s the important point: A discrimina- 
tive stimulus, the antecedent event that comes before the 
response of interest, acquires its ability to evoke a par- 
ticular response class because of its pairing with certain 
consequences in the past. It’s not just the phone’s ring 
that causes you to pick up the receiver. It is the fact that 
in the past, answering the phone when it was ringing was 
followed by the sound of a person’s voice. It’s that person 
talking to you, the consequence of picking up the receiver, 
that selected phone-answering behavior in the first place, 
but you only answer ringing phones. Why? Because 
you have learned that there’s someone on the other end 
only when the phone’s ringing. So we can still speak of 
consequences as having the ultimate control in terms of 
selecting operant behavior, but antecedent stimuli, by 
virtue of being paired with differential consequences, 
can indicate what kind of consequence is likely. This 
concept is called the three-term contingency, and its 
understanding, analysis, and manipulation are central to 
applied behavior analysis. 


delay between the occurrence of the response and the 
occurrence of the stimulus change (between R and 

SP), the less effective the punishment will be in changing 
the relevant response frequency, but not much is known 
about upper limits. (Michael, 2004, p. 36, emphasis in 
original, words in brackets added) 


Consequences Select Any Behavior 


Reinforcement and punishment are “equal opportunity” selec- 
tors. No logical or healthy or (in the long run) adaptive con- 
nection between a behavior and the consequence that functions 
to strengthen or weaken it is necessary. Any behavior that 


immediately precedes reinforcement (or punishment) will be 
increased (or decreased). 

It is the temporal relation between behavior and conse- 
quence that is functional, not the topographical or logical ones. 
“So far as the organism is concerned, the only important property 
of the contingency is temporal. The reinforcer simply follows the 
response. How this is brought about does not matter” (Skinner, 
1953, p. 85, emphasis in original). The arbitrary nature of which 
behaviors are reinforced (or punished) in operant conditioning 
is exemplified by the appearance of idiosyncratic behaviors that 
have no apparent purpose or function. An example is the super- 
stitious routine of a poker player who taps and arranges his cards 
in a peculiar fashion because similar movements in the past were 
followed by winning hands. 


Operant Conditioning Occurs Automatically 


Operant conditioning does not require a person’s awareness. 
“A reinforcing connection need not be obvious to the individual 
[whose behavior is] reinforced” (Skinner, 1953, p. 75, words 
in brackets added). This statement refers to the automaticity 
of reinforcement; that is, behavior is modified by its conse- 
quences regardless of whether the individual is aware that her 
behavior is, or has been, reinforced.!? A person does not have 
to understand or verbalize the relation between her behavior and 
a consequence, or even know that a consequence has occurred, 
for reinforcement to “work.” 


Reinforcement 


Reinforcement is the most important principle of behavior and 
a key element of most behavior change programs designed by 
behavior analysts (Flora, 2004; Northup, Vollmer, & Serrett, 
1993). When a response is followed by a stimulus change that 
results in similar responses occurring more often, reinforcement 
has taken place.'* Sometimes the delivery of just one reinforcer 
results in significant behavior change, although most often 
several responses must be followed by reinforcement before 
significant conditioning will occur. 

Most stimulus changes that function as reinforcers can 
be described operationally as either (a) a new stimulus added 
to the environment (or increased in intensity), or (b) an already 


Type of Stimulus Change 


Present or Increase 
Intensity of Stimulus 
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present stimulus removed from the environment (or reduced in 
intensity).!* These two Operations provide for two forms of rein- 
forcement, called positive and negative (see Figure 2.2). 

In positive reinforcement, a response is followed 
immediately by the presentation of a stimulus that results in sim- 
ilar responses occurring more often. Our baby’s increased batting 
of the mobile with her hands, when doing so produces movement 
and music, is an example of positive reinforcement. Likewise, a 
child’s independent play is reinforced when it increases as a result 
of his parent’s giving praise and attention when he plays. Positive 
reinforcement and procedures for using it to promote desired 
behaviors are described in detail in Chapter 11. 

When a behavior occurs more often because past responses 
have resulted in the withdrawal or termination of a stimulus, 
the operation is called negative reinforcement. Skinner (1953) 
used the term aversive stimulus to refer to, among other things, 
stimulus conditions whose termination functioned as reinforce- 
ment. Let us assume now that a parent programs the mobile 
to automatically play music for a period of time. Let us also 
assume that if the baby bats the mobile with hands or feet, the 
music immediately stops for a few seconds. If the baby bats the 
mobile more often when doing so terminates the music, negative 
reinforcement is at work, and the music can be called aversive. 

Negative reinforcement is characterized by escape or 
avoidance contingencies. Jumping out of the shower when the 
water suddenly becomes too hot is negatively reinforced by 
escape from the burning water. Being sent to the principal’s 
office for acting out may function as negative reinforcement if 
it enables the misbehaving student to avoid an aversive (to him) 
classroom activity. 

The concept of negative reinforcement has confused many 
students of behavior analysis. Much of the confusion can be 
traced to the inconsistent early history and development of the 
term and to psychology and education textbooks and professors 
who have used the term inaccurately.!° The most common mis- 
take is equating negative reinforcement with punishment. To 
help avoid the error, Michael (2004) suggested the following: 


Think about how you would respond if someone asked you 
(1) whether or not you like negative reinforcement; also if 
you were asked (2) which you prefer, positive or negative 


Withdraw or Decrease 
Intensity of Stimulus 


Figure 2.2 Positive and negative reinforce- 
ment and positive and negative punishment 
are defined by the type of stimulus change 
operation that follows a behavior and the 
effect that operation has on future occur- 
rences of that type of behavior. 
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reinforcement. Your answer to the first question should be 
that you do indeed like negative reinforcement, which con- 
sists of the removal or termination of an aversive condition 
that is already present. The term negative reinforcement 
refers only to the termination of the stimulus. In a labora- 
tory procedure the stimulus must, of course, be turned on 
and then its termination can be made contingent upon the 
critical response. No one wants an aversive stimulus turned 
on, but once it is on, its termination is usually desirable. 
Your answer to the second question should be that you 
cannot choose without knowing the specifics of the positive 
and negative reinforcement involved. The common error 

is to choose positive reinforcement, but removal of a very 
severe pain would certainly be preferred over the presen- 
tation of a small monetary reward or an edible, unless the 
food deprivation was very severe. (p. 32, italics and bold 
type in original) 


Negative reinforcement is examined in detail in Chapter 12. 
Remembering that the term reinforcement always means an 
increase in the occurrence of the behavior and that the modifi- 
ers positive and negative describe the type of stimulus change 
operation that best characterizes the consequence (i.e., adding 
or withdrawing a stimulus) should facilitate the discrimina- 
tion of the principles and application of positive and negative 
reinforcement. 

After a behavior has been established with reinforcement, 
it need not be reinforced each time it occurs. Many behaviors are 
maintained at high levels by schedules of intermittent reinforce- 
ment. Chapter 13 describes various schedules of reinforcement 
and their effects on behavior. However, if reinforcement is with- 
held for all members of a previously reinforced response class, 
a procedure based on the principle of extinction, the behavior 
will gradually decrease in rate to its pre-reinforcement level or 
cease to occur altogether. Chapter 24 describes the principle 
of extinction and the use of behavior change tactics based on 
extinction to decrease undesired behavior. 


Punishment 


Punishment, like reinforcement, is defined functionally. When 
a response is followed immediately by a stimulus change that 
results in similar responses occurring less often, punishment 
has taken place. Punishment can be accomplished by either of 
two types of stimulus change operations (see the bottom two 
boxes of Figure 2.2). As with reinforcement, the modifiers in 
the terms positive punishment and negative punishment connote 
neither the intention nor the desirability of the behavior change 
produced; they indicate only that the stimulus change that served as 
the punishing consequence was presented (positive punishment) 
or withdrawn (negative punishment). '° 

As with positive and negative reinforcement, numerous 
behavior change procedures incorporate the two basic punish- 
ment operations. Although some textbooks reserve the term 
punishment for procedures involving positive punishment and 
describe time-out from positive reinforcement and response cost 
as separate “principles” or types of punishment, both methods 
for reducing behavior are based on negative punishment (see 
Chapter 15). 


Reinforcement and punishment can each be accomplished 
by either of two different operations, depending on whether the 
consequence consists of presenting a new stimulus (or increasing 
the intensity of a current stimulus) or withdrawing (or decreasing 
the intensity of) a currently present stimulus in the environment 
(Morse & Kelleher, 1977; Skinner, 1953). Some behavior analysts 
argue that from a functional and theoretical standpoint only two 
principles are required to describe the basic effects of behavioral 
consequences—reinforcement and punishment. '7 However, from 
a procedural perspective (a critical factor for the applied behavior 
analyst), a number of behavior change tactics are derived from 
each of the four operations represented in Figure 2.2. 

Most behavior change procedures involve several prin- 
ciples of behavior (see Box 2.3). It is critical for the behavior 
analyst to have a solid conceptual understanding of the basic 
principles of behavior. Such knowledge permits better analysis 
of current controlling variables as well as more effective design 
and assessment of behavioral interventions that recognize the 
role various principles may be playing in a given situation. 


Stimulus Changes That Function 
as Reinforcers and Punishers 


Because operant conditioning involves the consequences of 
behavior, it follows that anyone interested in using operant 
conditioning to change behavior must identify and control the 
occurrence of relevant consequences. For the applied behavior 
analyst, therefore, an important question becomes: What kinds 
of stimulus changes function as reinforcers and punishers? 


Unconditioned Reinforcement and Punishment 


Some stimulus changes function as reinforcement even though 
the organism has had no particular learning history with those 
stimuli. A stimulus change that can increase future occurrences 
of behavior without prior pairing with any other form of rein- 
forcement is called an unconditioned reinforcer.'* For exam- 
ple, stimuli such as food, water, and sexual stimulation that 
support the biological maintenance of the organism and survival 
of the species often function as unconditioned reinforcers. The 
words can and often in the two previous sentences recognize 
the important qualification that the momentary effectiveness of 
an unconditioned reinforcer is a function of current motivating 
operations. For example, a certain level of food deprivation 
is necessary for the presentation of food to function as a rein- 
forcer. However, food is unlikely to function as reinforcement 
for a person who has recently eaten a heavy meal. The nature 
and functions of motivating operations are described in detail 
in Chapter 16. 

Similarly, an unconditioned punisher is a stimulus 
change that can decrease the future occurrence of any behavior 
that precedes it without prior pairing with any other form of 
punishment. Unconditioned punishers include painful stimula- 
tion that can cause tissue damage (i.e., harm body cells). How- 
ever, virtually any stimulus to which an organism’s receptors are 
sensitive—light, sound, and temperature, to name a few—can 
be intensified to the point that its delivery will suppress behav- 
ior even though the stimulus is below levels that actually cause 
tissue damage (Bijou & Baer, 1965). 
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Distinguishing Between Principles of Behavior and Behavior Change Tactics 


A principle of behavior describes a basic behavior—environment 
relation that has been demonstrated repeatedly in hundreds, 
even thousands, of experiments. A principle of behavior 
describes a functional relation between behavior and one or 
more of its controlling variables (in the form of y = fx) that 
has thorough generality across individual organisms, species, 
settings, and behaviors. A principle of behavior is an empiri- 
cal generalization inferred from many experiments. Principles 
describe how behavior works. Some examples of principles are 
reinforcement, punishment, and extinction. 

In general, a behavior change tactic is a method for oper- 
ationalizing, or putting into practice, one or more principles of 
behavior. A behavior change tactic is a research-based, tech- 
nologically consistent method for changing behavior that has 
been derived from one or more basic principles of behavior and 
that possesses sufficient generality across subjects, settings, 
and/or behaviors to warrant its codification and dissemination. 
Behavior change tactics constitute the technological aspect of 
applied behavior analysis. Examples of behavior change tactics 
include backward chaining, differential reinforcement of other 


Events that function as unconditioned reinforcers and 
punishers are the product of the natural evolution of the species 
(phylogeny). Malott, Tillema, and Glenn (1978) described the 
natural selection of “rewards” and “aversives” as follows:!? 


Some rewards and aversives control our actions because 

of the way our species evolved; we call these unlearned 
rewards or aversives. We inherit a biological structure that 
causes some stimuli to be rewarding or aversive. This struc- 
ture evolved because rewards helped our ancestors survive, 
while aversives hurt their survival. Some of these unlearned 
rewards, such as food and fluid, help us survive by 
strengthening our body cells. Others help our species 
survive by causing us to produce and care for our 
offspring—these stimuli include the rewarding stimulation 
resulting from copulation and nursing. And many unlearned 
aversives harm our survival by damaging our body cells; 
such aversives include burns, cuts and bruises. (p. 9) 


While unconditioned reinforcers and punishers are criti- 
cally important and necessary for survival, relatively few behay- 
iors that constitute the everyday routines of people as they go 
about working, playing, and socializing are directly controlled 
by such events. For example, although going to work each 
day earns the money that buys food, eating that food is far too 
delayed for it to exert any direct operant control over the behav- 
ior that earned it. Remember: Behavior is most affected by its 
immediate consequences. 


Conditioned Reinforcers and Punishers 


Stimulus events or conditions that are present or that occur just 
before, or simultaneous with, the occurrence of other reinforcers 


behavior, shaping, response cost, and time-out from positive 
reinforcement. 

So, principles describe basic scientific laws that delin- 
eate how behavior—environment relations work, and behavior 
change tactics are how applied behavior analysts put the prin- 
ciples to work to help people learn and use socially significant 
behaviors. There are relatively few behavior principles, but 
many derivative behavior change tactics. To illustrate further, 
reinforcement is a behavioral principle because it describes a 
lawful relation between behavior, an immediate consequence, 
and an increased occurrence of the behavior in the future 
under similar conditions. However, issuing of checkmarks in 
a token economy and providing contingent praise are behavior 
change tactics derived from the principle of reinforcement. To 
cite another example, punishment is a principle of behavior 
because it describes the established functional relation between 
the presentation of a consequence and the decreased occur- 
rence of similar behavior in the future. Response cost and time- 
out, in contrast, are tactics for changing behavior based on the 
principle of punishment. 


(or punishers) may acquire the ability to reinforce (or punish) 
behavior when they later occur on their own as consequences. 
Called conditioned reinforcers and conditioned punishers, 
these stimulus changes function as reinforcers and punishers 
because of their prior pairing with other reinforcers or punish- 
ers.” The stimulus—stimulus pairing procedure responsible for 
the creation of conditioned reinforcers or punishers is the same 
as that used for respondent conditioning except that the “out- 
come is a stimulus that functions as a reinforcer [or punisher] 
rather than a stimulus that will elicit a response” (Michael, 2004, 
p. 66, words in brackets added). 

Conditioned reinforcers and punishers are not related 
to any biological need or anatomical structure; their ability to 
modify behavior is a result of each person’s unique history of 
interactions with his or her environment (ontogeny). On the one 
hand, because no two people experience the world in exactly 
the same way, the roster of events that can serve as conditioned 
reinforcers and punishers at any particular time (given a rele- 
vant motivating operation) is idiosyncratic to each individual 
and always changing. On the other hand, to the extent that two 
people have had similar experiences (e.g., schooling, profession, 
the culture in general), they are likely to be affected in similar 
ways by similar events. Social praise and attention are examples 
of widely effective conditioned reinforcers in many cultures. 
Because social attention and approval (as well as disapproval) 
are often paired with many other reinforcers (and punishers), 
they exert powerful control over human behavior and will be 
featured in later chapters when specific behavior change tactics 
are presented. 

Because people who live in a common culture share sim- 
ilar histories, it is not unreasonable for a practitioner to search 
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for potential reinforcers and punishers for a given client among 
classes of stimuli that have proven effective with other simi- 
lar clients. However, in an effort to help the reader establish a 
fundamental understanding of the nature of operant condition- 
ing, we have purposely avoided presenting a list of stimuli that 
may function as reinforcers and punishers. Morse and Kelleher 
(1977) made this important point very well. 


Reinforcers and punishers, as environmental “things,” 
appear to have a greater reality than orderly temporal 
changes in ongoing behavior. Such a view is deceptive. 
There is no concept that predicts reliably when events will 
be reinforcers or punishers; the defining characteristics of 
reinforcers and punishers are how they change behavior 
[italics added]. Events that increase or decrease the sub- 
sequent occurrence of one response may not modify other 
responses in the same way. 

In characterizing reinforcement as the presentation of 
a reinforcer contingent upon a response, the tendency is to 
emphasize the event and to ignore the importance of both 
the contingent relations and the antecedent and subsequent 
behavior. It is how [italics added] they change behavior 
that defines the terms reinforcer and punisher; thus it is the 
orderly change in behavior that is the key to these defini- 
tions. It is not [italics added] appropriate to presume that 
particular environmental events such as the presentation 
of food or electric shock are reinforcers or punishers until 
a change in the rate of responding has occurred when the 
event is scheduled in relation to specified responses. 

A stimulus paired with a reinforcer is said to have 
become a conditioned reinforcer, but actually it is the behav- 
ing subject that has changed, not the stimulus. . . . It is, of 
course, useful shorthand to speak of conditioned reinforc- 
ers . . . just as it is convenient to speak about a reinforcer 
rather than speaking about an event that has followed an 
instance of a specific response and resulted in a subsequent 
increase in the occurrence of similar responses. The latter 
may be cumbersome, but it has the advantage of empirical 
referents. Because many different responses can be shaped 
by consequent events, and because a given consequent event 
is often effective in modifying the behavior of different 
individuals, it becomes common practice to refer to reinforc- 
ers without specifying the behavior that is being modified. 
These common practices have unfortunate consequences. 
They lead to erroneous views that responses are arbitrary 
and that the reinforcing or punishing effect of an event is a 
specific property of the event itself. (pp. 176-177, 180) 


The point made by Morse and Kelleher (1977) is of 
paramount importance to understanding functional behavior— 
environment relations. Reinforcement and punishment are not 
simply the products of certain stimulus events, which are then 
called reinforcers and punishers without reference to a given 
behavior and environmental conditions. There are no inherent 
or standard physical properties of stimuli that determine their 
permanent status as reinforcers and punishers. In fact, a stimulus 
can function as a positive reinforcer under one set of condi- 
tions and a negative reinforcer under different conditions. Just as 
positive reinforcers are not defined with terms such as pleasant 


or satisfying, aversive stimuli are not defined with terms such 
as annoying or unpleasant. The terms reinforcer and punisher 
should not to be used on the basis of a stimulus event’s assumed 
effect on behavior or on the basis of any inherent property of 
the stimulus event itself. Morse and Kelleher (1977) continued: 


When the borders of the table are designated in terms of 
stimulus classes (positive-negative; pleasant—noxious) and 
experimental operations (stimulus presentation—stimulus 
withdrawal), the cells of the table are, by definition, vari- 
eties of reinforcement and punishment. One problem is 

that the processes indicated in the cells have already been 
assumed in categorizing stimuli as positive or negative; a 
second is that there is a tacit assumption that the presen- 
tation or withdrawal of a particular stimulus will have an 
invariant effect. These relations are clearer if empirical 
operations are used to designate the border conditions. .. . 
The characterization of behavioral processes depends upon 
empirical observations. The same stimulus event, under dif- 
ferent conditions, may increase behavior or decrease behav- 
ior. In the former case the process is called reinforcement 
and in the latter the process is called punishment. (p. 180) 


At the risk of redundancy, we restate this important 
concept. Reinforcers and punishers denote functional classes 
of stimulus events, membership to which is not based on the 
physical nature of the stimulus changes or events themselves. 
Indeed, given a person’s individual history and current moti- 
vational state, and the current environmental conditions, “any 
stimulus change can be a ‘reinforcer’ if the characteristics of the 
change, and the temporal relation of the change to the response 
under observation, are properly selected” (Schoenfeld, 1995, 
p. 184). Thus, the phrase “everything is relative” is thoroughly 
relevant to understanding functional behavior—environment 
relations. 


The Discriminated Operant and Three-Term 
Contingency 


We have discussed the role of consequences in influencing the 
future occurrence of behavior. But operant conditioning does 
much more than establish a functional relation between behav- 
ior and its consequences. Operant conditioning also establishes 
functional relations between behavior and certain antecedent 
conditions. 


In contrast to if-A-then-B formulations (such as S-R for- 
mulations), the AB-because-of-C formulation is a general 
statement that the relation between an event (B) and its 
context (A) is because of consequences (C). .. . Applied to 
Skinner’s three-term contingency, the relation between 

(A) the setting and (B) behavior exists because of (C) con- 
sequences that occurred for previous AB (setting-behavior) 
relations. The idea [is] that reinforcement strengthens the 
setting-behavior relation rather than simply strengthening 
behavior. (Moxley, 2004, p. 111) 


Reinforcement selects not just certain forms of behav- 
ior; it also selects the environmental conditions that in 
the future will evoke (make more likely) instances of the 


response class. A behavior that occurs more often under 
some antecedent conditions than it does in others is called 
a discriminated operant. Because a discriminated operant 
occurs more often in the presence of a given stimulus than 
it does in the absence of that stimulus, it is said to be under 
stimulus control. Answering the phone, one of the every- 
day behaviors discussed by the professor and his students in 
Box 2.2, is a discriminated operant. The phone’s ring func- 
tions as a discriminative stimulus (S?) for answering the 
phone. We answer the phone when it is ringing, and we do 
not answer the phone when it is silent. 

Just as reinforcers or punishers cannot be identified 
by their physical characteristics, stimuli possess no inherent 
dimensions or properties that enable them to function as 
discriminative stimuli. Operant conditioning brings behavior 
under the control of various properties or values of anteced- 
ent stimuli (e.g., size, shape, color, spatial relation to another 
stimulus), and what those features are cannot be deter- 
mined a priori. (Stimulus control is described in detail in 
Chapter 17.) 


Any stimulus present when an operant is reinforced 
acquires control in the sense that the rate will be higher 
when it is present. Such a stimulus does not act as a goad; 
it does not elicit the response in the sense of forcing it to 
occur. It is simply an essential aspect of the occasion upon 
which a response is made and reinforced. The difference is 
made clear by calling it a discriminative stimulus (or SP). 
An adequate formulation of the interaction between an 
organism and its environment must always specify 
three things: (1) the occasion upon which a response 
occurs; (2) the response itself; and (3) the reinforcing 
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consequences. The interrelationships among them 
are the contingencies of reinforcement.” (Skinner, 
1969, p. 7) 


The discriminated operant has its origin in the three-term 
contingency. The three-term contingency—antecedent, behay- 
ior, and consequence—is sometimes called the ABCs of behavior 
analysis. Figure 2.3 shows examples of three-term contingencies 
for positive reinforcement, negative reinforcement, positive pun- 
ishment, and negative punishment.”! Most of what the science of 
behavior analysis has discovered about the prediction and control 
of human behavior involves the three-term contingency, which is 
“considered the basic unit of analysis in the analysis of operant 
behavior” (Glenn, Ellis, & Greenspoon, 1992, p. 1332). 

A four-term contingency analysis takes into account moti- 
vating events that make certain stimulus changes momentarily 
more or less reinforcing. The four-term contingency is intro- 
duced in Chapter 11 and described in detail in Chapter 16. 

The term contingency appears in behavior analysis lit- 
erature with several meanings signifying various types of tem- 
poral and functional relations between behavior and antecedent 
and consequent variables (Catania, 2013; Lattal, 1995; Lattal & 
Shahan, 1997; Vollmer & Hackenberg, 2001). Perhaps the most 
common connotation of contingency refers to the dependency of 
a particular consequence on the occurrence of the behavior. When 
a reinforcer (or punisher) is said to be contingent on a particu- 
lar behavior, the behavior must be emitted for the consequence 
to occur. For example, after saying, “Name a carnivorous dino- 
saur,” a teacher’s “Well done!” depends on the student’s response, 
“Tyrannosaurus Rex” (or another dinosaur of the same class). 

The term contingency 1s also used in reference to the tem- 
poral contiguity of behavior and its consequences. As stated 
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Figure 2.3 Three-term contingencies illustrating reinforcement and punishment operations. 
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previously, behavior is selected by the consequences that imme- 
diately follow it, irrespective of whether those consequences 
were produced by or depended on the behavior. This is the 
meaning of contingency in Skinner’s (1953) statement, “So far 
as the organism is concerned, the only important property of the 
contingency is temporal” (1953, p. 85). 


RECOGNIZING THE COMPLEXITY 
OF HUMAN BEHAVIOR 


Behavior—human or otherwise—remains an extremely 
difficult subject matter. 
—B. F. Skinner (1969, p. 114) 


The experimental analysis of behavior has discovered a number 
of basic principles—statements about how behavior works as 
a function of environmental variables. These principles, sev- 
eral of which have been introduced in this chapter, have been 
demonstrated, verified, and replicated in thousands of experi- 
ments; they are scientific facts.*> Tactics for changing behavior 
derived from these principles have been applied to a wide range 
of human behaviors in natural settings and analyzed in increas- 
ingly sophisticated and effective ways. An introduction to what 
has been learned from those applied behavior analyses makes 
up the majority of this book. 

The systematic application of behavior analysis techniques 
sometimes produces behavior changes of great magnitude and 
speed, even for clients whose behavior had been unaffected by 
other forms of treatment and appeared intractable. When such 
a happy (but not rare) outcome occurs, the neophyte behavior 
analyst must resist the tendency to believe that we know more 
than we do about the prediction and control of human behavior. 
As acknowledged in Chapter 1, applied behavior analysis is a 
young science that has yet to achieve anything near a complete 
understanding and technological control of human behavior. 

A major challenge facing applied behavior analysts is 
dealing with human behavior in applied settings where labora- 
tory controls are impossible, impractical, or unethical. Numer- 
ous factors contributing to this difficulty coalesce around the 
complexity of the human repertoire, the myriad controlling vari- 
ables, and individual differences. 


Complexity of the Human Repertoire 


Humans are capable of learning an incredible range of behav- 
iors. Response sequences, sometimes of no apparent logical 
organization, contribute to the complexity of behavior (Skinner, 
1953). In a response chain, effects produced by one response 
influence the emission of other responses. Returning a winter 
coat to the attic leads to rediscovering a scrapbook of old family 
photographs, which evokes a phone call to Aunt Helen, which 
sets the occasion for finding her recipe for apple pie, and so on. 

Verbal behavior may be the most significant contribu- 
tor to the complexity of human behavior (Donahoe & Palmer, 
1994; Palmer, 1991; Skinner, 1957; see Chapter 18). Not only 
is a problem generated when the difference between saying and 
doing is not recognized, but verbal behavior itself is often a 
controlling variable for many other verbal and nonverbal behav- 
iors. By following the contingency described in a verbal rule 


(e.g., “Pull the lever while simultaneously pushing the selec- 
tion button.”) we behave effectively (e.g., operate a complex 
vending machine in a foreign country) without direct experience 
with contingencies. “Rules can be learned more quickly than the 
behavior shaped by the contingencies they describe. . . . Rules 
are particularly valuable when contingencies are complex or 
unclear or for any other reason not very effective” (Skinner, 
1974, p. 129). Skinner referred to behavior controlled by verbal 
statements as rule-governed behavior, distinguishing it from 
contingency-shaped behavior acquired by direct experience 
with contingencies.” 

Operant learning does not always occur as a slow, gradual 
process. Sometimes new, complex repertoires appear quickly 
with little apparent direct conditioning (Epstein, 1991; Sidman, 
1994). Murray Sidman’s groundbreaking research on stimu- 
lus equivalence showed that under certain conditions learners 
acquire new skills and verbal relations without direct instruction 
on those skills and relations (cf., Critchfield, Barnes-Holmes, 
& Dougher, 2018). The new behaviors emerge as a product of 
instruction on related skills and relations. This research, which is 
found under various names in addition to stimulus equivalence, 
including generative instruction, equivalence-based instruction, 
and derived relational responding, is described in Chapters 19 
and 20. 

One type of rapid learning has been called contingency 
adduction, a process whereby a behavior that was initially 
selected and shaped under one set of conditions is recruited by a 
different set of contingencies and takes on a new function in the 
person’s repertoire (Andronis, 1983; Layng & Andronis, 1984). 
Johnson and Layng (1992, 1994) described several examples 
of contingency adduction in which simple (component) skills 
(e.g., addition, subtraction, and multiplication facts, isolating 
and solving for X in a simple linear equation), when taught to 
fluency, combined without apparent instruction to form new 
complex (composite) patterns of behavior (e.g., factoring com- 
plex equations). Lessons can be designed in such a way that 
teaching select concepts and relations results in students acquir- 
ing concepts and relations that were not taught directly. 

Intertwined lineages of different operants combine to form 
new complex operants (Glenn, 2004), which produce response 
products that in turn make possible the acquisition of behav- 
iors beyond the spatial and mechanical restraints of anatomical 
structure. 


In the human case, the range of possibilities may be 
infinite, especially because the products of operant 
behavior have become increasingly complex in the context 
of evolving cultural practices. For example, anatomical 
constraints prevented operant flying from emerging in a 
human repertoire only until airplanes were constructed 

as behavioral products. Natural selection’s leash has 

been greatly relaxed in the ontogeny of operant units. 
(Glenn et al., 1992, p. 1332) 


Complexity of Controlling Variables 


Behavior is selected by its consequences. This mega principle of 
operant behavior sounds deceptively (and naively) simple. How- 
ever, “Like other scientific principles, its simple form masks the 


complexity of the universe it describes” (Glenn, 2004, p. 134). 
The environment and its effects on behavior are complex. 

Skinner (1957) noted that, “(1) the strength of a single 
response may be, and usually is, a function of more than one 
variable and (2) a single variable usually affects more than one 
response” (p. 227). Although Skinner was writing in reference 
to verbal behavior, multiple causes and multiple effects are char- 
acteristics of many behavior—environment relations. 

Many behaviors result from multiple causes. In a phe- 
nomenon called joint control (Lowenkron, 1998, 2006; Palmer, 
2006), two separate but interrelated forms of a person’s own 
verbal behavior combine to acquire stimulus control of a 
response that would not have occurred in the absence of either. 
For example, a person writing on her laptop computer, while 
sitting in the kitchen, needs a particular book for a reference. 
She might say to herself, “I need About Behaviorism” and repeat 
the book’s title (an echoic) several times while walking to the 
room containing her books. She reads the titles (another form 
of verbal behavior) of the books on the bookcase until she emits 
the response, “About Behaviorism,” whose topography matches 
the self-echoic. The occurrence of these two sources of control 
mediates the selection of the desired book.”> 

Concurrent contingencies can also combine to make a 
behavior more or less likely to occur in a given situation. Per- 
haps we finally return our neighbor’s weed trimmer not just 
because he usually invites us in for a cup of coffee, but also 
because returning the tool reduces the “guilt” we feel for keep- 
ing it for 2 weeks. 

Concurrent contingencies often vie for control of 
incompatible behaviors. We cannot watch “Baseball Tonight” 
and study (properly) for an upcoming exam. Although not a 
technical term in behavior analysis, algebraic summation is 
sometimes used to describe the effect of multiple, concur- 
rent contingencies on behavior. The behavior that is emitted 
is thought to be the product of the competing contingencies 
“canceling portions of each other out,” as in an algebraic 
equation. 

Behavioral covariation illustrates one type of multiple 
effect. For example, Lerman, Kelley, Vorndran, and Van Camp 
(2003) found that blocking the emission of one problem behavior 
decreased occurrences of that behavior but produced a collateral 
increase in other topographies of problem behaviors in the same 
response class. As another example of multiple effects, the pre- 
sentation of an aversive stimulus may, in addition to suppressing 
future occurrences of the behavior it follows, elicit respondent 
behaviors and evoke escape and avoidance behaviors—three 
different effects from one event. 

Emotions are often a commingling of respondent and 
operant behaviors. Interactions between respondent and oper- 
ant conditioning further the difficulty of determining causal 
variables. For example Pavlovian (stimulus-reinforcer) contin- 
gencies embedded in stimulus control arrangements may limit 
the effectiveness of operant (response-reinforcer) contingencies 
(see Nevin, 2009). A study by Gorn (1982) suggests the possi- 
bility that pairing music a person likes (unconditioned stimulus 
for positive emotional responses) with an advertisement for a 
product (neutral stimulus) might condition emotional responses 
to the product functioning as a conditioned stimulus that elicits 
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positive emotional responses that influence the operant behavior 
of purchasing the product. 

All of these complex, concurrent, interrelated contingen- 
cies make it difficult for behavior analysts to identify and control 
relevant variables. It should not be surprising that the settings in 
which applied behavior analysts ply their trade are sometimes 
described as places where “reinforcement occurs in a noisy 
background” (Vollmer & Hackenberg, 2001, p. 251). 

Consequently, as behavior analysts, we should recognize 
that meaningful behavior change might take time and many tri- 
als and errors as we work to understand the interrelationships 
and complexities of the controlling variables. Don Baer (1987) 
recognized that some of the larger problems that beset society 
(e.g., poverty, substance addiction, illiteracy), given our pres- 
ent level of technology, might be too difficult to solve at the 
moment. He identified three barriers to solving such complex 
problems: 


(a) We are not empowered to solve these bigger remaining 
problems, (b) we have not yet made the analysis of how 

to empower ourselves to try them, and (c) we have not yet 
made the system-analytic task analyses that will prove cru- 
cial to solving those problems when we do empower our- 
selves sufficiently to try them. .. . In my experience, those 
projects that seem arduously long are arduous because 

(a) I do not have a strong interim reinforcer compared to 
those in the existing system for status quo and must wait 
for opportunities when weak control may operate, even 

so, or (b) I do not yet have a correct task analysis of the 
problem and must struggle through trials and errors. By 
contrast (c) when I have an effective interim reinforcer and 
I know the correct task analysis of this problem, long prob- 
lems are simply those in which the task analysis requires a 
series of many behavior changes, perhaps in many people, 
and although each of them is relatively easy and quick, the 
series of them requires not so much effort as time, and so it 
is not arduous but merely tedious. (pp. 335, 336-337) 


Individual Differences 


People often respond differently, sometimes drastically so, to 
the same set of environmental conditions. This fact is sometimes 
cited as evidence that principles of behavior based on environ- 
mental selection do not exist, at least not in a form that could 
provide the basis for a robust and reliable technology of behay- 
ior change. It is then argued that because people often respond 
differently to the same set of contingencies, control of behavior 
must come from within each person. In refuting that argument, 
Sidman (2013) recognizes the complex learning histories that 
account for much of our individual differences and points to the 
excitement of discovering the scientific principles that link us 
with one another and with the universe at large. 


Among the most powerful sources of behavioral control are 
our individual behavioral histories, in each of us a web of 
such complexity that it tempts us to give up and claim inde- 
pendence. . . . to claim control by an inner self. . . . Rather 
than needing to claim myself as prepotent, I have always 
found it exciting to recognize that I am at one with the rest 
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of the universe and to help discover the laws that relate me 
to everything else. (p. Xvi) 


As each of us experiences varying contingencies of rein- 
forcement (and punishment), some behaviors are strengthened 
(selected by the contingencies) and others are weakened. This 
is the nature of operant conditioning, which is to say, human 
nature. Because no two people ever experience the world in 
exactly the same way, each of us arrives at a given situation 
with a different history of reinforcement. The repertoire of 
behaviors each person brings to any situation has been selected, 
shaped, and maintained by his or her unique history of reinforce- 
ment. Each human’s unique repertoire defines him or her as a 
person. We are what we do, and we do what we have learned to 
do. “He begins as an organism and becomes a person or self as 
he acquires a repertoire of behavior” (Skinner, 1974, p. 231). 

Individual differences in responding to current stimulus 
conditions, then, do not need to be attributed to differences in 
internal traits or tendencies, but to the orderly result of different 
histories of reinforcement. The behavior analyst must also con- 
sider people’s varying sensitivities to stimuli (e.g., hearing loss, 
visual impairment) and differences in response mechanisms 
(e.g., cerebral palsy) and design program components to ensure 
that all participants have maximum contact with relevant contin- 
gencies (Heward, Alber-Morgan, & Konrad, 2017). 


Obstacles to Changing Behavior 
in Applied Settings 


Compounding the difficulty of tackling the complexity of human 
behavior in the “noisy” applied settings where people live, work, 
and play, applied behavior analysts are sometimes prevented 
from implementing an effective behavior change program due to 
logistical, financial, sociopolitical, legal, and/or ethical factors. 
Most applied behavior analysts work for agencies with limited 
resources, which may make the data collection required for a 
more complete analysis impossible. In addition, participants, 
parents, administrators, and even the general public may at times 
limit the behavior analyst’s options for effective intervention 
(e.g., “We don’t want students working for tokens”). Legal or 
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SUMMARY 
Behavior 
1. Behavior is the activity of living organisms. 


2. Technically, behavior is “that portion of an organism’s 
interaction with its environment that involves movement 
of some part of the organism” (Johnston & Pennypacker, 
2009, p. 31). 


3. The term behavior is usually used in reference to a larger 
set or class of responses that share certain topographical 
dimensions or functions. 


4. Response refers to a specific instance of behavior. 


ethical considerations may also preclude determining experi- 
mentally the controlling variables for an important behavior. 
Ethical considerations for behavior analysts are discussed in 
Chapter 31. 

Each of these practical complexities combines with the 
behavioral and environmental complexities previously men- 
tioned to make the applied behavior analysis of socially import- 
ant behavior a challenging task. However, the task need not be 
overwhelming, and few tasks are as rewarding or as important 
for the betterment of humankind. 

It is sometimes argued that the scientific study of behavior 
cannot account for such human creations as beauty and humor 
or the behavior of judging something to be beautiful. Hineline 
(2005), Palmer (2018), Mechner (2017), and Neuringer and 
Jensen (2013) discuss aesthetics and creativity from a behavior 
analytic point of view on topics ranging from literature aesthet- 
ics to the craftsmanship of woodworking tools. 

Will knowing how such amazing behavior is learned some- 
how diminish the quality or enjoyment of the human experience? 
For example, will our increasing knowledge of the variables 
responsible for creative behavior lessen the feelings evoked by a 
powerful painting or a beautiful symphony, or reduce our appre- 
ciation of the artists who produced them? We think not, and we 
encourage you, as you read and study about the basic concepts 
introduced in this chapter and examined in more detail throughout 
the book, to consider Nevin’s (2005) response to how a scientific 
account of behavior adds immeasurably to the human experience: 


At the end of Origin of Species (1859), Darwin invites us to 
contemplate a tangled bank, with its plants and its birds, its 
insects and its worms; to marvel at the complexity, diver- 
sity, and interdependence of its inhabitants; and to feel awe 
at the fact that all of it follows from the laws of reproduc- 
tion, competition, and natural selection. Our delight in the 
tangled bank and our love for its inhabitants are not dimin- 
ished by our knowledge of the laws of evolution; neither 
should our delight in the complex world of human activity 
and our love for its actors be diminished by our tentative 
but growing knowledge of the laws of behavior. Tony 
Nevin, personal communication (December 19, 2005.) 


5. Response topography refers to the physical shape or form 
of behavior. 


6. A response class is a group of responses of varying 
topography, all of which produce the same effect on the 
environment. 


7. Repertoire can refer to all of the behaviors a person can do 
or to a set of behaviors relevant to a particular setting or task. 


Environment 


8. Environment is the physical setting and circumstances in 
which the organism or referenced part of the organism exists. 


D; 


10. 


11. 


12. 


13. 


14. 


15. 


Stimulus is “an energy change that affects an organism 
through its receptor cells” (Michael, 2004, pas 


The environment influences behavior primarily by 
stimulus change, not static stimulus conditions. 


Stimulus events can be described formally (by their 
physical features), temporally (by when they occur), and 
functionally (by their effects on behavior). 


A stimulus class is a group of stimuli that share specified 
common elements along formal, temporal, and/or func- 
tional dimensions. 


Antecedent conditions or stimulus changes exist or occur 
prior to the behavior of interest. 


Consequences are stimulus changes that follow a behavior 
of interest. 


Stimulus changes can have one or both of two basic effects 
on behavior: (a) an immediate but temporary effect of 
increasing or decreasing the current occurrences of the 
behavior and/or (b) a delayed but relatively permanent 
effect in terms of the future occurrences of that type of 
behavior in the future. 


Respondent Behavior 
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Respondent behavior is elicited by antecedent stimuli. 


A reflex is a stimulus—response relation consisting of an 
antecedent stimulus and the respondent behavior it elicits 
(e.g., bright light—pupil contraction). 


All healthy members of a given species are born with the 
same repertoire of unconditioned reflexes. 


An unconditioned stimulus (e.g., food) and the respondent 
behavior it elicits (e.g., salivation) are called uncondi- 
tioned reflexes. 


Conditioned reflexes are the product of respondent con- 
ditioning: a stimulus—stimulus pairing procedure in which 
a neutral stimulus is presented with an unconditioned 
stimulus until the neutral stimulus becomes a conditioned 
stimulus that elicits the conditioned response. 


Pairing a neutral stimulus with a conditioned stimulus 
can also produce a conditioned reflex—a process called 
higher-order (or secondary) respondent conditioning. 


Respondent extinction occurs when a conditioned stimulus 
is presented repeatedly without the unconditioned stimulus 
until the conditioned stimulus no longer elicits the condi- 
tioned response. 


Operant Behavior 


23. 
24. 


Operant behavior is selected by its consequences. 


Unlike respondent behavior, whose topography and basic 
functions are predetermined, operant behavior can take a 
virtually unlimited range of forms. 
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Selection of behavior by consequences operates during 
the lifetime of the individual organism (ontogeny) and is 
a conceptual parallel to Darwin’s natural selection in the 
evolutionary history of a species (phylogeny). 


Operant conditioning, which encompasses reinforcement 
and punishment, refers to the process and selective effects 
of consequences on behavior: 


* Consequences can affect only future behavior. 


* Consequences select response classes, not individual 
responses. 


* Immediate consequences have the greatest effect. 
* Consequences select any behavior that precedes them. 


* Operant conditioning occurs automatically. 


Most stimulus changes that function as reinforcers or pun- 
ishers can be described as either (a) a new stimulus added 
to the environment or (b) an already present stimulus 
removed from the environment. 


Positive reinforcement: A response is followed by the 
presentation of a stimulus that results in similar responses 
occurring more often. 


Negative reinforcement: A response is followed by the 
withdrawal of a stimulus that results in similar responses 
occurring more often. 


The term aversive stimulus is often used to refer to stimulus 
conditions whose termination functions as reinforcement. 


Extinction (withholding all reinforcement for a previously 
reinforced behavior) produces a decrease in response rate 
to the behavior’s pre-reinforcement level. 


Positive punishment: A response is followed by the pre- 
sentation of a stimulus that results in similar responses 
occurring less often. 


Negative punishment: A response is followed by the 
withdrawal of a stimulus that results in similar responses 
occurring less often. 


A principle of behavior describes a functional relation 
between behavior and one or more of its controlling vari- 
ables that has thorough generality across organisms, spe- 
cies, settings, and behaviors. 


A behavior change tactic is a technologically consistent 
method for changing behavior that has been derived from 
one or more basic principles of behavior. 


Unconditioned reinforcers and punishers function irre- 
spective of any prior learning history. 


Stimulus changes that function as conditioned reinforcers 
and punishers do so because of previous pairing with other 
reinforcers or punishers. 


One important function of motivating operations is alter- 
ing the current value of stimulus changes as reinforcement 
or punishment. For example, deprivation and satiation are 
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motivating operations that make food more or less effec- 
tive as reinforcement. 


39. A discriminated operant occurs more often under some 
antecedent conditions than it does under others, an out- 
come called stimulus control. 


40. Stimulus control refers to differential rates of oper- 
ant responding observed in the presence or absence of 
antecedent stimuli. Antecedent stimuli acquire the ability 
to control operant behavior by having been paired with 
certain consequences in the past. 


41. The three-term contingency—antecedent, behavior, and 
consequence—is the basic unit of analysis in the analysis 
of operant behavior. 


42. If a reinforcer (or punisher) is contingent on a particular 


behavior, the behavior must be emitted for the conse- 


quence to occur. 
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KEY TERMS 


automatic reinforcement 
automaticity of reinforcement 
aversive stimulus 

behavior 

behavior change tactic 
conditioned punisher 
conditioned reflex 
conditioned reinforcer 
conditioned stimulus 
consequence 

contingency 
contingency-shaped behavior 
contingent 

deprivation 

discriminated operant 
discriminative stimulus (SP) 
environment 

extinction 


habituation 


higher-order conditioning 


history of reinforcement 
joint control 
motivating operation 
negative punishment 
negative reinforcement 
neutral stimulus 
ontogeny 

operant behavior 
operant conditioning 
phylogeny 

positive punishment 
positive reinforcement 
principle of behavior 
punisher 

punishment 

reflex 

reinforcement 


reinforcer 
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1. The controlling variables of primary importance in applied 


behavior analysis are located in: 
a. The environment 

b. Consequences 

c. The mental states 

d. Reinforcement 

Hint: (See “Behavior’’) 


43. All applied behavior analysis procedures involve manipulation 
of one or more components of the three-term contingency. 


Recognizing the Complexity of Human Behavior 


44. Humans are capable of acquiring a huge repertoire of 
behaviors. Response chains and verbal behavior also make 
human behavior extremely complex. 


45. The variables that govern human behavior are often highly 
complex. Many behaviors have multiple causes. 


46. Individual differences in histories of reinforcement and 
organic impairments also make the analysis and control of 
human behavior difficult. 


47. Applied behavior analysts are sometimes prevented from 
conducting an effective analysis of behavior because of 
practical, logistical, financial, sociopolitical, legal, and/or 
ethical reasons. 
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repertoire 

respondent behavior 
respondent conditioning 
respondent extinction 
response 

response class 
rule-governed behavior 
selectionism 

socially mediated contingency 
stimulus 

stimulus class 

stimulus control 
stimulus—stimulus pairing 
three-term contingency 
unconditioned punisher 
unconditioned reinforcer 


unconditioned stimulus 
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2. Two functionally distinct types of behavior important to 
behavior analysis include: 


a. Learned and operant behavior 

b. Reflexive and respondent behavior 
c. Respondent and operant behavior 
d. Operant and conditioned behavior 
Hint: (See “Behavior’”) 


- Which of the following is considered a property of 
behavior amenable to measurement? 


a. Temporal Weight 

b. Temporal Locus 

c. Temporal Displacement 

d. Temporal Size 

Hint: (See “Behavior”) 

. A____ denotes a set or collection of knowledge and 


skills a person has learned that are relevant to particular 
settings or tasks. 


a. Repertoire 

b. Behavior 

c. Skill 

d. Response 

Hint: (See “Behavior’’) 

. Behavior is affected by stimulus changes that occur 

prior to and immediately after the behavior. The term 
refers to environmental conditions or stimulus 


changes that exist or occur prior to the behavior of 
interest. 


a. Consequence 

b. Stimulus 

c. Event 

d. Antecedent 

Hint: (See “Environment’) 

. A behavior that is elicited by antecedent stimuli and is 
“brought about” by a stimulus that precedes it is: 

a. Operant 

b. Learned 

c. Respondent 

d. New 

Hint: (See “Respondent Behavior”) 

. Aresponse is followed by a stimulus change, the effect of 
which is an increase in the future frequency of behavior. 


Which type of stimulus-change operation most likely 
occurred? 


a. Reinforcement 

b. Punishment 

c. Extinction 

d. Positive Punishment 

Hint: (See “Operant Behavior”) 


reinforcer, while money 
reinforcer. 


. Water is an example of a 
is an example of a 


a. Conditioned, unconditioned 
b. Secondary, primary 

c. Unconditioned, conditioned 
d. Powerful, ineffective 


Hint: (See “Reinforcement”) 


3h 


10. 


11. 


12. 


13. 


14. 


Chapter 2 * Basic Concepts and Principles 63 


Answering the door when you hear the door bell and not 
when it is silent is an example of behavior being under: 


a. Simultaneous prompting 

b. Equivalence 

c. Stimulus control 

d. Premack principle 

Hint: (See “Stimulus Changes That Function as Reinforcers 
and Punishers’”’) 

A behavior that occurs more frequently under some 
antecedent conditions than it does in others is called a(n): 
a. Stimulus control 

b. Operant behavior 

c. Discriminative stimulus 

d. Discriminated operant 

Hint: (See “Stimulus Changes That Function as Reinforcers 
and Punishers’’) 

Food deprivation may alter the momentary effectiveness of 
food as a reinforcer. Food deprivation is an example of a(n): 
a. Satiation trial 

b. Motivating operation 

c. Operant behavior 

d. Experimental control 

Hint: (See “Unconditioned Reinforcement and 
Punishment’) 

The three-term contingency is made of these three terms: 
a. Attitude, behavior, reinforcement 

b. Antecedent, reinforcement, punishment 

c. Antecedent, behavior, consequence 

d. Antecedent, stimulus, control 

Hint: (See “The Discriminated Operant and Three-Term 


Contingency”) 


The term “contingent” as used in behavior analysis 

refers to the dependent relationship of a particular conse- 
quence on the occurrence of behavior and is also used in 

reference to the contiguity of behavior and its 
consequences. 


a. Temporal 
b. Independent 
c. Dependent 
d. False 
Hint: (See “The Discriminated Operant and Three-Term 
Contingency”) 
behavior is elicited by antecedent stimuli. 
a. Operant 
b. Temporal 
c. Respondent 
d. All 
Hint: (See “Behavior’) 
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Conditioned are the product of respondent 


conditioning. 

a. Antecedents 

b. Stimuli 

c. Punishers 

d. Reflexes 

Hint: (See “Behavior’’) 

Operant behavior is selected by its: 
a. Consequences 

b. Antecedents 


eC OE SOSSTSS BSE OOPS ATR ETON 


Discuss the difference between a response and a behavior. 
Hint: (See “Behavior’) 


. Discuss the importance of a functional analysis of behavior 


over a structural analysis of behavior. 

Hint: (See “Behavior’’) 

Describe the effect of positive and negative reinforcement 
on subsequent behavior. 


Hint: (See “Operant Conditioning”) 


. Give an example of negative reinforcement. 


Hint: (See “Operant Conditioning”) 


. Discuss the difference between a principle of behavior and 


a behavior-change tactic. 

Hint: (See “Reinforcement’’) 

Discuss the difference between conditioned and uncondi- 
tioned reinforcers. 


Hint: (See “Reinforcement’’) 


. Describe the effect punishment contingencies have on sub- 


sequent behavior. 


Hint: (See “Selection by Consequences’) 


. State an example of stimulus control. 


Hint: (See “The Discriminated Operant and Three-Term 
Contingency”) 
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The reader should not be overwhelmed by the many technical terms and 
concepts in this chapter. With the exception of respondent behavior, each 
concept introduced in this chapter is explained in detail in subsequent 
chapters. This initial overview of basic concepts and principles provides 
background information that will facilitate understanding portions of the 
text that precede the more detailed explanations. 

In the mid-1960s, Ogden Lindsley proposed what he called the dead man 
test as a way to help teachers determine whether they were targeting real 
behaviors for measurement and change as opposed to inanimate states such 
as “being quiet” (see Chapter 3). According to this test, if a dead man can 
do it, it isn’t behavior. Getting blown over by strong wind is not behavior (a 


c. Conditioners 
d. Respondent 
Hint: (See “Behavior’’) 


Operant conditioning encompasses: 
a. Time and stimuli 

b. Antecedents and phylogeny 

c. Conditioned and unconditioned 
d. Reinforcement and punishment 
Hint: (See “Behavior’’) 


oe = an EEE TONNES OONT TN TION 
conn eca aa Sten aSSbSSONC SOTO EAE HERENDEEN IO INTO CONNIE TEEL OTEAANESIES COSTS INN 


What effects do motivating operations have on reinforcers? 
Hint: (See “Unconditioned Reinforcement and Punishment”) 
Using an example from applied settings, construct a three- 
term contingency. 

Hint: (See “The Discriminated Operant and Three-Term 
Contingency’) 


Diagram an example of respondent conditioning. Identify 
and label the following: unconditioned stimulus, uncon- 
ditioned response, neutral response, conditioned stimulus, 
and conditioned response. 


Hint: (See “Respondent Conditioning”) 


Classify the following behaviors as respondent or operant: 
¢ Talking 

¢ Walking 

¢ Baby’s grasp 

¢ Salivation at smell of food 

¢ Baking a cake 

¢ Pupil constriction 

¢ Thinking about music 

¢ Playing the piano 

¢ Withdrawing hand from a fire 


Hint: (See “Respondent Conditioning”’) 


dead man would also be blown over), but moving arms and hands in front of 
one’s face, tucking and rolling, and yelling “Whoa!” as one is being blown 
over are behaviors. Critchfield (2016) provides some historical background 
on the dead man test and a thoughtful critique of its usefulness as a guide 
for measurement. 

Most behavior analysts use the word behavior both as a mass noun to refer to 
the subject matter of the field in general or a certain type or class of behavior 
(e.g., operant behavior, study behavior) and as a count noun to refer to spe- 
cific instances (e.g., two aggressive behaviors). The word behavior is often 
implied and unnecessary to state. We agree with Friman’s (2004) recommen- 
dation that in most cases, “If the object of our interest is hitting and spitting, 
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let’s just say ‘hitting’ and ‘spitting,’ Subsequently, when we are gathering our 
thoughts with a collective term, we can call them behaviors” (p. 105). 


. Respondent conditioning and the Operant principles mentioned here are 


introduced later in this chapter. 


. Gray (1979) provides an excellent and interesting description of Pavlov’s 


research. “Although the experimental preparations for studying Pavlov- 
ian conditioning have expanded over the years, Pavlov’s work with dogs 
describes many of the key empirical phenomena and theoretical processes 
that modern researchers continue to pursue... . The study of basic asso- 
ciative learning processes underlying Pavlovian conditioning has led to 
numerous insights into experimental design, how learning occurs, and how 
basic processes may lay the foundation for many putative higher forms of 
learning” (K. M. Lattal, 2013, p. 283). 

Unconditioned stimulus and conditioned stimulus are the most commonly 
used terms to denote the stimulus component of respondent relations. How- 
ever, because the terms ambiguously refer to both the immediate evocative 
(eliciting) effect of the stimulus change and its somewhat permanent and 
delayed function-altering effect (the conditioning effect on other stimuli), 
Michael (1995) recommended that the terms unconditioned elicitor (UE) 
and conditioned elicitor (CE) be used when referring to the evocative func- 
tion of these variables. 


. To learn more about Pavlovian conditioning, see the detailed descriptions 


and numerous examples in Gottlieb and Begej (2014), K. M. Lattal (2013), 
and Williams (2014). 

The verb emit is used in conjunction with operant behavior. Its use fits well 
with the definition of operant behavior, allowing reference to the conse- 
quences of behavior as the major controlling variables. The verb elicit is inap- 
propriate to use with operant behavior because it implies that an antecedent 
stimulus has primary control of the behavior. Domjan (2016) argues that the 
terms elicited and emitted are no longer useful or accurate for distinguishing 
classically conditioned (respondent) behavior from operant behavior. 

To learn more about the role of selectionism in Darwinian evolutionary 
biology and Skinnerian behaviorism, see Leao, Laurenti, and Haydu 
(2016); Moxley (2004); and Reese (1994). 

Unless noted otherwise, the term behavior will refer to operant behavior 
throughout the remainder of the text. 

Skinner (1966) used rate of responding as the fundamental datum for his 
research. To strengthen an operant is to make it more frequent. However, 
rate is not the only measurable and malleable dimension of behavior. As we 
will see in Chapters 3 and 4, sometimes the duration, latency, magnitude, 
and/or topography of behavior changes are of practical importance. 
Automaticity of reinforcement and automatic reinforcement are different 
concepts. Automatic reinforcement refers to the hypothesis that a behay- 
ior produces its own reinforcement, as in the sensory stimulation from 
scratching an itch (Vaughn & Michael, 1982). In practice, when an exper- 
imental analysis fails to identify the reinforcers maintaining a problem 
behavior, some form of automatic reinforcement is assumed. Automatic 
reinforcement is described in Chapters 11 and 27. 

Prominent behavior analysts disagree on whether delayed consequences 
can function as reinforcement (Bradley & Poling, 2010). Chapters 11, 29, 
and 30 discuss the use and effects of delayed consequences in behavior 
change programs. 
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Malott and Shane (2014) refer to these two operations as “stimulus addi- 
tion” and “stimulus subtraction.” 

For examples of inaccurate representations of behaviorism and behavior 
analysis and their implications for training practitioners and serving clients, 
see Cooke (1984); Heward (2005); Heward and Cooper (1992); Morris 
(2009); Schlinger (2018); and Todd and Morris (1983, 1992). 

Foxx (1982) introduced the terms Type J punishment and Type II pun- 
ishment for positive punishment and negative punishment, respectively. 
Some behavior analysts and practitioners continue to use Foxx’s terminol- 
ogy. Malott and Shane (2014) refer to negative punishment as a penalty 
contingency. 

Michael (1975) and Baron and Galizio (2005) present cogent arguments 
that positive and negative reinforcement are examples of the same funda- 
mental operant relation. This issue is discussed further in Chapter 12. 
Some authors use the modifiers primary or unlearned to identify uncondi- 
tioned reinforcers and unconditioned punishers. 


. Inaddition to using aversive stimulus as a synonym for a negative reinforcer, 


Skinner (1953) used the term to refer to stimuli whose onset or presentation 
functions as punishment, a practice continued by many behavior analysts 
(e.g., Alberto & Troutman, 2013; Malott & Shane, 2014; Miltenberger, 
2016). The term aversive stimulus (and aversive control when speaking of 
behavior change techniques involving such stimuli) is used widely in the 
behavior analysis literature to refer to one or more of three different behay- 
ioral functions: An aversive stimulus may be (a) a negative reinforcer if its 
termination increases behavior, (b) a punisher if its presentation decreases 
behavior, and/or (c) a motivating operation if its presentation increases 
the current frequency of behaviors that have terminated it in the past (see 
Chapter 16). When speaking or writing technically, behavior analysts must 
be careful that their use of omnibus terms such as aversive does not imply 
unintended functions (Michael, 1995). 

Some authors use the modifiers secondary or learned to identify condi- 
tioned reinforcers and conditioned punishers. 


. Contingency diagrams, such as those shown in Figure 2.3, are an effective 


way to illustrate temporal and functional relations between behavior and 
environmental events. Examples of other types of contingency diagrams 
and suggestions for using them to teach and learn about behavior analysis 
can be found in Goldwater and Acker (1995); Malott and Shane (2014); 
Mattaini (1995); and Toogood (2012). State notation is another means for 
visualizing complex contingency relations and experimental procedures 
(Mechner, 1959; Michael & Shafer, 1995). 

The phrase to make reinforcement contingent describes the behavior of the 
researcher or practitioner: delivering the reinforcer only after the target 
behavior has occurred. 

Like all scientific findings, these facts are subject to revision and even 
replacement should future research reveal more useful data. 

In shaping, a new behavior is gradually acquired through reinforcement 
of successive approximations to a final form (see Chapter 22). Because 
the natural environment seldom (if ever) truly shapes new behavior, con- 
tingency-selected behavior may be a better term than contingency-shaped 
behavior. 


. This example was adapted from Sidener (2006). More examples of joint 


control are presented in Chapter 18, Verbal Behavior. 
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PART 


Selecting, Defining, and Measuring 
Behavior 


Because applied behavior analysts must achieve and document behavior changes that improve 
people’s quality of life, the careful selection and systematic measurement of behavior form the 
operational foundation of applied behavior analysis. Chapter 3 describes the role of assessment in 
behavior analysis, principal assessment methods used by behavior analysts, how to identify and 
assess the social significance of potential target behaviors, how to prioritize target behaviors, and 
how to define the selected behaviors to enable accurate and reliable measurement, and discusses 
options for setting criteria for behavior change. Chapter 4 explains the role of measurement in 
applied behavior analysis, defines measurable dimensions of behavior, describes procedures and 
tools for measuring behavior, and suggests guidelines for selecting a measurement system. Chapter 5 
identifies indicators of trustworthy measurement; elucidates common threats to the validity, accuracy, 
and reliability of behavioral measurement; recommends strategies for combating those threats; and 
describes methods for assessing the accuracy and quality of behavioral measurement. 
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Selecting and Defining Target Behaviors 


LEARNING OBJECTIVES 


¢ Understand and explain the role of assessment in applied behavior analysis. 
e Identify the ethical and professional standards of professional applied behavior analysis. 
° State target intervention outcomes in observable and measurable terms. 


¢ Describe the four major methods for obtaining assessment information. 
e Explain the importance of social validity in regard to selecting target behavior. 
* Describe procedures for assessing the social significance of potential target behaviors. 


Discuss criteria for prioritizing target behaviors. 


Define behavior in observable and measurable terms. 


pplied behavior analysis is concerned with producing 

predictable and replicable improvements in behavior. 

However, not just any behavior will do: Applied behav- 
ior analysts improve socially significant behaviors that have 
immediate and long-lasting effects for the person and for those 
who interact with that person. Such behaviors include language, 
social, motor, and academic skills. 

An important preliminary and universally accepted first step 
involves choosing the right behaviors to target for assessment, 
measurement, and change (Lerman, Iwata, & Hanley, 2013). 
Lerman et al. (2013) state: “Specific target behaviors are usually 
selected because they will improve a person’s quality of life in 
the short and the long term by allowing the person access to new 
reinforcers and additional reinforcing contexts” (p. 86). 

This chapter describes the role of assessment in applied 
behavior analysis, three principal behavioral assessment methods, 
how to assess and prioritize the social significance of potential target 
behaviors, and how to apply criteria for defining target behaviors. 


ROLE OF ASSESSMENT IN APPLIED 
BEHAVIOR ANALYSIS 


Assessment has long been considered the first of four phases in 
any systematic model of instruction that includes: assessment, 
planning, implementation, and evaluation (Stephens, 1976). 
A comprehensive behavioral assessment informs planning, pro- 
vides guidance for implementation, and assists with evaluation. 


Definition and Purpose of Behavioral 
Assessment 


Traditional psychological and educational assessments typically 
involve a series of norm-referenced, criterion-referenced, or 
curriculum-based assessments supplemented by observations, 
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Explain the process for setting criteria for behavior change. 


anecdotal reports, and historical data to determine the learner’s 
strengths and weaknesses within cognitive, academic, social, 
and/or psychomotor domains. 

Conversely, behavioral assessment includes indirect and 
direct procedures such as interviews, checklists, and tests to iden- 
tify and define the specific target behavior. In addition to identify- 
ing behavior(s) to change, comprehensive behavioral assessment 
can uncover functional relations between variables; it provides 
context on the resources, assets, significant others, competing 
contingencies, maintenance and generalization factors, and rein- 
forcers (or punishers) that can be combined to improve the effi- 
ciency of an intervention (Brown, McDonnell, & Snell, 2016).! 

Wacker, Berg, Harding, and Cooper-Brown (2011) indi- 
cated that the principal goal of behavioral assessment is “to 
identify environmental variables that are related to increases or 
decreases in target behavior” (p. 165). Linehan (1977) offered 
a succinct description of the purpose of behavioral assessment: 
“To figure out what the client’s problem is and how to change it 
for the better” (p. 31). Implicit in the definitions of both Wacker 
et al. and Linehan is the concept that behavioral assessment is 
more than an exercise in describing and classifying behavioral 
abilities and deficiencies. In short, behavioral assessment goes 
beyond trying to obtain a psychometric score, grade equivalent 
data, or rating measure, as worthy as such findings might be for 
other purposes. Behavioral assessment discovers the function 
that behavior serves in the person’s environment (e.g., obtaining 
social attention, escaping or avoiding a task). A well-constructed 
and thorough assessment identifies the key variables controlling 
the behavior, and guides the practitioner to manipulate these 
variables to the benefit of the learner. Consequently, subsequent 
interventions can be targeted and have a much better chance 
of success. As Bourret, Vollmer, and Rapp (2004) pointed out: 
“The critical test of . . . assessment is the degree to which it 
differentially indicates an effective teaching strategy” (p. 140). 


Phases of Behavioral Assessment 


Hawkins’s (1979) conceptualization of behavioral assessment 
embraces five phases: (a) screening, (b) defining and quantifying 
problems and establishing outcome criteria, (c) pinpointing the 
target behavior(s), (d) monitoring progress, and (e) following up. 
Although the five phases form a general chronological sequence, 
overlap often occurs. Part III of this book, “Evaluating and Ana- 
lyzing Behavior Change,” describes the monitoring and follow-up 
Stages of assessment. This chapter is concerned primarily with the 
preintervention functions of assessment, the selection and definition 
of a target behavior—the specific behavior selected for change. 

To serve competently, applied behavior analysts must know 
what constitutes socially important behavior, have the technical 
skills to use appropriate assessment methods and instruments, ana- 
lyze the data, and be able to match assessment data with an inter- 
vention strategy. For instance, the remedial reading specialist must 
understand the critical behaviors of a competent reader; be able to 
determine the learner’s progress across a continuum of recognition, 
decoding, and comprehension skills; and deliver appropriate and 
effective instruction. In short, the analyst must be knowledgeable 
about the full scope and sequence of the target behavior. 


Preassessment Considerations 


Before conducting an informal or formal behavioral assessment 
for the purpose of pinpointing a target behavior, the analyst must 
address two fundamental questions. First, who has the authority, 
permission, resources, and skills to complete an assessment and 
intervene with the client? If a practitioner does not have authority or 
permission, then the analyst’s role in assessment and intervention is 
restricted. For example, suppose that a behavior analyst is standing 
in a checkout line where a nearby parent is attempting to manage 
an extremely disruptive child. Does the behavior analyst have the 
authority to conduct an on-the-spot assessment of the problem or to 
suggest an intervention to the parent? No. However, if the same epi- 
sode occurred during an in situ “coaching session” with the parent 


Figure 3.1 Four cornerstones of professional assessment. 


1. Recognize professional limitations. 


3. Maintain confidentiality. e 


receive them. 


e Secure authority to assess the problem or suggest an intervention. 
e Do not test beyond professional training or license capability. 


e Decline to assess under poor conditions (e.g., the learner is ill, the environment is 
distracting, the analyst is rushed for time). 


2. Assume responsibility for your work. e Be careful, be thorough, be conservative with interpretations. 


e Recognize the assumptions for referral and testing (i.e., change of placement, 
generate intervention strategies). 


Hold assessment information in confidence. Obtain a signed and dated Release of 
Information (ROI). 
e Ensure that assessment results are shared only with those who are eligible to 


e Seek guidance in situations where confidentiality is in question 
(e.g., life-threatening situations). 
4. Adhere to professional, administration, ¢ Use technically reliable and valid measures and procedures to assess. 
and ethical standards. ¢ Follow appropriate protocols. 
e Examine and analyze all relevant medical, educational, and historical data. 
Recommend the strongest, but least intrusive interventions. 
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who had requested assistance beforehand, the analyst would be able 
to offer an assessment and recommend an intervention. 

The second question asks: What records, resources, or 
data currently exist that shed light on what has been done in the 
past to identify, treat, and evaluate the target behavior? Mini- 
mally, the behavior analyst should examine and review relevant 
medical, educational, or historical data. Clearly, all medical rea- 
sons for suspected unusual behavior should be ruled out. A child 
with serious, but as yet undetected, eye (amblyopia), ear (otitis 
media), or prenatal conditions (fetal alcohol syndrome) may 
exhibit behavior problems, which, if treated solely by behav- 
ioral measures, will miss the mark. Pretreating these causal fac- 
tors medically allows for better focus in later educational and/ 
or behavioral interventions. 

Likewise, the practitioner should conduct an analysis of 
prior educational and behavioral interventions. Through examin- 
ing narrative reports, analyzing data-based graphs or charts, and 
interviewing caretakers, a skilled analyst can form a picture about 
the nature of the target behavior, its developmental scope and 
sequence, and the reinforcers and punishers that appear to sustain 
the behavior. Further, a judgment can be made about treatment 
integrity with prior interventions. It’s possible that a past interven- 
tion (e.g., extinction) was employed to reduce calling out in class, 
but it was abandoned prematurely because the caretakers misinter- 
preted the gradual decrease in call outs as failure. Alternatively, 
misbehaviors that were addressed successfully in a training envi- 
ronment (e.g., special education classroom, developmental center) 
might surface in an untrained setting (home, community setting) 
because of a past failure to program for generalization. 

In effect, applied behavior analysts must not only recog- 
nize the role of assessment in the four-stage assessment-planning- 
implementation-evaluation continuum, but also recognize all the 
personal, social, and environmental factors that affect behavior. 
Only then can they use their behavior analytic skills to assess and 
ultimately change behavior.” Figure 3.1 outlines the four corner- 
stones of professional assessment. 
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ASSESSMENT METHODS USED BY BEHAVIOR 
ANALYSTS 


Behavior analysts use a variety of assessment methods to identify 
which behavior(s) to target for change. A comprehensive penays 
ioral assessment often includes both indirect and direct methods.” 


Indirect Assessment 


Interviews, checklists, and rating scales are indirect assessment 
approaches because the data obtained from these methods are 
derived from recollections, reconstructions, and/or subjective, 
ordinal-scale ratings of events. Interviews, checklists, and rating 
scales can be conducted with the client and/or with people who 
come into regular contact with the individual (e.g., teachers, 
parents, care providers). 

Indirect assessments include open- and closed-ended for- 
mats (Fryling & Baires, 2016). Open-ended indirect assessments 
encourage the informant to comment freely and without restriction 
on the number, rate, intensity, and duration of target behaviors and 
stimuli and conditions that surround the occurrence of these behav- 
iors (How often does Mario engage in self-injurious behavior ?). 
The informant’s narrative across a range of questions often leads to 
follow-up questions that provide more context and substance on the 
possible variables controlling the behavior (Does Mario’s self-inju- 
rious behavior occur more often when he is at home or at school?). 

Closed-ended indirect assessments require the infor- 
mant to use a Likert scale to rate a series of questions, with 
a total summary score being generated that provides direc- 
tion on possible variables controlling the behavior (e.g., Rate 
the occurrences of Mario’s self-injurious behavior using this 
scale: 5 = frequently; 3 = sometimes; 1 = rarely). Figure 3.2 
compares the relative strengths and limitations of closed- and 
open-ended indirect assessments. 

The following sections describe the roles of the client, 
significant others, and checklists and rating scales with respect 
to conducting indirect assessments. 


Interviewing the Client 


A behavioral interview is often a first and important step in 
identifying potential target behaviors, which can be verified, or 


rejected, by subsequent direct observation or empirical analysis. 
A behavioral interview differs from a traditional interview by 
the type of questions asked and the level of information sought. 
Behavior analysts rely primarily on what and when questions, 
which tend to focus on the environmental conditions that exist 
before, during, and after a behavioral episode. Identifying 
environmental events that correlate with the behavior provides 
valuable information for formulating hypotheses about the con- 
trolling function of these variables and for planning interven- 
tions. Hypothesis generation leads to experimental manipulation 
and the discovery of functional relations (see Chapter 26). 

Figure 3.3 provides examples of the appropriate kinds of 
what and when questions. This sequence of questions was devel- 
oped by a behavioral consultant in response to a teacher who 
wanted to reduce the frequency of her own negative reactions to 
acting out and disruptive students. Similar questions could be 
generated to address situations in homes or community settings 
(Sugai & Tindal, 1993). 


interviewing Significant Others 


Sometimes either the behavior analyst cannot interview the 
client personally or the analyst needs information from others 
who are important in the client’s life (e.g., parents, teachers, 
coworkers). In such cases, the analyst will interview one or more 
of these significant others. When asked to describe a behav- 
ioral problem or deficit, significant others often begin with 
general terms or labels that do not identify specific behaviors 
to change and often imply causal factors thought to be intrinsic 
to the client (e.g., fearful, aggressive, unmotivated, lazy, with- 
drawn). By asking structured questions, the behavior analyst 
helps significant others describe the problem in terms of specific 
behaviors, environmental conditions, and events associated with 
those behaviors. For example, when interviewing parents, the 
following questions help to target instances of “noncompliant” 
and “immature” behaviors. 


¢ What does Derek do that leads you to characterize his 
behavior as immature or noncompliant? 

¢ During what time of day does Derek seem most immature 
(or noncompliant)? 


Figure 3.2 Strengths and limitations of closed- versus open-ended indirect assessments. 


Closed-ended indirect assessments 


Strengths -Assures information about common controlling 


variables with some populations. 

-Quick and easy to administer. 

-Few particular skills are required to administer 
and interpret. 


Limitations 


-Only asks questions about pre-determined variables. 
-May suggest a non-function (i.e., false-positives). 
-May be misused by individuals without training in ABA. 
-Little opportunity to develop rapport with informants. 


Open-ended indirect assessments 
-Information about a wide range of contextual variables 
may be obtained. 


-Opportunity to develop rapport with the informant and to 
listen to his or her experiences. 


-Takes longer than closed-ended indirect assessments. 
-May include information that is not relevant to the function. 
-Interpretation requires behavior analytic skills. 


-Requires clinical interviewing skills to obtain information 
and develop rapport. 


From “The Practical Importance of the Distinction Between Open and Closed-Ended Indirect Assessments,” by M. J. Fryling and N. A. Baires 
2016, Behavior Analysis in Practice, 9(2), p. 150. Copyright 2016 by the Association for Behavior Analysis International. Reprinted by permission. 
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* Are there certain situations or places where Derek is non- * How do you and other family members respond when 


compliant or acts immature? If so, where, and what does Derek exhibits these behaviors? 
he do? * If Derek were to be more mature and independent as 
* How many different ways does Derek act immature (or you would like, what would he do differently than he 
noncompliant)? does now? 
* What’s the most frequent noncompliant behavior that 
Bere 4 P Figure 3.4 is a form with which parents, caregivers, and 


significant others can identify possible target behaviors. 


Figure 3.3 Sample behavioral interview questions. 


Problem Identification Interview Form 


Reason for referral: The teacher requested help reducing her negative attention to acting out and disruptive students who were 
yelling and noncompliant. 


In your own words, can you define the problem behaviors that prompted your request? 
. Are there any other teacher-based behaviors that concern you at this time? 


. When you engage in negative teacher attention (i.e., when you attend to yelling or noncompliant behavior), what usually 


happens immediately before the negative teacher attention occurs? 

. What usually happens after the negative teacher attention occurs? 

. What are the students’ reactions when you yell or attend to their noncompliant behavior? 

. What behaviors would the students need to perform so that you would be less likely to attend to them in a negative way? 
Have you tried other interventions? What has been their effect? 


Figure 3.4 Form for generating possible target behaviors. 


The 5 + 5 Behavior List 


Child’s name: 


Person completing this list: 


Listmaker’s relationship to child: 


5 good things does now 5 things I'd like to see learn to do 
more (or less) often 


Directions: Begin by listing in the left-hand column 5 desirable behaviors your child (or student) does regularly now, things that you 
want him or her to continue doing. Next, list in the right-hand column 5 behaviors you would like to see your child do more Le 
(things that your child does sometimes but should do with more regularity) and/or undesirable behaviors that you want him or her 
to do less often (or not at all). You may list more than 5 behaviors in either column, but try to identify at least 5 in each. 
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In addition to seeking help from significant others to 
identify target behaviors and possible controlling variables, the 
behavior analyst can sometimes use the interview to determine 
the extent to which significant others are willing and able to help 
implement a subsequent intervention. Without the assistance of 
parents, siblings, teacher aides, and staff, it is difficult for behav- 
ior change programs to succeed, or be sustained, with any degree 
of treatment integrity after the initial intervention. 

As an outgrowth of being interviewed, some clients, or 
their significant others, may be asked to complete a question- 
naire or so-called needs assessment survey. Questionnaires 
and needs assessment surveys have been developed in multiple 
human services areas to refine or extend the interview process 
(Altschuld & Witkin, 2000). Results from such assessments 
can be useful in selecting and defining target behaviors further, 
or in determining possible interventions. For example, a client 
seeking behavioral treatment to quit smoking might respond 
to a questionnaire or survey by indicating the number of ciga- 
rettes smoked each day and the conditions under which smoking 
occurs (e.g., morning coffee break, after dinner, stuck in a traffic 
jam). These client-collected data may shed light on antecedent 
conditions correlated with the target behavior. 


Checklists and Rating Scales 


Behavior checklists and rating scales can be used alone or in 
combination with interviews to identify potential target behav- 
iors (Kelley, LaRue, Roane & Gadaire, 2011). A behavior 
checklist provides a description of specific behaviors and the 
conditions under which each behavior occurs (i.e., antecedent 
and consequence events that may affect the frequency, intensity, 
or duration of behaviors). Situation- or program-specific check- 
lists can be created to assess one particular behavior (e.g., tooth 
brushing) or a specific skill area (e.g., social skills). 

A rating scale using a Likert system (e.g., a range of 
numbers from | to 5 or | to 10) attempts to more precisely 
describe and quantify the target behavior of interest by using 
an ordinal scale. When a numerically based scale is employed, 
sometimes a total score is generated to provide further guid- 
ance on the significance of the target behavior. The Functional 
Assessment Checklist for Teachers and Staff (March et al., 2000) 
combines checklist and rating scale components to help specify 
the target behavior and the conditions under which it occurs (see 
Figure 3.5). Still, whenever an ordinal scale is used, and data 
interpretation is involved, behavior analysts are advised to heed 
Merbitz, Morris, and Grip’s (1989) sage advice: 


Ordinal level scales are widely used in rehabilitation. 
Such scales provide a rank order characterization of 
phenomena when the direction of an underlying dimen- 
sion (such as consciousness) can be described and units 
of measurement cannot. Unfortunately, the information 
produced by these scales is difficult to interpret and easy 
to misuse. (p. 308) 


Direct Assessment 


Tests and direct observations are direct assessment approaches 
because the results from these procedures provide measurable 


and validated information about the learner’s behavior. Stan- 
dardized tests and observations are two principal methods of 
direct assessment. 


Standardized Tests 


Literally thousands of standardized tests have been developed 
to assess behavior (cf. Carlson, Geisinger, & Jonson, 2 Oileiays 
Each time a standardized test is administered, the same ques- 
tions and tasks are presented using specified procedures and the 
same scoring criteria are applied. Some standardized tests yield 
norm-referenced scores, meaning that during the test’s develop- 
ment it was administered to a large sample of people selected 
to represent the population for whom the test is designed, and 
scoring tables and charts are produced that can later be refer- 
enced to plot, chart, or calculate results. Scores are charted on 
a normalized curve. See Box 3.1: “Primer on Norm-referenced 
Test Scores”. 

The majority of standardized tests on the market, however, 
are not conducive to behavioral assessment because the results 
cannot be translated directly into target behaviors for instruc- 
tion or treatment. For example, results from standardized tests 
commonly used in the schools, such as the Jowa Tests of Basic 
Skills (Hoover, Dunbar, & Frisbie, 2007), the Peabody Individ- 
ual Achievement Test-R/NU (Markwardt, 2005), and the Wide 
Range Achievement Test-——-5 (WRAT-5) (Wilkinson & Robert- 
son, 2017), might indicate that a fourth-grader is performing at 
the third-grade level in mathematics and at the first-grade level 
in reading. Such information is useful in determining how the 
student performs with these subjects compared to students in 
general, but it indicates neither the specific math nor reading 
skills the student has mastered. Further, it does not provide suf- 
ficient direction with which to launch a remedial, intervention, 
or enrichment program. 

Finally, and from a practical perspective, behavior ana- 
lysts may not be able to administer a given standardized test 
because of licensing requirements. For instance, only a licensed 
psychologist can administer some types of intelligence and per- 
sonality inventories. 

Tests are most useful as behavioral assessment devices 
when they provide a specific measure of the person’s perfor- 
mance of the behaviors of interest. Practitioners often use crite- 
rion-referenced and curriculum-based assessments to help target 
behaviors to change. 


Criterion-referenced Assessment 


A criterion-referenced assessment (CRA) measures a child’s 
skill performance across recognized and commonly accepted 
“developmental milestones.” For instance, most 12-month-old 
toddlers can sit with trunk stability, stand independently, walk 
a few steps, reach across their midline, and roll a ball with an 
adult. In scoring CRAs, teachers might indicate a plus (+), 
minus (—), or (E) for each associated skill measured to signify 
that the skill was present, not present, or emerging, respectively. 


Curriculum-based Assessment 


A curriculum-based assessment (CBA), by contrast, is especially 
useful because the data that are obtained bear uniquely on the 
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Figure 3.5 The Functional Assessment Checklist for Teachers and Staff. 


Step 1 


Step 2 


Step 3 


Step 4 


Step 5 


Functional Assessment Checklist for Teachers and Staff (FACTS-Part A) 


Student/Grade: 
Interviewer: 


Date: 
Respondent(s): 


Student Profile: Please identify at least three strengths or contributions the student brings to school. 


Problem Behavior(s): Identify problem behaviors 


Tardy Fight/physical Aggression Disruptive Theft 


Unresponsive Inappropriate Language Insubordination Vandalism 
Withdrawn Verbal Harassment Work not done Other 
Verbally Inappropriate Self-injury 


Describe problem behavior: 


Identifying Routines: Where, when and with whom problem behaviors are most likely 


Schedule | Activity 
(Times) 


Likelihood of Problem Behavior Specific Problem 


Behavior 


[alia Before School 


Transition 
Recess 


ee eo 


Select 1-3 routines for further assessment: Select routines based on (a) similarity of activities (conditions) 
with ratings of 4, 5 or 6 and (b) similarity of problem behavior(s). Complete the FACTS-Part B for each 
routine identified. 


Functional Assessment Checklist for Teachers & Staff (FACTS-Part B) 


Student/Grade: Date: 
Interviewer: Respondent(s): 


Routine/Activities/Context: Which routine (only one) from the FACTS-Part A is assessed? 


Problem Behavior(s) 


Routine/Activities/Context 


(continued) 
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Figure 3.5 (continued) 


Step 3 


Step 4 


Step 5 


Step 6 


Step 7 


Step 8 


Provide more detail about the problem behavior(s): 


What does the problem behavior(s) look like? 
How often does the problem behavior(s) occur? 
How long does the problem behavior(s) last when it does occur? 


What is the intensity/level of danger of the problem behavior(s)? 


What are the events that predict when the problem behavior(s) will occur? (Predictors) 


Related Issues (setting events) Environmental Features 


___illness Other: ____ reprimand/correction ____structured activity 
____ drug use ____ physical demands  __ unstructured time 
____ negative social socially isolated ____ tasks too boring 
___ conflict at home ____ with peers ___ activity too long 
____ academic failure Other _____ tasks too difficult 


What consequences appear most likely to maintain the problem behavior(s)? 


Things that are Obtained 


adult attention Other: 
peer attention 


Things Avoided or Escaped From 
hard tasks Other: 
reprimands 


preferred activity peer negatives 


money/things physical effort 


adult attention 


SUMMARY OF BEHAVIOR 
Identify the summary that will be used to build a plan of behavior support. 


Setting Events & Predictors Problem Behavior(s) Maintaining Consequence(s) 


How confident are you that the Summary of Behavior is accurate? 


Not very confident Very confident 
1 2 3 4 5 6 


What current efforts have been used to control the problem behavior? 


Strategies for preventing problem behavior Strategies for responding to problem behavior 


schedule change None: Other reprimand None: Other 


office referral 


seating change 


curriculum change detention 


From Functional Assessment Checklist for Teachers and Staff (FACTS), by R. E. March, R. H. Horner, T. Lewis-Palmer, D. Brown, D. Crone, A. W. 
Todd, and E. Carr, 2000, Educational and Community Supports, College of Education, University of Oregon, Eugene, OR. Reprinted with permission. 
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Primer on Norm-Referenced Test Scores 


Percent of cases 
under portions of 


the normal curve 0.13% 


2.14% 


Standard 
deviations —4o —30 -20 


Selected 
percentile 
ranks 0.1 7 


Z,-Scores 


T-scores 
20 30 


50 84 98 99.9 


50 60 70 80 


Figure A Relationship among percentile scores, z-scores, and t-scores in a normal distribution. 
From Educational Assessment of Students, 8th ed., p. 391, by S. M. Brookhart and A. J. Nitko, 2019, Upper Saddle River, NJ: Pearson. Reprinted by 


permission. 


Norm-referenced tests compare an individual’s performance 
with a comparable norm group. These tests provide a num- 
ber of metrics that practitioners use to make decisions. Three 
widely used and reported norm-referenced scores are percen- 
tiles, z-scores, and t-scores. Figure A shows the relationships 
of these and other norm-referenced scores within a normal 
distribution 

Standardized tests yield a number of metrics that prac- 
titioners use to make decisions. Plotted beneath this “normal 
curve” are key measures that can be used to compare an indi- 
vidual’s performance with that of a comparable norm group. 
Test scores of most interest are percentile ranks, z-scores, and 
t-scores. 


Percentile Ranks 


A percentile rank aligns a score with a standard deviation. An 
average score is the 50th percentile, located in the middle of 
the distribution. A score of 84 means that the tester’s score 
was equal to or better than the scores of 84% of the individuals 
who took the standardized test and equates with 1 standard 
deviation above the mean. 


Z-scores 


A z-score represents the number of standard deviations that a 
score appears below, at, or above the mean of the population 
who completed the assessment. A z-score of +1.0 equates to 
1 standard deviation, and also the 84th percentile. 


T-scores 


A t-score is a standard score with an average of 50 and a stan- 
dard deviation of 10. In our highlighted example, a t-score 
of 50, located in the middle of the distribution, equates to a 
percentile score of 50 (read the percentile scale above). 

Two other derived scores of interest are the grade 
equivalent score and the age equivalent (not shown on the 
normal curve). 


Grade Equivalent Score 


A grade equivalent score—say, a GES of 5.5—means that the 
learner’s raw score (the number of total items answered cor- 
rectly on the test) equaled the average number raw score by 
learners midway through fifth grade in the norm group. This 
score does not provide information on the specific skills the 
individual possesses or lacks. 


Age Equivalent Score 


An age equivalent score—say, 4.6—means that the learner’s 
raw score (the number of total items answered correctly on the 
test) equaled the average number of raw score points earned 
by learners midway through age 4 in the norm group. This 
score does not provide information on the specific skills the 
individual possesses or lacks. 

To learn more about standardized testing and interpreting 
norm-referenced scores, see assessment texts such as those by 
Brookhart and Nitko (2019) and Salvia, Ysseldyke, and Bolt (2013). 
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daily or weekly tasks the learner performs in response to planned 
lessons (i.e., the curriculum) (Overton, 2006). A learner taking a 
spelling test at the end of the week based on a 5-day list of words, 
or a middle school student taking a science quiz on a chapter 
from the text, serves as a prime example of curriculum-based 
assessments. CBA data can be collected on a formative basis 
(e.g., to note error patterns or relative progress) or on a summa- 
tive schedule (e.g., yearly) across motor (gross and fine), language 
(receptive and expressive), self-help, or cognition areas. Teach- 
ers would record aggregate scores or, depending on the learner’s 
developmental level, mark associated tasks within the skill area as 
occurring independently, semi-independently, or not at all. 


Direct Observations 


Direct and repeated observations of the client’s behavior in the 
natural environment are useful for determining behaviors to 
assess and ultimately to select as a target behavior. Practitioners, 
however, should be advised that observations could produce 
limited and skewed information. For instance, if data are collected 
only during morning sessions, at home, or with a specific care- 
taker, the “target” behavior may be masked in the afternoon, at 
school, or with different caretakers. Expanding the direct observa- 
tions to include a wider range of settings, persons, or behaviors is 
likely to improve the overall quality of direct assessment, but until 
the manipulation of variables occurs, it will fall short of being a 
complete analysis (Thompson & Borrero, 2011). 

One basic form of direct continuous observation, first 
described by Bijou, Peterson, and Ault (1968), is called ABC, 
or anecdotal observation, recording. With ABC recording, the 
observer records a descriptive, temporally sequenced account 
of all behaviors of interest and the antecedent conditions and 
consequences for those behaviors as those events occur in the 
client’s natural environment (Cooper, 1981). This technique pro- 
duces behavioral assessment data that can be used to identify 
potential target behaviors.* 

Anecdotal observation yields an overall description of a 
client’s behavior patterns. This detailed record of the client’s 
behavior within its natural context provides accountability to 
the individual and to others involved in the behavior change plan 
and is extremely helpful in designing interventions. 

Accurately describing behavioral episodes as they occur 
in real time is aided by using a form to record relevant anteced- 
ents, behaviors, and consequences in temporal sequence. 
For example, Lo (2003) used the form shown in Figure 3.6 
to record anecdotal observations of a fourth-grade special 
education student whose teacher had complained that the boy’s 
frequent talk-outs and out-of-seat behavior were impairing his 
learning and often disrupted the entire class. ABC observa- 
tions can also be recorded on a checklist of specific anteced- 
ents, behaviors, and consequent events individually created for 
the client based on information from interviews and/or initial 
observations (see Figure 27.3). 

ABC recording requires the observer to commit full 
attention to the person being observed for upwards of 20 to 30 
minutes. Hence, a classroom teacher, for example, could not 
easily use this assessment procedure while engaging in other 
activities, such as managing a reading group, demonstrating 


a mathematics problem on the chalkboard, or conducting a 
classwide peer tutoring session. Following are some addi- 
tional guidelines and suggestions for conducting anecdotal 
direct observations: 


* Record everything the client does and says and everything 
that happens to the client. 

¢ Use homemade shorthand or abbreviations to make record- 

ing more efficient, but be sure the notes can be and are accu- 

rately expanded immediately after the observation session. 

Record only actions that are seen or heard, not interpreta- 

tions of those actions. 

* Record the temporal sequence of each response of inter- 
est by writing down what happened just before and just 
after it. 

¢ Record the estimated duration of each instance of the 
client’s behavior. Mark the beginning and ending time of 
each behavioral episode. 

¢ Be aware that continuous anecdotal observation is often 
an obtrusive recording method. Most people behave 
differently when they see someone with a pencil and 
clipboard staring at them. Knowing this, observers should 
be as unobtrusive as possible (e.g., stay a reasonable dis- 
tance away from the subject, avoid eye contact). 

¢ Carry out the observations over a period of several days so 
that the novelty of having someone observe the client will 
lessen and the repeated observations can produce a valid 
picture of day-to-day behavior. 


Ecological Assessment 


Behavior analysts understand that human behavior is a function 
of multiple events and that many events have multiple effects 
on behavior (cf. Michael, 1995). An ecological approach to 
assessment recognizes the complex interrelationships between 
environment and behavior. In an ecological assessment a 
great deal of information is gathered about the person and the 
various environments in which that person lives and works. 
Among the many factors that can affect a person’s behavior are 
physiological conditions, physical aspects of the environment 
(e.g., lighting, seating arrangements, noise level), interactions 
with others, home environment, and past reinforcement history. 
Each of these factors represents a potential area for assessment. 

Although a thorough ecological assessment will provide 
a tremendous amount of descriptive data, the basic purpose of 
assessment—to identify the most pressing behavior problem and 
possible ways to alleviate it—should not be forgotten. It is easy 
to go overboard with the ecological approach, gathering far more 
information than necessary. Ecological assessment can be costly 
in terms of professional and client time, and it may raise ethical 
and perhaps legal questions regarding confidentiality (Koocher 
& Keith-Spiegel, 1998). Ultimately, good judgment must be used 
in determining how much assessment information is necessary. 
Writing about the role of ecological assessment for special edu- 
cation teachers, Heron and Heward (1988) suggested that 


The key to using an ecological assessment is to know 
when to use it... . Educators must strive to become 
keen discriminators of: (1) situations in which a 
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Figure 3.6 Example of an anecdotal ABC recording form. 


Student: Student 4 
Observer: Experimenter 


Time 
2:40 


Date: 3/10/03 
Starting time: 2:40 P.M. 


Antecedents (A) 


T tells students to work quietly 
on their math worksheets 


Behavior (B) 


Walks around the room and 
looks at other students 


Sits down, makes funny noises 
with mouth 


Says to the peer, “What? Me?” 
Stops making noises 


Setting: SED resource room (math period) 
Ending time: 3:00 P.M. 


Consequences (C) 


T says, “Everyone is working, 
but you. | don’t need to tell 
you what to do.” 


A female peer says: “Would 
you please be quiet?” 


Peer continues to work 


Math worksheet Sits in his seat and works quietly | No one pays attention to him 


Math worksheet Pounds on desk with hands 


Math worksheet Yells-out T’s name three times 
and walks to her with his 


worksheet 


Everyone is working quietly Gets up and leaves his seat 


Sits down and works 


Everyone is working quietly Gets up and talks to a peer 
Comes back to his seat and works 


Math worksheet, no one is 
attending to him 


Hand grabs a male peer and asks 
him to help on the worksheet 


SED aide asks him to stop 
Ignored by others 
T helps him with the questions 


T asks him to sit down 
and work 


Ignored by others 


T asks him to sit down 
and work 


Ignored by others 


Peer refuses 


Asks another male peer to help 
him 
Tells T he’s finished the work and 
it’s his turn to work on computer 


Whines about why it’s 
not his turn 


Stands behind a peer who is 
playing computer and watches 
him play the game 


Peer helps him 


T asks him to turn in his work 
and tells him that it’s not his 
turn on the computer 


T explains to him that other 
students are still working on 
the computer and he needs to 
find a book to read 


Ignored by T 


—— 


Adapted from Functional Assessment and Individualized Intervention Plans: Increasing the Behavior Adjustment of Urban Learners 
in General and Special Education Settings (p. 317) by Y. Lo. Unpublished doctoral dissertation. Columbus, OH: The Ohio State 
University. Used by permission. 


planned intervention has the potential for affecting Reactive Effects of Assessment 
student behaviors other than the behavior of concern; 
and (2) situations in which an intervention, estimated 
to be effective if the target behavior were viewed in 

isolation, may be ineffective because other ecological 


variables come into play. (p. 231) 


vation is obtrusive 


77 


Reactivity refers to the effects that an assessment has on the 
behavior being assessed. Reactivity is most likely when obser- 
that is, the person being observed is 
aware of the observer’s presence and purpose (Kazdin, 2010). 
Studies have demonstrated that the presence of observers in 
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applied settings can influence a subject’s behavior (Mercatoris 
& Craighead, 1974; White, 1977). Perhaps the most obtrusive 
assessment procedures are those that require the participant 
to monitor and record her own behavior. Research on self- 
monitoring shows that the procedure commonly affects the 
behavior under assessment (Kirby, Fowler, & Baer, 1991).> 

Research suggests, however, that even when the pres- 
ence of an observer alters the behavior of the person being 
observed, the reactive effects are temporary (e.g., Kazdin, 
1982). Nevertheless, behavior analysts mitigate reactivity 
when they make observations as unobtrusive as possible, 
repeat observations until apparent reactive effects subside, and 
take possible reactive effects into account when interpreting 
the results of observations. 


Choosing an Assessment Method 


In addition to selecting assessment methods and tools for iden- 
tifying target behaviors, behavior analysts conduct assessments 
to guide the design of effective interventions. These require 
the experimental manipulation of variables and are described 
elsewhere in this text: preference and reinforcer assessment 
(Chapter 11), punisher assessment (Chapter 14), and functional 
behavior analysis (Chapter 27). 

Given that multiple possibilities exist across assessment 
methods, how might a behavior analyst proceed when trying to 
weigh the cost-benefit of delaying an intervention in favor of gath- 
ering more “assessment” data versus launching an intervention 
prematurely? Kelley and colleagues (2011) provide a guideline: 


Ultimately, best practice is likely to include an assessment 
package that contains a structured interview, at least one 
rating scale, direct observation, and experimental manipu- 
lation of environmental variables. A combination of these 
assessment methods is likely to produce . . . a desirable 
outcome. (p. 189) 


We restate a key point: Assuming the assessment tools 
have reasonable reliability and validity, and are conducted 
according to professional standards, the most important element 
of the assessment process is not so much the gathering of the 
data as it is the analysis of that data. If the analysis should prove 
not to produce a desirable outcome as Kelley et al. (2011) proj- 
ect, practitioners should not be dismayed. Instead, they should 
try again “based on the original failure and previous ineffective 
analysis. ... Failure is always informative in the logic of behav- 
ior analysis” (Baer, 2005, p. 8). 


ASSESSING THE SOCIAL SIGNIFICANCE 
OF POTENTIAL TARGET BEHAVIORS 


In the past, few questions were asked when a teacher, therapist, 
or other human services professional determined that a client’s 
behavior should be assessed. For a well-trained behavior analyst, 
this is not the case. Because behavior analysts possess an 
effective technology to change behavior, accountability must be 
served. Both the goals and the rationale supporting assessment 
and intervention programs must be open to critical examination 
by the consumers (i.e., clients and their families) and by others 


who may be affected (i.e., society). In selecting target behaviors, 
practitioners should first consider whose behavior is being 
assessed—and changed—and why. 

Target behaviors should not be selected solely for the pri- 
mary benefit of others (e.g., “Be still, be quiet, be docile,” in Winett 
& Winkler, 1972), to simply maintain the status quo (Budd & Baer, 
1976; Holland, 1978), or because they pique the interest of some- 
one in a position to change the behaviors. For instance, assume that 
a newly hired director of an early childhood developmental day- 
care center announces an annual goal that all the children learn to 
talk, when in fact, it would be a more appropriate option to assess 
(and later intervene) on a fuller range of communication options 
(e.g., signing, picture exchange communication). Selecting “speak- 
ing” as a target goal for all children stands in stark contrast to 
the more fundamental and laudable goal of providing specialized 
services to meet the individual needs of the children. 

Admittedly, judgments about which behaviors to change 
are difficult to make. Still, practitioners are not without direc- 
tion when choosing target behaviors. Numerous authors have 
suggested guidelines and criteria for selecting target behaviors 
(e.g., Bailey & Lessen, 1984; Bosch & Fuqua, 2001; Hawkins, 
1984; Komaki, 1998; Rosales-Ruiz & Baer, 1997). Overall, 
these guidelines revolve around this central question: To what 
extent will the proposed behavior change improve the person’s 
life experience? A key answer to this question rests squarely on 
the concept of habilitation. 


Habilitation Defined 


Hawkins (1984) suggested that the potential meaningfulness of 
any behavior change should be considered within the context 
of habilitation, which he defined as “the degree to which the 
person’s repertoire maximizes short- and long-term reinforcers 
for that individual and for others, and minimizes short- and long- 
term punishers.” (p. 284) 

Hawkins (1986) cited several advantages of the definition 
in that it (a) is conceptually familiar to behavior analysts, (b) 
defines treatment using measurable outcomes, (c) is applicable 
to a wide range of habilitative activities, (d) deals with individual 
and societal needs nonjudgmentally, (e) treats adjustment along 
a non-deficit-driven, adaptive continuum, and (f) is culturally 
and situationally relative. 

It is difficult to make a priori decisions about how a par- 
ticular behavior change will contribute to a person’s overall 
habilitation (adjustment, competence). We-simply do not know 
beforehand how useful or functional a given behavior change will 
prove to be in the future (Baer, 1981, 1982), even when its short- 
term utility can be predicted. Applied behavior analysts, how- 
ever, must place the highest importance on the selection of target 
behaviors that are truly useful and habilitative (Hawkins, 1991). 
In effect, if a potential target behavior meets the habilitation stan- 
dard, then the individual is much more likely to (a) acquire addi- 
tional reinforcers in the future and (b) avoid potential punishers. 

From ethical and pragmatic perspectives, any target 
behavior must benefit the person either directly or indirectly. 
Examining prospective target behaviors according to the 
10 key questions described in the following sections should help 
clarify their relative social significance and habilitative value. 
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Figure 3.7 Worksheet for evaluating the social significance of potential target behaviors. 


Client's/Student’s Name: 


Person Completing Worksheet: 


Rater’s Relationship to Client/Student: 


Behavior: 


Considerations 


Is this behavior likely to produce Yes No 
reinforcement in the client's natural 
environment after intervention ends? 


Is this behavior a necessary prereg- 
uisite for a more complex and func- 
tional skill? 


Will this behavior increase the cli- 
ent’s access to environments where 
other important behaviors can be 
acquired and used? 


Will changing this behavior pre- 
dispose others to interact with the 
client in a more appropriate and sup- 
portive manner? 


Is this behavior a pivotal behavior or Yes No 
behavioral cusp? 


Is this an age-appropriate behavior? Yes No 


lf this behavior is to be reduced or Yes No 
eliminated from the client's reper- 

toire, has an adaptive and functional 

behavior been selected to replace it? 


Does this behavior represent the Yes No 
actual problem/goal, or is it only indi- 
rectly related? 


Is this “just talk,” or is it the real Yes No 
behavior of interest? 


Is the goal itself is not a specific Yes No 
behavior (e.g., losing 20 Ib), will this 
behavior help achieve It? 


Summary notes/comments: 


Not Sure 


Not Sure 


Not Sure 


Not Sure 


Not Sure 


Not Sure 
Not Sure 


Not Sure 


Not Sure 


Not Sure 


Figure 3.7 consolidates these questions in a worksheet that can 
be used to evaluate the social significance of target behaviors. 


Will This Behavior Produce Reinforcement 
in the Client’s Natural Environment After 
Treatment Ends? 


To determine whether a particular target behavior is func- 
tional for the client, the behavior analyst, significant others, 
and, whenever possible, the client should ask whether the 


proposed behavior change will be reinforced in the person’s 
daily life. Ayllon and Azrin (1968) called this the relevance 
of behavior rule; it means that a target behavior should be 
selected only when it can be determined that the behavior is 
likely to produce reinforcement in the person’s natural envi- 
ronment. The likelihood that a new behavior will result in 
reinforcement after the behavior change program is termi- 
nated is the primary determinant of whether the new behavior 
will be maintained, thereby having the possibility of long- 
term benefits for that person. 
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Judging whether occurrences of the target behavior will 
be reinforced in the absence of intervention can also help to 
clarify whether the proposed behavior change is primarily for 
the individual’s benefit or for someone else’s. For instance, 
despite parental wishes or pressure, it would be of little value 
to try to teach math skills to a student with severe develop- 
mental disabilities with pervasive deficits in communication 
and social skills. Teaching communication skills that would 
enable the student to have more effective interactions in her 
current environment should take precedence over skills that 
she might be able to use in the future (e.g., making change in 
the grocery store). Sometimes target behaviors are selected 
appropriately not because of their direct benefit to the person, 
but because of an important indirect benefit. Indirect benefits 
can occur in several different ways, as described by the ques- 
tions that follow. 


Is This Behavior a Necessary Prerequisite 
for a Useful Skill? 


Some behaviors that, in and of themselves, are not important are 
targeted for instruction because they are necessary prerequisites 
to learning other functional behaviors. For example, advances 
in reading research have demonstrated that teaching phonemic 
awareness skills has positive effects on the acquisition of read- 
ing skills (e.g., initial, medial, and final isolated and blended 
sound combinations) (National Reading Panel, 2000).° 


Will This Behavior Increase the Client’s Access to 
Environments Where Other Important Behaviors 
Can Be Learned and Used? 


Hawkins (1986) described the targeting of “access behaviors” 
as a means of producing indirect benefits to clients. For exam- 
ple, students in special education are taught to complete their 
assignments, interact politely and appropriately with the teacher 
and their peers, and follow teacher instructions because these 
behaviors not only will help their personal progress in their cur- 
rent setting but also will likely increase their acceptance and 
adaptation into a general education classroom. 


Will Changing This Behavior Predispose 
Others to Interact with the Client in a More 
Appropriate and Supportive Manner? 


Another type of indirect benefit occurs when a behavior change 
is of primary interest to a significant other in the person’s life. 
The behavior change may enable the significant other to behave 
in a manner more beneficial to the person. For example, sup- 
pose a teacher wants the parents of his students to implement 
a home-based instruction program, believing that the students’ 
language skills would improve considerably if their parents 
spent just 10 minutes per night playing a vocabulary game 
with them. In meeting with one student’s parents, however, the 
teacher realizes that although the parents are also concerned 
about poor language skills, they have other and, in their opinion, 
more pressing needs—the parents want their child to clean her 
room and help with the dinner dishes. Even though the teacher 
believes that straightening up a bedroom and washing dishes 


are not as important to the child’s ultimate welfare as language 
development, these tasks may be important target behaviors if 
a sloppy room and a sink full of dirty dishes impede positive 
parent-child interactions (including playing the teacher’s vocab- 
ulary-building games). In this case, the daily chores might be 
selected as the target behaviors for the direct, immediate benefit 
of the parents, with the expectation that the parents will be more 
likely to help their daughter with school-related activities if they 
are happier with her because she straightens her bedroom and 
helps with the dishes. 


Is This Behavior a Behavioral Cusp or 
a Pivotal Behavior? 


Behavior analysts often use a building block method to develop 
repertoires for clients. For example, in teaching a complex skill 
(e.g., two-digit multiplication), simpler and more easily attain- 
able skills are taught first (e.g., addition, regrouping, single-digit 
multiplication), or with shoe tying, crossing laces, making bows, 
and tying knots are taught systematically. As skill elements are 
mastered, they combine into increasingly complex behaviors. 
At any point along this developmental skill continuum when the 
person performs the skill to criterion, reinforcement follows, and 
the practitioner makes the determination to advance to the next 
skill level. As systematic and methodical as this approach has 
proven to be, behavior analysts are researching ways to improve 
the efficiency of developing new behavior (see Chapters 19 
through 23). Choosing target behaviors that are behavioral cusps 
and pivotal behaviors may improve this effort. 


Behavioral Cusps 


A behavioral cusp occurs when a learner performs a new 
behavior that sets the occasion to access reinforcers that other- 
wise would not have been available. Stated in different terms, 
a behavioral cusp can be claimed when what has been learned 
exposes the individual to an array of contingencies that go 
beyond the reinforcers that would have been delivered had only 
the target behavior, ora series of behaviors, occurred. 

Rosales-Ruiz and Baer (1997) defined a behavioral 
cusp as: 


[A] behavior that has consequences beyond the change 
itself, some of which may be considered important. . . . 
What makes a behavior change a cusp is that it exposes 

the individual’s repertoire to new environments, especially 
new reinforcers and punishers, new contingencies, new 
responses, new stimulus controls, and new communities of 
maintaining or destructive contingencies. When some or all 
of those events happen, the individual’s repertoire expands; 
it encounters a differentially selective maintenance of the 
new as well as some old repertoires, and perhaps that leads 
to some further cusps. (p. 534) 


Rosales-Ruiz and Baer (1997) cited crawling, reading, 
and generalized imitation as examples of behavioral cusp 
behaviors because the occurrence of such behaviors “sud- 
denly open the child’s world to new contingencies that will 
develop many new, important behaviors” (p. 535, italics added 
for emphasis). 


Cusps differ from component or prerequisite behaviors in 
that cusps ultimately occasion new behaviors in novel environ- 
ments that bring the learner under different social, sensory, or 
personal stimulus control (Lerman et al., 2013: Twyman, 2011). 
For an infant, specific arm, head, leg, or positional movements 
would be component behaviors for crawling. Crawling is the 
cusp because it enables the infant to contact new stimuli as 
sources of reinforcement (e.g., toys, parents), which, in turn, 
open a host of contingencies that can shape and select other 
adaptive behaviors. These novel contingencies become avail- 
able for the learner and the significant others in the learner’s 
community. 

Rosales-Ruiz and Baer (1997) distinguish between a 
learner performing a series of tasks in a task analysis and 
receiving the associated reinforcer for completing each step, 
and a behavioral cusp that exposes the learner to a host of new 
contingencies of reinforcement altogether. In plain language, 
a cusp might be akin to tree branches shooting off the trunk, 
and providing the possibility of still more offshoots from that 
“cusp” branch. 

Rosales-Ruiz & Baer (1997) sum up the role of identify- 
ing cusps as target behaviors by stating: 


The importance of cusps is judged by (a) the extent of the 
behavior changes they systematically enable, (b) whether 
they systematically expose behavior to new cusps, and (c) 
the audience’s view of whether these changes are import- 
ant for the organism, which in turn is often controlled by 
societal norms and expectations of what behaviors should 
develop in children and when that development should 
happen. (p. 537) 
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By identifying and assessing target behaviors based on 
their cusp potential, practitioners provide “added value” to their 
programming. Bosch and Hixson (2004) proposed a series of 
actions behavior analysts can take and questions to determine 
behavioral cusps (see Figure 3.8). If the answer is “yes” to the 
majority of these questions in the right-hand column, a good 
case can be made that the behavior is a cusp. 

Finally, Twyman (2011) sheds much-needed light on 
how present or future technologies can assist practitioners with 
developing and refining behavioral cusps. Whether it be through 
websites, smartphones, iPad/tablet applications, social media, 
simulations, virtual learning applications, or access to on-line 
instruction, behavior analysts can further explore programming 
cusp behaviors to enhance the learning of those they serve. 


Pivotal Behavior 


A pivotal behavior is a behavior that, once learned, produces 
corresponding modifications or covariations in other adap- 
tive untrained behaviors. For instance, Koegel, Carter, and 
Koegel (2003) indicated that teaching children with autism to 
“self-initiate” (e.g., approach others) may be a pivotal behav- 
ior. Choice making, self-management, and functional commu- 
nication training may also be considered pivotal behaviors in 
that once these behaviors are learned, they are likely to produce 
adaptive variations in untrained settings. 

Koegel and Koegel (2018) and their colleagues have 
examined pivotal behavior assessment and treatment approaches 
across a wide range of areas (e.g., social skills, communication, 
academic performance disruptive behavior) (Koegel, Bradshaw, 
Ashbaugh, & Koegel, 2014; Koegel, Koegel, Green-Hopkins, 


Figure 3.8 Actions and questions to determine behavioral cusps. 


Information Needed 


List the direct reinforcers for that target behavior: Social, 
automatic, conditioned, unconditioned. 


List and describe the environments that the new target 
behavior will access. 


List school personnel or familial beliefs and expectations that 


may strengthen or weaken the value of the behavior change. 


List the audience reactions, such as smiles, laughter, etc. 


List the inappropriate behaviors and their severity, including 
frequency, duration, strength, etc. and the effects on the 
environment. 


List the behaviors/repertoires subsequently affected. 


Approximate number of people physically or emotionally 
affected over time. 


Give a dollar amount by estimating the cost of physical/ 
property damage, health care, lawsuits, etc. over time. 


Behavioral Cusp Determination 


Will the behavior contact new reinforcers? 


Will the behavior allow access to new selective 
environments? 


Does the behavior meet the demands of the social 
community of which the person is a member? 


Is the behavior a reinforcer for the audience? 


ls the behavior interfering with or replacing inappropriate 
behaviors? 


ls the behavior facilitating subsequent learning by being 
either a prerequisite or a component of more complex 
responses? 


Is the behavior impacting a large number of people? 


Is the failure to establish this behavior costly? 


From “The Final Piece to a Complete Science of Behavior: Behavior Development and Behavioral Cusps,” by S. Bosch and M. D. Hixson, 
The Behavior Analyst Today, 5(2), 2004, pp. 244-254. 
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& Barnes, 2010; Koegel, Koegel, & Schreibman, 1991). The 
“longitudinal outcome data from children with autism suggest 
that the presence of pivotal initiation behaviors may be a prog- 
nostic indicator of more favorable long-term outcomes and 
therefore may be ‘pivotal’ in that they appear to result in wide- 
spread positive changes in a number of areas” (Koegel, Carter, 
& Koegel, 2003, p. 134). Improvement in self-initiations may 
be critical for the emergence of untrained response classes, such 
as asking questions and increased production and diversity of 
talking. 

Assessing and targeting pivotal behaviors can be advanta- 
geous for the practitioner and the client. From the practitioner’s 
perspective, it might be possible to assess and then train pivotal 
behaviors within relatively few sessions that would later be emit- 
ted in untrained settings or across untrained responses (Koegel 
et al., 2003). From the client’s perspective, learning a pivotal 
behavior would shorten intervention, provide the person with 
a new repertoire with which to interact with his environment, 
improve the efficiency of learning, and increase the chances of 
coming into contact with reinforcers. As Koegel and colleagues 
concluded: “The use of procedures that teach the child with dis- 
abilities to evoke language learning opportunities in the natural 
environment may be particularly useful for speech and language 
specialists or other special educators who desire ongoing learn- 
ing outside of language teaching sessions” (p. 143). 


Is This an Age-Appropriate Behavior? 


A number of years ago it was common to see adults with 
developmental disabilities being taught behaviors that an adult 
without disabilities would seldom, if ever, do. It was thought— 
perhaps as a by-product of the concept of mental age—that 
a 35-year-old woman with the verbal skills of a 10-year-old 
should play with dolls. Not only is the selection of such target 
behaviors demeaning, but their occurrence reduces the proba- 
bility that other people in the person’s environment will set the 
occasion for and reinforce more desirable, adaptive behaviors, 
leading to a more normal and rewarding life. 

The principle of normalization refers to the use of pro- 
gressively more typical environments, expectations, and proce- 
dures “‘to establish and/or maintain personal behaviors which are 
as culturally normal as possible” (Wolfensberger, 1972, p. 28). 
Normalization is not a single technique, but a philosophical 
position that holds the goal of achieving the greatest possible 
physical and social integration of people with disabilities into 
the mainstream of society. 

In addition to the philosophical and ethical reasons for 
selecting age- and setting-appropriate target behaviors, it should 
be re-emphasized that adaptive, independent, and social behay- 
iors that come into contact with reinforcement are more likely 
to be maintained than are behaviors that do not. For example, 
instruction in leisure-time skills such as sports, hobbies, and 
music-related activities would be more functional for a 17-year- 
old boy than teaching him to play with toy trucks and building 
blocks. An adolescent with those behaviors—even in an adapted 
way—has a better chance of interacting in a typical fashion with 
his peer group, which may help to ensure the maintenance of his 
newly learned skills and provide opportunities for learning other 
adaptive behaviors. 


If the Proposed Target Behavior Is to Be 
Reduced or Eliminated, What Adaptive 
Behavior Will Replace It? 


A practitioner should never plan to reduce or eliminate a behav- 
ior from a person’s repertoire without first (a) determining an 
adaptive behavior that will take its place (also known as the fair 
pair rule) and (b) designing the intervention plan to ensure that 
the replacement behavior is learned. Teachers and other human 
services professionals should be in the business of building pos- 
itive, adaptive repertoires, not merely reacting to and eliminat- 
ing behaviors they find troublesome (Brown et al., 2016). Even 
though a child’s maladaptive behaviors may be exceedingly 
annoying to others, or even damaging physically, those unde- 
sirable responses have proven functional for the child. That is, 
the maladaptive behavior has worked for the child in the past by 
producing reinforcers and/or helping the child avoid or escape 
punishers. A program that only denies that avenue of reinforce- 
ment is a nonconstructive approach. It does not teach adaptive 
behaviors to replace the inappropriate behavior. 

Some of the most effective and recommended methods for 
eliminating unwanted behavior focus primarily on the develop- 
ment of desirable replacement behaviors. Goldiamond (1997) rec- 
ommended that a “constructional” approach—as opposed to an 
eliminative approach—be used for the analysis of and intervention 
into behavioral problems. Under the constructional approach the 
“solution to problems is the construction of repertoires (or their 
reinstatement or transfer to new situations) rather than the elim- 
ination of repertoires” (Goldiamond, 1974, p. 14). Finally, using 
the “fair pair’ rule helps to ensure that maladaptive behaviors are 
replaced with incompatible, adaptive behaviors. 

If a strong case cannot be made for specific, positive 
replacement behaviors, then a compelling case has not been 
made for eliminating the undesirable target behavior. The class- 
room teacher, for example, who wants a behavior change pro- 
gram to maintain students staying in their seats during reading 
period must go beyond the simple notion that “they need to be 
in their seats to do the work.” The teacher must select materials 
and design contingencies that facilitate that goal and motivate 
the students to accomplish their work. 


Does This Behavior Represent the Actual 
Problem or Goal, or Is It Only Indirectly Related? 


An all-too-common error in education is teaching a related 
behavior, not the behavior of interest. Numerous behav- 
ior change programs have been designed to increase on-task 
behaviors when the primary objective should have been to 
increase production or work output. On-task behaviors are cho- 
sen because people who are productive also tend to be on task. 
However, as on task is typically defined, it is possible for a 
student to be on task (e.g., in her seat, quiet, and oriented toward 
or handling academic materials) yet produce little or no work. 
Targeting needed prerequisite skills should not be con- 
fused with selecting target behaviors that do not directly rep- 
resent or fulfill the primary reasons for the behavior analysis 
effort. Prerequisite skills are not taught as terminal behaviors for 
their own sake, but as necessary elements of the desired terminal 
behavior. Related, but indirect, behaviors are not necessary to 
perform the true objective of the program, nor are they really 


intended outcomes of the program by themselves. In attempting 
to detect indirectness, behavior analysts should ask two ques- 
tions: Is this behavior a necessary prerequisite to the intended 
terminal behavior? Is this behavior what the instructional pro- 
gram is really all about? If either question can be answered affir- 
matively, the behavior is eligible for target behavior status. 


Is This Just Talk, or Is It the Real Behavior 
of Interest? 


A characteristic of nonbehavioral therapies is that they rely 
heavily on what people say about what they do and why they do 
it. The client’s verbal behavior is considered important because 
it is believed to reflect the client’s inner state and the mental 
processes that govern the client’s behavior. Therefore, getting a 
person to talk differently about himself (e.g., in a more healthful, 
positive, and less self-effacing way) is viewed as a significant 
step in solving the person’s problem. This change in attitude is 
considered by some to be the primary goal of therapy. 

Behavior analysts, in contrast, distinguish between what 
people say and what they do (Skinner, 1953). Knowing and 
doing are not the same. Getting someone to understand his 
maladaptive behavior by being able to talk logically about it 
does not necessarily mean that his behavior will change in more 
constructive directions. The gambler may know that compulsive 
betting is ruining his life and that his losses would cease if he 
simply stopped placing bets. He may even be able to verbalize 
these facts to a therapist and state convincingly that he will not 
gamble in the future. Still, he may continue to bet. 

Because verbal behavior can be descriptive of what people 
do, it is sometimes confused with the performance itself. A teacher 
at a school for juvenile offenders introduced a new math program 
that included instructional games, group drills, timed tests, and 
self-graphing. The students responded with negative comments: 
“This is stupid,” “Man, I’m not writin’ down what I do,” “I’m 
not even going to try on these tests.” If the teacher had attended 
only to the students’ talk about the program, it would probably 
have been discarded on the first day. But the teacher was aware 
that negative comments about school and work were expected in 
the peer group of adolescent delinquents and that her students’ 
negative remarks had enabled them in the past to avoid tasks they 
thought they would not enjoy. Consequently, the teacher ignored 
the negative comments and attended to and rewarded her students 
for accuracy and rate of math computation when they participated 
in the program. In 1 week, the negative talk had virtually ceased, 
and the students’ math production was at an all-time high. 

There are, of course, situations in which the behavior of 
interest is what the client says. Helping a person reduce the 
number of self-effacing comments he makes and increase the 
frequency of positive self-descriptions is an example of a pro- 
gram in which talk should be the target behavior—not because 
the self-effacing comments are indicative of a poor self-concept, 
but because the client’s verbal behavior is the problem. 

In every case, a determination must be made of exactly which 
behavior is the desired functional outcome of the program: Is it 
a skill or motor performance, or is it verbal behavior? In some 
instances, doing and talking behaviors might be important. A trainee 
applying for a lawn mower repair position may be more likely to get 
a job if he describes verbally how he would fix a cranky starter on 
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a mower. However, once hired, he will hold his job if he is skilled 
and efficient in repairing mowers. It is highly unlikely that a person 
will last long on the job if he talks about how he would fix a lawn 
mower but not be able to do so. Target behaviors must be functional. 


What if the Goal of the Behavior Change 
Program Is Not a Behavior? 


Some of the important changes people want to make in their 
lives are not behaviors, but are instead the result or product of 
certain other behaviors. Weight loss is an example. On the sur- 
face, it appears that target behavior selection is obvious and 
straightforward—losing weight. The number of pounds can 
be measured accurately; but weight, or, more precisely, los- 
ing weight, is not a behavior. Losing weight is not a specific 
response that can be defined and performed; it is the product or 
result of other behaviors—notably reduced food consumption 
and/or increased exercise. Eating and exercise are definable, 
observable behaviors that can be measured in precise units. 

Some otherwise well-designed weight loss programs have 
not been successful because behavior change contingencies were 
placed on the goal—reduced weight—and not on the behaviors 
necessary to produce the goal. Target behaviors in a weight loss 
program should be measures of food consumption and exer- 
cise level, with intervention strategies designed to address those 
behaviors (e.g., De Luca & Holborn, 1992; McGuire, Wing, 
Klem, & Hill, 1999). Weight should be measured and charted 
during a weight loss program, not because it is the target behay- 
ior of interest, but because weight loss shows the positive effects 
of increased exercise or decreased food consumption. 

There are numerous other examples of important goals that 
are not behaviors, but are the end products of other behaviors. Earn- 
ing higher grades, for example, is a goal accomplished by practicing 
skills longer, complying with feedback on practiced performance, 
joining groups that perform even higher than the learner and imi- 
tating that performance, and so forth. Behavior analysts help clients 
achieve their goals by selecting target behaviors that are the most 
directly and functionally related to those goals. 

Some goals expressed by and for clients are not the direct 
product of a specific target behavior, but are broader, more gen- 
eral goals: To be more successful, to have more friends, to be 
creative, to learn good sportsmanship, to develop an improved 
self-concept. Clearly, none of these goals are defined by specific 
behaviors, and all are more complex in terms of their behavioral 
components than losing weight or earning a higher grade. Goals 
such as being successful represent a class of related behaviors 
or a general pattern of responding. They are labels that are 
used to describe people who behave in certain ways. Selecting 
target behaviors that will help clients or students attain these 
kinds of goals is even more difficult than their complexity sug- 
gests because the goals themselves often mean different things 
to different people. Being a success entails a wide variety of 
behaviors. One person may view success in terms of income 
and job title. For another, success means job satisfaction and 
ample leisure time. An important role of the behavior analyst 
during assessment and target behavior identification is to help 
the client select and define personal behaviors, the sum of which 
will result in a better quality of life for the client and others 
evaluating her repertoire in the intended fashion. 
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PRIORITIZING TARGET BEHAVIORS 


Once a “pool” of eligible target behaviors has been identified, deci- 
sions must be made about their relative priority. Sometimes the 
information obtained from behavioral assessment points to one 
particular aspect of the person’s repertoire in need of improvement 
more so than another. More often, though, assessment reveals a 
constellation of related, and sometimes not-so-related, behaviors in 
need of change. Direct observations, along with a behavioral inter- 
view and needs assessment, may produce a long list of important 
behaviors to change. When more than one eligible target behavior 
remains after careful evaluation of the considerations described 
in the previous section, the question becomes, Which behavior 
should be changed first? Judging each potential target behavior in 
light of the following nine questions may help determine which 
behavior deserves attention first, and the relative order in which 
the remaining behaviors will be addressed. 


1. Does this behavior pose any danger to the client or to 
others? Behaviors that cause harm or pose a serious threat 
to the client’s or to others’ personal safety or health must 
receive first priority. 


oe 


How many opportunities will the person have to use this 
new behavior? or How often does this problem behavior 
occur? A student who consistently writes reversed let- 
ters presents more of a problem than does a child who 
reverses letters only occasionally. If the choice is between 
first teaching a prevocational student to pack his lunch or 
to learn how to plan his 2-week vacation each year, the 
former skill takes precedence because the employee-to-be 
may need to pack his lunch every workday. 


oe 


How long-standing is the problem or skill deficit? 
A chronic behavior problem (e.g., bullying) or skill deficit 
(e.g., lack of social interaction skills) should take prece- 
dence over problems that appear sporadically or that have 
just recently surfaced. 


= 


Will changing this behavior produce higher rates of rein- 
forcement for the person? If all other considerations are 
equal, a behavior that results in higher, sustained levels 
of reinforcement should take precedence over a behavior 
that produces little additional reinforcement for the client. 


5. What will be the relative importance of this target behavior 
to future skill development and independent functioning ? 
Each target behavior should be judged in terms of its rela- 
tion (i.e., prerequisite or supportive) to other critical behav- 
iors needed for optimal learning and development and 
maximum levels of independent functioning in the future. 


a 


Will changing this behavior reduce negative or unwanted 
attention from others? Some behaviors are not maladap- 
tive because of anything inherent in the behavior itself, but 
because of the unnecessary problems the behavior causes 
the client. Some people with developmental and motoric 
disabilities may have difficulty at mealtimes with using 
utensils and napkins appropriately, thus reducing opportu- 
nities for positive interaction in public. Granted, public edu- 
cation and awareness are warranted as well, but it would be 
naive not to consider the negative effects of public reaction. 
Also, not teaching more appropriate mealtime skills may be 
a disservice to the person. Idiosyncratic public displays or 


mannerisms may be high-priority target behaviors if their 
modification is likely to provide access to more normalized 
settings or important learning environments. 


. Will this new behavior produce reinforcement for signifi- 


cant others? Even though a person’s behavior should sel- 
dom, if ever, be changed simply for the convenience of 
others or for maintaining the status quo, neither should 
the effect of a person’s behavior change on the significant 
others in his life be overlooked. This question is answered 
best by the significant others themselves because people 
not directly involved in the person’s life would often have 


no idea how rewarding it is to see your 19-year-old 
with severe intellectual disabilities acquire the skill of 
toilet flushing on command or pointing to food when 
she wants a second helping. I suspect that the average 
taxpayer would not consider it “meaningful” to him 

or her for Karrie to acquire such skills. And, although 
we cannot readily say how much Karrie’s being able 
to flush the toilet enhances her personal reinforcement/ 
punishment ratio, I can testify that it enhances mine as 
a parent. (Hawkins, 1984, p. 285) 


How likely is success in changing this target behavior? Some 
behaviors are more difficult to change than others. At least 
four sources of information help assess the level of difficulty 
or, more precisely, predict the ease or degree of success in 
changing a particular behavior. First, what does the literature 
say about attempts to change this behavior? Many of the tar- 
get behaviors that confront applied behavior analysts have 
been studied. Practitioners should stay abreast of published 
research reports in their areas of application. Not only is such 
knowledge likely to improve the selection of proven and effi- 
cient techniques for behavior change, but also it may help to 
predict the level of difficulty or chance of success. 

Second, how experienced is the practitioner? The 
practitioner’s own competencies and experiences with 
the target behavior in question should be considered. A 
teacher who has worked successfully with acting-out, 
aggressive children may have an array of effective behav- 
ior management strategies to employ. However, that same 
teacher may believe that he is less able to improve signifi- 
cant deficits in a student’s written language skills. 

Third, to what extent can important variables in the cli- 
ent’s environment be controlled effectively? Whether a cer- 
tain behavior can be changed is not the question. In an applied 
setting, however, identifying and then consistently manipu- 
lating the controlling variables for a given target behavior will 
determine whether the behavior will be changed. 

Fourth, are the resources available to implement 
and maintain the intervention at a level of fidelity and 
intensity that is long enough to achieve the desired out- 
comes? No matter how expertly designed a treatment plan, 
implementing it without the personnel and other resources 
needed to carry out the intervention properly is likely to 
yield disappointing results. 


How much will it cost to change this behavior? Cost should 
be considered before implementing any systematic behay- 
ior change program. However, a cost-benefit analysis of 
several potential target behaviors does not mean that if 
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a teaching program is expensive, it should not be imple- consumes so much of the child’s day that there is little time 
mented. Major courts have ruled that the lack of public for her to learn communication, leisure, or self-help skills, 
funds may not be used as an excuse for not providing an the fine motor skill objective may be too costly. 


appropriate education to all children regardless of the sever- 
ity of their disability (cf., Yell & Drasgow, 2000). The cost 
of a behavior change program cannot be determined by 
simply adding aggregate expenditures for equipment, mate- 


Developing and Using a Target Behavior 
Ranking Matrix 


rials, transportation, staff salaries, and the like. Consider- | ASSigning a numerical rating to each of a list of potential target 
ation should also be given to how much of the client’s time | behaviors produces a priority ranking of those behaviors. One 
the behavior change program will demand. If, for example, such ranking matrix is shown in Figure 3.9; it is an adaptation of 


teaching a fine motor skill to a child with severe disabilities 4 System described by Dardig and Heward (1981) for prioritizing 
Figure 3.9 Worksheet for prioritizing potential target behaviors. 


Client's/Student’s Name: 


Person Completing Worksheet: 


Rater’s Relationship to Client/Student: 


Directions: Use the key below to rank each potential target behavior by the extent to which it meets or 
fulfills each prioritizations criteria. Add each team member's ranking of each potential target behavior. 
The behavior(s) with the highest total scores would presumably be the highest priority for intervention. 
Other criteria relevant to a particular program or individual's situation can be added, and the criteria can 
be differentially weighted. 


Key: 0 = No/Never; 1 = Rarely; 2 = Maybe/Sometimes; 3 = Probably/Usually; 4 = Yes/Always 


Potential Target Behaviors 
C1) RRR, FO.(2) oh Sh OYA) Bas SEL, 


Prioritization Criteria 


Does this behavior pose dan- 
ger to the individual or to 
others? 


How many opportunities will 
the individual have to use this 
new skill in the natural envi- 
ronment? or How often does 
problem behavior occur? 


How long-standing is the 
problem or skill deficit? 


Will changing this behavior 
produce a higher rate of rein- 
forcement for the individual? 


What is the relative impor- 
tance of this target behavior 
to future skill development 
and independent functioning? 


Will changing this behavior 
reduce negative or unwanted 
attention from others? 


Will changing this behavior 
produce reinforcement for 
significant others? 


How likely it success in chang- 
ing this behavior? 


How much will it cost to @ be dA! 
change this behavior? 
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and selecting learning goals for students with disabilities. Each 
behavior is given a number representing the behavior’s value on 
each of the prioritizing variables (e.g., 0 to 4, with 0 representing 
no value or contribution and 4 representing maximum value or 
benefit). : 

Professionals planning behavior change programs for 
senior citizens would probably insist that target behaviors with 
immediate benefits receive high priority. Educators serving 
secondary students with disabilities would likely advocate for 
behaviors that focus on skill development and independent 
functioning. 

At times, the behavior analyst, the client, and/or signif- 
icant others have conflicting goals. Parents may want their 
adolescent daughter in the house by 10:30 pm on weekends, 
but the daughter may want to stay out until midnight. The 
school may want a behavior analyst to develop a program 
to increase students’ adherence to an unpopular dress code. 
The behavior analyst may believe that elements of the dress 
codes are negotiable, but not to the point that the adminis- 
tration wants the code enforced. Who decides what is best 
for whom? 

One way to minimize conflicts is to obtain client, parent, 
and staff/administration participation in the goal determination 
process. For example, the active participation of parents and 
students, when possible, in the selection of short- and long-term 
goals is desirable. Such participation by all of the significant 
parties can avoid and resolve goal conflicts, not to mention 
the invaluable information the participants can provide rela- 
tive to other aspects of program planning (e.g., identification of 
likely reinforcers). Reviewing the results of assessment efforts 
and allowing each participant to provide input on the relative 
merits of each proposed goal or target behavior can often pro- 
duce consensus on the best direction. Program planners should 
not commit a priori that whatever behavior is ranked first will 
necessarily be considered the highest priority target behavior. 
However, if the important people involved in a person’s life go 
through a ranking process such as the one shown in Figure 3.9, 
they are likely to identify areas of agreement and disagreement, 
which leads to further discussions of target behavior selection 
and concentration on the critical concerns of those involved. 


DEFINING TARGET BEHAVIORS 


Before a behavior undergoes analysis, it must be defined in 
a clear, objective, and concise manner. In constructing target 
behavior definitions, applied behavior analysts must consider the 
functional and topographical implications of their definitions. 


Role and Importance of Target Behavior 
Definitions in Applied Behavior Analysis 


Applied behavior analysis derives its validity from its systematic 
approach to seeking and organizing knowledge about human 
behavior. Validity of scientific knowledge in its most basic 
form implies replication. When predicted behavioral effects 
can be reproduced, principles of behavior are confirmed and 
methods of practice developed. If applied behavior analysts 
employ definitions of behavior not available to other scientists, 


replication is less likely. Without replication, the usefulness 
or meaningfulness of data cannot be determined beyond the 
specific participants themselves, thereby limiting the orderly 
development of the discipline as a useful technology (Baer, 
Wolf, & Risley, 1968). Without explicit, well-written definitions 
of target behaviors, researchers would be unable to accurately 
and reliably measure the same response classes within and across 
studies; or to aggregate, compare, and interpret their data.’ 

Explicit, well-written definitions of target behavior are 
also necessary for the practitioner, who may not be so con- 
cerned with replication by others or development of the field. 
Most behavior analysis programs are not conducted primar- 
ily for the advancement of the field; they are implemented by 
educators, clinicians, and other human services professionals 
to improve the lives of their clients. However, implicit in the 
application of behavior analysis is an accurate, ongoing eval- 
uation of the target behavior, for which an explicit definition 
of behavior is a must. 

A practitioner concerned only with evaluating his efforts 
to provide optimum service to his clients might ask, “As long as 
I know what I mean by [name of target behavior], why must I 
write down a specific definition?” First, a good behavioral defini- 
tion is operational. It provides the opportunity to obtain complete 
information about the behavior’s occurrence and nonoccurrence, 
and it enables the practitioner to apply procedures in a consis- 
tently accurate and timely fashion. Second, a good definition 
increases the likelihood of an accurate and believable evaluation 
of the program’s effectiveness. Not only does an evaluation need 
to be accurate to guide ongoing program decisions, but also the 
data must be believable to those with a vested interest in the pro- 
gram’s effectiveness. Thus, even though the practitioner may not 
be interested in demonstrating an analysis to the field at large, he 
must always be concerned with demonstrating effectiveness (i.e., 
accountability) to clients, parents, and administrators. 


Two Types of Target Behavior Definitions 


Target behaviors can be defined functionally or topographically. 


Function-based Definitions 


A function-based definition designates responses as members 
of the targeted response class solely by their common effect on 
the environment. For example, Irvin, Thompson, Turner, and 
Williams (1998) defined hand mouthing as any behavior that 
resulted in “contact of the fingers, hand, or wrist with the mouth, 
lips, or tongue” (p. 377). Figure 3.10 shows several examples of 
function-based definitions. 

Applied behavior analysts should use function-based defi- 
nitions of target behaviors whenever possible for the following 
reasons: 


* A function-based definition encompasses all relevant 
forms of the response class. However, target behavior 
definitions based on a list of specific topographies might 
omit some relevant members of the response class and/ 
or include irrelevant response topographies. For exam- 
ple, defining children’s offers to play with peers in terms 
of specific things the children say and do might omit 
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Figure 3.10 Function-based definitions of various target behaviors. 


Creativity In Children’s Blockbuilding 
The child behaviors of blockbuilding were defined according to their products, block forms. 
The researchers created a list of 20 arbitrary, but frequently seen forms, including: 


Arch—any placement of a block atop two lower blocks not in contiguity. 


Ramp—a block leaned against another, or a triangular block placed contiguous to 
another, to simulate a ramp. 


Story—two or more blocks placed one atop another, the upper block(s) resting solely 
upon the lower. 


Tower—any story of two or more blocks in which the lowest block is at least twice as 
tall as it is wide. (Goetz & Baer, 1973, pp. 210-211). 


Exercise by Obese Boys 

Riding a stationary bicycle—each wheel revolution constituted a response, which was auto- 
matically recorded by magnetic counters (DeLuca & Holborn, 1992, p. 672). 

Compliance at Stop Signs by Motorists 


Coming to a complete stop—observers scored a vehicle as coming to a complete stop if the 
tires stopped rolling prior to the vehicle entering the intersection (Van Houten & Retting, 
20017 DiclhS/). 


Recycling by Office Employees 

Recycling office paper—number of pounds and ounces of recyclable office paper found in 
recycling and trash containers. All types of paper accepted as recyclable were identified as 
well as examples of nonrecyclable paper (Brothers, Krantz, & McClannahan, 1994, p. 155). 
Safety Skills to Prevent Gun Play by Children 

Touching the firearm—the child making contact with the firearm with any part of his or her 
body or with any object (e.g., a toy) resulting in the displacement of the firearm. 

Leaving the area—the child removing himself or herself from the room in which the firearm was 
located within 10 seconds of seeing the firearm (Himle, Miltenberger, Flessner, & Gatheridge, 


2004, p. 3). 


responses to which peers respond with reciprocal play 

and/or include behaviors that peers reject. 
¢ The outcome, or function, of behavior is most important. 
This holds true even for target behaviors for which form 
or aesthetics is central to their being valued as socially 
significant. For example, the flowing strokes of the cal- 
ligrapher’s pen and the gymnast’s elegant movements 
during a floor routine are important (i.e., have been 
selected) because of their effects or function on others 
(e.g., praise from the calligraphy teacher, high scores 
from gymnastics judges). 
Functional definitions are often simpler and more concise 
than topography-based definitions, which leads to easier 
and more accurate and reliable measurement and sets the 
occasion for the consistent application of intervention. For 
example, in their study on skill execution by college foot- 
ball players, Ward and Carnes (2002) recorded a correct 
tackle according to the clear and simple definition, “if the 
offensive ball carrier was stopped” (p. 3). 


Function-based definitions can also be used in some sit- 
uations in which the behavior analyst does not have direct and 
reliable access to the natural outcome of the target behavior, or 
cannot use the natural outcome of the target behavior for ethical 


or safety reasons. In such cases, a function-based definition 
by proxy can be considered. For example, the natural outcome 
of elopement (i.e., running or walking away from a caregiver 
without consent) is a lost child. By defining elopement as “any 
movement away from the therapist more than 1.5 m without per- 
mission” (p. 240), Tarbox, Wallace, and Williams (2003) were 
able to measure and treat this socially significant target behavior 
in a safe and meaningful manner. 


Topography-based Definitions 

A topography-based definition identifies instances of the 
target behavior by the shape or form of the behavior. Topog- 
raphy-based definitions should be used when the behavior 
analyst (a) does not have direct, reliable, or easy access to the 
functional outcome of the target behavior and/or (b) cannot 
rely on the function of the behavior because each instance of 
the target behavior does not produce the relevant outcome in 
the natural environment or the outcome might be produced 
by other events. For example, Silvestri (2004) defined and 
measured two classes of positive teacher statements accord- 
ing to the words that made up the statements, not according 
to whether the comments produced specific outcomes (see 
Figure’3.11). 
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Figure 3.11 Topography-based definitions for two types of teacher statements. 


Generic Positive Statements 

Generic positive statements were defined as audible statements by the teacher that referred to one or more student's 
behavior or work products as desirable or commendable (e.g., “I’m proud of you!”, “Great job, everyone.”). State- 
ments made to other adults in the room were recorded if they were loud enough to be heard by the students and 
made direct reference to student behavior or work products (e.g., “Aren‘t you impressed at how quietly my students 
are working today?”). A series of positive comments that specified neither student names nor behaviors with less than 2 
seconds between comments was recorded as one statement. For example, if the teacher said, “Good, good, good. |'’m so 
impressed” when reviewing three or four students’ work, it was recorded as one statement. 


Teacher utterances not recorded as generic positive statements included (a) statements that referred to specific behavior 
or student names. (b) neutral statements indicating only that an academic response was correct (e.g., “Okay”, “Correct”), 
() positive statements not related to student behavior (e.g., saying “Thanks for dropping off my attendance forms” toa 
colleague), and (d) incomprehensible or inaudible statements. 


Behavior-Specific Positive Statements 


Behavior-specific positive statements made explicit reference to an observable behavior (e.g., “Thank you for putting 
your pencil away”). Specific positive statements could refer to general classroom behavior (e.g., “You did a great job 
walking back to your seat quietly”) or academic performance (e.g., “That was a super smart answer!”). To be recorded 
as separate responses, specific positive statements were separated from one another by 2 seconds or by differentiation of 
the behavior praised. In other words, if a teacher named a desirable behavior and then listed multiple students who were 
demonstrating the behavior, this would be recorded as one statement (e.g., “Marissa, Tony, and Mark, you did a great job 
of returning your materials when you were finished with them”). However, a teacher's positive comment noting several 
different behaviors would be recorded as multiple statements regardless of the Interval between the end of one comment 
and the start of the next. For example, “Jade, you did a great job cleaning up so quickly; Charles, thanks for putting the 
workbooks away; and class, | appreciate that you /ined up quietly” would be recorded as three positive statements. 


Adapted from The Effects of Self-Scoring on Teachers’ Positive Statements during Classroom Instruction (pp. 48-49) by S. M. Silvestri. 


Unpublished doctoral dissertation. Columbus, OH: The Ohio State University. Used by permission. 


Topography-based definitions can also be used for target 
behaviors for which the relevant outcome is sometimes pro- 
duced in the natural environment by undesirable variations of 
the response class. For example, because a duffer’s very poor 
swing of a golf club sometimes produces a good outcome (i.e., 
the ball lands on the green), it is better to define a correct swing 
by the position and movement of the golf club and the golfer’s 
feet, hips, head, and hands. 

A topography-based definition should encompass all 
response forms that would typically produce the relevant 
outcome in the natural environment. Although topography 
provides an important element for defining target behaviors, 
the applied behavior analyst must be especially careful not 
to select target behaviors solely on the basis of topography 
(see Box 3.2). 


Writing Target Behavior Definitions 


A good definition of a target behavior provides an accu- 
rate, complete, and concise description of the behavior to 
be changed (and therefore measured). It also states what 
is not included in the behavioral definition. Asking aloud 
to be excused from the dinner table is an observable and 
measurable behavior that can be counted. By comparison, 
“exercising good manners” is not a description of any par- 
ticular behavior; it merely implies a general response class 
of polite and socially acceptable behaviors. Hawkins and 
Dobes (1977) described three characteristics of a good target 


behavior definition that are as valid and useful today as they 
were when published more than 40 years ago: 


1. The definition should be objective, referring only to 
observable characteristics of the behavior (and environ- 
ment, if needed) or translating any inferential terms (such 


as “expressing hostile feelings,” “intended to help,” or 
“showing interest in’’) into more objective ones. 


2. The definition should be clear in that it should be readable 
and unambiguous so that experienced observers could 
read it and readily paraphrase it accurately. 


3. The definition should be complete, delineating the 
“boundaries” of what is to be included as an instance of 
the response and what is to be excluded, thereby directing 
the observers in all situations that are likely to occur and 
leaving little to their judgment. (p. 169) 


Stated succinctly, first, a good definition must be objec- 
tive, ensuring that specific instances of the defined target 
behavior are observed and recorded reliably. An objective 
definition increases the likelihood of an accurate and believ- 
able evaluation of program effectiveness. Second, a clear defi- 
nition is technological, meaning that it enables others to use 
and replicate it (Baer et al., 1968). A clear definition therefore 
becomes operational for present and future purposes. Finally, 
a complete definition discriminates between what is and what 
is not an instance of the target behavior. A complete definition 
allows others to record an occurrence of a target behavior, but 
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How Serious Are These Behavior Problems? 


Suppose you are a behavior analyst in a position to design and 


help implement an intervention to change the following four 
behaviors: 


1. A child repeatedly raises her arm, extending and retract- 
ing her fingers toward her palm in a gripping/releasing 
type of motion. 

2. An adult with developmental disabilities pushes his hand 
hard against his eye, making a fist and rubbing his eye 
rapidly with his knuckles. 

3. Several times each day a high school student rhyth- 
mically drums her fingers up and down, sometimes in 
bursts of 10 to 15 minutes in duration. 

4. A person repeatedly grabs at and squeezes another 
person’s arms and legs so hard that the other person 
winces and says “Ouch!” 


How much of a problem does the behavior pose for the 
person or for others who share his or her current and future 
environments? Would you rate each behavior as a mild, moder- 
ate, or serious problem? How important do you think it would 
be to target each of these behaviors for reduction or elimination 
from the repertoires of the four individuals? 


not record instances of nonoccurrence, in a standard fashion. A 
complete definition is a precise and concise description of the 
behavior of interest. Note how the target behavior definitions 
in Figures 3.10 and 3.11 meet the standard for being objective, 
clear, and complete. 

Morris (1985) suggested testing the definition of a target 
behavior by asking three questions: 


1. Can you count the number of times that the behavior 
occurs in, for example, a 15-minute period, a 1-hour 
period, or 1 day? Or, can you count the number of minutes 
that it takes for the child to perform the behavior? That 
is, can you tell someone that the behavior occurred “x” 
number of times or “x” number of minutes today? (Your 
answer should be “‘yes.”) 


2. Will a stranger know exactly what to look for when you 
tell him or her the target behavior you are planning to mod- 
ify? That is, can you actually see the child performing the 
behavior when it occurs? (Your answer should be “yes.”) 


3. Can you break down the target behavior into smaller 
behavioral components, each of which is more specific 
and observable than the original target behavior? (Your 
answer should be “no.”) 


In responding to the suggestion that perhaps a source- 
book of standard target behavior definitions be developed 
because it would increase the likelihood of exact replications 
among applied researchers and would save the considerable 
time spent in developing and testing situation-specific defini- 
tions, Baer (1985) offered the following perspectives. Applied 


Appropriate answers to these questions cannot be found 
in topographical descriptions alone. The meaning and relative 
importance of any operant behavior can be determined only in 
the context of the environmental antecedents and consequences 
that define the behavior. Here is what each of the four people 
in the previous examples were actually doing: 


1. An infant learning to wave “bye-bye.” 

2. A man with allergies rubbing his eye to relieve the 
itching. 

3. A student typing unpredictable text to increase her key- 
boarding fluency and endurance. 

4. A massage therapist giving a relaxing, deep-muscle mas- 
sage to a grateful and happy customer. 


Applied behavior analysts must remember that the 
meaning of any behavior is determined by its function, not its 
form. Behaviors should not be targeted for change on the basis 
of topography alone. 


Note: Examples | and 2 are adapted from Meyer and Evans. 1989. p. 53. 


behavior analysis programs are implemented because someone 
(e.g., teacher, parent, individual himself) has “complained” that 
a behavior needs to be changed. A behavioral definition has 
validity in applied behavior analysis only if it enables observers 
to capture every aspect of the behavior that the complainer is 
concerned with and none other. Thus, to be valid from an applied 
perspective, definitions of target behaviors should be situation 
specific. Attempts to standardize behavior definitions assume an 
unlikely similarity across all situations. 


SETTING CRITERIA FOR BEHAVIOR CHANGE 


Target behaviors are selected for study in applied behavior 
analysis because of their importance to the people involved. 
Applied behavior analysts attempt to increase, maintain, and 
generalize adaptive, desirable behaviors and decrease the 
occurrence of maladaptive, undesirable behaviors. Behavior 
analysis efforts that not only target important behaviors but 
also change those behaviors in a positive and meaningful way 
are said to have social validity.® But how much does a target 
behavior need to change before it makes a meaningful differ- 
ence in the person’s life? 

Van Houten (1979) made a case for specifying the desired 
outcome criteria before efforts to modify the target behavior 
begin. 


This step [specifying outcome criteria] becomes as import- 
ant as the previous step [selecting socially important target 
behaviors] if one considers that for most behaviors there 
exists a range of responding within which performance is 
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most adaptive. When the limits of this range are unknown 
for a particular behavior, it is possible to terminate treatment 
when performance is above or below these limits. Hence, the 
behavior would not be occurring within its optimal range... . 

In order to know when to initiate and terminate a 
treatment, practitioners require socially validated standards 
for which they can aim. (pp. 582, 583) 


Van Houten (1979) suggested two basic approaches to 
determining socially valid goals: (a) assess the performance of 


SUMMARY 
Role of Assessment in Applied Behavior Analysis 
1. Behavioral assessment involves using indirect, direct, and 


empirical methods to identify, define, and determine the 
function of target behaviors. 


2. Behavioral assessment consists of five phases or 
functions: (a) screening, (b) defining and quantifying 
problems or goals, (c) pinpointing the target 
behavior(s) to be treated, (d) monitoring progress, 
and (e) following up. 


3. Before conducting a behavioral assessment, the 
behavior analyst must determine whether she has the 
authority and permission, resources, and skills to assess 
and change the behavior. Past and current records 
related to medical, educational, and historical events 
should be examined and analyzed as part of a complete 
behavioral assessment. 


Assessment Methods Used by Behavior Analysts 


4. Subsumed under the three major methods for assess- 
ment information—indirect, direct, and empirical—are 
(a) interviews, checklists/rating scales, (b) tests and 
direct observations, and (c) functional behavior analysis 
and reinforcer/punishment preference assessment meth- 
ods, respectively. 


5. The client interview is used to determine the client’s 
description of problem behaviors or achievement goals. 
What, when, and where questions are emphasized, focus- 
ing on the actual behavior of the client and the responses 
of significant others to that behavior. 


6. Questionnaires and needs assessment surveys are some- 
times completed by the client to supplement the informa- 
tion gathered in the interview. 


7. Clients are sometimes asked to self-monitor certain situ- 
ations or behaviors. Self-collected data may be useful in 
selecting and defining target behaviors. 


8. Significant others can also be interviewed to gather assess- 
ment information and, in some cases, to find out whether 
they will be willing and able to assist in an intervention. 
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people judged to be highly competent and (b) experimentally 
manipulate different levels of performance to determine empir- 
ically which produces optimal results. 

Regardless of the method used, specifying treatment 
goals before intervention begins provides a guideline for 
continuing or terminating a treatment. Further, setting objec- 
tive, predetermined goals helps to eliminate disagreements 
or biases among those involved in evaluating a program’s 
effectiveness. 


9. Direct observation with a behavior checklist that contains 
specific descriptions of various skills can indicate possible 
target behaviors. 


10. Anecdotal observation, also called ABC recording, yields 
a descriptive, temporally sequenced account of all behav- 
iors of interest and the antecedent conditions and conse- 
quences for those behaviors as those events occur in the 
client’s natural environment. 


11. Ecological assessment entails gathering a large amount 
of information about the person and the environments in 
which that person lives and works (e.g., physiological con- 
ditions, physical aspects of the environment, interactions 
with others, past reinforcement history). A complete eco- 
logical assessment is neither necessary nor warranted for 
most applied behavior analysis programs. 


12. Reactivity refers to the effects that an assessment pro- 
cedure has on the behavior being assessed. Behavior 
analysts should use assessment methods that are as 
unobtrusive as possible, repeat observations until appar- 
ent reactive effects subside, and take possible reactive 
effects into account when interpreting the results of 
observations. 


13. Choose assessment methods that produce reliable and 
valid results, conduct assessments according to profes- 
sional standards, and apply conservative analyses when 
interpreting results. 


Assessing the Social Significance of Potential 
Target Behaviors 


14. Target behaviors in applied behavior analysis must 
be socially significant behaviors that will increase a 
person’s habilitation (adjustment, competence, and 
quality of life). 


15. The relative social significance and habilitative value of a 
potential target behavior can be clarified by viewing it in 
light of the following considerations: 


* Will the behavior be reinforced in the person’s daily 
life? The relevance of behavior rule requires that a tar- 
get behavior produce reinforcement for the person in the 
postintervention environment. 


Is the behavior a necessary prerequisite for a useful 
skill? 
Will the behavior increase the person’s access to envi- 


ronments in which other important behaviors can be 
learned or used? 


Will the behavior predispose others to interact with 


the person in a more appropriate and supportive 
manner? 


* Is the behavior a cusp or pivotal behavior? Behavioral 
cusps have sudden and dramatic consequences that extend 
well beyond the idiosyncratic change itself because they 
expose the person to new environments, reinforcers, con- 
tingencies, responses, and stimulus controls. Learning a 
pivotal behavior produces corresponding modifications or 
covariations in other untrained behaviors. 


¢ Is the behavior age appropriate? 


¢ Whenever a behavior is targeted for reduction or elim- 
ination, a desirable, adaptive behavior must be selected 
to replace it. 


Does the behavior represent the actual problem or 
achievement goal, or is it only indirectly related? 


¢ A person’s verbal behavior should not be confused 
with the actual behavior of interest. However, in 
some situations the client’s verbal behavior should 
be selected as the target behavior because it is the 
behavior of interest. 


¢ If a person’s goal is not a specific behavior, a target 
behavior(s) must be selected that will produce the 
desired results or state. 


Prioritizing Target Behaviors 


16. Assessment often reveals more than one possible behavior 
or skill area for targeting. Prioritization can be accom- 
plished by rating potential target behavior against key 
questions related to their relative danger, frequency, 
long-standing existence, potential for reinforcement, rele- 
vance for future skill development and independent func- 
tioning, reduced negative attention from others, likelihood 
of success, and cost. 


17. Participation by the person whose behavior is to 
be changed, parents and/or other important family 


KEY TERMS 


Chapter 3 * Selecting and Defining Target Behaviors 91 


members, staff, and administration in identifying and 
prioritizing target behaviors can help reduce goal 
conflicts. 


Defining Target Behaviors 


18. 


19. 


20. 


21. 


22. 


23. 


Explicit, well-written target behavior definitions are nec- 
essary for researchers to accurately and reliably measure 
the same response classes within and across studies or to 
aggregate, compare, and interpret their data. 


Good target behaviors definitions are necessary for prac- 
titioners to collect accurate and believable data to guide 
ongoing program decisions, apply procedures consis- 
tently, and provide accountability to clients, parents, and 
administrators. 


Function-based definitions designate responses as mem- 
bers of the targeted response class solely by their common 
effect on the environment. 


Topography-based definitions define instances of the tar- 
geted response class behavior by the shape or form of the 
behavior. 


A good definition must be objective, clear, and complete, 
and must discriminate between what is and what is not an 
instance of the target behavior. 


A target behavior definition is valid if it enables observers 
to capture every aspect of the behavior that the “com- 
plainer” is concerned with and none other. 


Setting Criteria for Behavior Change 


24. 


25. 


26. 


ABC recording ecological assessment 
anecdotal observation function-based definition 
behavior checklist habilitation 

behavioral assessment normalization 


behavioral cusp pivotal behavior 


A behavior change has social validity if it changes some 
aspect of the person’s life in an important way. 


Outcome criteria specifying the extent of behavior change 
desired or needed should be determined before efforts to 
modify the target behavior begin. 


Two approaches to determining socially validated perfor- 
mance criteria are (a) assessing the performance of people 
judged to be highly competent and (b) experimentally 
manipulating different levels of performance to determine 
which produces optimal results. 


reactivity 


relevance of behavior rule 
social validity 
target behavior 


topography-based definition 
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MULTIPLE-CHOICE QUESTIONS 


1. All of the following are assessment methods used in 
applied behavior analysis except: 


a. Direct observation 

b. Interview 

c. Medical evaluations 

d. Checklists 

Hint: (See “Role of Assessment in Applied Behavior 
Analysis’) 


2. Behavior assessment seeks to determine the ofa 


behavior. 
a. Function 
b. Replacement 
c. Importance 
d. Structure 
Hint: (See “Role of Assessment in Applied Behavior 
Analysis’’) 
3. All of the following are competencies of a behavior 
analyst except: 
a. Knowledge of socially important behavior 
b. Technical skills 
c. Ability to conduct statistical analyses of data 
d. 


Ability to match assessment data with intervention 
strategies 


Hint: (See “Phases of Behavioral Assessment’’) 
4. Potential target behaviors should not be selected if the 
primary reason for selection is the: 
a. Benefit of others 
b. Safety of the person 
c. Safety of family members 
d. Potential to increase independence 
Hint: (See “Assessing the Social Significance of Potential 
Target Behaviors’’) 
5. Interview questions should avoid “why” questions because 
these tend to encourage explanations of behavior. 
a. Defensive 
b. Mentalistic 
c. Lengthy 
d. Direct 
Hint: (See “Interviewing the Client’’) 
6. A descriptive and temporally sequenced account of behavior 
in the natural environment is called a(n): 
a. Anecdotal observation 
b. Behavioral assessment 
c. Ecological assessment 
d. Real time observation 


Hint: (See “Direct Observations’’) 


7. The 


10. 


11. 


. The principle of 


of behavior rule states that a target 
behavior should only be selected when it can be 
determined that the behavior will produce natural 
reinforcement. 


a. Functionality 
b. Validity 

c. Relevance 
d. Importance 


Hint: (See “Will This Behavior Likely to Produce 
Reinforcement in the Client’s Natural Environment 
after Treatment Ends?”’) 


determines the degree to which 
a person’s behavior repertoire maximizes short and long- 
term reinforcers for that individual and for others, and 
minimizes short and long-term punishers. 


a. Normalization 
b. Habilitation 
c. Functionality 
d. Justification 


Hint: (See “Assessing the Social Significance of Potential 
Target Behaviors”’) 


. Behavior that exposes an individual to new contingencies, 


reinforcers, and stimulus controls is called: 

a. Pivotal behavior 

b. Access behavior 

c. Behavior cusp 

d. Contingent behavior 

Hint: (See “Is This Behavior a Behavioral Cusp or a Pivotal 
Behavior?) 


When a problem behavior has been targeted for reduction 
or elimination, the behavior analyst must always include 
a(n) in the intervention plan. 


a. Replacement behavior 
b. Appropriate response 
c. Pivotal behavior 

d. Performance criterion 


Hint: (See “If the Proposed Target Behavior is to be 
Reduced or Eliminated, What Adaptive Behavior Will 
Replace It?’’) 


Juan is a six-year-old boy with a developmental disability 
who attends an integrated kindergarten class. Assessments 
have identified four target behaviors. Which behavior 
should be the first target for intervention? 


a. Flicking his fingers in front of his eyes 
b. Bolting from the playground 

c. Toilet training 

d. Humming loudly during group activities 


Hint: (See “Prioritizing Target Behaviors”) 


12. 


13: 


14. 


1. 


ESSAY-TYPE QUESTIONS 


In determining the likelihood of success in changing a 
behavior, all of the following should be considered except: 


a. Research on changing this behavior 

b. Experience of the behavior analyst 

c. Social validity of the behavior 

d. Available resources 

Hint: (See “Prioritizing Target Behaviors”) 

Explicit behavior definitions are important in research of 
applied behavior analysis for all of the following except: 
a. Replication by other scientists 

b. Accurate and reliable measurement of behavior 

c. Comparison of data across studies 

d. Agreement between assessment and intervention data 
Hint: (See “Defining Target Behaviors’) 

A(n) definition designates responses in terms of 
their effect on the environment. 

a. Observable 


b. Function-based 


How does behavioral assessment differ from educational 
or psychological assessment? 


Hint: (See “Role of Assessment in Applied Behavior 
Analysis’) 


. Describe one of the competencies of an applied behavior 


analyst. 

Hint: (See “Phases of Behavioral Assessment’) 
Describe one of the four major assessment methods 
used in behavior analysis. 

Hint: (See “Assessment Methods Used by Behavior 
Analysts’) 

Identify one reason social validity has become a major 
focus of applied behavior analysis. 


Hint: (See “Assessing the Social Significance of Potential 
Target Behavior’) 


NOTE 
1. 


Problem behaviors can be assessed empirically using a three-step process called 
functional behavior assessment that is designed to identify and systematically 
manipulate antecedents and/or consequences that may control the occurrence 
of the problem behavior(s). Chapter 27 describes this process in detail. 
Chapter 31, “Ethical and Professional Responsibilities of Applied Behavior 
Analysts,” examines this important issue in detail. 

Behavior analysts use other forms of assessment to guide the design of effec- 
tive interventions. Those assessment methods are described elsewhere in this 
text: preference assessment and reinforcer assessment (Chapter 11), punisher 
assessment (Chapter 14), and functional behavior assessment (Chapter 27). 
ABC recording is discussed further in Chapter 27, “Functional Behavior 
Assessment.” 

Reactive effects of assessment are not necessarily negative. Self-monitoring 
has become as much a treatment procedure as it is an assessment procedure; 


see Chapter 29. 


15. 


16. 


6. 


Chapter 3 * Selecting and Defining Target Behaviors 


93 


c. Topography-based 

d. Ecological 

Hint: (See “Defining Target Behaviors’’) 

A behaviors has if it changes some aspect of the 
person’s life in an important way. 

a. Social validity 

b. Functional application 

c. Observable benefit 

d. Normalized outcomes 

Hint: (See “Setting Criteria for Behavior Change’”’) 
Outcome criteria should be established before intervention 
commences for all of the following reasons except: 

a. To establish the target performance level 

b. To know when to terminate intervention 

c. To ensure accurate data collection 


d. To ensure agreement on outcomes among 
stakeholders 


Hint: (See “Setting Criteria for Behavior Change’’) 


. Describe the relevance of behavior rule and briefly explain 


how it can be used to identify socially significant behavior. 
Hint: (See “Habilitation Defined’) 


. What is the first question to ask when prioritizing potential 


target behaviors and why should it be asked first? 


Hint: (See “Prioritizing Target Behaviors”’) 


. Explain one research benefit of explicit, well-written defi- 


nitions in applied behavior analysis. 


Hint: (See “Defining Target Behaviors”) 


What are two features of a function-based behavior definition? 


Hint: (See “Two Types of Target Behavior Definitions’’) 


Describe one benefit to setting the criteria for behavior 
change prior to initiating intervention. 


Hint: (See “Setting Criteria for Behavior Change’) 


seen soe? SS 


A target behavior’s indirect benefit as a necessary prerequisite for another 
important behavior should not be confused with indirect teaching. Indirect 
teaching involves selecting a target behavior different from the true pur- 
pose of the behavior because of a belief that they are related (e.g., having 
a student with poor reading skills practice shape discrimination or balance 
beam walking). The importance of directness in target behavior selection 
is discussed later in this section. 

Procedures for measuring behavior accurately and reliably are discussed in 
Chapter 4. 

A third component of social validity concerns the social acceptability of the 
procedures used to change the behavior. Chapter 10 describes procedures 
for assessing the social validity of behavior change. 


Comer ER 4 


Measuring Behavior 


LEARNING OBJECTIVES 


e Describe the functions of measurement in applied behavior analysis. 
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Identify the measurable dimensions of behavior. 


Describe the different procedures for measuring behavior. 


State the advantages and disadvantages of using continuous measurement procedures and 


sampling procedures. 


Explain the procedure for measuring behavior by permanent products. 


Explain computer-assisted measurement of behavior. 


Select the appropriate measurement procedure given the dimensions of the behavior and 


the logistics of observing and recording. 


When I awoke this morning, I looked at the clock to see 
how many hours I slept. (Do I stay in bed or get up? I 
got up.) I showered, shaved, brushed my teeth, and then 
stepped onto the bathroom scale to check my weight. 
(Did I weigh the same, less, or more than yesterday? I 
gained two pounds.) I started thinking about how much 
we humans use measurement in our everyday activities, 
and for the most part we are completely oblivious of the 
behavioral measurement. 

For breakfast I made coffee (4 scoops of coffee, with 
4’ cups of water) and placed an English muffin in the 
toaster (set the toaster to the correct temperature for toast- 
ing the muffin). I had breakfast and then I was off to the 
community rec center for morning exercise. 

I got in the car, and the speedometer told me how 
many miles per hour I was driving. (I stayed within the 
speed limit.) The gas gauge told me how much gas was in 
the tank. (Enough to drive to the rec center, return home, 
and then some.) 

At the rec center, I observed people walking or run- 
ning on the track. One walker recorded the number of laps 
on a hand-held tally counter, a runner glanced at the clock 
after each lap, and several walkers and runners wore digital 
step counters on their wrists. In other places in the rec cen- 
ter, people counted and recorded the number of repetitions 
they compieted with free weights and strength-building 
machines. 

And on and on it goes throughout the day. Mea- 
surement makes up a large portion of our everyday lives. 
It greatly improves the quality of our interaction with our 
environment. Just as measurement is so important in our 


environment, it is the foundation of all activity in applied 
behavior analysis. 


easurement provides the basis for all scientific 
Mises and for the development and success- 
ful application of technologies derived from those 
discoveries. Direct and frequent measurement constitutes the 
foundation for applied behavior analysis. Applied behavior 
analysts use measurement to detect and compare the effects of 
various environmental arrangements on the acquisition, mainte- 
nance, and generalization of socially significant behaviors. 
What is it about behavior that applied behavior analysts 
can and should measure? How should those measures be 
obtained? And, what should we do with these measures once we 
have obtained them? This chapter identifies the dimensions by 
which behavior can be measured and describes the measurement 
methods behavior analysts commonly use. First, we examine the 
definition and functions of measurement in applied behavior 
analysis. 


DEFINITION AND FUNCTIONS 
OF MEASUREMENT IN APPLIED 
BEHAVIOR ANALYSIS 


Measurement is the process of applying quantitative labels 
to describe and differentiate objects and natural events. “TIt] 
involves attaching a number representing the observed extent 
of a dimensional quantity to an appropriate unit. The number 


and the unit together constitute the measure of the object or 
event” (Johnston & Pennypacker, 1993a, p. 95). Measurement 
in applied behavior analysis entails three steps: (a) identifying 
the behavior to be measured, (b) defining the behavior in 
observable terms, and (c) selecting an appropriate observation 
and data-recording method (Gast, 2014). Chapter 3 covered 
selecting and defining target behavior; this chapter details 
measurement methods. 

Bloom, Fischer, and Orme (2003)—who described 
measurement as the act or process of applying quantitative or 
qualitative labels to events, phenomena, or observed properties 
using a standard set of consensus-based rules by which to 
apply labels to those occurrences—pointed out that the concept 
of measurement includes the characteristics of what is being 
measured, the quality and appropriateness of the measurement 
tools, the technical skill of the measurer, and how the measures 
obtained are used. In the end, measurement gives researchers, 
practitioners, and consumers a common means for describing and 
comparing behavior with a set of labels that convey a common 
meaning. 


Researchers Need Measurement 


Dr. Watson: “What do you imagine that means?” 

Sherlock Holmes: “I have no data yet. It is a capital mistake 

to theorize before one has data. Insensibly one begins to twist 

facts to suit theories, instead of theories to suit facts.” 
—from A Scandal in Bohemia by Arthur Conan Doyle 


Measurement is how scientists operationalize empiricism. 
Objective measurement enables (indeed, it requires) scientists 
to describe the phenomena they observe in precise, consistent, 
and publicly verifiable ways. Without measurement, all three 
levels of scientific knowledge—description, prediction, and 
control—would be relegated to guesswork subject to the “indi- 
vidual prejudices, tastes, and private opinions of the scientist” 
(Zuriff, 1985, p. 9). We would live in a world in which the 
alchemist’s suppositions about a life-prolonging elixir would 
prevail over the chemist’s formulated compounds derived from 
experimentation. 

Behavior analysts measure behavior to answer questions 
about the existence and nature of functional relations between 
socially significant behavior and environmental variables. 
Measurement enables comparisons of a person’s behavior 
within and between different environmental conditions, 
thereby affording the possibility of drawing empirically based 
conclusions about the effects of those conditions on behavior. 
For example, Dunlap and colleagues (1994) measured students’ 
task engagement and disruptive behaviors during choice and 
no-choice conditions. Measurement revealed the level of both 
target behaviors during each condition, whether and how 
much the behaviors changed when choice was introduced or 
withdrawn, and how variable or stable the behaviors were during 
each condition. 

The researcher’s ability to achieve a scientific under- 
standing of behavior change depends on her ability to measure 
it. Measurement makes possible the detection and verification 
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of virtually everything that has been discovered about the 
effects of the environment on behavior. The empirical data- 
bases of the basic and applied branches of behavior analysis 
consist of organized collections of behavioral measurements. 
Virtually every graph in the Journal of Applied Behavior Anal- 
ysis and Journal of the Experimental Analysis of Behavior 
displays an ongoing record or summary of behavioral mea- 
surement. In short, measurement provides the very basis for 
learning and talking about behavior in scientifically mean- 
ingful ways.! 


Practitioners Need Measurement 


Behavioral practitioners are dedicated to improving their clients’ 
lives by changing socially significant behaviors. Practitioners 
measure behavior initially to determine the current level of a 
target behavior and whether that level merits intervention. If 
intervention is warranted, the practitioner measures the extent to 
which his efforts are successful. Practitioners measure behavior 
to find out whether and when it has changed; the extent and 
duration of behavior changes; the variability or stability of 
behavior before, during, and after treatment; and whether 
important behavior changes have occurred in other settings or 
situations and spread to other behaviors. 

Frequent measures of behavior during treatment 
(formative assessment) enable dynamic, data-based deci- 
sion making concerning the continuation, modification, or 
termination of treatment. Practitioners also compare mea- 
surements of the target behavior before and after treatment 
(sometimes including pre- and post-treatment measures 
obtained in nontreatment settings or situations) to evaluate 
the overall effects of behavior change programs (summative 
evaluation). 

The practitioner who does not obtain and attend to frequent 
measures of the target behavior is vulnerable to committing 
mistakes: (a) continuing an ineffective treatment when no real 
behavior change has occurred or (b) discontinuing an effective 
treatment because subjective judgment detects no improvement 
(e.g., without measurement, a teacher would be unlikely to 
know that a student’s oral reading rate has increased from 70 to 
80 words per minute). Direct and frequent measurement enables 
practitioners to detect their successes and, equally important, 
their failures so they can make changes to transform failure into 
success (Bushell & Baer, 1994; Greenwood & Maheady, 1997; 
Gast, 2014). 


Our technology of behavior change is also a tech- 
nology of behavior measurement and of experimental 
design; it developed as that package, and as long as 

it stays in that package, it is a self-evaluating enter- 
prise. Its successes are successes of known magnitude; 
its failures are almost immediately detected as failures; 
and whatever its outcomes, they are attributable to 
known inputs and procedures rather than to chance 
events or coincidences. (D. M. Baer, October 21, 1982) 


In addition to enabling ongoing program monitoring 
and data-based decision making, frequent measurement 


96 Part 2 « Selecting, Defining, and Measuring Behavior 


provides other important benefits to practitioners and the cli- Miller (2005) argued that practitioners would be better 
ents they serve: served by casting aside pseudo-educational theories and 
philosophies and adopting a scientific approach. Practi- 

¢ Measurement helps practitioners optimize their effec- tioners who directly and frequently measure the effects 


tiveness. To be optimally effective, a practitioner must 
maximize behavior change efficiency in terms of time 
and resources. Only by maintaining close, continual con- 
tact with relevant outcome data can a practitioner hope to 
achieve optimal effectiveness and efficiency (Bushell & 
Baer, 1994). Commenting on the critical role of direct and 
frequent measurement in maximizing the effectiveness 
of classroom practice, Sidman (2000) noted that teach- 
ers “must remain attuned to the pupil’s messages and be 
ready to try and to evaluate modifications [in instructional 
methods]. Teaching, then, is not just a matter of changing 
the behavior of pupils; it is an interactive social process” 
(p. 23, words in brackets added). Direct and frequent mea- 
surement is the process by which practitioners hear their 
clients’ messages. 
Measurement enables practitioners to verify the legit- 
imacy of treatments touted as “evidence based.” Prac- 
titioners are increasingly expected, and in some fields 
mandated by law, to use evidence-based interventions. 
An evidence-based practice is a treatment or intervention 
that has been shown to be effective by a substantial body 
of high-quality, peer-reviewed scientific research. When 
implementing any treatment, regardless of the type or 
amount of research evidence to support it, practitioners 
can and should verify its effectiveness with the students 
or clients they serve by direct and frequent measurement. 
Measurement helps practitioners identify and terminate 
treatments based on pseudoscience, fad, fashion, or 
ideology. Many controversial treatments and proposed 
cures for people with developmental disabilities and 
autism (e.g., facilitated communication, holding therapy, 
megadoses of vitamins, strange diets, weighted vests, 
dolphin-assisted therapy) have been promoted in the 
absence of sound scientific evidence of effectiveness 
(Foxx & Mulick, 2016). The use of so-called breakthrough 
therapies has led to disappointment and loss of precious 
instructional or therapeutic time and, in some cases, to 
disastrous consequences (Maurice, 1993; Todd, 2012). 
Even though well-controlled studies have shown many 
of these methods to be ineffective, and even though 
these programs are not justified because they lacked 
sound scientific evidence of effects, risks, and benefits, 
parents and practitioners are still bombarded with sincere 
and well-meaning testimonials. Measurement is the 
practitioner’s best ally in the quest to find and verify 
effective treatments, and to root out those whose strongest 
support consists of testimonials and slick Internet ads. 
Practitioners should maintain a healthy skepticism 
regarding claims for effectiveness. See Box 31.5 for 
further discussion on this topic. 

Using Plato’s Allegory of the Cave as a metaphor 
for teachers and other practitioners who use untested and 
pseudo-instructional ideas, Heron, Tincani, Peterson, and 


of their intervention and treatment programs have empir- 
ical support to defend against political or social pressures 
to adopt unproven treatments. In a real sense they arm 
themselves with what Carl Sagan (1996) called a “baloney 
detection kit.” 

¢ Measurement helps practitioners be accountable to cli- 
ents, consumers, employers, and society. Practitioners 
should use direct and frequent measurement of behavior 
to answer parent and caregiver questions about the devel- 
opment of their children or clients. 

¢ Measurement helps practitioners achieve ethical standards. 
Ethical codes of conduct for behavior analytic practitioners 
require direct and frequent measurement of client behavior 
(Behavior Analyst Certification Board, 2018; Chapter 31 this 
text). Determining whether a client’s right to effective treat- 
ment is being honored requires measurement of the behav- 
ior(s) for which treatment was sought or intended. Failing to 
measure the nature and extent of relevant behavior changes 
of clients borders on malpractice. Kauffman (2005) offered 
this perspective on the relationship between measurement 
and ethical practice in education: 


[T]he teacher who cannot or will not pinpoint and measure 
the relevant behaviors of the students he or she is teach- 
ing is probably not going to be very effective. .. . Not to 
define precisely and to measure these behavioral excesses 
and deficiencies, then, is a fundamental error; it is akin 

to the malpractice of a nurse who decides not to measure 
vital signs (heart rate, respiration rate, temperature, and 
blood pressure), perhaps arguing that he or she is too 
busy, that subjective estimates of vital signs are quite 
adequate, that vital signs are only superficial estimates 

of the patient’s health, or that vital signs do not signify 

the nature of the underlying pathology. The teaching 
profession is dedicated to the task of changing behavior— 
changing behavior demonstrably for the better. What can 
one say, then, of educational practice that does not include 
precise definition and reliable measurement of the behav- 
ioral change induced by the teacher’s methodology? It is 
indefensible. (p. 439) 


MEASURABLE DIMENSIONS OF BEHAVIOR 


If a friend asked you to measure a coffee table, you would prob- 
ably ask why he wants the table measured. In other words, what 
does he want measurement to tell him about the table? Does 
he need to know its height, width, and depth? Does he want to 
know how much the table weighs? Perhaps he is interested in 
the color of the table? Each of these reasons for measuring the 
table requires measuring a different dimensional quantity of the 
table (e.g., length, mass, light reflection). 

Behavior, like coffee tables and all entities in the physical 
world, also has features that can be measured. Because behavior 


occurs within and across time, it has three fundamental and 
measurable dimensional quantities. Johnston and Pennypacker 
(2009) described these dimensional quantities as: 


* Repeatability (also called countability): Instances of a 
behavior can occur repeatedly through time (i.e., behavior 
can be counted). 

* Temporal extent: Every instance of behavior occurs 
during some amount of time (i.e., the duration of behavior 
can be measured). 

* Temporal locus: Every instance of behavior occurs at 
a certain point in time with respect to other events (i.e., 
when behavior occurs can be measured). 


Figure 4.1 shows a schematic representation of repeat- 
ability, temporal extent, and temporal locus. Alone and in com- 
bination, these dimensional quantities provide the basic and 
derivative measures used by applied behavior analysts. In the 
following pages these and two other measurable dimensions of 
behavior—its form and strength—will be discussed. 


Measures Based on Repeatability 


Count 


Count is a simple tally of the number of occurrences of a behav- 
ior. Although how often a behavior occurs is often of primary 
interest, measures of count alone may not provide enough 
information to allow behavior analysts to make useful program 
decisions or analyses. For example, data showing that Katie 
wrote correct answers to 5, 10, and 15 long division problems 
over three consecutive math class periods suggests improving 
performance. However, if the three measures of count were 
obtained in observation periods of 5 minutes, 20 minutes, and 
60 minutes, respectively, a much different interpretation of 
Katie’s performance is suggested. Therefore, the observation 
period, or counting time, should always be noted when reporting 
measures of count. 


Rate 


Combining count and observation time yields rate, one of the 
most widely used measures in behavior analysis. Skinner (1953) 
considered rate of response the basic measurement for behav- 
ioral research. He invented the cumulative recorder, a device 
that automatically produced a graphic record of operant response 
rates by his experimental subjects (see Figure 1.1). 


Tine ——— = R3 
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Rate is defined as the number of responses per unit of 
time.” A rate measure is a ratio consisting of the dimensional 
quantity of count (number of responses) and time (observation 
period in which the count was obtained). Converting count 
to rate of response makes measurement more meaningful. For 
example, knowing that Yumi read 95 words correctly and 4 words 
incorrectly in 1 minute, that Lee wrote 250 words in 10 minutes, 
and that Joan’s self-injurious behavior occurred 17 times in 1 hour 
provides important information and context. Expressing the 
previously reported counts of Katie’s performance in math class 
as rate reveals that she correctly answered long division problems 
at rates of 1.0, 0.5, and 0.25 per minute over three consecutive 
class periods. 

Rate is often reported as a count per 30 seconds, count per 
minute, count per hour, count per day—or occasionally, count 
per week, count per month, or count per year. As long as the 
unit of time is standard within or across experiments, rate mea- 
sures can be compared. For example, a student who, over four 
daily class activities of different durations, had 12 talk-outs in 
20 minutes, 8 talk-outs in 12 minutes, 9 talk-outs in 15 minutes, 
and 12 talk-outs in 18 minutes had response rates of 0.60, 0.67, 
0.60, and 0.67 per minute. 

The six rules and guidelines that follow will help research- 
ers and practitioners obtain, describe, and interpret count and 
rate data most appropriately. 


Reference the Counting Time. Behavior analysts use two 
methods to calculate rate of response: (a) total observation time 
and (b) total interresponse time (IRT) (Johnston & Pennypacker, 
2009). Applied behavior analysts most often use the total obser- 
vation time to calculate rate. To calculate total IRT, the analysts 
first measure the IRT for each response (i.e., the amount of time 
that elapses between two consecutive instances of a behavior) 
and then calculate the sum of the IRTs. (See the section on IRT 
later in this chapter.) 

When using rate of response, behavior analysts must 
include the duration of the observation time. Comparing rate 
measures without reference to the counting time can lead to 
faulty interpretations of data. Consider, for example, that Sally 
and Lillian each ran at a rate of 7 minutes per mile. We cannot 
compare their performances without reference to the distances 
they ran. Running | mile at a rate of 7 minutes per mile is a 
different class of behavior than running a marathon (26.2 miles) 
at a rate of 7 minutes per mile. 


Figure 4.1 Schematic representation of the 
dimensional quantities of repeatability, temporal 
extent, and temporal locus. Repeatability is shown by 
a count of four instances of a given response class 
(Rj, Ro, R3, and Ry) within the observation period. 
The temporal extent (i.e., duration) of each response 
is represented by the raised and shaded portions 

of the time line. One aspect of the temporal locus 
(response latency) of two responses (R2 and Ra) is 
represented by the elapsed time ( <—L— ) between 
the onset of two antecedent stimulus events (S; and 
S>) and the initiation of the responses that follow. 
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The counting time used for each session needs to accom- 
pany each rate measure when the counting time changes from 
session to session. For instance, rather than having a set counting 
time to answer arithmetic facts (e.g., a 1-minute timing), the 
teacher records the total time required for the student to com- 
plete an assigned set of arithmetic problems during each class 
session. In this situation, the teacher could report the student’s 
correct and incorrect answers per minute for each session, and 
also report the counting times for each session because they 
changed from session to session. 


Calculate Correct and Incorrect Rates of Response When 
Assessing Skill Development. When a participant has an 
opportunity to make correct and incorrect responses, a rate of 
response for each behavior should be reported. Calculating rates 
of correct and incorrect responses is crucial for evaluating skill 
development because knowing only the correct rate does not 
allow the behavior analyst or participant to assess an improving 
performance. The rate of correct responses alone could show an 
improving performance, but if the rate of incorrect responding 
is also increasing, the improvement may be illusionary. 
Correct and incorrect rate measures together provide important 
information to help the teacher evaluate how well the student 
is progressing. Ideally, the correct response rate accelerates 
toward a performance criterion, while the incorrect response 
rate decelerates to a performance criterion. Also, reporting rate 
correct and rate incorrect enables an assessment of proportional 
accuracy while maintaining the dimensional quantities of the 
measurement (e.g., 20 correct and 5 incorrect responses per 
minute = 80% accuracy, or a multiple of <4 proportional 
accuracy). 

Correct and incorrect response rates provide essential data 
for assessing fluent performance (i.e., proficiency) (Kubina, 2005). 
The assessment of fluency requires measuring the number of 
correct and incorrect responses per unit of time (i.e., proportional 
accuracy). Fluency cannot be evaluated using correct rate only 
because fluent performance must be accurate also. 


Consider the Varied Complexity of Responses. Rate of 
responding is a sensitive and appropriate measure of skill acqui- 
sition and the development of fluent performances only when 
the level of difficulty and complexity from one response to the 
next remains constant within and across observations. The rates 
of response previously discussed have been with whole units in 
which the response requirements are essentially the same from 
one response to the next. Many important behaviors, however, 
are composites of two or more component behaviors, and differ- 
ent situations call for varied sequences or combinations of the 
component behaviors. 

One method for measuring rate of responding that takes 
varied complexity into account for multiple-component behav- 
iors is to count the operations required to achieve a correct 
response. For example, in measuring students’ math calcula- 
tion performance, instead of counting answers to a two-digit 
plus three-digit addition problem with regrouping as correct or 
incorrect, a behavior analyst might consider the number of steps 
that were completed in correct sequence within each problem. 


Helwig (1973) used the number of operations needed to pro- 
duce the answers to mathematics problems to calculate response 
rates. Each session, students were given 20 multiplication and 
division problems selected at random from a set of 120 prob- 
lems. The teacher recorded the duration of time for each session. 
All the problems were of two types:a X b = canda + b = c. 
For each problem, the students were asked to find one of the 
factors: the product, the dividend, the divisor, or the quotient. 
Depending on the problem, finding the missing factor required 
from one to five operations. For example, writing the answer 275 
in response to the problem 55 X 5 = ? would be scored as four 
correct responses because finding the missing factor requires 
four operations: 


1. Multiply the ones: 5 X 5 = 25. 

2. Record the 5 ones and carry the 2 tens. 

3. Multiply the tens: 5 X 5(0) = 25(0). 

4. Add the 2 tens carried and record the sum (27). 


If multiple ways to find the answer were possible, the 
mean number of operations was counted. For example, the 
answer to the problem 4 X ? = 164 can be obtained by two 
operations with multiplication and by four operations with divi- 
sion. The mean number of operations is three. Helwig counted 
the number of operations completed correctly and incorrectly 
per set of 20 problems and reported correct and incorrect rates 
of response. 


Use Rate to Measure Free Operants. Rate of response is a 
useful measure for all behaviors characterized as free oper- 
ants. The term free operant refers to behaviors that have 
discrete onsets and offsets, do not depend on discriminative 
stimuli, involve minimal displacement of the organism in time 
and space (upon completion of one response, the organism is 
immediately ready to emit another response in the class), and 
can be emitted over a wide range of response rates. Skinner 
(1966) used rate of response of free operants as the primary 
dependent variable-when developing the experimental analysis 
of behavior. The bar press by rats and key peck by pigeons 
are typical free operant responses used in nonhuman animal 
laboratory studies. Many socially significant behaviors meet 
the definition of free operants: words read during a 1-minute 
counting period, head slaps per minute, letter strokes written 
in 3 minutes. 

Rate of response is a preferred measurement for free 
operants because it is sensitive to changes in behavior values 
(e.g., oral reading may occur at rates ranging from 0 to 250 or 
more correct words per minute) and because it offers clarity and 
precision by defining a count per unit of time. 


Do Not Use Rate to Measure Behaviors That Occur Within 
Discrete Trials. Rate of response is an inappropriate measure 
for behaviors that can occur only within limited or restricted 
situations. For example, response rates of behaviors that occur 
within discrete trials are controlled by a given Opportunity to 
emit the response. Typical discrete trials used in nonhuman ani- 
mal laboratory studies include moving from one end of a maze 


or shuttle box to another. Applied examples of discrete trials 
include responding to teacher-presented flash cards: answering 
a question prompted by the teacher; and, when presented with 
a sample color, pointing to a color from an array of three colors 
that matches the sample color. In each of these examples, rate 
of response is controlled by the presentation of a discriminative 
stimulus. Because behaviors that occur within discrete trials are 
opportunity bound, measures such as percentage of response 


opportunities in which a response was emitted or trials-to- 
criterion should be used. 


Do Not Use Rate to Measure Continuous Behaviors That 
Occur for Extended Periods. Rate is also a poor measure 
for continuous behaviors that occur for extended periods, 
such as participating in a playground game or being on task 
during a classroom activity. Such behaviors are best measured 
by whether they are “on” or “off” at any given time, yielding 
data on duration or estimates of duration obtained by interval 
recording. 


Celeration 


Just like a car that goes faster when the driver presses the gas 
pedal and slows down when the driver releases the pedal or 
steps on the brake, rates of response accelerate and decelerate. 
Celeration, the root word of acceleration and deceleration, 
is a measure of how rates of response change over time. Rate 
accelerates when a participant responds faster over successive 
counting periods and decelerates when responding slows over 
successive observations. Applied behavior analysts should use 
the terms acceleration or deceleration when describing increas- 
ing or decreasing rates of response. 

Celeration is a measure of count per unit time/per unit of 
time; or expressed another way, rate/per unit of time (Graf & 
Lindsley, 2002; Kubina, 2005). Celeration—changes in rate—is 
a direct measure of dynamic patterns of behavior change, such 
as transitions from one steady state of responding to another and 
the acquisition of fluent levels of performance (Cooper, 2005). 
The Standard Celeration Chart provides a standard format for 
displaying measures of celeration. There are four Standard 
Celeration Charts, showing rate as count (a) per day, (b) per 
week, (c) per month, and (d) per year. These four charts provide 
different levels of magnification for viewing and interpreting 
celeration. Methods for plotting and interpreting celeration data 
on the Standard Celeration Chart are described in Chapter 6. 


Measures Based on Temporal Extent 


Duration 


Duration, the amount of time from the onset to the end point 
of a response, is the basic measure of temporal extent. Applied 
behavior analysts measure duration in standard units of time 
(e.g., Enrique worked cooperatively with his peer tutor for 6 
minutes and 24 seconds today). 

Duration is important when measuring the amount of time 
a person engages in a target behavior. Applied behavior analysts 
measure the duration of target behaviors that a person has been 
engaging in for too long or for too short of a time period, such 
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as a child with developmental disabilities who has tantrums for 
more than an hour at a time, or a student who sticks with an 
academic task for no more than 30 seconds. 

Duration is also an appropriate measure for behaviors that 
occur at very high rates (e.g., rocking, rapid jerks of the head, 
hands, legs) or task-oriented continuous behaviors that occur 
for an extended time (e.g., cooperative play, on-task behavior, 
off-task behavior). 

Behavioral researchers and practitioners commonly 
measure one or both of two kinds of duration measures: total 
duration per session or observation period and duration per 
occurrence. 


Total Duration per Session. Total duration is a measure of the 
cumulative time a person engages in the target behavior. Applied 
behavior analysts use two procedures to measure and report total 
duration. One method involves recording the cumulative amount 
of time a target behavior occurs during an observation period. 
An observer measuring the total time a child engages in solitary 
play during free-play periods activates a stopwatch when the 
child begins to play alone. The moment solitary play ceases, 
the observer stops the stopwatch but does not reset it. When 
the child drifts into solitary play again, the observer restarts the 
stopwatch. The observer continues to start and stop the stop- 
watch, matching the child’s starting and ending bouts of solitary 
play. If the duration of the observation periods remains constant 
(e.g., 10 minutes), total duration per session data can be reported 
in standard units of time (e.g., 6 min and 30 sec of free play). 
If the duration of observation periods varies, total duration per 
session data must be converted to a percentage of total time 
observed (e.g., 6 min and 30 sec of solitary play in a 10-min 
session = 65%). 

Zhou, Iwata, Goff, and Shore (2001) used total duration 
measurement to assess leisure-item preferences of people with 
profound developmental disabilities. They used a stopwatch to 
record physical engagement with an item (i.e., contact between 
both hands and the item) during 2-minute trials. They reported 
total contact in seconds by summing the duration values across 
three 2-minute trials of each assessment. McCord, Iwata, Galen- 
sky, Ellingson, and Thomson (2001) measured the total duration 
in seconds that two adults with severe or profound intellectual 
disabilities engaged in problem behavior (see Figure 6.6). 

The other measure of total duration recording is the 
amount of time a person spends completing a specific task, with- 
out specifying a minimum or maximum observation period—for 
example, the number of minutes a person takes getting washed 
and dressed for work, after arising in the morning, or the time 
needed to complete a flat tire repair by a high school student 
enrolled in a work-study program. 


Duration per Occurrence. Duration per occurrence is a mea- 
sure of the duration of time that each instance of the target 
behavior occurs. For example, Greene, Bailey, and Barber 
(1981) used a sound-recording device to automatically record 
(a) the number of times noisy disruptions by children on a 
school bus exceeded a specified sound threshold and (b) the 
duration in seconds that each disruption remained above that 


100 Part 2 © Selecting, Defining, and Measuring Behavior 


threshold. The researchers used the mean duration per occur- 
rence of noisy disruptions as one measure for evaluating the 
intervention’s effects. 


Selecting and Combining Measures of Count and Duration. 
Measurements of count, total duration, and duration per occur- 
rence provide different views of behavior. Count and duration 
measure different dimensional quantities of behavior, and these 
differences provide the basis for selecting which dimension(s) 
to measure. Count measures repeatability, whereas duration 
recording measures temporal extent. For instance, a teacher 
concerned about a student who is out of her seat “too much” 
could tally each time the student leaves her seat. The behavior 
is discrete and is unlikely to occur at such a high rate that count- 
ing the number of occurrences would be difficult. Because any 
instance of out-of-seat behavior has the potential to occur for an 
extended time and the total time the student is out of her seat is 
a socially significant aspect of the behavior, the teacher could 
also use total duration recording. 

Using count to measure out-of-seat behavior provides the 
number of times the student left her seat. A measure of total 
duration will indicate the amount and proportion of time that 
the student was out of her seat during the observation period. 
Because of the relevance of temporal extent in this case, duration 
would be a better measure than count. The teacher might observe 
that the student left her seat once in a 30-minute observation 
period. One occurrence in 30 minutes might not be viewed as a 
problem. However, if the student left her seat at minute | of the 
observation period and did not return for the remainder of the 
period, a very different view of the behavior is obtained. 

In this situation, duration per occurrence would make 
a better measurement selection than count or total duration 
recording because duration per occurrence measures repeat- 
ability (i.e., the rate of response) and the temporal extent of 
the behavior. A duration-per-occurrence measure will give the 
behavior analyst information on the number of times the stu- 
dent was out of her seat and the duration of each occurrence. 
Duration per occurrence is often preferable to total duration 
because rate is sensitive to the number of instances and dura- 
tion of the target behavior. Further, if a total duration measure 
is useful for other purposes, the individual durations of each 
of the counted and timed occurrences can be summed. How- 
ever, if behavior endurance (e.g., academic responding, motor 
movements) is the major consideration, then total duration 
recording may be sufficient (e.g., 5 minutes of oral reading, 
10 minutes of free writing). 


Measures Based on Temporal Locus 


Temporal locus refers to (a) when a behavior occurs with 
respect to other events of interest (e.g., the latency between 
the onset of an antecedent event and the occurrence of the 
behavior) and (b) the amount of time that elapses between two 
consecutive instances of a response class. These two points of 
reference provide the context for measuring response latency 
and interresponse time (IRT), the two measures of temporal 
locus most frequently reported in the behavior analysis 
literature. 


Latency 


Latency is a measure of the elapsed time between the onset of 
a stimulus and a subsequent response.° Latency is an appro- 
priate measure when the applied behavior analyst is interested 
in how much time occurs between the opportunities to emit 
a behavior and when the subject initiates the target behavior. 
For example, Edwards, La Londe, Cox, Weetjens, and Poling 
(2016) researched the effects of schedules of reinforcement (see 
Chapter 13) on rats’ search for living people trapped under rub- 
ble. “Rats were exposed to 10 trials each day. For each trial, the 
release point for rats and the location of the two human targets, 
who assumed seated positions within the rubble, were selected 
from random lists” (p. 200). The researchers recorded (a) the 
latency from the rat’s search-area release point to finding and 
placing both front paws on a prepositioned human target under 
simulated rubble and (b) after an initiation buzzer sounded, a 
second latency measure from the target back to the release point. 
This significant research emphasized the importance of quickly 
locating and providing treatment to living people who have been 
trapped under rubble. 

Interest can also focus on latencies that are too short. A 
student may answer incorrectly because she does not wait for 
the teacher to complete the questions. An adolescent who, at 
the slightest provocation from a peer, immediately retaliates has 
no time to consider alternative behaviors that could defuse the 
situation and lead to improved interactions. 

Behavior analysts typically report response latency data 
as the mean, median, and range of individual latency measures 
per observation period. For example, Lerman, Kelley, Vorn- 
dran, Kuhn, and LaRue (2002) used a latency measure to assess 
the effects of different reinforcer magnitudes (i.e., 20 seconds, 
60 seconds, or 300 seconds of access to a reinforcer) on post- 
reinforcement pause—the absence of responding for a period 
following reinforcement. The researchers measured the number 
of seconds from the end of each reinforcer-access interval to the 
first instance of the target behavior (a communication response). 
They then calculated and graphed the mean, median, and range 
of response latencies measured during each session (see Lerman 
étial;2002)/p. 41). 


Interresponse Time 


Interresponse time (IRT) is a measure of the amount of 
time that elapses between two consecutive instances of a 
behavior. Like response latency, IRT is a measure of tem- 
poral locus because it identifies when a specific instance of 
behavior occurs with respect to another event (i.e., the previ- 
ous response). Figure 4.2 shows a schematic representation of 
interresponse time. 

Although a direct measure of temporal locus, IRT is func- 
tionally related to rate of response. Shorter IRTs coexist with 
higher rates of response, and longer IRTs occur within lower 
response rates. Applied behavior analysts measure IRT when 
the time between instances of a response class is important. 
IRT provides a basic measure for implementing and evaluat- 
ing interventions using differential reinforcement of low and 
high rates (DRL & DRH), a procedure for using reinforce- 
ment to reduce (i.e., DRL) or increase (i.e., DRH) the rate of 
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responding (see Chapter 25). Like latency data, IRT measures 
are typically reported and displayed graphically by mean (or 
median) and range per observation period. 

IRT measures appear frequently in basic behavior analysis 
experiments examining the effects of schedules of reinforce- 
ment (Tanno, Silberberg, & Sakagami, 2009) and variables 
related to conditioned reinforcement (Bejarano & Hackenberg, 
2007), delayed reinforcement (Lattal & Ziegler, 1982), and pun- 
ishment (Galbicka & Platt, 1984). In one of the few applied 
studies featuring IRT measures, Lennox, Miltenberger, and Don- 
nelly (1987) reduced rapid eating by three adults with profound 
intellectual disabilities. The intervention combined differential 
reinforcement of low rate responding (DRL; see Chapter 25) 
(an IRT of 15 sec was required before an eating response was 
permitted), response interruption (blockage of eating responses 
before 15 seconds had elapsed since the previous response), and 
prompting participants to engage in an incompatible behavior 
(putting their forks down and hands in their laps) during the 
15-second interval. 


Derivative Measures 


Behavior analysts frequently use percentage and trials-to- 
criterion, two forms of data derived from direct measures of 
dimensional quantities of behavior. 


Percentage 


Percentage is a ratio (i.e., a proportion) formed by combin- 
ing the same dimensional quantities, such as count (i.e., a tally 
divided by a tally) or time (i.e., a duration divided by a dura- 
tion; a latency divided by a latency). Percentage expresses 
the proportional quantity of some event in terms of the num- 
ber of times the event occurred per 100 opportunities that the 
event could have occurred. For example, if a student correctly 
answered 39 of 50 exam items, an accuracy percentage would 
be calculated by dividing the number of correct answers by 
the total number of items and multiplying that product by 100 
(39 + 50 = 0.78 X 100 = 78%). 

Percentage is frequently used in applied behavior analy- 
sis to report participants’ response accuracy. For example, Ward 
and Carnes (2002) used a percentage-of-correct-performances 
measure in their study evaluating the effects of goal setting and 
public posting on skill execution of three defensive skills by 
linebackers on a college football team. The researchers recorded 
counts of correct and incorrect reads, drops, and tackles by 
each player and calculated accuracy percentages based on the 
number of opportunities for each type of play. (Data from this 
study are shown in Figure 9.3.) Petursdottir and Aguilar (2016) 
reported percentage correct data on three kindergarten boys’ 
acquisition of receptive identification tasks as a function of 
the order of stimulus presentation in matching-to-sample trials. 
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Figure 4.2 Schematic representation of three 
interresponse times (IRT). IRT, the elapsed time 
between the termination of one response and the 
initiation of the next response, is a commonly used 
measure of temporal locus. 


Applied behavior analysts often use percentage to 
report the proportion of observation intervals in which the tar- 
get behavior occurred. These measures are typically reported 
as a percentage of intervals within a session (e.g., see 
Figures 6.4 and 12.7). Percentage can also be calculated for 
an entire observation session. In a study analyzing the differ- 
ential effects of reinforcer quality, immediacy, and response 
effort on the impulsive behavior of students with attention- 
deficit/hyperactivity disorder, Neef, Bicard, and Endo (2001) 
reported the percentage of time each student allocated to 
two sets of concurrently available math problems (e.g., time 
allocated to math problems yielding high-quality delayed 
reinforcers + total time possible X 100 = %). 

Percentages are used widely in education, psychology, 
and the popular media, and most people understand proportional 
relationships expressed as percentages. However, percentages 
are often used improperly. Accordingly, we offer several notes 
of caution on the use and interpretation of percentages. 

Percentages most accurately reflect the level of and 
changes in behavior when calculated with a divisor (or denom- 
inator) of 100 or more. However, most percentages used by 
behavior analysts are calculated with divisors much smaller 
than 100. Percentage measures based on small divisors are 
unduly affected by small changes in behavior. For example, a 
change in count of just | response per 10 opportunities changes 
the percentage by 10%. Guilford (1965) cautioned that it is 
unwise to compute percentages with divisors smaller than 20. 
For research purposes, we recommend that whenever possible, 
applied behavior analysts design measurement systems in which 
resultant percentages are based on no fewer than 30 response 
opportunities or observation intervals. 

Changes in percentage may erroneously suggest improv- 
ing performance. For example, an accuracy percentage could 
increase even though the occurrence of incorrect responses 
remains the same or worsens. Consider a student whose accu- 
racy in answering math problems on Monday is 50% (5 of 10 
problems answered correctly) and on Tuesday is 60% (12 of 
20 problems answered correctly). Even with the improved pro- 
portional accuracy, the number of errors increased (from 5 on 
Monday to 8 on Tuesday). 

Although no other measure communicates proportional 
relationships better than percentage, its use as a behavioral 
quantity is limited because a percentage has no dimensional 
quantities.* For example, percentage cannot be used to assess 
the development of proficient or fluent behavior because 
an assessment of proficiency must reference count and time. 
Percentage can, however, show the proportional accuracy of a 
targeted behavior during the development of proficiency. 

Another limitation of percentage as a measure of behavior 
change is that it imposes lower and upper limits on the data. 
For example, using percent correct to assess a student’s 
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reading performance establishes an artificial ceiling on the 
measurement of performance. A learner who correctly reads 
100% of the words she is presented cannot improve according 
to the measure used. 

Different percentages can be reported from the same 
data set, with each percentage suggesting significantly dif- 
ferent interpretations. For example, consider a student who 
scores 4 correct (20%) on a 20-item pretest and 16 correct 
(80%) when the same 20 items are given as a posttest. The 
most straightforward description of the student’s improvement 
from pretest to posttest (60%) compares the two measures 
using the original basis or divisor (20 items). Because the 
student scored 12 more items correct on the posttest than 
he did on the pretest, his performance on the posttest could 
be reported as an increase (gain score) over his pretest per- 
formance of 60%. And given that the student’s posttest score 
represented a fourfold improvement in correct responses, 
some might report the posttest score as a 300% improvement 
of the pretest—a completely different interpretation from an 
improvement of 40%. 

Although percentages greater than 100% are sometimes 
reported, strictly speaking, doing so is incorrect. Although a 
behavior change greater than 100% may seem impressive, it 
is a mathematical impossibility. A percentage is a proportional 
measure of a total set, where x (the proportion) of y (the total) 
is expressed as | part in 100. A proportion of something cannot 
exceed the total of that something or be less than zero (1.e., there 
is no such thing as a negative percentage). Every coach’s favorite 
athlete who “always gives 110%” simply does not exist.° 


Trials-to-Criterion 


Trials-to-criterion is a measure of the number of response 
opportunities needed to achieve a predetermined level of per- 
formance. What constitutes a trial depends on the nature of the 
target behavior, the relevant setting variables and antecedent 
stimuli, and the desired performance level. For a skill such as 
shoe tying, each opportunity to tie a shoe could be considered 
a trial, and trials-to-criterion data are reported as the number 
of trials required for the learner to tie a shoe correctly without 
prompts or assistance. For behaviors involving problem solving 
or discriminations that must be applied across a large number of 
examples to be useful, a trial might consist of a block, or series 
of response opportunities, in which each response opportunity 
involves the presentation of a different exemplar of the prob- 
lem or discrimination. For example, a trial for discriminating 
between the short and long vowel sounds of the letter 0 could 
be a block of 10 consecutive opportunities to respond, in which 
each response opportunity is the presentation of a word con- 
taining the letter 0, with short-vowel and long-vowel o words 
(e.g., hot, boat) presented in random order. Trials-to-criterion 
data could be reported as the number of blocks of 10 trials 
required for the learner to correctly pronounce the o sound in 
all 10 words. Count would be the basic measure from which the 
trials-to-criterion data would be derived. 

Behavior analysts can also use other basic measures as 
trials-to-criterion data (1.e., rate, duration, latency). For exam- 
ple, a trials-to-criterion measure for solving two-digit minus 


two-digit subtraction problems requiring borrowing could be 
the number of practice sheets of 20 randomly generated and 
sequenced problems a learner completes before she is able to 
solve all 20 problems on a single sheet in 3 minutes or less. 

Trials-to-criterion data are often calculated and reported as 
an ex post facto measure of one important aspect of the “cost” of 
a treatment or instructional method. For example, Trask-Tyler, 
Grossi, and Heward (1994) reported the number of instructional 
trials needed by each of three students with visual impairments 
and developmental disabilities to prepare three recipes without 
assistance on two consecutive times over two sessions. Each 
recipe entailed from 10 to 21 task-analyzed steps. 

Trials-to-criterion data are used frequently to compare the 
relative efficiency of two or more treatments or instructional 
methods. For example, by comparing the number of practice 
trials needed for a student to master weekly sets of spelling 
words practiced in two different ways, a teacher could determine 
whether the student learns spelling words more efficiently with 
one method than with another. Sometimes trials-to-criterion data 
are supplemented by information on the number of minutes of 
instruction needed to reach predetermined performance crite- 
ria (e.g., Holcombe, Wolery, Werts, & Hrenkevich 1993; Repp, 
Karsh, Johnson, & Van Laarhoven, 1994). 

Trials-to-criterion measures can also be collected and 
analyzed as a dependent variable throughout a study. For exam- 
ple, R. Baer (1987) recorded and graphed trials-to-criterion on 
a paired-associates memory task as a dependent variable in a 
study assessing the effects of caffeine on the behavior of pre- 
school children. 

Trials-to-criterion data can also be useful for assessing a 
learner’s increasing competence in acquiring a related class of 
concepts. For instance, teaching a concept such as the color red 
to a child could consist of presenting “red” and “not red” items to 
the child and differentially reinforcing correct responses. Trials- 
to-criterion data could consist of the number of “red” and “not 
red” exemplars required before the child achieves a specified 
level of performance with the discrimination. The same instruc- 
tional and data collection procedures could then be used in teach- 
ing other colors to the child. Data showing that the child achieves 
mastery of each newly introduced color in fewer instructional 
trials than required to learn previous colors might be evidence of 
the child’s increasing agility in learning color concepts. 


Definitional Measures 


In addition to the basic and derived dimensions already dis- 
cussed, behavior can also be defined and measured by its form 
and intensity. Neither form (i.e., topography) nor intensity (i.e., 
magnitude) of responding is a fundamental dimensional quantity 
of behavior, but each is an important parameter for defining and 
verifying the occurrence of many response classes. Behavior 
analysts measure the topography or magnitude of a response to 
determine whether the response represents an occurrence of the 
target behavior. Occurrences of the target behavior verified on 
the basis of topography or magnitude are then measured by one 
or more aspects of count, temporal extent, or temporal locus. 
In other words, measuring topography or magnitude is some- 
times necessary to determine whether instances of the targeted 


response class have occurred, but the subsequent quantification 
of those responses is recorded, reported, and analyzed in terms 
of the fundamental and derivative measures of count, rate, dura- 
tion, latency, IRT, percentage, and trials-to-criterion. 


Topography 


Topography, the physical form or shape of a behavior, is a 
measurable and malleable dimension of behavior. Topography 
is a measurable dimension of behavior because responses of 
varying form can be detected from one another. That topogra- 
phy is a malleable aspect of behavior is evidenced by the fact 
that responses of varying form are shaped and selected by their 
consequences. 

A group of responses with widely different topographies 
may serve the same function (i.e., form a response class). For 
example, each of the different ways of writing the word topog- 
raphy shown in Figure 4.3 would produce the same effect on 
most readers. Membership in some response classes, however, 
is limited to responses within a narrow range of topographies. 
Although each of the response topographies in Figure 4.3 would 
meet the functional requirements of most written communica- 
tions, none would meet the standards required of an advanced 
calligraphy student. 


7 OPOCRAPHY 


T opograahy 
Topography 
Legere sich, 
Topography 
Topography 


Figure 4.3 Topography, the physical form or shape of 
behavior, is a measurable dimension of behavior. 
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Topography is of obvious and primary importance in 
activities in which form, style, or artfulness of behavior is valued 
in its own right (e.g., painting, sculpting, dancing, gymnastics). 
Measuring and providing differential consequences for 
responses of varied topographies is also important when the 
functional outcomes of the behavior correlate highly with 
specific topographies. A student who sits with good posture 
and looks at the teacher is more likely to receive positive 
attention and opportunities to participate academically than is 
a student who slouches, with her head on the desk (Schwarz 
& Hawkins, 1970). Basketball players who execute foul shots 
with a certain form make a higher percentage of shots than 
when they shoot idiosyncratically (Kladopoulos & McComas, 
2001; see Figure 6.3). 

Trap, Milner-Davis, Joseph, and Cooper (1978) measured 
the topography of cursive handwriting by first-grade students. 
Plastic transparent overlays were used to detect deviations from 
model letters in lower- and uppercase letters written by the 
children (see Figure 4.4). The researchers counted the number 
of correct letter strokes—those that met all of the specified 
topographical criteria (e.g., all letter strokes contained within 
the 2-mm parameters of the overlay, connected, complete, 
sufficient length)—and used the percentage correct of all strokes 


ep 
oF 


€ i oa 
oe 


& 


Figure 4.4 Examples of outlines on a transparent overlay 
used to measure inside and outside boundaries of manu- 
script letters and an illustration of using the transparent 
overlay to measure the letter m. Because the vertical stroke 
of the letter m extends beyond the outline boundary, it did 
not meet the topographical criteria for a correct response. 
From “The Measurement of Manuscript Letter Strokes” by J. J. Helwig, 
J.C. Johns, J. E. Norman, J. O. Cooper, 1976. Journal of Applied Behavior 
Analysis, 9, p. 231. Copyright 1976 by the Society for the Experimental 
Analysis of Behavior, Inc. Used by permission. 
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written by each student to assess the effects of visual and verbal 
feedback and a certificate of achievement on the children’s 
acquisition of cursive handwriting skills. 


Magnitude 


Magnitude refers to the force or intensity of a response. The 
desired outcomes of some behaviors are contingent on respond- 
ing at or above (or below) a certain intensity or force. A screw- 
driver must be turned with sufficient force to insert or remove 
screws; a pencil must be applied to paper with enough force to 
leave legible marks. In contrast, applying too much torque to a 
misaligned screw or bolt is likely to strip the threads, and push- 
ing too hard with a pencil will break its point. 

Several studies have measured the magnitude of speech 
or other vocalizations that were considered too loud or too 
soft (Koegel & Frea, 1993). Schwarz and Hawkins (1970) 
measured the voice loudness of Karen, a sixth-grade girl who 
spoke so softly in class that her voice was usually inaudible to 
others. Karen’s voice was recorded on videotape during two 
class periods each day. (The videotape was also used to obtain 
data on two other behaviors: face touching and the amount of 
time Karen sat in a slouched position). The researchers then 
played the videotape into an audiotape recorder with a loudness 
indicator and counted the number of times the needle went 
above a specified level on the loudness meter. Schwarz and 
Hawkins used the number (proportion) of needle inflections 
per 100 words spoken by Karen as the primary measure for 
evaluating the effects of an intervention on increasing her voice 
volume during class. 

Edgerton and Wine (2017) used a digital tablet and app 
(Voice Meter Pro™) to measure the voice volume of a boy with 
autism who frequently spoke below conversational volume. The 
app displayed a thermometer gauge that rose and fell with the 
surrounding sound level and signaled changes in voice volume 
with different background colors, an animated figure, and text 
prompts (“Speak up, I can’t hear you!”, ““That’s better”, and 
“That’s too loud!””). The investigators calibrated the app to detect 
three levels of magnitude (too soft, conversational, and too loud) 
and recorded the number of times the boy’s responses to ques- 
tions were spoken at conversational volume. 

Greene, Bailey, and Barber (1981) measured the magni- 
tude of noisy disruptions by middle school students on a school 
bus with an automated sound-recording device. The recording 
device could be adjusted so that only sound levels above a pre- 
determined threshold activated it. The device automatically 
recorded both the number of times outbursts of sound exceeded 
a specified threshold (93 dB) and the total duration in seconds 
that the sound remained above that threshold. When the noise 
level exceeded the specified threshold, a light on a panel that all 
students could see was activated automatically. When the light 
was off, students listened to music during the bus ride; when the 
number of noisy disruptions was below a criterion, they partici- 
pated in a raffle for prizes. This intervention drastically reduced 
the outbursts and other problem behaviors as well. Greene and 
colleagues reported both the number and the mean duration per 
occurrence of noisy disruptions as measures of evaluating the 


intervention’s effects.° Table 4.1 summarizes the measurable 
dimensions of behavior and considerations for their use. 


METHODS FOR MEASURING BEHAVIOR 


Methods for measuring behavior used most often by applied 
behavior analysts involve one or a combination of the following: 
event recording, timing, and various time sampling methods. 


Event Recording 


Event recording encompasses a wide variety of procedures for 
detecting and recording the number of times a behavior of inter- 
est occurs. For example, Cuvo, Lerch, Leurquin, Gaffaney, and 
Poppen (1998) used event recording to measure the effects of 
work requirements and reinforcement schedules on the choice 
behavior of adults with intellectual disabilities and preschool 
children while they were engaged in age-appropriate tasks (e.g., 
adults sorting silverware, children tossing beanbags or jump- 
ing hurdles). The researchers recorded each piece of silverware 
sorted, each beanbag tossed, and each hurdle jumped. 

Event recording is also used to measure discrete trial 
behaviors, in which the count for each trial or opportunity to 
respond is either | or 0, representing the occurrence or nonoc- 
currence of the target behavior. Figure 4.5 shows a form used to 
record the occurrence of imitation responses by a preschooler 
with disabilities and his typically developing peer partner within 
a series of instructional trials embedded into ongoing classroom 
activities (Valk, 2003). For each trial, the observer recorded the 
occurrence of a correct response, no response, an approxima- 
tion, or an inappropriate response by the target child and the 
peer by circling or marking a slash through letters representing 
each behavior. The form also allowed the observer to record 
whether the teacher prompted or praised the target child’s imi- 
tative behavior. 


Considerations for Event Recording. Many applied behavior 
analysts in practice use non-automated event recording. 
It is easy to do. Most people can tally discrete behaviors 
accurately, often on the first attempt. If the response rate is 
not too high, non-automated event recording may not interfere 
with other activities. For example, a teacher can continue with 
instruction while tallying occurrences of the target behavior. 
Event recording provides useful data for most behaviors. 
However, each instance of the target behavior must have 
discrete beginning and ending points. Event recording is 
applicable for target behaviors such as students’ oral responses 
to questions, students’ written answers to math problems, 
and a parent praising a son or daughter’s behavior. Behaviors 
such as humming are hard to measure with event recording 
because an observer would have difficulty determining when 
one hum ends and another begins. Event recording is difficult 
for behaviors defined without specific discrete action or object 
relations, such as engagement with materials during free-play 
activity. Because engagement with materials does not present 
a specific discrete action or object relation, an observer may 
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Target 


Session date: May 21 Sessionno: 16 Observer: Jennie 
child: Jordan Peer: Ethan |OA day: CES) NO 
behavior: Place block on structure Condition: 5-sec time delay _ 


Target 
Code: 


A= ma dew: | = Inappropriate 


N = No response 


C = Correct 


No. cor 


Target behavior: Place sticker on paper 


& No. corrects by peer: ] 


KKK KKK KKK KKK KKK KKK KKK KKK KK KK KR EK KEK KKK KKKKKEKEEEEK 


rects by target child: 


Condition: 5-sec time delay 


Target child's behavior Teacher behavior Peer's behavior | Teacher 
toward aa a praise 


ee 

en tomes ee (NA cP 
| 3 | sc N @ i KPrompt> -< Praise> [© NA 1 | GPraised 

Figure 4.5 Data collection form for recording rrkhe Se ea D e> | Oana aee 

the behavior of two children and a teacher 3 ON romp cree (ON be 

during a series of discrete trials. : 2h Sh "throm KONA Td 

Adapted from The Effects of Embedded Instruction Within | 208) SCN aA) A 

the Context of a Small Group on the Acquisition of Imita- — & L© 

tion Skills of Young Children with Disabilities by J. E. Valk 10 © 

(2003), p. 167. Unpublished doctoral dissertation, The 

Ohio State University. Used by permission. No. corrects by target child:__4 _ No. corrects by peer: & 


have difficulty judging when one engagement starts and ends, 
and then another engagement begins. 

Another consideration with non-automated event record- 
ing is that the target behaviors should not occur at such high 
rates that an observer would have difficulty counting each dis- 
crete occurrence accurately. High-rate behaviors that may be 
difficult to measure with event recording include rapid talking, 
body rocks, and tapping objects. 

Also, event recording does not produce accurate measures 
for target behaviors that occur for extended time periods, such as 
staying on task, listening, playing quietly alone, being out of one’s 
seat, or thumb sucking. Task-oriented or continuous behaviors 
(e.g., being “on task”) are examples of target behaviors for which 
event recording would not be indicated. Classes of continuous 
behaviors occurring across time are usually not a prime concern 
of applied behavior analysts. For example, reading per se is of less 
concern than the number of words read correctly and incorrectly, 
or the number of reading comprehension questions answered 
correctly and incorrectly. Similarly, behaviors that demonstrate 
understanding are more important to measure than “listening 
behavior,” and the number of academic responses a student emits 
during an independent seatwork period is more important than 
being on task. 


Timing 


Behavior analysts use a variety of timing devices and procedures 
to measure duration, response latency, and IRT. 


Timing Duration 


Researchers often use semi-automated computer-driven systems 
for recording durations. Practitioners, however, will most likely 
use non-automated instruments for recording duration. The most 
precise non-automated instrument is a digital stopwatch. Prac- 
titioners can use wristwatches and wall clocks to measure dura- 
tion, but the measures obtained will be less precise than those 
obtained with a stopwatch. 

The procedure for recording the total duration of a tar- 
get behavior per session with a stopwatch is to (a) activate the 
stopwatch when the behavior starts and (b) stop the watch at 
the end of the episode. Then, without resetting the stopwatch, 
the observer starts the stopwatch again at the beginning of the 
second occurrence of the behavior and stops the watch at the end 
of the second episode. The observer continues to accumulate the 
durations of time in this fashion until the end of the observation 
period, and then transfers the total duration of time showing on 
the stopwatch to a data sheet. 


Timing Latency and IRT 


Procedures for measuring latency and IRT are similar to those 
used to measure duration. Measuring latency requires the precise 
detection and recording of the time that elapses from the onset 
of each occurrence of the antecedent stimulus event of interest 
to the onset of the target behavior. Measuring IRTs requires 
recording the precise time that elapses from the termination of 


each occurrence of the target behavior to the onset of the next 
response. 


Time Sampling 


Time sampling refers to a variety of methods for observing 
and recording behavior during intervals or at specific moments 
in time. The basic procedure involves dividing the observation 
period into time intervals and then recording the presence or 
absence of behavior within or at the end of each interval. 

Ethologists originally developed time sampling to study 
the behavior of animals in the field (Altmann, 1974). Because it 
was not possible or feasible to observe the animals continuously, 
these scientists arranged systematic schedules of relatively brief 
but frequent observation intervals. The measures obtained from 
these “samples” are considered representative of the behavior 
during the total time period from which they were collected. 
For example, much of our knowledge about the behavior of 
chimpanzees and other primates is based on data collected with 
time sampling observation methods by researchers such as Jane 
Goodall (1991). 

Applied behavior analysts use three forms of time sam- 
pling—whole-interval recording, partial-interval recording, and 
momentary time sampling.’ 


Whole-Interval Recording 


Whole-interval recording is often used to measure continuous 
behaviors (e.g., cooperative play) and behaviors that occur at 
such high rates that observers have difficulty distinguishing 
one response from another (e.g., rocking, humming) but can 
detect whether the behavior is occurring at any given time. The 
observation period for whole-interval recording is divided 
into a series of brief time intervals (e.g., 5 to 10 seconds). At 
the end of each interval, the observer records whether the tar- 
get behavior occurred throughout the interval. Data obtained 
with whole-interval recording usually underestimate the over- 
all percentage of the observation period in which the behav- 
ior actually occurred. The longer the observation intervals, 
the greater the degree to which whole-interval recording may 
underestimate the actual occurrence of the behavior. 

Data collected with whole-interval recording are reported 
as the percentage of total intervals in which the target behavior 
was recorded as occurring. Because whole-interval recording 
represents the proportion of the entire observation period that 
the person was engaged in the target behavior, whole-interval 
recording data yield an estimate of total duration. For example, 
assume a whole-interval observation period consisted of six 
10-second intervals (a 1-minute time frame). If the target behav- 
ior occurred for four of these 10-second whole intervals and did 
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not occur for the remaining two intervals, the whole-interval 
recording would yield a total duration estimate of 40 seconds. 

Figure 4.6 shows an example of a whole-interval record- 
ing form used to measure the on-task behavior of four students 
during academic seatwork time (Ludwig, 2004). Each minute 
was divided into four 10-second observation intervals; each 
observation interval was followed by 5 seconds in which the 
observer recorded the occurrence or nonoccurrence of target 
behavior during the preceding 10 seconds. The observer first 
watched Student | continuously for 10 seconds, and then she 
looked away during the next 5 seconds and recorded whether 
Student | had been on task throughout the previous 10 seconds 
by circling YES or NO on the recording form. After the 5-second 
interval for recording Student 1’s behavior, the observer looked 
up and watched Student 2 continuously for 10 seconds, after 
which she recorded Student 2’s behavior on the form. The same 
procedure for observing and recording was used for Students 
3 and 4. In this way, the on-task behavior of each student was 
observed and recorded for one 10-second interval per minute. 

Continuing the sequence of observation and recording 
intervals over a 30-minute observation period provided thirty 
10-second measures (i.e., samples) of each student’s on-task 
behavior. The data in Figure 4.6 show that the observer judged 
the four students to have been on task during Session 17 for 87%, 
93%, 60%, and 73% of the intervals, respectively. Although the 
data are intended to represent the level of each student’s behav- 
ior throughout the observation period, it is important to remem- 
ber that each student was observed for a total of only 5 of the 
observation period’s 30 minutes. 

Observers using any form of time sampling should always 
make a recording response of some sort in every interval. For exam- 
ple, an observer using a form such as the one in Figure 4.6 would 
record the occurrence or nonoccurrence of the target behavior in 
each interval by circling YES or NO. Leaving unmarked intervals 
increases the likelihood of losing one’s place on the recording form 
and marking the result of an observation in the wrong interval space. 

All time sampling methods require a timing device to sig- 
nal the beginning and end of each observation and recording 
interval. Observers using pencil, paper, clipboard, and timers 
for interval measurement will often attach a stopwatch to a clip- 
board. However, observing and recording behavior while having 
to look simultaneously at a stopwatch is likely to have a neg- 
ative impact on the accuracy of measurement. As an effective 
solution to this problem, the observer can listen by earphone to 
prerecorded audio cues signaling the observation and recording 
intervals. For example, observers using a whole-interval record- 
ing procedure like the one just described could listen to an audio 
recording with a sequence of prerecorded statements such as 
the following: “Observe Student 1”; 10 seconds later, “Record 
Student 1”; 5 seconds later, “Observe Student 2”; 10 seconds 
later, “Record Student 2”; and so on. 

Tactile prompting devices can also be used to signal 
observation intervals. For example, the Gentle Reminder (dan@ 
gentlereminder.com) and MotivAider (www.habitchange.com) 
are small timing instruments that vibrate at the time intervals 
programmed by the user. 
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Date: May 7_ 
Observer:(Robin )___ 


Experimental condition: 


Obs. start time: 9:42 


On-Task Recording Form 


Group no:__1 Session no: 17 


JOA session: ___Yes X_No 


Baseline On task 


Stop time: 10:12 
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| NO | 
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Figure 4.6 Observation form used for whole- 
interval recording of on-task behavior by four 
students during independent seatwork time. 
Adapted from Smiley Faces and Spinners: Effects of 
Self-Monitoring of Productivity with an Indiscriminable 
Contingency of Reinforcement on the On-Task Behavior 
and Academic Productivity by Kindergarteners During 
Independent Seatwork by R. L. Ludwig, 2004, p. 101. 
Unpublished master’s thesis, The Ohio State University. Used 
by permission. 


Partial-Interval Recording 


When using partial-interval recording, the observer records 
whether the behavior occurred at any time during the interval. 
Partial-interval time sampling is not concerned with how many 
times the behavior occurred during the interval or how long the 
behavior was present, just that it occurred at some point during 
the interval. If the target behavior occurs multiple times during 
the interval, it is still scored as occurring only once. An observer 
using partial-interval recording to measure a student’s disruptive 
behavior would mark an interval if disruptive behavior of any 
form meeting the target behavior definition occurred for any 
amount of time during the interval. That is, an interval would be 
scored as disruptive behavior even if the student were disruptive 
for only 1 second of a 6-second interval. Because of this, data 
obtained with partial-interval recording often overestimate the 
overall percentage of the observation period (i.e., total duration) 
that the behavior actually occurred. 

Partial-interval data, like whole-interval data, are most 
often reported as a percentage of total intervals in which the 
target behavior was scored. Partial-interval data are used to 


YES | NO | GST No TOES no TGS TCH] 
ioe Pee two ves CHO TES TN 


% \ntervals 
on task 


= On-task 


= Off-task 


represent the proportion of the entire observation period in 
which the target behavior occurred, but the results of partial-in- 
terval recording, unlike those of whole-interval recording, do 
not provide any information on duration per occurrence. That is 
because any instance of the target behavior, regardless of how 
brief its duration, will cause an interval to be scored. 

If the observer uses partial-interval recording with brief 
observation intervals to measure discrete responses of short 
duration per occurrence, the data obtained provide a crude esti- 
mate of the minimum rate of responding. For example, data 
showing that a behavior measured by partial-interval recording 
consisting of 6-second contiguous intervals (i.e., successive 
intervals are not separated by time in which the behavior is 
not observed) occurred in 50% of the total intervals indicate 
a minimum response rate of five responses per minute (on the 
average, at least one response occurred in 5 of the 10 intervals 
per minute). Although partial-interval recording often overes- 
timates the total duration, it is likely to underestimate the rate 
of a high-count behavior. This is because an interval in which 
a person made eight nonverbal sounds would be scored the 


Same as an interval in which the person made only one sound. 
When the evaluation and understanding of a target behavior 
require a more sensitive measure than interval recording, use 
rate of response. 

Because an observer using partial-interval recording 
needs to record only that a behavior has occurred at any point 
during each interval (compared to having to watch the behav- 
ior throughout the entire interval with whole-interval record- 
ing), it is possible to measure multiple behaviors concurrently. 
Figure 4.7 shows a portion of a form for measuring four response 
classes by three students, using partial-interval recording with 
20-second intervals. The observer watches Student 1 throughout 
the first 20-second interval, Student 2 for the next 20 seconds, 
and Student 3 for the next 20 seconds. Each student is observed 
for 20 seconds out of each minute of the observation period. 
If a student engages in any of the behaviors being measured at 
any time during an observation interval, the observer marks the 
letter(s) corresponding to those behaviors. If a student engages 
in none of the behaviors being measured during an interval, the 
observer marks N to indicate no occurrences of the target behav- 
iors. For example, during the first interval in which Student 1 
was observed, he said, “Pacific Ocean” (an academic response). 
During the first interval in which Student 2 was observed, she 
left her seat and threw a pencil (a behavior within the response 
class “other disruptive behavior”). Student 3 emitted none of 
the four target behaviors during the first interval that he was 
observed. 


Momentary Time Sampling 


An observer using momentary time sampling records whether 
the target behavior occurred at the moment the time interval 
ends. If conducting momentary time sampling with 1|-minute 
intervals, an observer would look at the person at the 1-minute 
mark of the observation period, determine immediately whether 
the target behavior was occurring, and indicate that decision on 
the recording form. One minute later (i.e., 2 minutes into the 
observation period), the observer would look again at the person 
and then score the presence or absence of the target behavior. 
This procedure would continue until the end of the observation 
period. 


1 2 
Student? |@ TS DNI@TSODNI@@DS DN/IAT S DW 


Student 2 


Student 3 


A TOON ADO? nla t sOn[Or son 


ATSD@ATSONI@TSONIAMODN 


Key: 

A = Academic response 

T =Talk-out 

S = Out-of-seat 

D = Other disruptive behavior 

N = No occurrences of target behaviors 
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As with interval recording methods, data from momentary 
time sampling are typically reported as percentages of the total 
intervals in which the behavior occurred and are used to estimate 
the proportion of the total observation period that the behavior 
occurred. 

A major advantage of momentary time sampling is that the 
observer does not have to attend continuously to measurement, 
whereas interval-recording methods demand the undivided 
attention of the observer. 

Because the person is observed for only a brief moment, 
much behavior will be missed with momentary time sampling. 
Momentary time sampling is used primarily to measure contin- 
uous activity behaviors such as engagement with a task or activ- 
ity, because such behaviors are easily identified. Momentary 
time sampling is not recommended for measuring low-count, 
short-duration behaviors. 

Numerous studies have compared measures obtained by 
momentary time sampling and interval recording using intervals of 
varying duration with measures obtained by continuous duration 
recording (e.g., Alvero, Struss, & Rappaport, 2007; Gunter, Venn, 
Patrick, Miller, & Kelly, 2003; Hanley, Cammilleri, Tiger, & 
Ingvarsson, 2007; Meany-Daboul, Roscoe, Bourret, & Ahearn, 
2007; Powell, Martindale, Kulp, Martindale, & Bauman, 1977; 
Saudargas & Zanolli 1990; Test & Heward, 1984). In general, 
this research has found that momentary time sampling both 
overestimates and underestimates continuous duration measures 
when intervals are greater than 2 minutes. With intervals up to 
1 minute, the data obtained using momentary time sampling 
produce data paths similar to those of continuous duration 
recording. 


Planned Activity Check 


A variation of momentary time sampling, planned activity 
check (PLACHECK), uses head counts to measure “group 
behavior.” A teacher using PLACHECK observes a group of 
students at the end of each time interval, counts the number 
of students engaged in the targeted activity, and records the 
tally with the total number of students in the group. Doke and 
Risley (1972) used data obtained by PLACHECK measure- 
ment to compare group participation in required and optional 


Figure 4.7 Portion of a form used for partial-interval 
recording of four response classes by three students. 
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before-school activities. Observers tallied the number of stu- 
dents in either the required or the optional activity area at the end 
of 3-minute intervals, and then the number of children actually 
participating in an activity in either area. They reported these 
data as separate percentages of children participating in required 
or optional activities. 

Dyer, Schwartz, and Luce (1984) used a variation of 
PLACHECK to measure the percentage of students with 
disabilities living in a residential facility who were engaged in 
age-appropriate and functional activities. As students entered the 
observation area, they were observed individually for as long as 
it took to determine the activity in which they were engaged. The 
students were observed in a predetermined order not exceeding 
10 seconds per student. 

In a study examining the effects of response cards on 
the disruptive behavior of third-graders during daily math 
lessons, Armendariz and Umbreit (1999) recorded at 1-minute 
intervals whether each student in the class was being disrup- 
tive. By combining the PLACHECK data obtained across all of 
the no-response cards (baseline) sessions, and graphing those 
results as the percentage of students who were disruptive at 
each 1-minute mark, and doing the same with the PLACHEK 
data from all response cards sessions, Armendariz and Umbreit 
created a clear and powerful picture of the differences in “group 
behavior” from the beginning to the end of a typical lesson in 
which response cards were or were not used. 

Other variations of the PLACHECK measurement can 
be found in the literature, though they usually are called time 
sampling or momentary time sampling. For example, McKenzie 
and colleagues have developed two PLACHECK-type observa- 
tional systems, SOPLAY and SOPARC, for measuring children’s 
physical activity in school and community play environments 
(McKenzie & Cohen, 2006; McKenzie, Marshall, Sallis, & 
Conway, 2000). “SOPLAY and SOPARC use a group momen- 
tary time-sampling format (i.e., serial observation ‘snapshots’ ) 
to record the physical activity level (i.e., sedentary, walking/ 
moderate, vigorous) in specific target areas. . . . Counting the 
number of people in an area is itself an important endeavor 
because it provides an indication of how well the space/ 
facility supports school or community physical activity goals” 
(McKenzie, 2016, p. 335, 336). 


Recognizing Discrepancies Among and Between Time 
Sampling Measures and Continuous Measurement 


As stated previously, all time sampling methods provide only an 
estimate of the actual occurrence of the behavior. Different time 
sampling procedures produce different results, which can influ- 
ence decisions and interpretations. A series of studies by Rapp 
and colleagues compared the sensitivity of momentary time sam- 
pling (MTS) and partial-interval recording (PIR) to detect small, 
moderate, and large behavior changes evident with continuous 
duration measures (Carroll, Rapp, Colby-Dirksen, & Linden- 
berg, 2009; Devine, Rapp, Testa, Henrickson, & Schnerch, 
2011; Rapp, Colby-Dirksen, Michalski, Carroll, & Lindenberg, 
2008). These researchers compared measures obtained by MTS 


and PIR of various interval sizes (10-sec, 20-sec, 30-sec, 1-min, 
and 2-min intervals) within varied observation periods (10-min, 
30-min, and 60-min sessions) with data provided by continuous 
measurement (duration or event recording). Rapp et al. (2008) 
found that MTS with intervals up to 30 seconds reliably detected 
most moderate and large changes in duration events, that MTS 
with 1-minute intervals detected most large changes in duration 
events, and that PIR with 10-second or larger intervals failed to 
detect a large percentage of changes in duration events. Given 
longer observation sessions, momentary time sampling with 
interval sizes up to 30 sec detected a wide range of changes in 
both duration and rate events. 

Devine and colleagues (2011) found that momentary time 
sampling with interval sizes up to 30 seconds detected a wide 
range of changes in duration and rate events during lengthier 
observation periods. These authors concluded that 


[P]ractitioners with limited resources may measure behav- 
ior changes with smaller interval sizes of PIR or MTS 
during shorter observation sessions or with a larger interval 
size of MTS during lengthier sessions with comparable 
sensitivity. For instance, 10-s MTS during 10-min sessions 
and 30-s MTS during 30-min sessions yielded comparable 
detection of changes in duration events. Similarly, both 
10-s PIR during 10-min sessions and 30-s MTS during 
30-min sessions were sensitive to changes in frequency 
events. In both cases, the number of observations required 
in the 10-min session is the same as the number of observa- 
tions required in the 30-min session. (pp. 120-121) 


Figure 4.8 illustrates just how different the results obtained 
by measuring the same behavior with different time sampling 
methods can be. The shaded bars indicate when the behavior 
was occurring within an observation period divided into 10 
contiguous intervals. The shaded bars reveal the three-dimen- 
sional quantities of behavior: repeatability (seven instances of 
the behavior), temporal extent (the duration of each response) 
and temporal locus (IRT time is depicted by the space between 
the shaded bars). 

Because the time sampling methods used in applied 
behavior analysis are most often viewed and interpreted as mea- 
sures of the proportion of the total observation period in which 
the behavior occurred, it is important to compare the results 
of time sampling methods with those obtained by continuous 
measurement of duration. Continuous measurement reveals that 
the behavior depicted in Figure 4.8 occurred 55% of the time 
during the observation period. When the same behavior during 
the same observation period was recorded using whole-inter- 
val recordings, the measure obtained grossly underestimated 
the actual occurrence of the behavior (i.e., 30% versus 55%), 
partial-interval recording grossly overestimated the actual occur- 
rence (i.e., 70% versus 55%), and momentary time sampling 
yielded a fairly close estimate of actual occurrence of the behay- 
ior (50% versus 55%). 

Although momentary time sampling resulted in a mea- 
sure that most closely approximated the actual behavior, it is 
not always the preferred method. Different distributions of the 
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Measurement Method 


mmm = Actual occurrence of behavior measured by continuous 
duration recording 
+ = Behavior recorded as occurring during the interval 
— = Behavior recorded as not occurring during the interval 


WI = whole-interval recording 
P| = partial-interval recording 
MTS = momentary time sampling 


behavior (i.e., temporal locus) during the observation period, 
even at the same overall count and duration as the session shown 
in Figure 4.8, would result in widely different outcomes from 
each of the three time sampling methods. 

Discrepancies between measures obtained by different 
measurement methods are usually described in terms of the rel- 
ative accuracy or inaccuracy of each method. However, accuracy 
is not the issue here. If the shaded bars in Figure 4.8 represent 
the true value of the behavior, then each of the time sampling 
methods was conducted with complete accuracy and the result- 
ing data are what should be obtained by applying each method. 
An example of the inaccurate use of one of the measurement 
methods would be if the observer using whole-interval recording 
had marked the behavior as occurring in Interval 2, when the 
behavior did not occur according to the rules of whole-interval 
recording. 

But if the behavior actually occurred for 55% of the 
observation period, what should we call the wrong and mis- 
leading measures of 30% and 70% if not inaccurate? In this 
case, the misleading data are artifacts of the measurement 
methods used to obtain them. An artifact is a phenomenon 
that appears to exist because of the way it is examined or mea- 
sured. The 30% measure obtained by whole-interval recording 
and the 70% measure obtained by partial-interval recording 
are artifacts of the way those measures were conducted. The 
fact that data obtained from whole-interval and partial-interval 
recording consistently underestimate and overestimate, respec- 
tively, actual occurrence of behavior as measured by continu- 
ous duration recording is an example of well-known artifacts. 

It is clear that interval measurement and momentary time 
sampling result in some variability in the data analysis, which 
must be considered carefully when interpreting results obtained 
with these measurement methods. Some common causes of 
measurement artifacts and how to avoid them are discussed in 


Chapter 5. 


Figure 4.8 Comparing measures of the same 
behavior obtained by three different time 
sampling methods with measure obtained by 
continuous duration recording. 


MEASURING BEHAVIOR 
BY PERMANENT PRODUCTS 


Behavior can be measured in real time by observing a person’s 
actions and recording responses of interest as they occur. For 
example, a teacher can tally the number of times a student 
raises her hand during a class discussion. Some behaviors can 
be measured in real time by recording their effects on the envi- 
ronment as those effects are produced. For example, a baseball 
batting coach advances a handheld counter each time a batter 
hits a pitch to the right-field side of second base. 

Some behaviors can be measured after they have 
occurred. A behavior that produces consistent effects on 
the environment can be measured after it has occurred if the 
effects, or products left behind by the behavior, remain unal- 
tered until measurement is conducted. For example, if the 
flight of the baseballs hit by a batter during a practice ses- 
sion were not impeded, and the balls were left lying on the 
ground, a hitting instructor could collect data on the batter’s 
performance after the batter’s turn was completed by counting 
each ball found lying in fair territory on the right-field side of 
second base. 

Measuring behavior after it has occurred by observing the 
effects the behavior produced on the environment is known as 
measurement by permanent product. A permanent product is 
a change in the environment produced by a behavior that lasts 
long enough for measurement to take place. 

Although often described erroneously as a method for 
measuring behavior, measurement by permanent product does 
not refer to any particular measurement procedure or method. 
Instead, measurement by permanent product refers to the time 
of measurement (i.e., after the behavior has occurred) and the 
medium (i.e., the effect of the behavior, not the behavior itself) 
by which the measurer comes in contact with (i.e., observes) the 
behavior. All of the methods for measuring behavior described 
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in this chapter—event recording, timing, and time sampling— 
can be used to measure permanent products. 

Permanent products can be natural or contrived outcomes 
of the behavior. Permanent products are natural and important 
outcomes of a wide range of socially significant behaviors in 
educational, vocational, domestic, and community environments. 
Examples in education include compositions written (Dorow & 
Boyle, 1998), computations of math problems written (Skinner, 
Fletcher, Wildmon, & Belfiore, 1996), spelling words written 
(McGuffin, Martz, & Heron, 1997), worksheets completed 
(Alber, Heward, & Hippler, 1999), homework assignments 
turned in (Alber, Nelson, & Brennan, 2002), and test questions 
answered (e.g., Gardner, Heward, & Grossi, 1994). Behav- 
iors such as mopping floors and dishwashing (Grossi & Heward, 
1998), incontinence (Adkins & Matthews, 1997), drawing bath- 
room graffiti (Mueller, Moore, Doggett, & Tingstrom, 2000), 
recycling (Brothers, Krantz, & McClannahan, 1994), stealing 
food (Maglieri, DeLeon, Rodriguez-Catter, & Sevin, 2000), and 
picking up litter (Powers, Osborne, & Anderson, 1973) can also 
be measured by the natural and important changes they make on 
the environment. Carbon monoxide (CO) levels in expired breath 
rise with increasing cigarette consumption and decline with 
abstention. Romanowich and Lamb’s (2015) smoking abstinence 
study measured CO in exhaled breath as a permanent product 
measure. Participants took a deep breath, held it for 20 seconds, 
and then exhaled over 20 seconds into the mouthpiece of a Vita- 
lograph™ CO monitor. 

Many socially significant behaviors have no direct 
effects on the physical environment. Reading orally, sit- 
ting with good posture, and repetitive hand flapping leave 
no natural products in typical environments. Nevertheless, 
measurement of such behaviors can often be accomplished 
using contrived permanent products. For example, by audio 
recording students as they read out loud (Eckert, Ardoin, 
Daly, & Martens, 2002), video recording a girl sitting in class 
(Schwarz & Hawkins, 1970), and video recording a boy flap- 
ping his hands (Ahearn, Clark, Gardenier, Chung, & Dube, 
2003), researchers obtained contrived permanent products for 
measuring these behaviors. 

Contrived permanent products are sometimes useful in 
measuring behaviors that have natural permanent products that 
are only temporary. For example, Goetz and Baer (1973) mea- 
sured variations in the form of children’s block building from 
photographs taken of the constructions made by the children, 
and Twohig and Woods (2001) measured the length of finger- 
nails from photographs of nail-biters’ hands. 


Advantages of Measuring Permanent 
Products 


Frees Practitioner for Other Tasks. Not having to observe and 
record behavior as it occurs enables the practitioner to attend to 
other tasks during the observation period. For example, a teacher 
who audio records students’ questions, comments, and talk-outs 
during a class discussion can concentrate on what her students 
are Saying, provide individual help, and so on. 


Enables Measurement of Some Behaviors That Occur 
at Inconvenient or Inaccessible Times and Places. Many 
socially significant behaviors occur at times and places that are 
inconvenient or inaccessible to the researcher or practitioner. 
Measuring permanent products is indicated when observing 
the target behavior as it occurs would be difficult because the 
behavior occurs infrequently, in various environments, or for 
extended periods. For example, a music teacher can have a gui- 
tar student make audio recordings of portions of daily practice 
sessions at home. 


Measurement May Be More Accurate, Complete, and 
Continuous. Although measuring behavior as it occurs pro- 
vides the most immediate access to data, it does not necessar- 
ily yield the most accurate, complete, and representative data. 
An observer measuring behavior from permanent products can 
take his time, rescore the worksheet, or listen to and watch the 
video recording again. Video recording enables the observer 
to slow down, pause, and repeat portions of the session—to 
literally “hold still” the behavior so it can be examined and 
measured repeatedly if necessary. The observer might see or 
hear additional nuances and aspects of the behavior, or other 
behaviors that he overlooked or missed altogether during live 
performances. 

Video- or audio-recording behavior enables continuous 
measurement of all instances of a target behavior. Having this 
permanent product of all instances lends itself to later scoring 
by using the built-in calibrated digital timer (e.g., on a VCR 
or digital recorder), set to zero seconds (or the first frame) at 
the beginning of the session, to note the exact time of behav- 
ior onset and offset. Further, software programs facilitate data 
collection and analysis based on exact timings. PROCODER is 
a software system for facilitating the collection and analysis of 
video-recorded behavior. According to Miltenberger, Rapp, and 
Long (1999), “With a recording of the exact time of the onset 
and offset of the target behavior in the observation session, we 
are able to report the frequency (or rate) or the duration of the 
behavior” (p. 119).8 


Measurement by permanent product enables data collection 
on more participants. An observer can view a video recording 
once and measure one participant’s behavior, then replay the 
recording and measure a second participant’s behavior. 


Facilitates Data Collection for Interobserver Agreement and 
Treatment Integrity. Video- or audio recording assists with 
data collection tasks such as obtaining interobserver agree- 
ment data (see Chapter 5) and assessing treatment integrity 
(Chapter 10). Permanent products make possible the repeated 
measurement of behavior, eliminating the need to bring multiple 
observers into the research or treatment setting. 


Enables Measurement of Complex Behaviors and Multiple 
Response Classes. Permanent products, especially video record- 
ings of behavior, enable measurement of complex behaviors and 


multiple response classes in busy social environments. Schwarz 
and Hawkins (1970) obtained measures of the posture, voice vol- 
ume, and face touching of an elementary student from videotapes 
taken during two class periods. The three behaviors were targeted 
as the result of operationalizing the girl’s “poor self-esteem.” The 
researchers were able to view the tapes repeatedly and score them 
for different behaviors. In this study, the girl also watched and eval- 
uated her behavior on the videotapes as part of the intervention. 
Figure 4.9 shows an example of a recording form used 
by Silvestri (2004) to measure three types of statements by 
teachers—generic positive, specific positive, and negative— 
from audiotapes of classroom lessons. (See Figure 3.11 for 
definitions of these behaviors.) The movement, multiple 
voices, and general commotion that characterize any class- 
room would combine to make it very difficult, if not impos- 
sible, for a live observer to consistently detect and accurately 


Participant: tl Date of session: 4/23 
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record these behaviors. Each teacher participant in the study 
wore a small wireless microphone that transmitted a signal to 
a receiver connected to a cassette tape recorder. 


Determining Whether Measurement by 
Permanent Product Is Appropriate 


The advantages of measurement by permanent product are 
considerable, and it may seem as though permanent product 
measurement is always preferable to real-time measurement. 
Answering four questions will help practitioners and researchers 
determine whether measurement by permanent product is 
appropriate: Is real-time measurement needed? Can the behavior 
be measured by permanent product? Will obtaining a contrived 
permanent product unduly affect the behavior? How much will 
it cost? 


Exp. condition: Self-scoring generalization 


Observer: _ Susan Date of observation 4/23 Duration of observation:_15:00 Decimal: 15.0 
Transcribe positive and negative statements, corresponding time indexes, and time indexes of repeated statements in the boxes below. 
Generic Positive Time | Time index Specific Positive Time | Time index Negative Time Time index 
(transcribe first index | of repeats (transcribe first index | of repeats (transcribe first index | of repeats 
instance) instance) instance) 
| 1:37) 37 I like how you gh | 
; ok 4: mee 4:15 | helped her. ass 
Bere nets 7:45 | 9:11) Thank you for i ee eae 
1022 1034] not talking, = WE 
Good—nicebig | 7 
ea lowe | Sots] word rss] J prima 
Peautiful — |22lt309[[Youraisedyour | 337 | 433[ 
[hana beautitujob. | [TT 
Good job, that's ca boa cat 
anew word. 
geod ai | 650 Thankyoufor | 456 
9:25 | paying attention. 
:06 | 11:59 | Thank you for 
vat | Ann not writing. 
nites aes hee 
High five 8:00 -—}— pons =i eal 
- ome 
Count: 28 % repeats: 63%] Count: 13 | %repeatt 46% O % repeats: 
No. permin. 1.9 No. permin. O.9 : 
9, 
All Positives: Count: 41 No.permin. 2.7  % repeats Hs 


Figure 4.9 Data collection form for recording count and temporal locus of three classes of ene statements on vid- 


eo-recorded sessions. 


From The Effects of Self-Scoring on Teachers’ Positive Statements During Classroom Instruction by S. M. Silvestri (2004), p. 124. Unpublished doctoral 


dissertation, The Ohio State University. Used by permission. 
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ls Real-Time Measurement Needed? 


A defining feature of applied behavior analysis and one of its 
major strengths is using data-based decisions concerning treat- 
ment procedures and experimental conditions. Data-based deci- 
sion making requires more than direct and frequent measurement 
of behavior; it also requires ongoing and timely access to the 
data those measures provide. Measuring behavior as it occurs 
provides the most immediate access to the data. Although real- 
time measurement by permanent product can be conducted in 
some situations (e.g., counting each batted ball that lands to the 
right of second base as a hitter takes batting practice), measure- 
ment by permanent product will most often be conducted after 
the instructional or experimental session has ended. 

Measures taken from video or audio recordings cannot 
be obtained until the recordings are viewed after a session has 
ended. If treatment decisions are made on a session-by-session 
basis, this behavior-to-measurement delay poses no problem 
as long as the data can be obtained from the recordings prior to 
the next session. However, when moment-to-moment treatment 
decisions must be made according to the participant’s behavior 
during the session, real-time measurement is necessary. 
Consider a behavior analyst trying to reduce the rate of a 
person’s self-injurious behavior (SIB) by providing access 
to a preferred stimulus contingent on increasing durations of 
time without SIB. Accurate implementation of the treatment 
protocol would require the real-time measurement of IRTs. 


Can the Behavior Be Measured 
by Permanent Product? 


Not all behaviors are suitable for measurement by permanent 
product. Some behaviors affect fairly permanent changes in 
the environment that are not reliable for the purposes of mea- 
surement. For example, SIB often produces long-lasting effects 
(bruises, welts, and even torn and bleeding skin) that could be 
measured after the occurrence of the behavior. But accurate 
measures of SIB could not be obtained by examining the cli- 
ent’s body on a regular basis. The presence of discolored skin, 
abrasions, and other such marks would indicate that the person 
had been injured, but many important questions would be left 
unanswered. How many times did SIB occur? Were there acts 
of SIB that did not leave observable marks on the skin? Was 
each instance of tissue damage the result of SIB? These are 
important questions for evaluating the effects of any treatment. 
Yet, they could not be answered with certainty because these 
permanent products are not precise enough for the measurement 
of the SIB. Behaviors suitable for measurement with permanent 
product must meet two rules. 


Rule 1: Each occurrence of the target behavior must pro- 
duce the same permanent product. The permanent product 
must be a result of every instance of the behavior measured. 
All topographical variants of the target behavior and all 
responses of varying magnitude that meet the definition of 
target behavior must produce the same permanent product. 


Measuring an employee’s work productivity by counting the 
number of correctly assembled widgets in his “completed 
work” bin conforms to this rule. An occurrence of the target 
behavior in this case is defined functionally as a correctly 
assembled widget. Measuring SIB by marks on the skin does 
not meet Rule 1 because some SIB responses will not leave 
discernible marks. 


Rule 2: The permanent product can be produced only by the 
target behavior. This rule requires that the permanent product 
cannot result from (a) any behaviors by the participant other than 
the target behavior or (b) the behavior of any person other than 
the participant. Using the number of correctly assembled objects 
in the employee’s “completed work” bin to measure his produc- 
tivity conforms to Rule 2 if the observer can be assured that (a) 
the employee put no objects in his bin that he did not assemble 
and (b) none of the assembled objects in the employee’s bin were 
put there by anyone other than the employee. Using marks on the 
skin as a permanent product for measuring self-injurious behav- 
ior also fails to meet Rule 2. Marks on the skin could be produced 
by other behaviors by the person (e.g., running too fast, which 
caused him to trip and hit his head; stepping in poison ivy) or 
by the behavior of other people (e.g., struck by another person). 


Will Obtaining a Contrived Permanent Product 
Unduly Affect the Behavior? 


Practitioners and researchers should always consider 
reactivity—the effects of the measurement procedure on the 
behavior being measured. Reactivity is most likely when 
observation and measurement procedures are obtrusive. 
Obtrusive measurement alters the environment, which may in 
turn affect the behavior being measured. Permanent products 
obtained using recording equipment, the presence of which 
may cause a person to behave differently, are called con- 
trived permanent products. For example, recording conver- 
sations might encourage the participant to talk less or more. 
But it should be recognized that reactivity to the presence 
of human observers is a common phenomenon, and reactive 
effects are usually temporary (e.g., Haynes & Horn, 1982; 
Kazdin, 1982, 2001). Even so, it is appropriate to antici- 
pate the influence that equipment may have on the target 
behavior. 


How Much Will It Cost to Obtain and: Measure the 
Permanent Product? 


The final issues to address in determining the appropriateness of 
measuring a target behavior by permanent product are availabil- 
ity, cost, and effort. If recording equipment is required to obtain 
a contrived product of the behavior, is it currently available? If 
not, how much will it cost to buy or rent the equipment? How 
much time will it take to learn to use the equipment initially? 
How difficult and time-consuming will it be to set up, store, 
and use the equipment for the duration of the study or behavior 
change program? 


MEASUREMENT TOOLS 
Low-Tech 


The low-tech measurement tools described here have a 
long history in applied behavior analysis. Researchers and 
practitioners will continue to use these “old school” tools 


described here because they are functional, easy to use, and 
inexpensive. 


* Pencil-and-paper data sheets. Observers record occur- 
rences and nonoccurrences of behavior on data sheets 
such as those shown in Figures 4.5, 4.6, 4.7, and 4.9. 

* Wrist counters. Wrist counters are useful for tally- 
ing behavior. Most wrist counters record from 0 to 99 
responses. These counters can be purchased from sporting 
goods stores or large department stores. 

* Hand-tally digital counters. Similar to wrist counters, 
hand-tally digital counters are frequently used in gro- 
cery chain stores, cafeterias, military mess halls, and 
tollgates to tally the number of people served. These 
mechanical counters are available in single or multiple 
channels and fit comfortably in the palm of the hand. 
With practice, applied behavior analysts can operate the 
multiple-channel counters rapidly and reliably with just 
one hand. Digital counters can be obtained from office 
supply stores. 

¢ Abacus wrist and shoestring counters. Landry and 
McGreevy (1984) described two types of abacus counters 
for measuring behavior. The abacus wrist counter is made 
from pipe cleaners and beads attached to a leather wrist- 
band to form an abacus with rows designated as ones and 
tens. An observer can tally from 1 to 99 occurrences of 
behavior by sliding the beads in abacus fashion. Responses 
are tallied in the same way on an abacus shoestring counter 
except the beads slide on shoestrings attached to a key ring, 
which is attached to the observer’s belt, belt loop, or some 
other piece of clothing, such as a buttonhole. 

¢ Masking tape. Practitioners tally behavioral events on 
strips of masking tape attached to their wrists, clothing, 
or desk. 

¢ Pennies, buttons, paper clips. Observer moves one item from 
one pocket to another each time the target behavior occurs. 


High-Tech 


Automated measurement of subjects’ responses has been 
standard practice in the experimental analysis of behavior 
(EAB) since Skinner’s invention of the cumulative recorder 
(Lattal, 2004; Sakagami & Lattal 2016). In today’s EAB 
laboratory, the electro-mechanical cumulative recorder has 
been replaced by computer-based measurement. Researchers 
can program sophisticated hardware and software systems 
to continuously measure rate, duration, latency, IRTs, and/ 
or magnitude on multiple response classes throughout 
experimental sessions. 
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Computer-assisted measurement and data analysis tools 
have become increasingly sophisticated and useful for applied 
behavior analysts. Developers have produced data collec- 
tion and analysis software for observational measurement 
using laptops (Bullock, Fisher, & Hagopian, 2017), mobile 
devices such as tablets and smartphones (Operant Systems 
Inc, 2017), personal digital assistants (PDAs) (Fogel, Milten- 
berger, Graves, & Koehler, 2010), and wearables that auto- 
matically record movement such as steps taken and distance 
traveled (Hayes & Van Camp, 2015; Valbuena, Miltenberger, 
& Solley, 2015). 

Practitioners using apps designed for data collection 
during treatment sessions tap icons on a touchscreen repre- 
senting response classes, prompt levels, and consequences 
(e.g., Richard, Taylor, DeQuinzio, & Katz, 2010; Sleeper 
et al., 2017). At session’s end, practitioners can add notes 
and tap an icon to produce graphs immediately available for 
inspection. 

Developments in microchip technology have advanced 
the measurement and data analysis capabilities of these sys- 
tems, and have made the software increasingly easier to learn 
and apply. Observers using some of these computerized tools 
can record multiple frequency- and duration-based behaviors, 
including discrete trials, the number of responses per unit of 
time, duration, latency, interresponse time (IRT), and fixed and 
variable intervals for time sampling measurement. These sys- 
tems can present data calculated as rate, duration, latency, IRT, 
percentage of intervals, percentage of trials, and conditional 
probabilities (e.g., BDataPro [Bullock, Fisher, and Hagopian, 
2017]; MOOSES [Tapp, Wehby, & Ellis, 1995]; BEST— 
Behavioral Evaluation Strategy and Taxonomy—Sidenar, 
Shabani, & Carr, 2004).’ 

Computer-based (automated) observation and measure- 
ment systems have distinct advantages over non-automated mea- 
surement when working with aggregates of rates of response, 
time-series analyses, conditional probabilities, sequential depen- 
dencies, interrelationships, and combinations of events. Auto- 
mated measurement facilitates the clustering and analyzing of 
complex data sets. Because these systems allow for the simul- 
taneous recording of multiple behaviors across multiple dimen- 
sions, outputs can be examined and analyzed from perspectives 
that would be difficult and time-consuming with paper-and- 
pencil methods. 

In addition to recording and calculating data, some 
systems produce graphs (e.g., Sleeper et al., 2017) and anal- 
yses of interobserver agreement (IOA) (e.g., smaller/larger, 
overall, occurrence, nonoccurrence) and measurement from 
audio and video documents. The automated computer-driven 
systems, as compared to the commonly used paper-and-pen- 
cil data-recording and analysis methods, have the potential 
to improve IRAs, the reliability of observational measure- 
ments, and the efficiency of data calculation (Kahng & 
Iwata, 1998). Automated measurement has the capacity to 
eliminate many human observer errors (e.g., observer drift, 
expectancy bias). 
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We believe continued advances in digital measurement 
tools will further enrich research and practice in applied 
behavior analysis. Some cautions, however, are associated 
with automated measurement of behavior. The equipment must 
be monitored because machines can break, and programmers 
can make mistakes. Kahng, Ingvarsson, Quigg, Seckinger, 
and Teichman (2011) caution applied behavior analysts’ use 
of automated measurement. “Computer programs of this 
kind should be used only if they provide improvements in 
data collection that are of sufficient importance considering 
the behavior of interest and the ultimate goals of data 
collection.” (p. 117). 


SELECTING A MEASUREMENT METHOD 


Although it may go without saying that the measurement 
method must fit the target behavior dimension one wants to 
measure (e.g., event recording for rate, timing for duration and 
latency), selecting the most appropriate measurement method 
for a given situation is hardly as simple as that. Researchers 
and practitioners must also consider the behavior change goals 
and expected direction of behavior change, the relative ease of 
detecting occurrences of the behavior, the environments where 
and times when the behavior will be measured, and the avail- 
ability and skills of personnel who will observe and record the 


Is the problem 


behavior observable? 


Is the problem behavior 
discrete and countable? 


Are observer resources 
sufficient to count each 
instance of problem 
behavior? 


sufficient to 
continuously monitor 
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Are observer resources 
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produce a measurable, physical 
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Start brain storming with 
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Figure 4.10 Decision-making model for selecting measurement procedures for problem behavior. Note: Asterisks denote 


measurement procedures that also generate count data. 


From “A Proposed Model for Selecting Measurement Procedures for the Assessment and Treatment of Problem Behavior” by L. A. LeBlanc, P. B. Raetz 
T. P. Sellers, and J. E. Carr, 2016, Behavior Analysis in Practice, 9(1), p. 79. Copyright 2016 by the Association for Behavior Analysis International. Used 


by permission. 
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behavior (Fiske & Delmolino, 2012; Gast 2014; LeBlanc, Raetz, 
Sellers, & Carr, 2016). We believe the decision-making model 
selecting measurement methods proposed by LeBlanc and col- 
leagues (2016) provides useful guidance to practitioners (see 
Figure 4.10). 

Practitioners should balance the resources required to 
observe and measure the target behavior with the planned uses 
of the data. The old adage “Just because you can, doesn’t mean 
you should” can be a useful guide when designing behavioral 
measurement systems. Recording every response emitted by a 
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SUMMARY 


Definition and Functions of Measurement 
in Applied Behavior Analysis 


a ees 


1 


Measurement is the process of applying quantitative labels to 
observed properties of events using a standard set of rules. 


2. Measurement is how scientists operationalize empiricism. 


3. Without measurement, all three levels of scientific 
knowledge—description, prediction, and control—would 
be relegated to guesswork and subjective opinions. 


4. Applied behavior analysts measure behavior to obtain 
answers to questions about the existence and nature of 
functional relations between socially significant behavior 
and environmental variables. 


5. Practitioners measure behavior before and after treatment 
to evaluate the overall effects of interventions (summative 
evaluation) and frequent measures of behavior during 
treatment (formative assessment) to guide decisions con- 
cerning the continuation, modification, or termination of 
treatment. 


6. Without frequent measures of the behavior targeted for 
intervention, practitioners may (a) continue an ineffec- 
tive treatment when no real behavior change occurred or 
(b) discontinue an effective treatment because subjective 
judgment detects no improvement. 


7. Measurement also helps practitioners optimize their 
effectiveness; verify the legitimacy of practices touted as 
“evidence based”; identify treatments based on pseudosci- 
ence, fad, fashion, or ideology; be accountable to clients, 
consumers, employers, and society; and achieve ethical 
standards. 


Measurable Dimensions of Behavior 


8. Because behavior occurs within and across time, it has 
three dimensional quantities: repeatability (i.e., count), 
temporal extent (i.e., duration), and temporal locus 
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client and every therapist-delivered prompt and consequence is 
not always necessary (Lerman, Dittlinger, Fentress, & Lanagan, 
2011; Najdowski et al., 2009). Collect the type and number of 
behavioral measures needed to make timely, data-based deci- 
sions about treatment effects. Measurement that does not provide 
a decision maker with close, continuous contact with relevant 
outcome data may be superfluous (cf. Bushell & Baer, 1994). 
Additional considerations and issues with respect to 


designing behavioral measurement systems are discussed in 
Chapter 5. 


(i.e., when behavior occurs). These properties, alone and 
in combination, provide the basic and derivative measures 
used by applied behavior analysts. 


9. Count is a simple tally of the number of occurrences of a 
behavior. 


10. Rate is a ratio of count per observation period; it is 
expressed as count per standard unit of time. 


11. Celeration is a measure of the change (acceleration or 
deceleration) in rate of responding per unit of time. 


12. Duration is the amount of time from the onset to the end 
point of a behavior. 


13. Response latency is a measure of the elapsed time between 
the onset of a stimulus and the initiation of a subsequent 
response. 


14. Interresponse time (IRT) is the amount of time that elapses 
between two consecutive instances of a response class. 


15. Percentage, a ratio formed by combining the same dimen- 
sional quantities, expresses the proportional quantity 
of an event in terms of the number of times the event 
occurred per 100 opportunities that the event could have 
occurred. 


16. Trials-to-criterion is a measure of the number of response 
opportunities needed to achieve a predetermined level of 
performance. 


17. Although form (i.e., topography) and intensity of respond- 
ing (i.e., magnitude) are not fundamental dimensional 
quantities of behavior, they are important quantitative 
parameters for defining and verifying the occurrence of 
many response classes. 


18. Topography refers to the physical form or shape of a 
behavior. 


19. Magnitude refers to the force or intensity with which a 
response is emitted. 
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Methods for Measuring Behavior 


20. 


2A 


22. 


23. 


24, 


25. 


Event recording encompasses a wide variety of procedures 
for detecting and recording the number of times a behavior 
of interest is observed. 


Time sampling refers to a variety of methods for observ- 
ing and recording behavior during intervals or at specific 
moments in time. 


Observers using whole-interval recording divide the obser- 
vation period into a series of equal time intervals. At the 
end of each interval, they record whether the target behav- 
ior occurred throughout the entire interval. 


Observers using partial-interval recording divide the 
observation period into a series of equal time intervals. 
At the end of each interval, they record whether behavior 
occurred at any point during the interval. 


Observers using momentary time sampling divide the 
observation period into a series of time intervals. At the 
end of each interval, they record whether the target behav- 
ior is occurring at that specific moment. 


Planned activity check (PLACHECK) is a variation of 
momentary time sampling in which the observer records 
whether each individual in a group is engaged in the target 
behavior. 


Measuring Behavior by Permanent Products 


26. 


27. 


28. 


KEY TERMS 


Measuring behavior after it has occurred by measuring its 
effects on the environment is known as measurement by 
permanent product. 


Measurement of many behaviors can be accomplished 
with contrived permanent products. 


Measurement by permanent product offers numerous 
advantages: The practitioner is free to do other tasks; 
it enables the measurement of behaviors that occur at 
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artifact latency 
celeration magnitude 
count measurement 


discrete trial 
duration 

event recording 
free operant 
frequency 


interresponse time (IRT) 


percentage 


rate 


29. 


30. 


inconvenient or inaccessible times and places; measure- 
ment may be more accurate, complete, and continuous; it 
facilitates the collection of interobserver agreement and 
treatment integrity data; and it enables the measurement of 
complex behaviors and multiple response classes. 


If moment-to-moment treatment decisions must be made 
during the session, measurement by permanent product 
may not be warranted. 


Behaviors suitable for measurement with permanent prod- 
ucts must meet two rules. Rule 1: Each occurrence of the 
target behavior must produce the same permanent product. 
Rule 2: The permanent product can be produced only by 
the target behavior. 


Measurement Tools 


31. 


32. 


33. 


Low tech tools are functional, easy to use, inexpensive, 
and effective for research, education, and treatment. 


High-tech, digital and computer hardware and software 
systems for behavioral measurement and data analysis 
have become increasingly sophisticated and easier to use. 


Developers have produced data collection and analysis 
software for observational measurement using laptops, 
handheld computers, personal digital assistants (PDAs), 
and desktop computers. These measurement tools provide 
the researcher and practitioner with advantages of practical 
value, efficiency, and ease of use. 


Selecting a Measurement Method 


34. 
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measurement by permanent product 
momentary time sampling 


partial-interval recording 


When selecting a measurement method, consider the 
behavior change goals and expected direction of behav- 
ior change, the relative ease of detecting occurrences of 
the behavior, the environments where and times when 
the behavior will be measured, and the availability and 
skills of personnel who will observe and record the 
behavior. 


repeatability 
temporal extent 
temporal locus 
time sampling 
topography 
trials-to-criterion 


whole-interval recording 


planned activity check (PLACHECK) 
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MULTIPLE-CHOICE QUESTIONS 


1. In order to be understood by everyone, measurement units 


must be: 

a. Analytic 

b. Subjective 

c. Labeled 

d. Unambiguous 


Hint: (See “Definition and Functions of Measurement in 
Applied Behavior Analysis”’) 


. Practitioners need measurement in order to do all of the 


following except 

a. Operationalize empiricism 

b. Identify current behavior levels 
c. Evaluate treatment 

d. Make accurate decisions 


Hint: (See “Definition and Functions of Measurement in 
Applied Behavior Analysis”’) 


. The dimensional quantity of duration is described as: 


a. Temporal locus 

b. Repeatability 

c. Temporal extent 

d. Count 

Hint: (See “Measurable Dimensions of Behavior’) 
is obtained by combining observation 


time with a tally of the number of occurrences of 
behavior. 


a. Response latency 
b. Rate 

c. Celeration 

d. Whole interval 


Hint: (See “Measurable Dimensions of Behavior’) 


. All of the following are procedures for measuring behav- 


ior except 

a. Event recording 

b. Timing 

c. Time sampling 

d. Observation 

Hint: (See “Measurement Tools”) 


. Devices such as wrist counters, masking tape, buttons, 


and calculators are all useful when using 
recording procedures. 


a. Event 

b. Timing 

c. Rate 

d. Duration 

Hint: (See “Measurement Tools”’) 
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The procedure that measures behavior after it has occurred 
by measuring its effect on the environment is called: 


a. Measurement artifact 
b. Planned activity check 
c. Permanent product 

d. Time sampling 


Hint: (See “Measuring Behavior by Permanent Products’’) 


. Which of the following is a reason not to use permanent 


product measurement? 

a. Moment-to-moment decisions must be made 

b. The behavior has a direct effect on the environment 

c. The product can only be produced by the target behavior 
d. Each behavior occurrence results in the same product 


Hint: (See “Measuring Behavior by Permanent Products’’) 


. One advantage of using computer-based observation and 


measurement systems is 

a. Many practitioners are familiar with it 

b. Data can be clustered and analyzed easily 

c. It is less expensive 

d. It is less intrusive than traditional methods 

Hint: (See “Measurement Tools’’) 

Computer-assisted measurement of behavior allows the 
observer to do all of the following except: 

a. Collect permanent products 

b. Record multiple behaviors simultaneously 

c. Aggregate data easily 

d. Analyze data from different perspectives 

Hint: (See “Measurement Tools”’) 

You are preparing to observe Jennifer, a second-grade 
student who has difficulty following teacher direction 
and frequently wanders around the classroom. The target 


behavior for Jennifer is being out of her seat. What is the 
most appropriate method for measuring this target behavior? 


a. Event recording 

b. Interresponse time 

c. Duration 

d. Planned activity check 

Hint: (See “Measurement Tools”) 

Marco is learning to assemble computer components as part 
of his new job at a factory. Speed and accuracy are critical 


skills for him to learn in assembling his component. What is 
the most appropriate method for measuring his behavior? 


a. Whole interval recording 
b. Duration 

c. Permanent product 

d. Response latency 


Hint: (See “Measurement Tools”) 
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ESSAY-TYPE QUESTIONS 


Ls 


NOTES 


ile 
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Describe one of the functions of measurement for practi- 
tioners of applied behavior analysis. 


Hint: (See “Definition and Functions of Measurement in 
Applied Behavior Analysis”) 

Define the three fundamental dimensional qualities of 
behavior. 

Hint: (See “Measurable Dimensions of Behavior’) 
Describe a one-time sampling procedure for measuring 


behavior and the appropriate conditions for using that 
procedure. 


Hint: (See “Measurement Tools”’) 
Describe three advantages of measuring behavior by 
permanent product. 


Hint: (See “Measuring Behavior by Permanent Products”) 


Measurement is necessary but not sufficient for scientific understanding. 
(See Chapter 7.) 


. Noting that frequency has been used in the behavior analysis literature to 


refer to both count and rate, a practice that may cause confusion and mis- 
understanding, Merbitz, Merbitz, and Pennypacker (2016) made a case for 
limiting the use of frequency to mean rate. Carr, Nosik, and Luke (2018) 
surveyed the usage of frequency in contemporary behavior-analytic journals 
and textbooks and found that “the predominant usage of frequency was as 
count, not rate” (p. 436). Carr et al. encouraged behavior analysts to use 
frequency as a synonym for count. 


. Latency is most often used to describe the time between the onset of an 


antecedent stimulus change and the initiation of a response. However, the 
term can be used to refer to any measure of the temporal locus of a response 
with respect to any type of antecedent event. See Johnston and Pennypacker 
(2009). 


. Because percentages are ratios based on the same dimensional quantity, the 


dimensional quantity is canceled out and no longer exists in the percentage. 
For example, an accuracy percentage created by dividing the number of 
correct responses by the number of response opportunities removes the 
actual count. However, ratios created from different dimensional quantities 
retain the dimensional quantities of each component. For example, rate 
retains a count per unit of time. See Johnston and Pennypacker (2009) for 
further explication. 


. When a person reports a percentage in excess of 100% (e.g., “Our mutual 


6 
fund grew by 120% during the recent bear market’), he is probably using 
the misstated percentage in comparison to the previous base unit, not as a 
proportion of it. In this example, the mutual fund’s 20% increase makes its 
value 1.2 times greater than its value when the bear market began. 


. Researchers sometimes manipulate and control response topography and 


magnitude to assess their possible effects as independent variables. Piazza, 
Roane, Kenney, Boney, and Abt (2002) analyzed the effects of different 
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5. Give two reasons for the increasing popularity of 


computer-assisted measurement systems. 


Hint: (See “Measurement Tools’’) 


6. John, a fourth-grade teacher, wants to collect data on task 


completion for seven of his students. John has 34 students 
and does not have access to an aide or teaching assistant. 
Identify the most appropriate recording procedure for John 
to use and explain why it is the best choice. 


Hint: (See “Measurement Tools”) 
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response topographies on the occurrence of pica (e.g., ingesting nonnu- 
tritive matter that may threaten one’s life) by three females. Pica items 
were located in various places, which required the subjects to respond in 
different ways (e.g., reach, bend over, get on the floor, open a container) to 
obtain them. Pica decreased when more elaborate response topographies 
were required to obtain pica items. Van Houten (1993) reported that when 
a boy with a long history of intense and high occurrences of face slapping 
wore |.5-pound wrist weights, face slapping immediately dropped to zero. 
Studies such as these suggest that problem behaviors may be reduced when 
engaging in those behaviors requires more effortful responses in terms of 
topography or magnitude (Friman & Poling, 1995). 


. A variety of terms appear in the applied behavior analysis literature to 


describe measurement procedures that involve observing and recording 
behavior within or at the end of specified intervals. Some authors use time 
sampling to refer only to momentary time sampling. We consider whole- 
interval and partial-interval recording as time sampling methods because 
the data they produce provide a representative “sample” of the target behav- 
ior during the observation period. 


. Edwards and Christophersen (1993) described a time-lapse videotape 


recorder (TLVCR) that automatically records on one 2-hour tape time 
samples of behavior over observation periods ranging from 2 to 400 hours. 
A TLVCR programmed to record over a 12-hour period would record for 
0.10 of each second. Such a system can be useful for recoding very low 
rate behaviors and for behaviors that occur over long periods (e.g., a child’s 
sleep behavior). 


. Descriptions of a variety of computer-assisted behavioral measure- 


ment systems can be found in Bullock, Fisher, and Hagopian (2001); 
Emerson, Reever, and Felce (2000); Gravlee, Zenk, Woods, Rowe, and 
Schultz (2006); Kahng & Iwata (1998, 2000); Noldus, Trienes, Hen- 
driksen, and Jansen (2000); Sidenar, Shabani, and Carr (2004): Tapp & 
Walden (2000); and Tapp and Wehby (2000). 


CHAPTER 5 


Improving and Assessing the Quality 


of Behavioral Measurement 


LEARNING OBJECTIVES 


¢ State and describe the elements of useful scientific measurement. 


* Identify threats to measurement validity. 


¢ Describe various threats to the accuracy and reliability of measurement. 
* Identify and explain ways to assess the accuracy and reliability of behavioral measurement. 
* Identify and explain how to assess interobserver agreement (IOA) for a variety of data sets. 


Three friends—John, Tim, and Bill—took a bicycle ride 
together. At the end of the ride John looked at his handle- 
bar-mounted bike computer and said, “We rode 68 miles. 
Excellent!” “My computer shows 67.5 miles. Good ride, 
fellas!” Tim replied. As he dismounted and rubbed his 
rump, the third biker, Bill, said, ““Gee whiz, I’m sore! We 
must’ ve ridden 100 miles!” A few days later, the three 
friends completed the same route. After the second ride, 
John’s computer showed 68 miles, Tim’s computer read 
70 miles, and Bill, because he wasn’t quite as sore as he 
was after the first ride, said they had ridden 90 miles. 
Following a third ride on the same country roads, John, 
Tim, and Bill reported distances of 68, 65, and 80 miles, 
respectively. 


he data obtained by measuring behavior are the primary 

material with which behavioral researchers and practi- 

tioners guide and evaluate their work. Applied behavior 
analysts measure socially significant behaviors to help deter- 
mine which behaviors need to be changed, to detect and com- 
pare the effects of various interventions on behaviors targeted 
for change, and to evaluate the acquisition, maintenance, and 
generalization of behavior changes. 

Because so much of what the behavior analyst does either 
as a researcher or as a practitioner depends on measurement, 
concerns about the legitimacy of the data it produces must be 
paramount. Do the data meaningfully reflect the original rea- 
son(s) for measuring the behavior? Do the data represent the 
true extent of the behavior as it actually occurred? Do the data 
provide a consistent picture of the behavior? In other words, can 
the data be trusted? 

Chapter 4 identified the measurable dimensions of behay- 
ior and described the measurement methods most often used in 
applied behavior analysis. This chapter focuses on improving 


and assessing the quality of behavioral measurement. We begin 
by defining the essential indicators of trustworthy measure- 
ment: validity, accuracy, and reliability. Next, common threats 
to measurement are identified and suggestions for combating 
these threats are presented. The chapter’s final sections detail 
procedures for assessing the accuracy, reliability, and believ- 
ability of behavioral measurement. 


INDICATORS OF TRUSTWORTHY 
MEASUREMENT 


How trustworthy were the measures reported by the three bicy- 
clists in the opening anecdote? Which of the three friends’ data 
would contribute to a scientific account of the miles they had 
ridden? To contribute to a scientific account of events, mea- 
surement must be valid, accurate, and reliable. Were the three 
friends’ measurements characterized by validity, accuracy, and 
reliability? 


Validity 


Measurement has validity when it yields data directly relevant 
to the phenomenon measured and to the reason(s) for measuring 
it. Determining the validity of measurement revolves around 
this basic question: Was a relevant dimension of the behay- 
ior that is the focus of the investigation measured directly and 
legitimately? 

Did the measurements of miles ridden by the three bicy- 
clists have validity? Because the bikers wanted to know how 
far they had ridden each time, number of miles ridden was a 
relevant, or valid, dimension of their riding behavior. Had the 
bikers’ primary interest been how long or how fast they had 
ridden, the number of miles ridden would not have been a valid 
measure. John and Tim’s use of their bike computers to measure 
directly the miles they rode was a valid measure. Because Bill 
used an indirect measure (the relative tenderness of his backside) 


123 


124 Part2 ¢ Selecting, Defining, and Measuring Behavior 


to determine the number of miles he had ridden, the validity of 
Bill’s mileage data is suspect. A direct measure of the actual 
behavior of interest will always possess more validity than an 
indirect measure, because a direct measure does not require an 
inference about its relation to the behavior of interest, whereas 
an indirect measure always requires such an inference. Although 
soreness may be related to the distance ridden, because it is also 
influenced by such factors as the time on the bike saddle, the 
roughness of the road, riding speed, and how much (or little) a 
person has ridden recently, soreness as a measure of mileage 
has little validity. 

Valid measurement in applied behavior analysis requires 
three equally important elements: (a) measuring directly a 
socially significant target behavior (see Chapter 3), (b) measur- 
ing a dimension (e.g., rate, duration) of the target behavior rele- 
vant to the question or concern about the behavior (see Chapter 
4), and (c) ensuring that the data are representative of the behav- 
ior’s occurrence under conditions and during times that are most 
relevant to the question or concern about the behavior. When 
any of these elements are suspect or lacking—no matter how 
accurate and reliable was the measurement that produced the 
data—the validity of the resultant data is compromised, perhaps 
to the point of being meaninglessness. 


Accuracy 


When used in the context of measurement, accuracy refers to 
the extent to which the observed value (i.e., the quantitative 
labels produced by measuring an event) match the true value 
of the event. In other words, measurement is accurate to the 
degree that it corresponds to the true value of the thing mea- 
sured. Obtaining a true value requires special observation and 
recording procedures that “must be at least somewhat different 
from those used to produce the data being evaluated, and the 
differences must have the effect of minimizing the possibility 
of error to an uncommon degree” (Johnston & Pennypacker, 
2009, pp. 136-137). 

How accurate were the three bikers’ measures of miles 
ridden? Because each biker reported a different measure of the 
same event, all of their data could not be accurate. Skeptical of 
the training miles the three cyclists were claiming, a friend of 
theirs, Lee, drove the same country roads with GPS-calibrated 
odomenter in his car. At the end of the route, the odometer read 
58 miles. Using the measure obtained by the odometer as the 
true value of the route’s distance, Lee determined that none of 
the three cyclists’ measures was accurate. Each rider had over- 
estimated the true mileage. 

By comparing the mileage reported by John, Tim, and 
Bill with the true value of the route’s distance, Lee discovered 
not only that the riders’ data were inaccurate but also that the 
data reported by all three riders were contaminated by a par- 
ticular type of measurement error called measurement bias. 
Measurement bias refers to nonrandom measurement error— 
that an error in measurement that is likely to be in one direction. 
Random measurement error is just as likely to overestimate the 
true value of an event as it is to underestimate it. Because John, 
Tim, and Bill consistently overestimated the actual miles they 
had ridden, their data contained measurement bias. 


Reliability 


Reliability refers to the consistency of measurement, specif- 
ically, the extent to which repeated measurement of the same 
event yields the same value. In other words, reliable mea- 
surement is consistent measurement. Like validity and accu- 
racy, reliability is a relative concept; it is a matter of degree. 
The closer the values obtained by repeated measurement of 
the same event are to one another, the greater the reliability. 
Conversely, the more observed values from repeated mea- 
surement of the same event differ from one another, the less 
the reliability. 

How reliable were the bicyclists’ measurements? Because 
John obtained the same value, 68 miles, each time he mea- 
sured the same route, his measurement had complete reli- 
ability. Tim’s three measures of the same ride—67.5, 70, and 
65 miles—differed from one another by as much as 5 miles. 
Therefore, Tim’s measurement was less reliable than John’s. 
Bill’s measurement system was the least reliable of all, yielding 
values for the same route ranging from 80 to 100 miles. 


Relative Importance of Validity, Accuracy, 
and Reliability 


Behavioral measurement should provide legitimate data for 
evaluating behavior change and guiding research and treatment 
decisions. Data of the highest quality (i.e., data that are most 
useful and trustworthy for advancing scientific knowledge or for 
guiding practice) are produced by valid, accurate, and reliable 
measurement (see Figure 5.1). Validity, accuracy, and reliability 
are relative concepts; each can range from high to low. 

Measurement must be both valid and accurate for the 
data to be trustworthy. If measurement is not valid, accu- 
racy is moot. Accurately measuring a behavior that is not the 
focus of the investigation, accurately measuring an irrelevant 
dimension of the target behavior, or accurately measuring the 
behavior under circumstances or at times not representative 
of the conditions and times relevant to the analysis will yield 
invalid data. Conversely, the data obtained from measuring a 
meaningful dimension of the right behavior under the relevant 
circumstances and times are of little use if the observed val- 
ues provide an inaccurate picture of the behavior. Inaccurate 
measurement renders invalid the data obtained by otherwise 
valid measurement. 

Reliability should never be confused with accuracy. 
Although John’s bicycle computer provided totally reliable 
measures, it was also totally inaccurate. Concern about the reli- 
ability of data in the absence of a prior interest in their accu- 
racy suggests that reliability is being mistaken for accuracy. The 
question for a researcher or someone who is reading a published 
study is not “Are the data reliable?” but “‘Are the data accurate?” 
(Johnston & Pennypacker, 1993a, p. 146) 

If accuracy trumps reliability—and it does—why should 
researchers and practitioners be concerned with the reliabil- 
ity of measurement? Although high reliability does not mean 
high accuracy, poor reliability reveals problems with accuracy. 
Because Tim and Bill’s measurements were not reliable, we 
know that at least some of the data they reported could not be 
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Figure 5.1 Valid, accurate, and reliable measurement yields the most trustworthy and 


Measurement that is... 


... yields data that are... 


useful data for science and science-based practice. 


-.. Most useful for advancing scientific knowledge 
and guiding data-based practice. 


Niels Moh 


... meaningless for the purposes for which 
measurement was conducted. 


... always wrong.? 


... sometimes wrong.° 


a. If adjusted for consistent measurement error of standard size and direction, inaccurate data may still be usable. 


b. If the accuracy of every datum in a data set can be confirmed, reliability is a moot point. In practice, however, 
that is seldom possible; therefore, knowing the consistency with which observers applied a valid and accurate 
measurement system contributes to the level of confidence in the overall trustworthiness of the data set. 


c. User is unable to separate good data from bad. 


accurate, knowledge that could and should lead to checking the 
accuracy of their measurement tools and procedures. 

Highly reliable measurement means that whatever degree 
of accuracy (or inaccuracy) exists in the measurement system 
will be revealed consistently in the data. If it can be determined 
that John’s computer reliably obtains observed values higher 
than the true values by a constant amount or proportion, the 
data could be adjusted to accommodate for that constant degree 
of inaccuracy. 

The next two sections of the chapter describe methods for 
combating common threats to the validity, accuracy, and reliabil- 
ity of behavioral measurement. 


THREATS TO VALID MEASUREMENT 


The validity of behavioral data is threatened when measurement 
is indirect, when the wrong dimension of the target behavior is 
measured, or when measurement 1s conducted in such a way that 
the data it produces are an artifact of the actual events. 


Indirect Measurement 


Indirect measurement occurs when the researcher or practi- 
tioner measures a proxy, or stand-in, for the actual behavior 
of interest. Indirect measurement provides secondhand or “fil- 
tered” information that requires the researcher or practitioner 
to make inferences about the relationship between the event 
that was measured and the actual behavior of interest (Komaki, 
1998). For example, a student’s responses to a questionnaire 
designed to measure his socialization with peers is an indi- 
rect measure. Using a student’s score on a standardized math 
achievement test as an indicator of mastery is another exam- 
ple of indirect measurement. Both of these illustrations require 
an inference about the behavior being measured and the actual 
behavior of interest. By contrast, a student’s score on a prop- 
erly constructed test consisting of math problems from recently 


covered curriculum content is a direct measure requiring no 
inferences about what it means with respect to her performance 
in the curriculum (e.g., Hosp & Hosp, 2003; Stecker, Fuchs, & 
Fuchs, 2005). Conversely, direct measurement occurs when 
the behavior measured is exactly the same as the behavior that 
is the focus of the investigation or behavior change program. 

Indirect measurement rarely occurs in applied behay- 
ior analysis because meeting the applied dimension of ABA 
includes the targeting and meaningful (1.e., valid) measure- 
ment of socially significant behavior. Sometimes, however, the 
researcher or practitioner has no direct and reliable access to 
the behavior of interest and so must use some form of indi- 
rect measurement. Because researchers studying adherence to 
medical regimens cannot directly observe and measure patients’ 
behavior in their homes, they rely on self-reports for their data 
(e.g., La Greca & Schuman, 1995).! 

Indirect measurement is sometimes used to make infer- 
ences about private events or affective states. For example, 
Green and Reid (1996) used direct measures of smiling to rep- 
resent “happiness” by persons with profound multiple disabil- 
ities. However, research on private events does not necessarily 
involve indirect measurement. A research participant who has 
been trained to observe his own private events is measuring the 
behavior of interest directly (e.g., Kostewicz, Kubina, & Cooper, 
2000; Kubina, Haertel, & Cooper, 1994). 

Whenever indirect measurement is used, it is the research- 
er’s responsibility to provide evidence that the event measured 
directly reflects, in some reliable and meaningful way, some- 
thing about the behavior for which the researcher wishes to draw 
conclusions (Johnston & Pennypacker, 2009). In other words, it 
is incumbent upon the researcher to provide a convincing case 
for the validity of her data. Although it is sometimes attempted, 
the case for validity cannot be achieved by simply attaching the 
name of the thing one claims to be measuring to the thing actu- 
ally measured. With respect to that point, Marr (2003) recounted 
this anecdote about Abraham Lincoln: 


126 Part2 * Selecting, Defining, and Measuring Behavior 


“Sir, how many legs does this donkey have?” 

“Four, Mr. Lincoln.” 

“And how many tails does it have?” 

“One, Mr. Lincoln.” 

“Now, sir, what if we were to call a tail a leg; how many 
legs would the donkey have?” 

“Five, Mr. Lincoln.” 

“No sir, for you cannot make a tail into a leg by calling it 
one.” (pp. 66-67) 


Measuring the Wrong Dimension 
of the Target Behavior 


The validity of behavioral measurement is threatened much 
more often by measuring the wrong dimension of the behavior 
of interest than it is by indirect measurement. Valid measure- 
ment yields data that are relevant to the questions about the 
behavior one seeks to answer through measurement. Validity is 
compromised when measurement produces values for a dimen- 
sion of the behavior ill suited for, or irrelevant to, the reason for 
measuring the behavior. 

Johnston and Pennypacker (1980) provided an excellent 
example of the importance of measuring a dimension that fits the 
reasons for measurement. “Sticking a ruler in a pot of water as 
the temperature is raised will yield highly reliable measures of 
the depth of the water but will tell us very little about the chang- 
ing temperature” (p. 192). While the units of measurement on a 
ruler are well suited for measuring length, or in this case, depth, 
they are not at all valid for measuring temperature. If the purpose 
of measuring the water is to determine whether it has reached 
the ideal temperature for making a pot of tea, a thermometer is 
the correct measurement tool. 

If you are interested in measuring a student’s academic 
endurance with oral reading, counting the number of correct and 
incorrect words read per minute without measuring and report- 
ing the total time that the student read will not provide valid data 
on endurance. Number of words read per minute alone does not 
fit the reason for measuring reading (i.e., academic endurance). 
To measure endurance, the practitioner would need to report 
the duration of the reading period (e.g., 30 minutes). Similarly, 
measuring the percentage of trials on which a student makes 
a correct response would not provide valid data for answering 
questions about the student’s developing fluency with a skill, 
whereas measuring the number of correct responses per minute 
and the changing rates of responding (celeration) would. 


Measurement Artifacts 


Directly measuring a relevant dimension of a socially significant 
target behavior does not guarantee valid measurement. Validity 
is reduced when the data—no matter how accurate or reliable 
they are—do not give a meaningful (i.e., valid) representation 
of the behavior. When data give an unwarranted or mislead- 
ing picture of the behavior because of the way measurement 
was conducted, the data are called an artifact. As introduced 
in Chapter 4, a measurement artifact is something that appears 
to exist because of the way it is measured. Discontinuous mea- 
surement, poorly scheduled measurement periods, and using 


insensitive or limiting measurement scales are common Causes 
of measurement artifacts. 


Discontinuous Measurement 


Because behavior is a dynamic and continuous phenomenon that 
occurs and changes over time, continuous measurement is the gold 
standard in behavior analysis research and practice. Continuous 
measurement is measurement that detects all instances of the tar- 
get behavior during the observation period. The development of 
automated data collection technologies has produced an increas- 
ing trend of continuous measurement in behavioral research 
(Kostewicz, King, Datchuk, & Brennan, 2016). 

Discontinuous measurement is any form of measure- 
ment in which some instances of the response class(es) of inter- 
est may not be detected. A data set produced by discontinuous 
measurement—no matter how accurate and reliable—may be 
an artifact of the measurement system rather than accurate por- 
trayal of behavioral events. 

In the absence of automatic measurement technology, contin- 
uous measurement may be prohibited by factors such as the nature 
of the target behavior being measured; the number of participants 
or clients whose behavior must be measured; the number, duration, 
and scheduling of observation periods; and the availability of trained 
personnel to observe and collect the data. In such instances, applied 
behavior analysts typically use one of the three primary methods 
of discontinuous measurement described in Chapter 4 (partial- 
interval recording, whole-interval recording, or momentary time 
sampling). 

A study by Thomson, Holmber, and Baer (1974) demon- 
strated the extent of artificial variability in a data set that may be 
caused by discontinuous measurement. A single, highly expe- 
rienced observer used three different procedures for scheduling 
time sampling observations to measure the behavior of four 
subjects (two teachers and two children) in a preschool setting 
during 64-minute sessions. Thomson and colleagues called the 
three time sampling procedures contiguous, alternating, and 
sequential. With each time sampling procedure, one fourth of 
the observer’s time (i.e., 16 minutes) was assigned to each of 
the four subjects. 

When the contiguous observation schedule was used, the 
observer recorded the behavior of Subject 1 throughout the first 
16 minutes of the session, recorded the behavior of Subject 2 
during the second 16 minutes, and so on until all four students 
had been observed. In the alternating mode, Subjects 1 and 2 
were observed in alternating intervals during the first half of 
the session, and Subjects 3 and 4 were observed in the same 
fashion during the last half of the session. Specifically, Subject 1 
was observed during the first 4 minutes, Subject 2 during the 
next 4 minutes, Subject 1 during the next 4 minutes, and so on 
until 32 minutes had expired. The same procedure was then used 
for Subjects 3 and 4 during the last 32 minutes of the session. 
The sequential approach systematically rotated the four subjects 
through 4-minute observations. Subject 1 was observed during 
the first 4 minutes, Subject 2 during the second 4 minutes, Sub- 
ject 3 during the third 4 minutes, and Subject 4 during the fourth 
4 minutes. This sequence was repeated 4 times to give the total 
of 64 minutes of observation. 
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To arrive at the percentage of artifactual variance in the 
data associated with each time sampling schedule, Thomson and 
colleagues (1974) compared the observer’s data with “actual 
rates” for each subject produced by continuous measurement 
of each subject for the same 64-minute sessions. Results of the 
study showed clearly that the contiguous and alternating sched- 
ules produced the most unrepresentative (and, therefore, less 
valid) measures of the target behaviors (often more than 50% 
variance from continuous measurement), whereas sequential 
sampling procedure produced results that more closely resem- 
bled the data obtained through continuous recording (from 
4% to 11% variance from continuous measurement) (cf. Tiger 
et al., 2013). 

In spite of its inherent limitations, discontinuous mea- 
surement is used in many studies in applied behavior analysis 
in which individual observers measure the behavior of multi- 
ple subjects within the same session. Minimizing the threat to 
validity posed by discontinuous measurement requires careful 
consideration of when observation and measurement periods 
should be scheduled. Infrequent measurement, no matter how 
accurate and reliable it is, often yields results that are an artifact. 
Although a single measure reveals the presence or absence of 
the target behavior at a given point in time, it may not be repre- 
sentative of the typical value for the behavior.” As a general rule, 
observations should be scheduled on a daily or frequent basis, 
even if for only brief periods. 

Ideally, all occurrences of the behavior of interest should 
be recorded. However, when available resources preclude 
continuous measurement throughout an observation period, 
the use of sampling procedures is necessary. A sampling 
procedure may be sufficient for decision making and analy- 
sis if the samples represent a valid approximation of the true 
parameters of the behavior of interest. When measurement 
cannot be continuous throughout an observation period, it is 
generally preferable to sample the occurrence of the target 
behavior for numerous brief observation intervals that are 
evenly distributed throughout the session than it is to use lon- 
ger, less frequent intervals (Thomson et al., 1974; Thompson, 
Symons, & Felce, 2000). For example, measuring a subject’s 
behavior in thirty 10-second intervals equally distributed 
within a 30-minute session will likely yield more representa- 
tive data than will observing the person for a single 5-minute 
period during the half hour. 

Measuring behavior with observation intervals that are too 
short or too long may result in data that grossly over- or underes- 
timates the true occurrence of behavior. For example, measuring 
off-task behavior by partial-interval recording with 10-minute 
intervals may produce data that make even the most diligent of 
students appear to be highly off task. 


Poorly Scheduled Measurement Periods 


The observation schedule should be standardized to provide 
an equal opportunity for the occurrence or nonoccurrence of 
the behavior across sessions and consistent environmental 
conditions from one observation session to the next. When 
neither of these requirements is met, the resultant data may not 
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represent actual events and therefore be invalid. If observation 
periods are scheduled at times when and/or places where the 
frequency of behavior is atypical, the data may not represent 
periods of high or low responding. For example, measuring 
students’ being on task during only the first 5 minutes of each 
day’s 20-minute cooperative learning group activity may yield 
data that make on-task behavior appear higher than it actually 
is Over the entire activity. 

When data will be used to assess the effects of an inter- 
vention or treatment, the most conservative observation times 
should be selected. That is, target behaviors should be measured 
during those times when their rates of occurrence is most likely 
to be different from the desired or predicted outcomes of the 
treatment. Measurement of behaviors targeted for reduction 
should occur during times when those behaviors are most likely 
to occur at their highest response rates. Conversely, behay- 
lors targeted for increase should be measured when high-rate 
responding is least likely. If an intervention is not planned—as 
might be the case in a descriptive study—it is important to select 
the observation times most likely to yield data that are generally 
representative of the behavior. 


Insensitive and/or Limited Measurement Scales 


Artifactual data may result from using measurement scales that 
cannot detect the full range of relevant values or that are insen- 
sitive to meaningful changes in behavior. Data obtained with a 
measurement scale that does not detect the full range of relevant 
performances may incorrectly imply that behavior cannot occur 
at levels below or above obtained measures because the scale 
has imposed an artificial floor or ceiling on performance. For 
example, measuring a student’s oral reading fluency by giving 
him a 100-word passage to read in 1 minute may yield data that 
suggest that his maximum performance is 100 wpm. 

Measurement scales that are over- or under-sensitive to 
relevant changes in behavior may produce misleading data 
showing that meaningful behavior change has (or has not) 
occurred. For example, using a percentage measure scaled 
in 10% increments to evaluate the effects of an intervention 
to improve quality control in a manufacturing plant may not 
reveal important changes in performance if improvement in 
the percentage of correctly fabricated widgets from a base- 
line level of 92% to a range of 97% to 98% is the differ- 
ence between unacceptable and acceptable (i.e., profitable) 
performance. 


THREATS TO ACCURATE AND RELIABLE 
MEASUREMENT 


Human error is the biggest threat to the accuracy and reliability 
of data in applied behavior analysis. Unlike the experimental 
analysis of behavior, in which measurement is typically auto- 
mated, most investigations in applied behavior analysis use 
human observers to measure behavior.* Factors that contribute 
to human measurement error include poorly designed measure- 
ment systems, inadequate observer training, and expectations 
about what the data should reveal. 
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Poorly Designed Measurement System 


Unnecessarily cumbersome and difficult-to-use measurement 
systems create needless loss of accuracy and reliability. Collect- 
ing behavioral data in applied settings requires attention, keen 
judgment, and perseverance. The more taxing and difficult a 
measurement system is to use, the less likely an observer will 
be to consistently detect and record all instances of the target 
behavior. Simplifying the measurement system as much as pos- 
sible minimizes measurement errors. 

The complexity of measurement includes such variables 
as the number of individuals observed, the number of behaviors 
recorded, the duration of observation periods, and/or the dura- 
tion of the observation intervals, all of which may affect the 
quality of measurement. For instance, observing several individ- 
uals is more complex than observing one person; recording sev- 
eral behaviors is more complex than recording a single behavior; 
and using contiguous 5-second observation intervals with no 
time between intervals to record the results of the observation 
is more difficult than a system in which time is reserved for 
recording data. 

Specific recommendations for reducing complexity 
depend on the particular nature of the study. However, when 
using time sampling measurements, applied behavior analysts 
can consider modifications such as decreasing the number of 
simultaneously observed individuals or behaviors, decreasing 
the duration of the observation sessions (e.g., from 30 minutes 
to 15 minutes), and increasing the duration of time intervals 
(e.g., from 5 to 10 seconds). Requiring more practice during 
observer training, establishing a higher criterion for mastery of 
the observational code, and providing more frequent feedback 
to observers may also reduce the possible negative effects of 
complex measurement. 


Inadequate Observer Training 


Careful attention must be paid to the selection and training of 
observers. Explicit and systematic observer training is essen- 
tial for the collection of trustworthy data. Observation and 
coding systems require observers to discriminate the occur- 
rence and nonoccurrence of specific classes of behaviors or 
events against an often complex and dynamic background of 
other behaviors or events and to record their observations onto 
a data sheet. Observers must learn the definitions for each 
behavior or event to be measured; a coded notation system or 
icon for each event to be measured; a common set of recording 
procedures (e.g., marking a data collection form, keystrokes, 
touching icons); and how to correct recording errors (e.g., 
writing a plus sign instead of a minus sign, hitting the F6 key 
instead of the F5 key, touching the ‘on-task’ icon instead of 
the ‘off-task’ icon). 


Selecting Observers Carefully 


Admittedly, applied researchers often scramble to find data 
collectors, but not all volunteers should be accepted into train- 
ing. Potential observers should be interviewed to determine 
past experiences with observation and measurement activities, 


current schedule and upcoming commitments, work ethic and 
motivation, and overall social skills. The interview might 
include a pretest to determine current observation and skill 
levels. This can be accomplished by having potential observ- 
ers watch short video clips of behaviors similar to what they 
may be asked to observe and noting their performance against 
a criterion. 


Training Observers to an Objective Standard 
of Competency 


Observer trainees should meet a specified criterion for recording 
before conducting observations in applied settings. During train- 
ing, observers should practice recording numerous examples 
and nonexamples of the target behavior(s) and receive a critique 
and performance feedback. Observers should have numerous 
practice sessions before actual data collection. Training should 
continue until a predetermined criterion is achieved (e.g., 95% 
accuracy for two or three consecutive sessions). For example, 
in training observers to measure the completion of preventive 
maintenance tasks of heavy equipment by military personnel, 
Komaki (1998) required three consecutive sessions of at least 
90% agreement with a true value. 

Various methods can be used to train observers. These 
include sample vignettes, narrative descriptions, video model- 
ing sequences (see Chapter 21), role playing, and practice ses- 
sions in the environment in which actual data will be collected. 
Practice sessions in natural settings are especially beneficial 
because they allow both observers and participants to adapt to 
each other’s presence and may reduce the reactive effects of the 
presence of observers on participants’ behavior. For example, 
observer trainees for a study that measured appropriate and inap- 
propriate recruiting responses by middle school students with 
learning disabilities during collaborative learning group (CLG) 
activities and three types of peer reactions to those responses 
(instructional feedback, praise, and negative statements) were 
required to first demonstrate accurate recording (using experi- 
menters’ descriptions of events as true values) and then practice 
observing and measuring those events in a classroom similar to 
the experimental setting: 


The first author created two observation training scripts, 
each script consisting of 10 vignettes of various CLG 
interactions the observers were likely to encounter. The 
vignettes included appropriate and inappropriate recruiting 
responses and various types of peer reactions to recruiting. 
The first author discussed the first script with the observ- 
ers, demonstrating how each vignette should be marked 
on the recording form. Each observer then independently 
completed a recording form based on the second script. 
When the observers achieved at least 95% interobserver 
agreement (IOA) with the first author on the frequency 
and type of student recruiting and peer assistance, they 
began practicing data collection in the language arts 
classroom during CLG activities. Baseline data collection 
began when the observers attained 95% IOA with the 

first author in the classroom. (Wolford, Alber, & Heward, 
2001, p. 164) 
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The following steps are an example of a systematic 
approach for training observers. 


Step 1 Trainees read the target behavior definitions and become 
familiar with data collection forms, procedures for recording 
their observations, and the proper use of any measurement or 
recording devices (e.g., tape recorders, stopwatches, laptops, 
PDAs, bar code scanners). 


Step 2 Trainees practice recording simplified narrative descrip- 
tions of behavioral vignettes until they obtain 100% accuracy 
over a predetermined number of instances. 


Step 3 Trainees practice recording longer, more complex narra- 
tive descriptions of behavioral vignettes until they obtain 100% 
accuracy for a predetermined number of episodes. 


Step 4 Trainees practice observing and recording data from vid- 
eorecorded or role-played vignettes depicting the target behav- 
ior(s) at the same speed and complexity as will occur in the 
natural environment. Training vignettes should be scripted and 
sequenced to provide trainees practice in making increasingly 
difficult discriminations between the occurrence and nonoccur- 
rence of the target behavior(s). Having trainees rescore the same 
series of vignettes a second time and comparing the reliability 
of their measures provides an assessment of the consistency 
with which the trainees are applying the measurement system. 
Trainees remain at this step until their data reach pre-established 
accuracy and reliability criteria. (If the study involved collecting 
data from natural permanent products such as compositions or 
academic worksheets, Steps 2 through 4 should provide train- 
ees with practice scoring increasingly extensive and more diffi- 
cult-to-score examples.) 


Step 5 The final training step of observer training is practice 
in collecting data in the natural environment. An experienced 
observer accompanies the trainee and simultaneously and inde- 
pendently measures the target behaviors. Each practice session 
ends with the trainee and experienced observer comparing their 
data sheets and discussing any questionable or heretofore unfore- 
seen instances. Training continues until a pre-established criterion 
of agreement between the experienced observer and the trainee 
is achieved (e.g., at least 90% for three consecutive sessions). 


Providing Ongoing Training to Minimize 

Observer Drift 

Over the course of a study, observers sometimes alter, often 
unknowingly, the way they apply a measurement system. Called 
observer drift, these unintended changes in the way data are 
collected may produce measurement error. Observer drift usu- 
ally entails a shift in the observer’s interpretation of the defini- 
tion of the target behavior from that used in training. Observer 
drift occurs when observers expand or compress the original 
definition of the target behavior. For example, observer drift 
might be responsible for the same behaviors by a child that were 
recorded by an observer as instances of noncompliance during 
the first week of a study being scored as instances of compliance 
during the study’s final week. Observers are usually unaware of 
the drift in their measurement. 


Occasional observer retraining or booster sessions 
throughout the investigation can minimize observer drift. Con- 
tinued training provides the opportunity for observers to receive 
frequent feedback on the accuracy and reliability of measure- 
ment. Ongoing training can occur at regular, prescheduled inter- 
vals (e.g., every Friday morning) or randomly. 


Unintended Influences on Observers 


Ideally, data reported by observers have been influenced only 
by the actual occurrences and nonoccurrences of the target 
behavior(s) observers have been trained to measure. In reality, 
however, a variety of unintended and undesired influences on 
observers can threaten the accuracy and reliability of the data 
they report. Common causes of this type of measurement error 
include an observer’s presuppositions about the expected out- 
comes of the data and awareness that others are measuring the 
same behavior. 


Observer Expectations 


Observer expectations that the target behavior should occur at 
a certain level under particular conditions, or change when a 
change in the environment has been made, pose a major threat 
to accurate measurement. For example, if an observer believes 
or predicts that a teacher’s implementation of a token economy 
should decrease inappropriate student behavior, she may record 
fewer inappropriate behaviors during the token reinforcement 
condition than she would have recorded otherwise without hold- 
ing that expectation. Data influenced by an observer’s expecta- 
tions or efforts to obtain results that will please the researcher 
are characterized by measurement bias. 

The surest way to minimize measurement bias caused 
by observer expectations is to use naive observers. A totally 
naive observer is a trained observer who is unaware of the 
study’s purpose and/or the experimental conditions in effect 
during a given phase or observation period. Researchers should 
inform observer trainees that they will receive limited information 
about the study’s purpose and why that is. However, maintaining 
observers’ naiveté is often difficult and sometimes impossible. 

When observers are aware of the purpose or hypothesized 
results of an investigation, measurement bias can be minimized 
by using target behavior definitions and recording procedures 
that will give a conservative picture of the behavior (e.g., 
whole-interval recording of on-task behavior with 10-second 
rather than 5-second intervals), frank and repeated discussion 
with observers about the importance of collecting accurate data, 
and frequent feedback to observers on the extent to which their 
data agree with true values or data obtained by naive observ- 
ers. Observers should not receive feedback about the extent to 
which their data confirm or run counter to hypothesized results 
or treatment goals. 


Observer Reactivity 

Measurement error resulting from an observer's aware- 
ness that others are evaluating the data he reports is called 
observer reactivity. Like reactivity that may occur when 
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participants are aware that their behavior is being observed, 
the behavior of observers (i.e., the data they record and report) 
can be influenced by the knowledge that others are evaluating 
the data. For example, knowing that the researcher or another 
observer is watching the same behavior at the same time, or 
will monitor the measurement through video or audio recording 
later, may produce observer reactivity. If the observer antici- 
pates that another observer will record the behavior in a certain 
way, his data may be influenced by what he anticipates the other 
observer may record. 

Monitoring observers as unobtrusively as possible on 
an unpredictable schedule helps reduce observer reactivity. 
Separating multiple observers by distance or partition reduces 
the likelihood that their measures will be influenced by one 
another’s during an observation. One-way mirrors in some 
research and clinical settings eliminate visual contact between 
the primary and secondary observers. If audio or video record- 
ings are made of sessions, the secondary observer can measure 
the behavior at a later time and the primary observer never 
has to come into contact with the secondary observer. In set- 
tings where one-way mirrors are not possible, and where audio 
or video recording may be intrusive, the secondary observer 
might begin measuring the behavior at a time unknown to the 
primary observer. For example, if the primary observer begins 
measuring behavior with the first interval, the secondary 
observer could start measuring behavior after 10 minutes have 
elapsed. The intervals used for comparisons would begin at 
the 10-minute mark, ignoring those intervals that the primary 
observer recorded beforehand. 


ASSESSING THE ACCURACY AND RELIABILITY 
OF BEHAVIORAL MEASUREMENT 


After designing a measurement system that will produce a valid 
representation of the target behavior and training observers to 
use it in a manner likely to yield accurate and reliable data, 
the researcher’s next measurement-related tasks are evaluating 
the extent to which the data are, in fact, accurate and reliable. 
Essentially, all procedures for assessing the accuracy and reli- 
ability of behavioral data entail some form of “measuring the 
measurement system.” 


Assessing the Accuracy of Measurement 


Measurement is accurate when the observed values (i.e., the 
numbers obtained by measuring an event) match the true val- 
ues of the event. The fundamental reason for determining the 
accuracy of data is obvious: No one wants to base research 
conclusions or make treatment decisions on faulty data. More 
specifically, conducting accuracy assessments serves four inter- 
related purposes. First, it is important to determine early in an 
analysis whether the data are good enough to serve as the basis 
for making experimental or treatment decisions. The first person 
the researcher or practitioner must try to convince that the data 
are accurate is herself. Second, accuracy assessments enable the 
discovery and correction of specific instances of measurement 
error. The two other approaches to assessing the quality of data, 
to be discussed later in this chapter—reliability assessments and 


interobserver agreement—can alert the behavior analyst to the 
likelihood of measurement errors, but neither approach identi- 
fies errors. Only the direct assessment of measurement accuracy 
allows practitioners or applied researchers to detect and correct 
faulty data. 

A third reason for conducting accuracy assessments 1s to 
reveal consistent patterns of measurement error, which can lead 
to calibration of the measurement system. Calibration entails 
comparing the data produced by a measurement system to a 
known standard or true value and, if necessary, adjusting the 
measurement system so the data it produces match the known 
standard. For example, knowing that John’s bicycle computer 
reliably obtained a measure of 68 miles for a route with a true 
value of 58 miles led not only to the cyclists correcting the data 
in hand (in this case, confessing to one another and to their 
friend Lee that they had not ridden as many miles as previously 
claimed) but to their calibrating John’s bicycle computer (in this 
case, by adjusting the wheel circumference setting) so that future 
measures would be accurate. 

Mudford, Zeleny, Fisher, Klum, and Owen (2011) cal- 
ibrated human observers by comparing their data to criterion 
(known) values for the same events. Five experienced and five 
novice observers participated in the study. Observers viewed 
scripted video records of two adults playing the roles of therapist 
and client and recorded three behaviors (client pinches therapist, 
model prompts by the therapist, and verbal+ gestural prompts 
by the therapist) with laptop computers. Each observer recorded 
data from 10 different. 10-minute samples viewed in random 
order. Each sample included a varied number of pinches (from 
0 to 80) and 15 randomly distributed therapist prompts. 

Because the video samples were scripted, the number 
of occurrences for each of the three target behaviors was a 
known (true) value. What the researchers called the “crite- 
rion records” for when each response occurred during the 
video samples to 1-second precision were obtained by three 
of the study’s co-authors independently, noting times of occur- 
rence while playing, pausing, rewinding, and replaying the 
video recordings. “The only discrepancies between indepen- 
dent records were on the exact second in which responses 
occurred. Discrepancies were resolved by authors reviewing 
the sample together until agreement on time of occurrence was 
achieved” (p. 575). 

After using three different IOA methods and linear regres- 
sion to compare each observer’s data with “criterion records,” 
Mudford et al. (2011) concluded that, “Experienced observers 
demonstrated almost uniformly high levels of agreement with 
criterion records, accuracy, and precision. . .. Novice observers, 
as a group, did not fall far behind the experienced observers in 
accuracy and precision” (p. 582). Although these results apply 
only to the measurement system and observers assessed, the 
study demonstrates a calibration method that may improve the 
accuracy of data collected by applied behavior analysts. 

Calibrating any measurement tool, whether it is a mechan- 
ical device or human observer, entails comparing the data 
obtained by the tool against a true value. The measure obtained 
by the GPS-odometer served as the true value for calibrating 
John’s bike computer. Calibration of a timing device such as a 
stopwatch or countdown timer could be made against a known 
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standard: the “ytterbium clock.”* If no differences are detected 
when comparing the timing device against the ytterbium clock, 
or if the differences are tolerable for the intended purposes of 
measurement, then calibration is satisfied. If significant differ- 
ences are found, the timing device would need to be reset to the 
standard. We recommend frequent accuracy assessments in the 
beginning stages of an analysis. Then, if the assessments have 
produced high accuracy, less frequent assessments can be con- 
ducted to check the calibration of the recorders. 

A fourth reason for conducting accuracy assessments 
is to assure consumers that the data are accurate. Including 
the results of accuracy assessments in research reports helps 
readers judge the trustworthiness of the data being offered for 
interpretation. 


Establishing True Values 


True values for some behaviors are evident and universally 
accepted. For example, obtaining the true values of correct 
responses in academic areas such as math and spelling is 
straightforward. The correct response to the arithmetic problem 
2 + 2 = ? has a true value of 4, and the Oxford English Dic- 
tionary 1s a source of true values for assessing the accuracy of 
measuring the spelling of English words.> 

Although not universal, true values for many socially 
significant behaviors of interest to applied researchers and prac- 
titioners can be established conditionally on local context. For 
example, the correct response to the question “Name the three 
starches recommended as thickeners for pan gravy” on a quiz 
given to students in a culinary school has no universal true value. 
Nevertheless, a true value relevant to the students taking the quiz 
can be found in the instructor’s course materials. 


There is no single way to obtain true values. There are only 
the requirements that: (a) the procedure must be different 
from that used to collect the data that are being evaluated; 
and (b) the procedure must incorporate extraordinary steps 
that avoid or remove possible sources of error. (Johnston & 
Pennypacker, 2009, p. 146) 


True values for each of the preceding examples were 
obtained through sources independent of the measures to be 
evaluated. Establishing true values for many behaviors studied 
by applied behavior analysts is difficult because the process for 
determining a true value must be different from the measure- 
ment procedures used to obtain the data one wishes to compare 
to the true value. 

This is difficult, true, but not impossible. For example, 
in Mudford and colleagues’ (2011) calibration study described 
earlier, true values were obtained by a procedure that (a) was dif- 
ferent from the procedure used by the observers to record their 
data (the researchers paused and replayed the video recordings 
while measuring the behaviors), and (b) incorporated extraor- 
dinary steps (the researcher resolved discrepant measures by 
viewing the video together until reaching agreement within 
1-second tolerance). 

It can be easy to mistake as true values, measures that only 
appear to be so. For example, suppose that four well-trained and 
experienced observers view a video recording of teacher and 


student interactions. Their task is to identify the true value of 
all instances of teacher praise contingent on academic accom- 
plishments. Each observer views the recording independently 
and counts all occurrences of contingent teacher praise. After 
recording their respective observations, the four observers share 
their measurements, discuss disagreements, and suggest reasons 
for the disagreements. The observers independently record con- 
tingent praise a second time. Once again they share and discuss 
their results. After repeating the recording and sharing process 
several times, all observers agree that they have recorded every 
instance of teacher praise. However, the observers did not pro- 
duce a true value of teacher praise for two reasons: (1) The 
observers could not calibrate their measurement of teacher 
praise to an independent standard of teacher praise, and (2) the 
process used to identify all instances of teacher praise may be 
biased (e.g., one of the observers may have convinced the others 
that her measures represented the true value). When true values 
cannot be established, researchers must rely on reliability assess- 
ments and measures of interobserver agreement to evaluate the 
quality of their data. 


Accuracy Assessment Procedures 


Determining the accuracy of measurement is a straightforward 
process of calculating the correspondence of each measure, or 
datum, assessed to its true value. For example, a researcher or 
practitioner assessing the accuracy of the score for a student’s 
performance on a 30-word spelling test reported by a grader 
would compare the grader’s scoring of each word on the test 
with the true value for that word found in a dictionary. Each 
word on the test that matched the correct letter sequence (i.e., 
orthography) provided by the dictionary and was marked correct 
by the grader would be an accurate measure by the grader, as 
would each word marked incorrect by the grader that did not 
match the dictionary’s spelling. If the original grader’s scoring 
of 29 of the test’s 30 words corresponded to the true values for 
those words, the grader’s measure would be 96.7% accurate. 

Although an individual researcher or practitioner can 
assess the accuracy of the data she has collected, multiple 
independent observers are often used. Brown, Dunne, and 
Cooper (1996) described the procedures they used to assess the 
accuracy of measurement in a study of oral reading comprehen- 
sion as follows: 


An independent observer reviewed one student’s audio-tape 
of the delayed one-minute oral retell each day to assess our 
accuracy of measurement, providing an assessment of the 
extent that our counts of delayed retells approximated the 
true value of the audio-taped correct and incorrect retells. 
The independent observer randomly selected each day’s 
audiotape by drawing a student’s name from a hat, then 
listened to the tape and scored correct and incorrect retells 
using the same definitions as the teacher. Observer scores 
were compared to teacher scores. If there was a discrepancy 
between these scores, the observer and the teacher reviewed 
the tape (i.e., the true value) together to identify the source 
of the discrepancy and corrected the counting error on the 
data sheet and the Standard Celeration Chart. The observer 
also used a stopwatch to time the duration of the audio-tape 
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to ensure accuracy of the timings. We planned to have the 
teacher re-time the presentation or retell and recalculate 
the frequency per minute for each timing discrepancy 

of more than 5 seconds. All timings, however, met the 
5-second accuracy definition. (p. 392) 


Reporting Accuracy Assessments 


In addition to describing procedures used to assess the accuracy 
of the data, researchers should report the number and percent- 
age of measures that were checked for accuracy, the degree of 
accuracy found, the extent of measurement error detected, and 
whether those measurement errors were corrected in the data. 
Brown and colleagues (1996) used the following narrative to 
report the results of their accuracy assessment: 


The independent observer and the teacher achieved 100% 
agreement on 23 of the 37 sessions checked. The teacher 
and the observer reviewed the tape together to identify the 
source of measurement errors for the 14 sessions containing 
measurement discrepancies and corrected the measurement 
errors. Accurate data from the 37 sessions rechecked were 
then displayed on the Standard Celeration Charts. The mag- 
nitude of the measurement errors was very small, often a 
difference of | to 3 discrepancies. (p. 392) 


A full description and reporting of the results of accu- 
racy assessment helps readers of the study evaluate the accuracy 
of all the data included in the report. For example, suppose a 
researcher reported that she conducted accuracy checks on a 
randomly selected 20% of the data, found those measures to be 
97% accurate with the 3% error being nonbiased, and corrected 
the assessed data as needed. A reader of the study would know 
that 20% of the data are 100% accurate and be fairly confident 
that the remaining 80% of the data (i.e., all of the measures that 
were not checked for accuracy) is 97% accurate. 


Assessing the Reliability of Measurement 


Measurement is reliable when it yields the same values across 
repeated measures of the same event. Reliability is established 
when the same observer measures the same data set repeatedly 
from archived response products such as audiovisual products 
and other forms of permanent products. The more frequently a 
consistent pattern of observation is produced, the more reliable 
the measurement. Conversely, if similar observed values are not 
achieved with repeated observations, the data are considered 
unreliable. This leads to a concern about accuracy, which is the 
primary indicator of quality measurement. 

But, as we have pointed out repeatedly, reliable data are 
not necessarily accurate data. As the three bicyclists discovered, 
totally reliable (i.e., consistent) measurement may be totally 
wrong. Relying on the reliability of measurement as the basis 
for determining the accuracy of measurement would be, as the 
philosopher Wittgenstein (1953) noted, “As if someone were to 
buy several copies of the morning paper to assure himself that 
what it said was true” (p. 94). 

In many research studies and most practical applications, 
however, checking the accuracy of every measure is not possible 
or feasible. In other cases, true values for measures of the target 


behavior may be difficult to establish. When confirming the 
accuracy of each datum is not possible or practical, or when true 
values are not available, knowing that a measurement system has 
been applied with a high degree of consistency contributes to 
confidence in the overall trustworthiness of the data. Although 
high reliability cannot confirm high accuracy, discovering a low 
level of reliability signals that the data are then suspect enough 
to be disregarded until problems in the measurement system can 
be determined and repaired. 

Assessing the reliability of behavioral measurement 
requires either a natural or contrived permanent product so the 
observer can re-measure the same events. For example, reliabil- 
ity of measurement of variables such as the number of adjectives 
or action verbs in students’ essays could be accomplished by 
having an observer rescore essays. The reliability of the mea- 
surement reporting the number and type of response prompts 
and feedback statements parents presented to their children at 
the family dinner table could be evaluated by having an observer 
replay and rescore videos of the family’s mealtime and compare 
the data obtained from the two measurements. 

Observers should not re-measure the same permanent 
product soon after measuring it the first time. Doing so might 
result in the measures from the second scoring being influenced 
by what the observer remembered from the initial scoring. To 
avoid such unwanted influence, a researcher can insert several 
previously scored essays or videos randomly into the sequence 
of “new data” being recorded by observers. 


USING INTEROBSERVER AGREEMENT TO 
ASSESS BEHAVIORAL MEASUREMENT 


Interobserver agreement is the most commonly used indicator 
of measurement quality in applied behavior analysis (Kostewicz 
et al., 2016). Interobserver agreement (IOA) refers to the 
degree to which two or more independent observers report the 
same observed values after measuring the same events. 


Benefits and Uses of IOA 


Obtaining and reporting interobserver agreement serves four 
distinct purposes. First, IOA can function as a basis for deter- 
mining the competence of new observers. As noted earlier, a 
high degree of agreement between a newly trained observer 
and an experienced observer provides an objective index of the 
extent to which the new observer is measuring the behavior in 
the same way as experienced observers. 

Second, systematic assessment of IOA over the course of 
a study can detect observer drift. When observers who obtained 
the same, or nearly the same, observed values when measuring 
the same behavioral events at the beginning of a study (i.e., IOA 
was high) obtain different measures of the same events later in 
the study (i.e., IOA is now low), one of the observers may be 
using a definition of the target behavior that has drifted. Deteri- 
orating IOA assessments cannot indicate with assurance which 
of the observer’s data are being influenced by drift (or any other 
reason for disagreement), but the information reveals the need 
for further evaluation of the data and/or for retraining and cali- 
bration of the observers. 
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Third, knowing that two or more observers consistently 
obtained similar data increases confidence that the definition of 
the target behavior was clear and unambiguous and the measure- 
ment code and system not too difficult. Fourth, for studies that 
employ multiple observers as data collectors, consistently high 
levels of IOA increase confidence that variability in the data is 
not a function of which observer(s) happened to be on duty for 
any given session, and therefore that changes in the data more 
likely reflect actual changes in the behavior. 

The first two reasons for assessing IOA are proactive: 
They help researchers determine and describe the degree to 
which observers have met training criteria and detect possible 
drift in observers’ use of the measurement system. The second 
two purposes or benefits of IOA are as summative descriptors of 
the consistency of measurement across observers. By reporting 
the results of IOA assessments, researchers enable consumers 
to judge the relative believability of the data as trustworthy and 
deserving of interpretation. 


Requisites for Obtaining Valid |OA Measures 


A valid assessment of IOA depends on three equally important 
criteria. Although these criteria are perhaps obvious, it is none- 
theless important to make them explicit. Two observers (usually 
two, but there may be more) must (a) use the same observation 
code and measurement system, (b) observe and measure the 
same participant(s) and events, and (c) observe and record the 
behavior independent of any influence from one other. 


Observers Must Use the Same Measurement System 


Interobserver agreement assessments conducted for any of the 
four previously stated reasons require observers to use the same 
definitions of the target behavior, observation procedures and 
codes, and measurement devices. Beyond using the same mea- 
surement system, all observers participating in IOA measures 
used to assess the believability of data (as opposed to evaluating 
the observer trainees’ performance) should have received identi- 
cal training with the measurement system and achieved the same 
level of competence in using it. 


Observers Must Measure the Same Events 


The observers must observe the same subject(s) at precisely the 
same observation intervals and periods. IOA for data obtained 
by real-time measurement requires that both observers be in the 
setting simultaneously. Real-time observers must be positioned 
such that each has a similar view of the subject(s) and envi- 
ronment. Two observers sitting on opposite sides of a class- 
room, for example, might obtain different measures because the 
different vantage points enable only one observer to see or hear 
some occurrences of the target behavior. 

Observers must begin and end the observation period at 
precisely the same time. Even a difference of a few seconds 
between observers may produce significant measurement 
disagreements. To remedy this situation, the timing devices 
could be started simultaneously and outside the observation 
setting, but before data collection begins, with the understanding 
that the data collection would actually start at a prearranged time 
(e.g., exactly at the beginning of the fifth minute). Alternatively, 


but less desirably, one observer could signal the other at the 
exact moment the observation is to begin. 

A common and effective procedure is for both observers 
to listen by earphones to prerecorded audio cues signaling the 
beginning and end of each observation interval (see Chapter 4). 
An inexpensive splitter device that enables two earphones to 
be plugged into the same playback device allows observers to 
receive simultaneous cues unobtrusively and without depending 
on one another. 

When assessing IOA for data obtained from perma- 
nent products, the two observers do not need to measure the 
behavior simultaneously. For example, the observers could 
each watch and record data from the same video or audio 
recording at different times. Procedures must be in place, 
however, to ensure that both observers watched or listened 
to the same recording and that they started and stopped their 
independent observations at precisely the same point(s) on 
the tapes. Ensuring that two observers measure the same 
events when the target behavior produces natural permanent 
products, such as completed academic assignments or wid- 
gets manufactured, would include procedures such as clearly 
marking the session number, date, condition, and subject’s 
name on the product and guarding the response products to 
ensure they are not disturbed until the second observer has 
obtained his measure. 


Observers Must Be Independent 


The third essential ingredient for valid IOA assessment is 
ensuring that neither observer is influenced by the other’s 
measurements. Procedures must be in place to guarantee each 
observer’s independence. For example, observers conducting 
real-time measurement of behavior must not be seated or posi- 
tioned so closely to one another that either observer can detect 
or be influenced by the other observer’s recordings (Johnston 
& Pennypacker, 2009). 

Giving the second observer academic worksheets or 
written assignments that have already been marked by another 
observer would violate the observers’ independence. To main- 
tain independence, the second observer must score photocopies 
of unadulterated and unmarked worksheets or assignments as 
completed by the subjects. 


Methods for Calculating IOA 


Numerous methods for calculating IOA are available, each of 
which provides a somewhat different view of the extent and 
nature of agreement and disagreement between observers (¢.g., 
Hartmann, 1977; Hawkins & Dotson, 1975; Page & Iwata, 1986; 
Mudford, Taylor, & Martin, 2009; Repp, Dietz, Boles, Dietz, & 
Repp, 1976). The following explanation of different IOA for- 
mats is organized by the three major methods for measuring 
behavioral data described in Chapter 4: event recording, timing, 
and interval recording or time sampling. Although other statis- 
tics are sometimes used, the percentage of agreement between 
observers is by far the most common convention for reporting 
IOA in applied behavior analysis.° Therefore, we have provided 
the formula for calculating a percentage of agreement for each 
type of IOA. 
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The formula for calculating mean duration-per-occurrence 
IOA is similar to the one used to determine mean count-per- 
interval IOA: 


Dur IOA RI + Dur IOA R2 + Dur IOA Rn 
n responses with Dur IOA 


x 100 = mean duration-per-interval IOA % 


Using this formula to calculate the mean duration-per- 
occurrence IOA for the two observers’ timing data of the five 
responses just presented would entail the following steps: 


1. Calculate duration-per-occurrence IOA for each 
response!’ R by 290) 0.35°='0/83)'R2)15°21'= 0.71; 
R337) =-9'= 0:78; R4,°14 > 14 = 1.0; and R5, 
14 + 17 = 0.82 

2. Add the individual IOA percentages for each occurrence: 
0.83 O71 + 0.78 + 1.00 + 0.82 = 4.14 


3. Divide the sum of the individual IOAs per occurrence by 
the total number of responses for which two observers 
measured duration: 4.14 + 5 = 0.828 


4. Multiply by 100 and round to the nearest whole number: 
0.828 X 100 = 83% 


This basic formula is also used to compute the mean 
latency-per-response IOA or mean IRT-per-response IOA 
for latency and IRT data. An observer’s timings of latencies 
or IRTs in a session should never be added and the total 
time compared to a similar total time obtained by another 
observer as the basis for calculating IOA for latency and 
IRT measures. 

In addition to reporting mean agreement per occurrence, 
IOA assessment for timing data can be enhanced with informa- 
tion about the range of differences between observers’ timings 
and the percentage of responses for which the two observers 
each obtained measures within a certain range of error. For 
example, a researcher might report: “Mean duration-per- 
occurrence IOA for Participant 1’s compliance was 87% (range 
across responses, 63% to 100%), and 96% of all timings 
obtained by the second observer were within + /—2 seconds 
of the primary observer’s measures.” 


IOA for Data Obtained by Interval 

Recording/Time Sampling 

Three techniques commonly used by applied behavior analysts 
to calculate IOA for interval data are interval-by-interval IOA, 
scored-interval IOA, and unscored-interval IOA. 


Interval-by-Interval IOA. When using an interval-by-interval 
IOA (sometimes referred to as the point-by-point, total inter- 
val, or block-by-block method), the primary observer’s record 
for each interval is matched to the secondary observer’s record 
for the same interval. The formula for calculating interval-by- 
interval IOA is as follows: 


Number of intervals agreed 


Number of intervals agreed + number of intervals disagreed 


x 100 = interval-by-interval IOA % 


The hypothetical data in Figure 5.3 show the interval-by- 
interval method for calculating IOA based on the record of two 
observers who recorded the occurrence (X) and nonoccurrence 
(O) of behavior in each of 10 observation intervals. The observ- 
ers’ data sheets show that they agreed on the occurrence or the 
nonoccurrence of the behavior for seven intervals (Intervals 
2,3, 4,5, 7,9, and 10). Interval-by-interval IOA for this data set 
is 70% (i.e.,7 + [7 + 3] X 100 = 70%). 

Interval-by-interval IOA is likely to overestimate the 
actual agreement between observers measuring behaviors that 
occur at very low or very high rates. This is because inter- 
val-by-interval IOA is subject to random or accidental agree- 
ment between observers. For example, with a behavior whose 
actual frequency of occurrence is only about 1 or 2 intervals per 
10 observation intervals, even a poorly trained and unreliable 
observer who misses some of the few occurrences of the behay- 
ior and mistakenly records the behavior as occurring in some 
intervals when the behavior did not occur is likely to mark most 
intervals as nonoccurrences. As a result of this chance agree- 
ment, interval-by-interval IOA is likely to be quite high. Two 
IOA methods that minimize the effects of chance agreements 
for interval data on behaviors that occur at very low or very 
high rates are scored-interval IOA and unscored-interval IOA 
(Hawkins & Dotson, 1975). 


Figure 5.3 When calculating interval-by-interval OA, the number of intervals in which 
both observers agreed on the occurrence or the nonoccurrence of the behavior (shaded 
intervals: 2, 3, 4, 5, 7, 9, 10) is divided by the total number of observation intervals (10). 


Interval no. > 


Observer 1 


Observer 2 


Interval-by-interval IOA for the data shown here is 70% (7/10). 


X = behavior was recorded as occurring during interval 


0 = behavior was recorded as not occurring during interval 
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Scored-Interval IOA. Only those intervals in which either or 
both observers recorded the occurrence of the target behavior 
are used in calculating scored-interval IOA. An agreement 
is counted when both observers recorded that the behav- 
ior occurred in the same interval, and each interval in which 
one observer recorded the occurrence of the behavior and the 
other recorded its nonoccurrence is counted as a disagreement. 
For example, for the data shown in Figure 5.4, only Intervals 
1, 3, and 9 would be used in calculating scored-interval IOA. 
Intervals 2, 4,5, 6, 7, 8, and 10 would be ignored because both 
observers recorded that the behavior did not occur in those 
intervals. Because the two observers agreed that the behavior 
occurred in only one (Interval 3) of the three scored intervals, 
the scored-interval IOA measure is 33% (1 interval of agree- 
ment divided by the sum of | interval of agreement plus 2 inter- 
vals of disagreement X 100 = 33%). 

For behaviors that occur at low rates, scored-interval IOA is 
a more conservative measure of agreement than interval-by-inter- 
val IOA. This is because scored-interval IOA ignores the intervals 
in which agreement by chance is highly likely. For example, using 
the interval-by-interval method for calculating IOA for the data 
in Figure 5.4 would yield an agreement of 80%. To avoid over- 
inflated and possibly misleading IOA measures, we recommend 
using scored-interval interobserver agreement for behaviors that 
occur at frequencies of approximately 30% or fewer intervals. 


Unscored-Interval IOA. Only intervals in which either or both 
observers recorded the nonoccurrence of the target behavior 


are considered when calculating unscored-interval IOA. An 
agreement is counted when both observers recorded the nonoc- 
currence of the behavior in the same interval, and each interval 
in which one observer recorded the nonoccurrence of the behav- 
ior and the other recorded its occurrence is counted as a dis- 
agreement. For example, only Intervals 1, 4, 7, and 10 would be 
used in calculating the unscored-interval IOA for the data in 
Figure 5.5 because at least one observer recorded the nonoc- 
currence of the behavior in each of those intervals. The two 
observers agreed that the behavior did not occur in Intervals 
4 and 7. Therefore, the unscored-interval IOA in this example is 
50% (2 intervals of agreement divided by the sum of 2 intervals 
of agreement plus 2 intervals of disagreement X 100 = 50%). 

For behaviors that occur at relatively high rates, 
unscored-interval IOA provides a more stringent assessment of 
interobserver agreement than does interval-by-interval IOA. To 
avoid overinflated and possibly misleading IOA measures, we 
recommend using unscored-interval interobserver agreement for 
behaviors that occur at frequencies of approximately 70% or 
more of intervals. 

The equations for calculating each of the aforementioned 
types of IOA are summarized in Figure 5.6. 


Considerations in Selecting, Obtainirg, 
and Reporting Interobserver Agreement 


The guidelines and recommendations that follow are organized 
under a series of questions concerning the use of interobserver 
agreement to evaluate the quality of behavioral measurement. 


Figure 5.4 Scored-interval IOA is calculated using only those intervals in which either 
observer recorded the occurrence of the behavior (shaded intervals: 1, 3, 9). Scored- 
interval IOA for the data shown here is 33% (1/3: interval 3). 


Scored-Interval |OA 


Interval no. > 


Observer 1 


Observer 2 


X = behavior was recorded as occurring during interval 


O = behavior was recorded as not occurring during interval 


Figure 5.5 Unscored-interval IOA is calculated using only those intervals in which either 
observer recorded the nonoccurrence of the behavior (shaded intervals: 1, 4, 7, 10). 
Unscored-interval IOA for the data shown here is 50% (2/4: Interval 4 and 7). 


Unscored-Interval |OA 


Interval no. > 


Observer 1 


Observer 2 


X = behavior was recorded as occurring during interval 


0 = behavior was recorded as not occurring during interval 
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Type of IOA 


Total count 


Figure 5.6 Equations for calculating various types of interobserver agreement (IOA). 


Equation 


Int 1 1OA + Int 210A + Intn lOA . 


smaller count 


100 
larger count 


100 


Mean count-per-interval 


n intervals 


Exact count-per-interval 


number of intervals of 100% IOA 8 


number of trials (items) agreed 


100 


n intervals 


x 100 


Trial-by-trial 


Total duration 


total number of trials (items) 


Dur |OA R1 + Dur lOA R2 + Dur IOA Rn 


shorter duration 
longer duration 


Mean duration-per-occurrence 


Interval-by-zinterval 


n responses with Dur lOA 


number of intervals agreed 


x 100 


100 


Scored-interval 


no. intervals agreed + no. intervals disagreed 


no. of intervals in which either observer 
recorded the occurrence of the behavior 


x 100 


total number of all scored intervals 


no. of intervals in which either observer 
recorded the nonoccurrence of the behavior 


Unscored-interval 


How Often and When Should IOA Be Obtained? 


Interobserver agreement should be assessed during each condi- 
tion and phase of a study and be distributed across days of the 
week, times of day, settings, and observers. Scheduling IOA 
assessments in this manner ensures that the results will provide a 
representative (i.e., valid) picture of all data obtained in a study. 
It is generally recommended that IOA be obtained for a mini- 
mum of 20% of a study’s sessions, and preferably between 25% 
and 33% of sessions (Gast & Ledford, 2014; Kennedy, 2005; 
Poling, Methot, & LeSage, 1995). 

In general, studies using data obtained with real-time mea- 
surement will have IOA assessed for a higher percentage of ses- 
sions than studies with data obtained from permanent products. 

The frequency with which data should be assessed using 
interobserver agreement will vary depending on the complexity 
of the measurement code, the number and experience of observ- 
ers, the number of conditions and phases, and the results of the 
IOA assessments themselves. More frequent IOA assessments 
are expected in studies that involve complex or new measurement 
systems, inexperienced observers, and numerous conditions and 
phases. If appropriately conservative methods for obtaining and 
calculating IOA reveal high levels of agreement early in a study, the 
number and proportion of sessions in which IOA is assessed may 


total number of all unscored intervals 


x 100 


decrease as the study progresses. For instance, IOA assessment 
might be conducted in each session at the beginning of an analysis, 
and then reduced to a schedule of once per four or five sessions. 


For What Variables Should |OA Be Obtained 
and Reported? 


In general, researchers should obtain and report IOA at the same 
levels at which they report and discuss the results of their study. 
For example, a researcher analyzing the relative effects of two 
treatment conditions on two behaviors of four participants in 
two settings should report IOA outcomes on both behaviors 
for each participant separated by treatment condition and set- 
ting. This would enable consumers of the research to judge the 
relative believability of the data within each component of the 
experiment. 


Which Method of Calculating IOA Should Be Used? 


No method for calculating IOA is universally superior. Because 
each method shows systematic bias in various situations 
(Mudford, Martin, Hui, & Taylor, 2009), behavior analysts 
should select IOA method(s) best suited to the measurement 
system and resulting data set. As a general rule, more stringent 
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and conservative methods of calculating IOA should be used 
over methods that are likely to overestimate actual agreement 
as a result of chance. With event recording data, we recommend 
reporting overall IOA on a trial-by-trial or item-by-item basis, 
perhaps supplemented with separate IOA calculations for cor- 
rect responses and incorrect responses. For data obtained by 
interval or time sampling measurement, we recommend sup- 
plementing interval-by-interval IOA with scored-interval IOA 
or unscored-interval IOA, depending on the relative frequency 
of the behavior. In situations in which the primary observer 
scores the target behavior as occurring in approximately 30% 
or fewer intervals, scored-interval IOA provides a conservative 
supplement to interval-by-interval IOA. Conversely, when the 
primary observer scores the target behavior as occurring in 
approximately 70% or more of the intervals, unscored-interval 
IOA should supplement interval-by-interval IOA. If the rate at 
which the target behavior occurs changes from very low to very 
high, or from very high to very low, across conditions or phases 
of a study, reporting both unscored-interval and scored-interval 
IOA may be warranted. 

If in doubt about which form of IOA to report, calculating 
and presenting several variations will help readers make their 
own judgments regarding the believability of the data. How- 
ever, if the acceptance of the data for interpretation or decision 
making rests on which formula for calculating IOA is chosen, 
serious concerns about the data’s trustworthiness exist and must 
be addressed. 


What Are Acceptable Levels of IOA? 


Carefully collected and conservatively computed IOA assess- 
ments increasingly enhance the believability of a data set 
as agreement approaches 100%. The usual convention in 
applied behavior analysis is to expect independent observers 
to achieve a mean of no less than 80% agreement when using 
observational recording. However, as Kennedy (2005) pointed 
out, “There is no scientific justification for why 80% is neces- 
sary, only a long history of researchers using this percentage 
as a benchmark of acceptability and being successful in their 
research activities” (p. 120). 

According to Kleinmann et al. (2009), “convention stip- 
ulates that mean [interobserver] agreement should equal or 
exceed 85%” (p. 474). However, various factors at work in 
a given situation may make an 85% criterion too low or too 
high (Kazdin, 2011). For example, 95% IOA on the number 
of words contained in student compositions should raise seri- 
ous questions about the trustworthiness of the data. IOA at 
or very near 100% is needed to enhance the believability of 
count data obtained from permanent products. However, some 
analysts might accept data with a mean IOA below 80% for the 
simultaneous measurement of multiple behaviors by several 
subjects in a complex environment, especially if it is based on 
a sufficient number of individual IOA assessments with a small 
range (e.g., 75% to 83%). 

The degree of behavior change revealed by the data 
should also be considered when determining an acceptable level 
of interobserver agreement. When behavior change from one 


condition to another is small, the variability in the data might 
represent inconsistent observation more than actual change 
in the behavior. Therefore, the smaller the change in behay- 
ior across conditions, the higher the criterion should be for an 
acceptable IOA percentage. 


How Should IOA Be Reported? 


IOA scores can be reported in narrative, table, and graphic form. 
Whichever format is chosen, it is important to note how, when, 
and how often interobserver agreement was assessed. 


Narrative Description. The most common approach for report- 
ing IOA is a simple narrative description of the mean and 
range of agreement percentages. For example, Craft, Alber, 
and Heward (1998) described the methods and results of IOA 
assessments in a study in which four dependent variables were 
measured as follows: 


Student recruiting and teacher praise. A second observer 
was present for 12 (30%) of the study’s 40 sessions. The 
two observers independently and simultaneously observed 
the 4 students, recording the number of recruiting responses 
they emitted and teacher praise they received. Descrip- 
tive narrative notes recorded by the observers enabled 
each recruiting episode to be identified for agreement 
purposes. Interobserver agreement was calculated on an 
episode-by-episode basis by dividing the total number 

of agreements by the total number of agreements plus 
disagreements and multiplying by 100%. Agreement for 
frequency of student recruiting ranged across students 
from 88.2% to 100%; agreement for frequency of recruited 
teacher praise was 100% for all 4 students; agreement for 
frequency of nonrecruited teacher praise ranged from 
93.3% to 100%. 

Academic work completion and accuracy. A second 
observer independently recorded each student’s work 
completion and accuracy for 10 (25%) sessions. Interob- 
server agreement for both completion and accuracy on the 
spelling worksheets was 100% for all 4 students. (p. 403) 


Table. An example of reporting interobserver agreement 
outcomes in table format is shown in Table 5.1. Krantz and 
McClannahan (2003) reported the range and mean IOA com- 
puted for three types of social interactions by three children 
across each experimental condition. 


Graphic Display. Interobserver agreement can be represented 
visually by plotting the measures obtained by the secondary 
observer on a graph of the primary observer’s data, as shown 
in Figure 5.7. Looking at both observers’ data on the same 
graph reveals the extent of agreement between the observers 
and the existence of observer drift or bias. The absence of 
observer drift is suggested in the hypothetical study shown 
in Figure 5.6 because the secondary observer's measures 
changed in concert with the primary observer’s measures. 
Although the two observers obtained the same measure 
on only 2 of the 10 sessions in which IOA was assessed 
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TABLE 5.1 Example of Reporting Interobserver Agreement Results for Each Dependent Variable by Participant and 


Experimental Condition 


Range and Mean Percentage Interobserver Agreement on Scripted Interaction, Elaborations, and Unscripted Interaction 


by Child and Condition 


ee 


Condition 


Baseline Teaching New recipient Script fading New activities 
Type of Range M Range M Range M Range M Range M 
interaction 
Scripted 
David 88-100 94 100 100 
Jeremiah 89-100 98 100 —* 
Ben 80-100 98 90 —9 
Elaborations 
David 75-100 95 87-88 88 90-100 95 
Jeremiah 83-100 95 92-100 96 4 
Ben 75-100 05 95 —9 
Unscripted 
David 100 100 87-88 88 97-100 98 98-100 99 
Jeremiah 100 100 88-100 94 93-100 96 98 
Ben 100 100 100 92-93 92 98-100 99 


“No data are available for scripted responses and elaborations in the script-fading condition, because interobserver agreement was obtained after scripts were 
removed (i.e., because scripts were absent, there could be only unscripted responses). 


From “Social Interaction Skills for Children with Autism: A Script-Fading Procedure for Beginning Readers,” by P. J. Krantz and L. E. McClannahan, 1998, 
Journal of Applied Behavior Analysis, 31, p. 196. Copyright 1998 by the Society for the Experimental Analysis of Behavior, Inc. Reprinted by permission. 
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Figure 5.7 Plotting the second observer's data on a graph 
with the primary observer's data provides a visual represen- 
tation of the extent and nature of interobserver agreement 
and may enable assessment of observer drift, bias, and con- 
clusions regarding the presence of a functional relation. 


(Sessions 3 and 8), the fact that neither observer consistently 
reported measures that were higher or lower than the other 
suggests the absence of observer bias. An absence of bias is 
usually indicated by a random pattern of overestimation and 
underestimation. 

In addition to revealing observer drift and bias, a third 
way that graphically displaying IOA assessments can enhance 


the believability of measurement is illustrated by the data in 
Figure 5.6. When the data reported by the primary observer 
show clear change in the behavior between conditions or 
phases and all of the measures reported by the secondary 
observer within each phase fall within the range of observed 
values obtained by the primary observer, confidence increases 
that the data represent actual changes in the behavior mea- 
sured rather than changes in the primary observer’s behavior 
due to drift or extra-experimental contingencies. Likewise, 
graphic displays that reveal disagreement among observers 
can reduce viewers’ confidence that a functional relationship 
is evident (Ledford, Artman, Wolery, & Wehby, 2012; Ledford 
& Wolery, 2013). 

Although published research reports in applied behavior 
analysis seldom include graphic displays of IOA,® measures 
in creating and using such displays during a study constitute a 
simple and direct way for researchers to detect patterns in the 
consistency (or inconsistency) with which observers are measur- 
ing behavior that might not be as evident in comparing a series 
of percentages. 


Which Approach Should Be Used for Assessing 
the Quality of Measurement: Accuracy, Reliability, 
or Interobserver Agreement? 


Assessments of the accuracy of measurement, the reliability 
of measurement, and the extent to which different observers 
obtain the same measures each provide different indications 
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of data quality. Ultimately, the reason for conducting any type 
of assessment of measurement quality is to obtain quantitative 
evidence that can be used for the dual purposes of improving 
measurement during the course of an investigation and judging 
and convincing others of the trustworthiness of the data. 

After ensuring the validity of what they are measuring 
and how they are measuring it, applied behavior analysts should 
choose to assess the accuracy of measurement whenever pos- 
sible, rather than reliability or interobserver agreement. If it 
can be determined that all measurements in a data set meet an 
acceptable accuracy criterion, questions regarding the reliability 
of measurement and interobserver agreement are moot. For data 
confirmed to be accurate, conducting additional assessments of 
reliability or IOA is unnecessary. 

When assessing the accuracy of measurement is not possi- 
ble because true values are unavailable, an assessment of reliabil- 
ity provides the next best quality indicator. If natural or contrived 
permanent products can be archived, applied behavior analysts 
can assess the reliability of measurement, allowing consumers to 
know that observers have measured behavior consistently from 
session to session, condition to condition, and phase to phase. 

When true values and permanent product archives are 
unavailable, interobserver agreement provides a level of believ- 
ability for the data. Although IOA is not a direct indicator of the 
validity, accuracy, or reliability of measurement, it has proven to 
be a valuable and useful research tool in applied behavior analy- 
sis. Reporting interobserver agreement has been an expected and 
required component of published research in applied behavior 
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SUMMARY 


Indicators of Trustworthy Measurement 


1. To be most useful for science, measurement must be valid, 
accurate, and reliable. 


2. Valid measurement in ABA encompasses three equally 
important elements: (a) measuring directly a socially sig- 
nificant target behavior, (b) measuring a dimension of the 
target behavior relevant to the question or concern about 
the behavior, and (c) ensuring that the data are represen- 
tative of the behavior under conditions and during times 
most relevant to the reason(s) for measuring it. 


3. Measurement is accurate when observed values, the data 
produced by measuring an event, match the true state, or 
true values, of the event. 


4. Measurement is reliable when it yields the same values 
across repeated measurement of the same event. 


Threats to Valid Measurement Validity 


5. Indirect measurement—measuring a behavior different 
from the behavior of interest—threatens validity because it 
requires the researcher or practitioner to make inferences 
about the relationship between the measures obtained and 
the actual behavior of interest. 


analysis for several decades. In spite of its limitations, “the 
homely measures of observer agreement so widely used in 
the field are exactly relevant” (Baer, 1977, p. 119) to efforts to 
develop a robust technology of behavior change. 


Percentage of agreement, in the interval-recording para- 
digm, does have a direct and useful meaning: how often do 
two observers watching one subject, and equipped with the 
same definitions of behavior, see it occurring or not occur- 
ring at the same standard times? The two answers, “They 
agree about its occurrence X% of the relevant intervals, and 
about its nonoccurrence Y% of the relevant intervals,” are 
superbly useful. (Baer, 1977, p. 118) 


There are no reasons to prevent researchers from using 
multiple assessment procedures to evaluate the same data set. 
When time and resources permit, a combination of data assess- 
ments may be desirable. Applied behavior analysts can use any 
possible combination assessment (e.g., accuracy plus IOA, reli- 
ability plus IOA). In addition, some aspects of the data set could 
be assessed for accuracy or reliability, while other aspects are 
assessed with IOA. The previous example of accuracy assess- 
ment reported by Brown and colleagues (1996) included assess- 
ments for accuracy and IOA. Independent observers recorded 
correct and incorrect student-delayed retells. When IOA was 
less than 100%, data for that student and session were assessed 
for accuracy. IOA was used as an assessment to enhance believ- 
ability, and also as a procedure for selecting data to be assessed 
for accuracy. 


6. A researcher who uses indirect measurement must provide 
evidence that the behavior measured directly reflects, 
in some reliable and meaningful way, something about 
the behavior for which the researcher wishes to draw 
conclusions. 


7. Measuring a dimension of the behavior that is ill suited 
for, or irrelevant to, the reason for measuring the behavior 
compromises validity. 


8. Measurement artifacts are data that give an unwarranted or mis- 
leading picture of the behavior because of the way measure- 
ment was conducted. Discontinuous measurement, poorly 
scheduled observations, and insensitive or limiting measure- 
ment scales are common causes of measurement artifacts. 


Threats to Accurate and Reliable Measurement 


9. Most investigations in applied behavior analysis use 
human observers to measure behavior, and human 
error is the biggest threat to the accuracy and reliability 
of data. 


10. Factors that contribute to measurement error include 
poorly designed measurement systems, inadequate 
observer training, and expectations about what the data 
should look like. 
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11. 


12. 


13. 


14. 


15. 


16. 


Observers should receive systematic training and practice 
with the measurement system and meet predetermined 
accuracy and reliability criteria before collecting data. 


Observer drift—unintended changes in the way an 
observer uses a measurement system over the course of 
an investigation—can be minimized by booster training 
sessions and feedback on the accuracy and reliability of 
measurement. 


An observer’s expectations or knowledge about predicted 
or desired results can impair the accuracy and reliability 
of data. 


Observers should not receive feedback about the extent to 
which their data confirm or run counter to hypothesized 
results or treatment goals. 


Measurement bias caused by observer expectations can be 
avoided by using naive observers. 


Observer reactivity is measurement error caused by an 
observer’s awareness that others are evaluating the data he 
reports. 


Assessing the Accuracy and Reliability 
of Behavioral Measurement 


17. 


18. 


19. 


20. 


21. 


22. 


Researchers and practitioners who assess the accuracy of 
their data can (a) determine early in an analysis whether 
the data are usable for making experimental or treatment 
decisions, (b) discover and correct measurement errors, 
(c) detect consistent patterns of measurement error that 
can lead to the overall improvement or calibration of the 
measurement system, and (d) communicate to others the 
relative trustworthiness of the data. 


Assessing the accuracy of measurement is a straightfor- 
ward process of calculating the correspondence of each 
measure, or datum, assessed to its true value. 


True values for many behaviors of interest to applied 
behavior analysts are evident and universally accepted or 
can be established conditionally by local context. True val- 
ues for some behaviors (e.g., cooperative play) are difficult 
because the process for determining a true value must be 
different from the measurement procedures used to obtain 
the data one wishes to compare to the true value. 


Assessing the extent to which observers are reliably apply- 
ing a valid and accurate measurement system provides a 
useful indicator of the overall trustworthiness of the data. 


Assessing the reliability of measurement requires a natural 
or contrived permanent product so the observer can remea- 
sure the same behavioral events. 


Although high reliability does not confirm high accuracy, 
discovering a low level of reliability signals that the data 
are suspect enough to be disregarded until problems in the 
measurement system can be determined and repaired. 


Using Interobserver Agreement to Assess Behavioral 
Measurement 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33, 


The most commonly used indicator of measurement qual- 
ity in ABA is interobserver agreement (IOA), the degree to 
which two or more independent observers report the same 
observed values after measuring the same events. 


Researchers and practitioners use measures of IOA to (a) 
determine the competence of new observers, (b) detect 
observer drift, (c) judge whether the definition of the target 
behavior is clear and the system not too difficult to use, 
and (d) convince others of the relative believability of the 
data. 


Measuring IOA requires that two or more observers (a) use 
the same observation code and measurement system, (b) 
observe and measure the same participant(s) and events, 
and (c) observe and record the behavior independent of 
influence by other observers. 


There are numerous techniques for calculating IOA, 

each of which provides a somewhat different view of the 
extent and nature of agreement and disagreement between 
observers. 


Percentage of agreement between observers is the most 
common convention for reporting IOA in ABA. 


IOA for data obtained by event recording can be calculated 
by comparing (a) the total count recorded by each observer 
per measurement period, (b) the counts tallied by each 
observer during each of a series of smaller intervals of 
time within the measurement period, or (c) each observer’s 
count of 1 or 9 on a trial-by-trial basis. 


Total count IOA is the simplest and crudest indicator of 
IOA for event recording data, and exact count-per-interval 
IOA is the most stringent for most data sets obtained by 
event recording. 


IOA for data obtained by timing duration, response 
latency, or interresponse time (IRT) is calculated in essen- 
tially the same ways as for event recording data. 


Total duration IOA is computed by dividing the shorter of 
the two durations reported by the observers by the longer 
duration. Mean duration-per-occurrence IOA is a more 
conservative and usually more meaningful assessment of 
IOA for total duration data and should always be calcu- 
lated for duration-per-occurrence data. 


Three techniques commonly used to calculate IOA for 
interval data are interval-by-interval IOA, scored-interval 
IOA, and unscored-interval IOA. 


Because it is subject to random or accidental agreement 
between observers, interval-by-interval IOA is likely to 
overestimate the degree of agreement between observy- 
ers measuring behaviors that occur at very low or very 
high rates. 
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34. Scored-interval IOA is recommended for behaviors that 38. 
occur at relatively low frequencies; unscored-interval IOA 


is recommended for behaviors that occur at relatively high 
frequencies. 


The convention for acceptable IOA has been a minimum 
of 80%, but there can be no set criterion. The nature of 
the behavior being measured and the degree of behavior 
change revealed by the data must be considered when 


35. IOA assessments should occur during each condition and determining an acceptable level of IOA. 


phase of a study and be distributed across days of the 39. 


IOA scores can be reported in narrative, table, and graphic 
week, times of day, settings, and observers. 


form. 


36. Researchers should obtain and report IOA at the same levels 40. 


Researchers can use multiple indices to assess the 
at which they report and discuss the results of their study. 


quality of their data (e.g., accuracy plus IOA, reliability 


| 
37. More stringent and conservative IOA methods should be pS 


used over methods that may overestimate agreement as a 
result of chance. 


KEY TERMS aad —— an 2 oe 


accuracy interval-by-interval IOA scored-interval IOA 
believability mean count-per-interval IOA total count IOA 
calibration mean duration-per-occurrence IOA total duration IOA 


continuous measurement 
direct measurement 
discontinuous measurement 
exact count-per-interval IOA 
indirect measurement 


interobserver agreement (IOA) 


measurement bias 
naive observer 
observed value 
observer drift 
observer reactivity 


reliability 


trial-by-trial IOA 
true value 
unscored-interval IOA 


validity 


cnnecsqnedeSRRENORO TNR 


MULTIPLE-CHOICE QUESTIONS 


1. A(n) measure of the actual behavior of interest 
will always possess more validity than a(n) 
measure. 


a. Indirect, direct 

b. Direct, indirect 

c. Discontinuous, continuous 

d. Reliable, accurate 

Hint: (See “Indicators of Trustworthy Measurement’) 


2. measurement is consistent measurement. 
a. Reliable 
b. Valid 
c. Accurate 
d. Discontinuous 
Hint: (See “Indicators of Trustworthy Measurement”) 
3. When data give an unwarranted or misleading picture of 


the behavior because of the way measurement was con- 
ducted, the data are called a(n): 


a. Bias 
b. Artifact 


c. Drift 
d. Variable 
Hint: (See “Threats to Valid Measurement’) 


. In general, artifacts can be limited by scheduling 


observations. 


. Indirect 


a 
b. Infrequent 
c. Frequent 
d 


. Mini 
Hint: (See “Threats to Valid Measurement’) 


. Cumbersome and difficult to use measurement 


can lead to inaccurate and unreliable measurements. 
a. Agreement 

b. Instruments 

c. Observers 

d. Rulers 


Hint: (See “Threats to Accurate and Reliable 
Measurement’) 
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6. 


. Reliable data are not necessarily 


Which of the following is not a suggested training method 
for potential observers? 


a. Select observers carefully 
b. Train observers to an objective standard of competency 
c. Use a systematic approach to training 


d. Explicit feedback about data collection as it relates to 
the hypothesis 


Hint: (See “Threats to Accurate and Reliable 
Measurement’’) 

data. 
a. Complete 

b. Direct 

c. Accurate 

d. Real 


Hint: (See “Assessing the Accuracy and Reliability of 
Behavioral Measurement’) 


. Which of the following is not a requisite for obtaining 


valid IOA measures? 


a. Use the same observation code and measurement 
system 


b. Observe and measure the same participant(s) and 
events 


ESSAY-TYPE QUESTIONS 


1. 


_sengreoonnorEManngcoaNeoeCoNETEARADAN 


Discuss the difference between accuracy and reliability 
as they relate to behavioral data and suggest methods to 
evaluate both. 


Hint: (See “Threats to Valid Measurement’) 


. Describe threats to measurement validity in applied 


behavior analysis. Suggest ways to control the threats you 
describe. 


Hint: (See “Threats to Valid Measurement’) 


. Describe ways in which applied behavior analysis 


researchers can control human error before, during, and 
after data collection. 


Hint: (See “Threats to Accurate and Reliable Measurement’’) 


CEEEIEORAAATOONRAASNIDOSIENS SSR STEREO RASS PETE EES SECT 


NOTES 


ih. 


to 


Strategies for increasing the accuracy of self-reports can be found in Critch- 
field, Tucker, and Vuchinich (1998) and Finney, Putnam, and Boyd (1998). 


. Single measures, such as pretests and posttests, can provide valuable infor- 


mation on a person’s knowledge and skills before and after instruction 
or treatment. The use of probes, occasional but systematic measures, to 
assess maintenance and generalization of behavior change is discussed in 
Chapter 30. 


. We recommend using automatic data-recording devices whenever possible. 


For example, to measure the amount of exercise by boys on stationary 


10. 


4. 


SO EE RISERS OO EIS SO SEES SC SSE I SESS SSS DOSES St ES OCS SOCN 


c. Observe and record the behavior independent of any 
influence from one another 

d. Score all IOA while sitting side-by-side with another 
observer 


Hint: (See “Assessing the Accuracy and Reliability of 
Behavioral Measurement’) 


. Which of the following is a limitation of total count IOA? 


a. Provides no assurance that the two observers recorded 
the same instances of behavior 


b. Useful for event recording 
c. Is expressed as percentage of agreement 


d. Is calculated by dividing the smaller of the counts by 
the larger count and multiplying by 100 


Hint: (See “Methods for Calculating IOA’”’) 
Which of the following is the most stringent method for 


calculating IOA for behaviors that occur at relatively high 
rates and obtained by interval recording? 


a. Total Count IOA 

b. Unscored-interval IOA 

c. Scored-interval IOA 

d. Mean Count-per-Interval IOA 

Hint: (See “Methods for Calculating IOA”’) 


soon goo S oS HESSESOAIO ASAE OC DOL RAISON SIO STEERS 


Discuss a training strategy you might use for data 
collectors/observers. 


Hint: (See “Threats to Accurate and Reliable 
Measurement’’) 


. Discuss the role of collecting interobserver agreement 


measures in applied behavior analysis research. List vari- 
ous methods for assessing agreement for event recording 
and interval recordings of behavioral data. 


Hint: (See “Using Interobserver Agreement to Assess 
Behavioral Measurement’) 


bicycles, DeLuca and Holborn (1992) used magnetic counters that auto- 
matically recorded the number of wheel revolutions. 


. The official time in the United States can be accessed through the National 


Institute of Standards and Technology: https://www.time.gov/. The ytter- 
bium clock, though experimental at this time, exceeds the accuracy of the 
cesium clock with respect to emerging standards of uncertainty, stability, 
and reproducibility! 


. The preferred spelling of a word may change (e.g., judgement becomes 


judgment), but in such cases a new true value is established. 
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6. IOA can be calculated by product-moment correlations, which range from 
+1.0 to —1.0. However, expressing IOA by correlation coefficients has two 
major weaknesses: (a) High coefficients can be achieved if one observer 
consistently records more occurrences of the behavior than the other, and 
(b) correlation coefficients provide no assurance that the observers agreed 
on the occurrence of any given instance of behavior (Poling, Methot, & 
LeSage, 1995). Hartmann (1977) described the use of kappa (k) as a mea- 
sure of IOA. The k statistic was developed by Cohen (1960) as a procedure 
for determining the proportion of agreements between observers that would 
be expected as a result of chance. However, the k statistic is seldom reported 
in the behavior analysis literature. 

- Multiple terms are used in the applied behavior analysis literature for the 
same methods of calculating IOA, and the same terms are sometimes used 


with different meanings. We believe the IOA terms used here represent the 
discipline’s most used conventions. To point out and preserve some mean- 


ingful distinctions among variations of IOA measures, we have introduced 
several terms. 


. Ahand search by Artman, Wolery, and Yoder (2012) of all publications in 


the Journal of Applied Behavior Analysis from 1977 through May 2009 
found just four studies that presented both observers’ data on the same 
graph (Luce, Delquardi, & Hall, 1980: Tertinger, Greene, & Lutzker, 1984; 
Van Houten & Nau, 1980; Van Houten & Rolider, 1984). Artman et al. dis- 
covered no articles containing graphic displays of IOA published in JABA 
in the 25 years following the two instances in 1984. 
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PART 


Evaluating and Analyzing 
Behavior Change 


Part 2 described considerations and procedures for selecting and defining target behaviors; outlined 
detailed methods for measuring behavior; and examined techniques for improving, assessing, and 
reporting the veracity of behavioral measurement. The product of these measurements—data—is the 
medium with which behavior analysts work. But what does the behavior analyst do with data? The 
five chapters in Part 3 focus on the presentation and interpretation of behavioral data and the design, 
conduct, and evaluation of experiments that analyze the effects of interventions. 

Chapter 6 presents the purpose and benefits of graphic displays of behavioral data, identifies 
the types of graphs used in applied behavior analysis, explains how to construct line graphs, and 
describes how behavior analysts interpret graphically displayed data. 

Although measurement and graphic displays can reveal whether, when, and to what extent 
behavior has changed, they alone cannot reveal what brought about the behavior change. Chapters 
7 through 10 are devoted to what constitutes analysis in applied behavior analysis. Chapter 7 
outlines the concepts and assumptions underlining the analysis of behavior; describes the requisite 
components of an experiment in behavior analysis; and explains how researchers and practitioners 
apply steady state strategy and the three elements of basic logic—prediction, verification, and 
replication—to seek and verify functional relations between behavior and its controlling variables. 
Chapters 8 and 9 describe the logic and operation of reversal, multielement, multiple baseline, and 
changing criterion designs—the most commonly used experimental designs in applied behavior 
analysis. Chapter 10 covers a wide range of topics requisite to a more complete understanding of 
behavioral research. Beginning with the assumption that the research methods of any science should 
reflect the characteristics of its subject matter, we examine the importance of analyzing the behavior 
of the individual client or research participant, discuss the value of flexibility in experimental design, 
identify common confounds to the internal validity of experiments, present methods for assessing 
the social validity of behavior changes and the treatments that accomplish them, and explain how 
replication is used to determine the external validity of research. Chapter 10 concludes with a series 
of issues and questions to consider in evaluating the “goodness” of a published applied behavior 
analysis study. 


147 


a forget eal & per owt 
& th abnle a . Recor af the count 

the tape ome « mg ty HO 
We (he “Mico w shang BOTY 


1 WEiLn vs Oe Rdleesr: & 
{eee Ut yee Oe 4 ine 
Pr ea! CA’ a ing? 
=a Pres Chor & : 
h t el a i 
¢. Sciwerl 4m i. Ve on 
a, Mean ( aad ve @traad WaA Bae 

. tier ew Vina ge “ohwililaaly a a 


ce hermes cet acee) ee! roladity i Pores ab tne, 
Pontenian : Knerui dv ou) =Aper’ a — uete } ‘ Ales bint Ds ts: a... < — 
ae i ; ; = o/ 
7 % me Vind ped ~OWewds ae aod Reda 
‘s j ¥ i Vu tore ree Vseeaiicorau » ” 


. nites 

PATS (> waceath oe a? “cote o —/;, pot 2 ia ins tad tr4 aw tors, _ ”" 
; anal pans Seat Kgs fe ete) Be Gee yw + geowruirca 4 a Patio <= tee 
i? indin ces « on Ee egoerqent (et ; . : 
Mt ied. (See Thremie io ) 049) ome amy) wate 0.6) tr melas a4 Petieee] ep _ 


t i > 454 “" b ieee Orage yn 
‘¢ pe Re ous im ws. ar A ts a m 4 


clers. Gay Ser 3 ¢ ‘ Ketievewsl Wesert meni) 
: data 2itestau 
> Hiw (Aa T tees & = «@ Pi \! : ‘ 


‘Sine { ‘ magye es, Delian cmt Helter (2) vel g 
nei Pads fun > ote @ heey PBeacalls PiowaA? Be womber of shee] revileee 
ten 7: . we © Fine fC Radal nee verte Toned Sates on be accede ieee 
‘ Pee Kien igette Af Siandlerr= and Teetacdigy: (age 
ae tat tye ex) ; + Hheray Chee, okey, HF peiineents! w# Thad tune; 108 " 
‘. ‘eof ; t0tum ak lene ee orm she 4 & 
ui ong Me 


PART 


Evaluating and Analyzing 
Behavior Change 


Part 2 described considerations and procedures for selecting and defining target behaviors; outlined 
detailed methods for measuring behavior; and examined techniques for improving, assessing, and 
reporting the veracity of behavioral measurement. The product of these measurements—data—is the 
medium with which behavior analysts work. But what does the behavior analyst do with data? The 
five chapters in Part 3 focus on the presentation and interpretation of behavioral data and the design, 
conduct, and evaluation of experiments that analyze the effects of interventions. 

Chapter 6 presents the purpose and benefits of graphic displays of behavioral data, identifies 
the types of graphs used in applied behavior analysis, explains how to construct line graphs, and 
describes how behavior analysts interpret graphically displayed data. 

Although measurement and graphic displays can reveal whether, when, and to what extent 
behavior has changed, they alone cannot reveal what brought about the behavior change. Chapters 
7 through 10 are devoted to what constitutes analysis in applied behavior analysis. Chapter 7 
outlines the concepts and assumptions underlining the analysis of behavior; describes the requisite 
components of an experiment in behavior analysis; and explains how researchers and practitioners 
apply steady state strategy and the three elements of basic logic—prediction, verification, and 
replication—to seek and verify functional relations between behavior and its controlling variables. 
Chapters 8 and 9 describe the logic and operation of reversal, multielement, multiple baseline, and 
changing criterion designs—the most commonly used experimental designs in applied behavior 
analysis. Chapter 10 covers a wide range of topics requisite to a more complete understanding of 
behavioral research. Beginning with the assumption that the research methods of any science should 
reflect the characteristics of its subject matter, we examine the importance of analyzing the behavior 
of the individual client or research participant, discuss the value of flexibility in experimental design, 
identify common confounds to the internal validity of experiments, present methods for assessing 
the social validity of behavior changes and the treatments that accomplish them, and explain how 
replication is used to determine the external validity of research. Chapter 10 concludes with a series 
of issues and questions to consider in evaluating the “goodness” of a published applied behavior 
analysis study. 
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Constructing and Interpreting Graphic 


Displays of Behavioral Data 


LEARNING OBJECTIVES 


¢ State the purpose and list the benefits of graphic displays of behavioral data. 
e List and describe the types of graphs used in applied behavior analysis. 
¢ Given a set of behavioral data, select the appropriate data display to communicate quantitative relations. 


¢ Given an unlabeled line graph, provide the correct labels. 


Describe and discuss the proper use of a scatterplot. 


pplied behavior analysts quantify behavior change with 
Ac and repeated measurement. The product of these 

measurements, called data, is the medium with which 
applied behavior analysts work. In everyday usage, the word 
data refers to a wide variety of often imprecise and subjective 
information offered as facts. In scientific usage data means “‘the 
results of measurement, usually in quantified form” (Johnston 
& Pennypacker, 1993a, p. 365).! 

Because behavior change is dynamic and ongoing, behav- 
ior analysts—practitioners and researchers—strive for direct and 
continuous contact with the target behavior. The data obtained 
throughout a behavior change program or research study provide 
the empirical basis for every important decision: continue with 
the current intervention or experimental condition, implement 
a different intervention or condition, or reinstitute a previous 
condition. But making valid and reliable decisions from raw data 
(a series of numbers) is difficult, if not impossible, and ineffi- 
cient. Inspecting a long sequence of numbers will reveal only 
large and abrupt changes in performance, or no change at all, and 
important features of behavior change can easily be overlooked. 

Consider the three data sets that follow, each one a series 
of numbers representing consecutive measures of different tar- 
get behaviors. The first data set shows the number of responses 
emitted under two conditions (A and B): 


Condition A 
(AAG) WS, WSs SiO) 114 WO, WS, W241, 
IomS0ni23 20; IO; WG, OZ, 120, 
120, 127 sy, 2 


—_—_—_— 


Condition B 
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Describe and state the purpose of semilogarithmic charts. 


Describe and discuss the proper use of a cumulative record. 


Given a set of behavioral data, construct and label a line graph. 
Given a set of behavioral data, select the most appropriate graphic display. 
Given a graphic display of behavioral data, identify the presence or absence of variability, level, and trends. 


The second data set comprises consecutive measures of the 
percentage of correct responses: 


60; 62? 78, 35,-60"90,-65/ So, 9OPIZ 


The third data set consists of responses per minute mea- 
sured on successive school days: 


2542, S380 20520; Alin Some Dio Lee WO msg aa 


What do these numbers tell you? What conclusions can 
you draw from each data set? How long did it take you to reach 
your conclusions? How sure of them are you? What if the data 
sets contained many more measures to interpret? How likely is it 
that others interested in the behavior change program or research 
study would reach the same conclusions? How could these data 
be directly and effectively communicated to others? This chap- 
ter will show you how graphs can help answer these questions. 

Graphs—telatively simple formats for visually displaying 
relationships among and between a series of measurements and 
relevant variables—help people “make sense” of quantitative 
information. Graphs are the major device with which applied 
behavior analysts organize, store, interpret, and communicate 
the results of their work. Figure 6.1 includes a graph for each 
of the three data sets presented previously. The top graph 
reveals a level of responding during Condition B lower than 
measured during Condition A. The middle graph shows an 
upward trend over time in correct responding. Evident in the 
bottom graph is a variable pattern of responding, characterized 
by higher response rates during the first part of each week and 
a decreasing trend toward the end of each week. The graphs 
in Figure 6.1 illustrate three fundamental properties of behav- 
ior change over time—level, trend, and variability—-each of 
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Figure 6.1 Graphic displays of three sets of hypothetical 
data illustrating changes in the level of responding 
across conditions (top), trend (middle), and cyclical 
variability (bottom). 


which will be discussed in detail later in the chapter. Graphic 
displays of behavioral data are an effective means of illus- 
trating and communicating these aspects of behavior change. 


PURPOSE AND BENEFITS OF GRAPHIC 
DISPLAYS OF BEHAVIORAL DATA 


In essence, the function of the graph is to communicate, in 
a readily assimilable and attractive manner, descriptions 
and summaries of data that enable rapid and accurate 


analysis of the facts. 
—Parsonson and Baer (1978, p. 134) 


The graphic display and visual analysis of behavioral data 
provide seven benefits to behavior analysts and their clients. 
First, plotting each measure of behavior ona graph soon after 


the observational period gives the practitioner or researcher 
immediate access to an ongoing visual record of the partici- 
pant’s behavior. Continual assessment of behavior change 
enables treatment and experimental decisions to be respon- 
sive to the participant’s performance. Graphs provide the 
“close, continual contact with relevant outcome data” that 
can lead to “measurably superior instruction” (Bushell & 
Baer, 1994, p. 9). 

Second, direct and continual contact with the data in a 
readily analyzable format enables the researchers to explore 
interesting variations in behavior as they occur. Some of the 
most important research findings about behavior have been 
made because scientists followed the leads suggested by their 
data instead of following predetermined experimental plans 
(Sidman, 1960, 1994; Skinner, 1956). 

Third, graphs, like statistical analyses of behavior change, 
are judgmental aids: devices that help practitioners, research- 
ers, and consumers interpret the results of a study or treatment 
(Michael, 1974). In contrast to the statistical tests of inference 
used in group comparison research, however, visual analysis of 
graphed data takes less time, is relatively easy to learn, imposes 
no predetermined or arbitrary level for determining the signifi- 
cance of behavior change, and does not require the data to meet 
preset mathematical properties or statistical assumptions to be 
analyzed.” 

Fourth, visual analysis is a conservative method for deter- 
mining the significance of behavior change. A behavior change 
deemed statistically significant by a test of mathematical prob- 
abilities may not look very impressive when the data are plotted 
on a graph that reveals great variability and repeated overlaps in 
the data within and across experimental or treatment conditions. 
This screening out of weak variables in favor of robust interven- 
tions has enabled applied behavior analysts to develop a useful 
technology of behavior change (Baer, 1977b).° 

Fifth, graphs enable and encourage independent judg- 
ments and interpretations of the meaning and significance of 
behavior change. Instead of relying on conclusions based on 
statistical manipulations of the data or an author’s interpreta- 
tions, readers of published reports of applied behavior analysis 
can (and should) form independent conclusions based on their 
visual analysis of the data.* 

Sixth, in addition to their primary purpose of displaying 
relationships between behavior change (or lack thereof) and vari- 
ables manipulated by the researcher or practitioner, graphs can 
provide feedback to the people whose behavior the graphs depict. 
Numerous studies have reported improved performance of a 
wide variety of target behaviors by participants when receiving 
graphical feedback, alone or as part of a package intervention, 
(e.g., Gil & Carter, 2016; Grossi & Heward, 1998; Perrin, Fredrick, 
& Klick, 2016; Quinn, Miltenberger, Abreu, & Narozanick, 2017; 
Squires et al., 2008). Graphing one’s own performance has also 
been demonstrated to be an effective intervention for a variety of 
academic and behavior change objectives (e.g., Kasper-Ferguson & 
Moxley, 2002; Stolz, Itoi, Konrad, & Alber-Morgan, 2008). 

Seventh, graphs facilitate the communication, dissemi- 
nation, and comprehension of behavior change among a wide 
variety of recipients (e.g., professionals, parents, government 
agents responsible for policymaking). A well-constructed 
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BOX 6.1 


Page Space Devoted to Graphs by Behavior Analysis Journals 


Open any issue of the Journal of Applied Behavior Analy- 
sis (JABA) to an article reporting an experimental study and 
chances are good you will see a graph. If not, turn a page or two 
in either direction, and a graph will almost certainly appear. 

Does that matter? Yes, it does, for several reasons. First, 
JABA’s frequent use of graphs emphasizes the central role of visual 
analysis of data in applied behavior analysis and underscores 
the obligation that practitioners, researchers, and journal article 
reviewers and editors know how to create and interpret graphic 
displays. Second, graphs take up a greater proportion of page 
space of journals in scientific disciplines (1.e., natural sciences) 
considered more mature, harder, and with greater consensus of 
knowledge than they do in less mature, softer scientific disci- 
plines (i.e., social sciences) (Arsenault, Smith, & Beauchamp, 
2006; Best, Smith, & Stubbs, 2001; Cleveland, 1984; Smith, 
Best, Stubbs, Johnston, & Bastiani-Archibald, 2000). 

How do behavior analysis journals compare to other sci- 
ences in that regard? In an effort to shed light on that question, 
Kubina, Kostewicz, and Datchuk (2008) computed the fractional 
graph area (FGA)—the proportion of page space devoted to 
graphs—in research articles from one randomly selected issue 


graph condenses aggregated individual numerical data so that 
relevant elements are visually salient and can be understood 
commonly. 

Behavior analysts’ heavy reliance on graphic data dis- 
plays puts them in good company with scientists in other natural 
sciences (see Box 6.1). 


GRAPHS USED BY APPLIED BEHAVIOR 
ANALYSTS 


The graphic displays most often used by applied behavior 
analysts are line graphs, bar graphs, cumulative records, ratio 
charts, and scatterplots. 


Line Graphs 


The simple line graph, or frequency polygon, is far and 
away the most common graphic format in applied behavior 
analysis. Based on the Cartesian coordinate system created by 
René Descartes in the 17th century, the line graph is a two- 
dimensional area formed by the intersection of two perpen- 
dicular reference lines, called axes. Any point within the area 
enclosed by the two axes represents a specific relationship 
between values of the two dimensions (or variables) described 
by the intersecting lines. In applied behavior analysis, each 
point on a line graph shows the level of some quantifiable 
dimension of the target behavior (i.e., the dependent variable) 
in relation to a specified point in time and/or environmental 
condition (1.e., the independent variable) in effect when the 
measure was taken. Comparing points on a graph reveals the 


from each of three volume years (1995, 2000, and 2005) from 
11 behavioral journals. Diagrams and flow charts that did not 
graphically present quantitative data were not included. 

Kubina and colleagues (2008) reported a mean FGA of 
057 for all behavior analysis journals, which included journals 
such as Behavior Therapy, Child and Family Therapy, Cognitive 
and Behavioral Practices, and the Journal of Behavior Therapy 
and Experimental Psychiatry (see Figure A). The four behav- 
ior analysis journals with the highest FGA ranged from .08 for 
The Analysis of Verbal Behavior to .17 for the Journal of the 
Experimental Analysis of Behavior (JEAB), equal to the FGAs 
of journals in the natural sciences, which range from .06 in 
geology to .18 in chemistry (as measured by Cleveland [1984)). 

Notable among Kubina and colleagues’ findings is that 
behavior analysis flagship journals’ (JABA and JEAB) compos- 
ite FGA of .147 ranked third among the natural sciences, trail- 
ing only chemistry and physics. While the authors cautioned 
that their results were obtained from just three issues from each 
journal, they noted that Best, Smith, and Stubbs’s (2001) inde- 
pendent assessment of page space devoted to graphs yielded an 
identical measure of .17 for JEAB. 


presence and extent of changes in level, trend, and/or variability 
within and across conditions. 


Parts of a Basic Line Graph 


Although graphs vary considerably in their final appearance, 
all properly constructed line graphs share certain elements. The 
basic parts of a simple line graph are shown in Figure 6.2 and 
described in the following sections. 


1. Horizontal Axis. The horizontal axis, also called the x-axis, 
or abscissa, is a straight horizontal line that most often represents 
the passage of time and the presence, absence, and/or value of the 
independent variable. A defining characteristic of applied behav- 
ior analysis is the repeated measurement of behavior across time. 
Time is also the unavoidable dimension in which all manipula- 
tions of the independent variable occur. On most line graphs the 
passage of time is marked in equal intervals on the horizontal 
axis. The horizontal axis in Figure 6.2 represents consecutive 
observational sessions from 9:00 to 10:00 AM on weekdays. 

The horizontal axis on some graphs represents different 
values of the independent variable instead of time. For example, 
Lalli, Mace, Livezey, and Kates (1998) scaled the horizontal axis 
of one graph in their study from less than 0.5 meter to 9.0 meters 
to show how the occurrence of self-injurious behavior by a girl 
with severe intellectual disabilities decreased as the distance 
between the therapist and the girl increased. 


2. Vertical Axis. The vertical axis, also called the y-axis, or 
ordinate, is a vertical line drawn upward from the left-hand end 
of the horizontal axis. The vertical axis most often represents 
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Figure A Proportion of page space devoted to graphic data displays by journals in various scientific disciplines. (Behavior 


analysis flagship journals are JABA and JEAB.) 


From “An Initial Survey of Fractional Graph and Table Area in Behavioral Journals,” by R. M. Kubina, D. E. Kostewicz, and S. M. D. Datchuk, 2008, 
The Behavior Analyst, 31, p. 65. Copyright 2008 by the Association for Behavior Analysis International. Used by permission. 


a range of values of the dependent variable, which in applied 3. Axis Labels. Axis labels are brief descriptions of the dimen- 
behavior analysis is always some quantifiable dimension of sion represented by and scaled along each axis; most often a 
behavior. The intersection of the horizontal and vertical axes —_ response measure on the vertical axis and experimental condi- 
is called the origin and most often, though not necessarily, tions across time on the horizontal axis. 


represents the zero value of the dependent variable. Each suc- 


cessive point upward on the vertical axis represents a greater 4. Condition Change Lines. Condition change lines are vertical 
value of the dependent variable. The most common practice _ lines drawn upward from the horizontal axis to show points in 
is to mark the vertical axis with an equal-interval scale, in time at which changes in the independent variable occurred. The 
which equal distances on the axis represent equal absolute condition change lines in Figure 6.2 coincide with the intro- 
amounts of behavior change. The vertical axis in Figure 6.2 duction or withdrawal of an intervention the researchers called 
represents the number of 50-meter laps walked in | hour. Token Reinforcement. 
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Figure 1. Laps walked per session for each participant. Triangles indicate days in which the 
participants arrived late to the walking session. 


Figure 6.2 Major parts of a simple line 
graph: (1) horizontal axis, 

(2) vertical axis, (3) axis labels, 

(4) condition change lines, (5) condi- 
tion labels, (6) data points, (7) data 
path, and (8) figure caption. (Original 
figure includes graphs for four addi- 
tional participants.) 

Based on “Using Token Reinforcement to 
Increase Walking for Adults with Intellectual 
Disabilities” by H. Krentz, R. Miltenberger, 
and D. Valbuena, 2016, Journal of Applied 
Behavior Analysis, 49, p. 749. Copyright 2016 
by the Society for the Experimental Analysis of 
Behavior. 
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5. Condition Labels. Condition labels are single words or brief 
descriptive phrases printed along the top of the graph and par- 
allel to the horizontal axis. These labels identify the experimen- 
tal conditions (i.e., the presence, absence, or some value of the 
independent variable) in effect during each phase of the study. 


6. Data Points. Each data point on a graph represents two facts: 
(a) a quantifiable measure of the target behavior recorded during 
a given observation period and (b) the time and/or experimental 
conditions under which that particular measurement was con- 
ducted. Using two data points from Figure 6.2 as examples, we 
can see that John walked 14 laps during Session 6, the last ses- 
sion of the first baseline phase, and 21 laps in Session 7, the first 
session of the first phase of the token reinforcement condition. 


7. Data Path. Connecting successive data points within a given 
condition with a straight line creates a data path. The data path 
represents the level and trend of behavior between successive 
data points and is a primary focus of attention in the interpre- 
tation and analysis of graphed data. Because behavior is rarely 
observed and recorded continuously in applied behavior analysis, 
the data path represents an estimate of the actual course taken 
by the behavior during the time between the two consecutive 
measures. The more measures and resultant data points per unit 
of time (given an accurate observation and recording system), the 
more confidence one can place in the story told by the data path. 


8. Figure Caption. The figure caption is a concise statement 
that, in combination with the axis and condition labels, provides 


the reader with sufficient information to identify the indepen- 
dent and dependent variables. The figure caption should explain 
any symbols (see Figure 6.2) or observed but unplanned events 
that may have affected the dependent variable and point out 
and clarify any potentially confusing features of the graph (see 
Figure 6.6 and Figure 6.7). 


Variations of the Simple Line Graph: 
Multiple Data Paths 


The line graph is a remarkably versatile vehicle for displaying 
behavior change. Whereas Figure 6.2 is an example of the line 
graph in its simplest form (one data path showing a series of suc- 
cessive measures of behavior across time and experimental con- 
ditions), by the addition of multiple data paths, the line graph can 
display more complex behavior—environment relations. Graphs 
with multiple data paths are used frequently in applied behavior 
analysis to show (a) two or more dimensions of the same behavior, 
(b) two or more different behaviors, (c) the same behavior under 
different and alternating experimental conditions, (d) changes in 
target behavior relative to the changing values of an independent 
variable, and (e) the behavior of two or more participants. 


Two or More Dimensions of the Same Behavior. Showing 
multiple dimensions of the dependent variable on the same graph 
enables visual analysis of the absolute and relative effects of the 
independent variable on those dimensions. Figure 6.3 shows the 
results of a study of the effects of training three members of 
a women’s college basketball team proper foul shooting form 


From “The Effects of Form Training on Foul-Shooting 
Performance in Members of a Women’s College 0 


Basketball Team,” by C. N. Kladopoulos and 


Baseline Form Training 
100 100 
80 80 
60 win / Whe 60 
40 40 
20 20 
' @ foul shot data 
0 oS Soar A form data Participant 1 
1 a 
Or MIs = =—-_ SAMS aaa D 2 
$ 100 100 °s 
= = 
Z 80 80 
& 60 5 
= 60 § 
a 40 i Ss 
a 1 40 o 
£ D 
5 20 20 = 
= fen, ee Shes Participant 2 o 
a 0 ia Tas 2 by 
100 apt ae | 100 
Figure 6.3 Graph with multiple data paths 7 
, ; 80 i 80 
showing the effects of the independent 
variable (Form Training) on two dimensions 60 60 
(accuracy and topography) of the depen- 40 
dent variable. 40 
20 20 
Participant 3 
Tae saaee 0 
25 


J. J. McComas, 2001, Journal of Applied Behavior 
Analysis, 34, p. 331. Copyright 2001 by the Society 
for the Experimental Analysis of Behavior, Inc. Used 
by permission. 
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Figure 1. Percentage of shots made (filled circles) and percentage of shots taken with correct 
form (open triangles) across sessions for each participant. 
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(Kladopoulos & McComas, 2001). The data path created by 
connecting the open triangle data points shows changes in the 
percentage of foul shots executed with the proper form, whereas 
the data path connecting the solid data points reveals the per- 
centage of foul shots made. Had the experimenters recorded 
and graphed only the players’ foul shooting form, they would 
not have known whether any improvements in the target behav- 
ior on which training was focused (correct foul shooting form) 
coincided with improvements in the behavior by which the 
social significance of the study would ultimately be judged— 
foul shooting accuracy. By measuring and plotting both form 
and outcome on the same graph, the experimenters were able to 
analyze the effects of their treatment procedures on two critical 
dimensions of the dependent variable. 


Two or More Different Behaviors. Multiple data paths are 
also used to facilitate simultaneous comparison of the effects 
of experimental manipulations on two or more different behay- 
iors. Determining the variation of two behaviors as a function of 
changes in the independent variable is accomplished more easily 
if both behaviors can be displayed on the same graph. Figure 6.4 
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Figure 2. Occurrence of stereotypy for James across attention, demand, and no-attention conditions. Data are 
arrayed as the percentage of intervals of stereotypy on the left y axis and number of signs per sessions on the 


right y axis. 


“vy 


shows the percentage of intervals in which a boy with autism 
exhibited stereotypy (e.g., repetitive body movements, rocking) 
across three conditions, and the number of times that he raised his 
hand for attention (in the attention condition), signed for a break 
(in the demand condition), and signed for access to preferred tan- 
gible stimuli (in the no-attention condition) in a study investigat- 
ing a strategy called functional communication training (Kennedy, 
Meyer, Knowles, & Shukla, 2000).° By recording and graphing 
both stereotypic responding and appropriate behavior, the inves- 
tigators were able to determine whether increases in alternative 
communication responses (raising his hand and signing) were 
accompanied by reductions in stereotypy. Note that Figure 6.4 
includes a second vertical axis showing the dimensional units and 
scaling for signing frequency. Because of the differences in scale, 
readers of dual-vertical axis graphs must view them with care, par- 
ticularly when assessing the magnitude of behavior change. 


Measures of the Same Behavior Under Different Conditions. 
Multiple data paths can also be used to represent measures 
of the same behavior taken under different experimental 
conditions that alternate throughout an experimental phase. 
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Figure 6.4 Graph with multiple data paths show- 

10 ing two different behaviors by one participant 
during baseline and training across three differ- 
ent conditions. Note the different dimensions 

0 and scaling of the dual vertical axes. 

55 From “Analyzing the Multiple Functions of Stereotypical 


Behavior for Students with Autism: Implications for Assess- 
ment and Treatment,” by C. H. Kennedy, K. A. Meyer, 

T. Knowles, and S. Shukla, 2000, Journal of Applied Behavior 
Analysis, 33, p. 565. Copyright 2000 by the Society for the 
Experimental Analysis of Behavior, Inc. Used by permission. 
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Figure 6.5 Graph with multiple data 
paths showing the same behav- 

ior measured under four different 
conditions. 

Based on “The Influence of Therapist Attention 
on Self-Injury during a Tangible Condition” 

by J. W. Moore, M. M. Mueller, M. Dubard, 
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Tangible 


D. S. Roberts, and H. E. Sterling-Turner, 2002, 
Journal of Applied Behavior Analysis, 35, 

p. 285. Copyright 2002 by the Society for 

the Experimental Analysis of Behavior, Inc. 
Used by permission. 


Figure 6.5 shows self-injurious responses per minute by a 
6-year-old girl with developmental disabilities under four dif- 
ferent conditions (Moore, Mueller, Dubard, Roberts, & Sterling- 
Turner, 2002). Graphing an individual’s behavior under multiple 
conditions on the same set of axes allows direct visual compari- 
sons of differences in absolute levels of responding at any given 
time as well as relative changes in performance over time. 
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Figure 6.6 Graph using two data 
paths to show the duration of prob- 
lem behavior (dependent variable) by 
two adults with severe or profound 
intellectual disabilities as noise level 
was increased gradually (indepen- 
dent variable). 

From “Functional Analysis and Treatment 

of Problem Behavior Evoked by Noise”, 

B. E. McCord, B. A. Iwata, T. L. Galensky, 

S. A. Ellingson, and R. J. Thomson, 2001, 
Reproduced with permission of John Wiley 
& Sons Inc. 
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Figure 1. Rate of self-injurious behavior during the initial functional analysis. 


Changing Values of an Independent Variable. Multiple data path 
graphs can also show changes in the target behavior (shown on one 
data path) relative to changing values of the independent variable (rep- 
resented by a second data path). In each of the two graphs in Figure 6.6 
one data path shows the duration of problem behavior 
(plotted against the left-hand y-axis scaled in seconds) relative 
to changes in noise level depicted by a second data path (plotted 
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Figure 4. Results of Debbie's and Sarah's treatment evaluation. Sessions marked A and B near the end of treatment 
indicate two generalization probes in the natural environment; F indicates a follow-up probe. 
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Figure 1. Results of the component analysis for children exposed to the toilet-training package after 
baseline. Each chid's age (in months) is noted parenthetically. 

Figure 6.7 Graph using a combination of data path and bars to display 
changes in three response classes. 

From “A Component Analysis of Toilet-Training Procedures Recommended for Young 
Children,” by B. D. Greer, P. L. Neibert, and C. L. Dozier, 2016, Journal of Applied 
Behavior Analysis, 49, p. 76. Copyright 2016 by the Society for the Experimental Analysis 
of Behavior, Inc. Reprinted by permission. 
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Bar Graphs 


The bar graph, or histogram, is a simple and adaptable for- 
mat for graphically summarizing data. Bar graphs sacrifice 
the presentation of the variability and trends in data (which 
are apparent in a line graph) in exchange for the efficiency of 
summarizing and comparing large amounts of data in a simple, 
easy-to-interpret format. 

Bar graphs are used frequently to summarize the perfor- 
mance of a participant or group of participants during different 
conditions. For example, Figure 6.8 shows the mean percentage 
of spelling worksheet items completed and completed correctly 
by four students during baseline and the combined generaliza- 
tion programming and maintenance conditions that followed 
training each child to recruit teacher attention while working 
(Craft, Alber, & Heward, 1998). 

Bar graphs that present a measure of central tendency, 
such as the mean or median score for each condition, may 
mask important variability in the data. The range of measures 
represented by a mean or median can be shown with range bars 
(e.g., Putnam & Tiger, 2016). Figure 6.9 is an example of a 
bar graph showing both the range of measures and individual 
performances. 

Bar graphs are also used to display discrete sets of data 
that are not related to one another by a common underlying 
dimension by which the horizontal axis could be scaled. For 
example, in a study analyzing the effects of establishing opera- 
tions on preference assessments, Gottschalk, Libby, and Graff 
(2000) used bar graphs to show the percentage of trials in which 


Percent Completion/Accuracy 


Figure 6.8 Bar graph comparing mean levels for two 
dimensions of participants’ performance between 
experimental conditions. 


Based on “Teaching Elementary Students with Developmental 
Disabilities to Recruit Teacher Attention in a General Education 
Classroom: Effects on Teacher Praise and Academic Productivity” by 
M. A. Craft, S. R. Alber, and W. L. Heward, 1998, Journal of Applied 
Behavior Analysis, 37, p. 410. Copyright 1998 by the Society for the 
Experimental Analysis of Behavior, Inc. 
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four children reached toward and picked up different items 
(see Figure 6.10). 

The y-axis on some bar graphs is scaled to negative 
values/below zero to show amount of behavior change above or 
below a baseline average or point in time (e.g., Greer, Neibert, & 
Dozier, 2016; Jessel, Hanley & Ghaemmaghami, 2016; Meindl, 
Ivy, Miller, Neef, & Williamson, 2013). 


Cumulative Records 


In the 1930s, Skinner invented the cumulative recorder, a 
device that automatically draws a graph of a subject’s rate of 
behavior. In a book cataloging 6 years of experimental research 
on schedules of reinforcement, Ferster and Skinner (1957) 
described a cumulative record in the following manner: 


A graph showing the number of responses on the ordinate 
against time on the abscissa has proved to be the most 
convenient representation of the behavior observed in 
this research. Fortunately, such a “cumulative” record 
may be made directly at the time of the experiment. The 
record is raw data, but it also permits a direct inspection 
of rate and changes in rate not possible when the behav- 
ior is observed directly. .. . Each time the bird responds, 
the pen moves one step across the paper. At the same 
time, the paper feeds continuously. If the bird does not 
respond at all, a horizontal line is drawn in the direction 
of the paper feed. The faster the bird pecks, the steeper 
the line. (p. 23) 
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Figure 4. Mean percentage of spelling worksheet items completed and mean 
percentage of accuracy by each student during baseline and combined generalization 
programming and maintenarice conditions. Numbers in parentheses show total 
number of sessions per condition. 
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Figure 2. Total number of breath CO sample <3 ppm across 
each group. Each filled circle represents one participant. 
Gray area represents the mean percentage of breath CO 
samples <3 ppm for each group. 


For decades, cumulative records (graphs produced by a 
cumulative recorder) served as the primary form of data display 
in basic behavior analysis research laboratories around the world 
(Kangas & Cassidy, 2010). Although behavioral researchers 
seldom use cumulative recorders today, the cumulative graphs 
remain an important form of data display in both experimental 
and applied behavior analysis.’ 

A cumulative record is constructed by adding the number 
of responses recorded during each observation period (thus the 
term cumulative) to the total number of responses recorded during 
all previous observation periods. The y-axis value of any data point 
on a cumulative graph represents the total number of responses 
recorded since the beginning of data collection. Exceptions occur 
when the data path resets to the 0 value of the y-axis and begins 
its ascent again. Resets occur when the total number of responses 
has exceeded the upper limit of the y-axis scale, a new condition is 
implemented (Wilson & Gratz, 2016), or a performance criterion 
has been met (Williams, Perez-Gonzales, & Queiroz, 2005). 

Figure 6.12 shows the cumulative number of spelling 
words mastered by a man with intellectual disabilities during 
baseline and two training conditions (Neef, Iwata, & Page, 
1980). The individual mastered | word during the 12 sessions of 
baseline (social praise for correct spelling responses and rewrit- 
ing incorrectly spelled words three times), 22 words under the 
interspersal condition (baseline procedures plus the presentation 
of a previously learned word after each unknown word), and 11 
words under the high-density reinforcement condition (baseline 
procedures plus social praise given after each trial for task- 
related behaviors such as paying attention and writing neatly). 


Figure 6.9 Bar graph comparing mean performance 

by three groups of participants under different 
experimental conditions, with data points representing 
each individual participant. 

From “The Effects of Fixed Versus Escalating Reinforcement 
Schedules on Smoking Abstinence,” by P. Romanowich and 

R. J. Lamb, 2015, Journal of Applied Behavior Analysis, 48, p. 33. 
Copyright 2015 by the Society for the Experimental Analysis 

of Behavior, Inc. Reprinted by permission. 


In addition to the total number of responses recorded at 
any given point in time, cumulative records show the overall and 
local response rates. Rate is the number of responses emitted per 
unit of time, usually reported as responses per minute in applied 
behavior analysis. An overall response rate is the average rate 
of response over a given time period, such as during a specific 
session, phase, or condition of an experiment. Overall rates are 
calculated by dividing the total number of responses recorded 
during the period by the number of observation periods indicated 
on the horizontal axis. In Figure 6.12 the overall response rates 
are 0.46 and 0.23 words mastered per session for the interspersal 
and high-density reinforcement conditions, respectively.® 

On a cumulative record, the steeper the slope, the higher 
the response rate. To produce a visual representation of an over- 
all rate on a cumulative graph, the first and last data points of a 
given series of observations should be connected with a straight 
line. A straight line connecting Points a and c in Figure 6.12 
would represent the learner’s overall rate of mastering spell- 
ing words during the interspersal condition. A straight line 
connecting Points a and e represents the overall rate during the 
high-density reinforcement condition. Relative rates of response 
can be determined by visually comparing one slope to another: 
the steeper the slope, the higher the rate of response. A visual 
comparison of Slopes a—c and a-e shows that the interspersal 
condition produced the higher overall response rate. 

Response rates often fluctuate within a given period. 
The term local response rate refers to the rate of response 
during periods smaller than that for which an overall rate has 
been given. Over the last four sessions of the study shown in 
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Society for the Experimental Analysis of Behavior, Inc. 
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Figure 6.11 Diagram of a cumulative recorder. 


Based on Schedules of Reinforcement, pp. 24-25, by C. B. Ferster and 
B. F. Skinner, 1957, Upper Saddle River, NJ: Prentice Hall. Copyright 1957 
by Prentice Hall. 
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Figure 1. Percentage of approach responses across conditions for Ethan, Daniel, Mark, and Ashley. 


Figure 6.12, the learner exhibited a local rate of responding 
during interspersal training (Slope b-c) that was considerably 
higher than his overall rate for that condition. At the same time 
his performance during the final four sessions of the high- 
density reinforcement condition (Slope d—e) shows a lower local 
response rate than his overall rate for that condition. 

A legend giving the slopes of some representative rates can 
aid considerably in the determination and comparison of relative 
response rates both within and across cumulative curves plotted on 
the same set of axes (e.g., see Kennedy & Souza, 1995, Figure 2). 
However, very high rates of responding are difficult to compare 
visually with one another on cumulative records. 


Although the rate of responding is directly proportional 
to the slope of the curve, at slopes above 80 degrees small 
differences in angle represent very large differences in 
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rate; and although these can be measured accurately, they 
cannot be evaluated easily by [visual] inspection. (Ferster 
& Skinner, 1957, pp. 24—25) 


Even though cumulative records derived from continuous 
recording are the most directly descriptive displays of behav- 
ioral data available, two other features of behavior, in addition to 
the comparison of very high rates, can be difficult to determine 
on some cumulative graphs. One, although the total number of 
responses since data collection began can be easily seen on a 
cumulative graph, the number of responses recorded for any 
given session can be hard to ascertain, given the number of data 
points and the scaling of the vertical axis. Two, gradual changes 
in slope from one rate to another can be hard to detect on cumu- 
lative graphs. 

A cumulative graph may be preferable to a noncumula- 
tive line graph in four situations. First, cumulative records are 
desirable when the total number of responses emitted over time 
is important or when progress toward a specific goal can be mea- 
sured in cumulative units of behavior. The number of new words 
learned, dollars saved, or miles trained for an upcoming marathon 
are examples. One look at the most recent data point on the graph 
reveals the total amount of behavior up to that point in time. 

Second, a cumulative graph may be more effective than 
a noncumulative graph when used as a source of feedback for 
clients. This is because both total progress and relative rate of 
performance are easily detected by visual inspection. Further, a 
cumulative graph does not reveal decelerating trends; it either 
ascends or plateaus. For individuals for whom a decelerating 
trend may signal failure, spark embarrassment, incite retaliation 
by members of a peer group, or otherwise affect them negatively, 
a cumulative graph, with its ascending trend (or plateaus), may 
mitigate concerns. 

Third, a cumulative record should be used when the target 
behavior can occur or not occur just once per observation session. 


Cc — 


Figure 6.12 Cumulative graph of number of spelling 
words learned by a man with intellectual disabilities 
during baseline, interspersal training, and high-density 
reinforcement training. Points a-e have been added 
to illustrate the differences between overall and local 
response rates. 


Based on “The Effects of Interspersal Training Versus High Density 
Reinforcement on Spelling Acquisition and Retention” by 
N. A. Neef, B. A. lwata, and T. J. Page, 1980, Journal of Applied 


60 = Behavior Analysis, 13, p. 156. Copyright 1980 by the Society for the 


Experimental Analysis of Behavior, Inc. Adapted by permission. 


In these instances, the effects of an intervention may be easier to 
detect on a cumulative graph. Figure 6.13 shows the same data 
plotted on a noncumulative graph and a cumulative graph. The 
cumulative graph clearly shows a relation between behavior and 
intervention, whereas the noncumulative graph gives the impres- 
sion of greater variability in the data than really exists. 

Fourth, cumulative records can “reveal the intricate 
relations between behavior and environmental variables” 
(Johnston & Pennypacker, 2009, p. 317). Figure 6.14 is an 
excellent example of how a cumulative graph enables a detailed 
analysis of behavior change (Hanley, Iwata, & Thompson, 
2001). By plotting the data from single sessions cumulatively 
by 10-second intervals, the researchers revealed patterns of 
responding not shown by a graph of session-by-session data. 
Comparing the data paths for the three sessions for which the 
results were graphed cumulatively (Mult #106, Mixed #107, and 
Mixed #1 12) revealed two undesirable patterns of responding (in 
this case, pushing a switch that operated a voice output device 
that said “talk to me, please” as an alternative to self-injurious 
behavior and aggression) that are likely to occur during mixed 
schedules, and the benefits of including schedule-correlated 
stimuli (Mult #106). 


Ratio Charts 


All of the line graphs discussed so far have been equal- 
interval graphs on which the distance between any two consec- 
utive points on each axis is always the same. On the horizontal 
axis, the distance between data on Day | and Day 2 has the 
same distance as Day 11 to Day 12; on the vertical axis, the dis- 
tance between 10 and 20 responses per minute visually matches 
the distance between 35 and 45 responses per minute. On an 
equal-interval graph, equal vertical distances on the vertical axis 
represent equal amounts of behavior change, whether the perfor- 
mance increases or decreases. 


160 Part3 * Evaluating and Analyzing Behavior Change 


Yes 


No 


Figure 6.13 Same set of hypothetical data plotted on 
noncumulative and cumulative graphs. Cumulative 
graphs more clearly reveal patterns of and changes 
in responding for behaviors that can occur only once 
during each period of measurement. 

Based on Working with Parents of Handicapped Children, p. 100, 
by W. L. Heward, J. C. Dardig, and A. Rossett, 1979, Columbus, 
OH: Charles E. Merrill. Copyright 1979 by Charles E. Merrill. 


Cumulative Number of Responses 
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Figure 6.14 Cumulative record used to 
make a detailed analysis and comparison 
of behavior across components of 
multiple- and mixed-reinforcement sched- 
ules within specific sessions of a study. 


Alternative Responses (Cumulative #) 


Based on “Reinforcement Schedule Thinning 
Following Treatment with Functional Communica- 
tion Training” by G. P. Hanley, B. A. Iwata, and 

R. H. Thompson, 2001, Journal of Applied Behavior 
Analysis, 34, p. 33. Copyright 2001 by the Society 
for the Experimental Analysis of Behavior, Inc. Used 
by permission. 


Another way to examine behavior change is by proportional 
or relative change. Ratio scales are well suited for displaying propor- 
tional change. On a ratio scale, equal vertical distances correspond 
to equal ratios of change in the variable being measured. Because 
behavior is measured and charted over time, which progresses in 
equal intervals, the horizontal axis is marked off in equal intervals 
and only the vertical axis has a ratio scaling. Hence, the term ratio 
chart refers to graphs where one axis is scaled proportionally. 

Ratio charts offer a different perspective, namely, detect- 
ing relative behavior change as opposed to absolute change. 


20 


uw 
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Baseline, Intervention, Baseline, Intervention, 
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Days 
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—O— Mixed #112 


10-S Bins 


Figure 4. Cumulative number of alternative responses across schedule components for three sessions from 
Julie's assessment of schedule-correlated stimuli. Open symbols represent two mixed-schedule sessions; filled 
symbols represent a single multiple-schedule session. 


For example, on a ratio chart a doubling of response rate from 4 
to 8 per minute would appear the same as a doubling of 50 to 100 
responses per minute. Likewise, a decrease in responding from 
75 to 50 responses per minute (a decrease of one third) would 
occupy the same distance on the vertical axis as a change from 
12 to 8 responses per minute (a decrease of one third). 

Figure 6.15 shows the same data plotted on an equal- 
interval graph (sometimes called linear, arithmetic, or 
add-subtract charts) and on a ratio chart (sometimes called a 
semilogarithmic chart). The behavior change that appears as an 
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exponential curve on the arithmetic chart is a straight line on 
the ratio chart. The vertical axis in the ratio chart in Figure 6.15 
is scaled by log-base-2 or X2 cycles, which means each cycle 
going up the vertical axis represents a times-2 increase (i.e., a 
doubling or 100% increase) of the cycle below it. 


Standard Celeration Chart 


Ogden Lindsley developed the Standard Celeration Chart to 
provide a standardized means of charting and analyzing how 
frequency of behavior changes over time (Lindsley, 1971). The 
Standard Celeration Chart is a standard ratio chart with six X10 


Semilogarithmic Graph 


Figure 6.15 Same set of data plotted on equal-interval 
arithmetic scale (left) and on equal-proportion ratio 
scale (right). 


cycles on the vertical axis that can accommodate response rates 
as low as | per 24 hours (0.000695 per minute) or as high as 
1000 per minute. There are four standard charts, differentiated 
from one another by the scaling on the horizontal axis: a daily 
chart with 140 calendar days, a weekly chart, a monthly chart, 
and a yearly chart. The daily chart shown in Figure 6.16 is used 
most often. 

The size of the chart and the consistent scaling of the 
vertical axis and horizontal axis do not make the Standard 
Celeration Chart standard, as is commonly believed. What 
makes the Standard Celeration Chart standard is its consistent 
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Figure 6.16 Standard Celeration Chart showing basic charting conventions. (See Table 6.1 for descriptions of 


numbered features.) 


Model Standard Celeration Chart created by Douglas E. Kostewicz, Ph.D., BCBA-D, University of Pittsburgh. Used with permission. 
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display of celeration, a linear measure of frequency change 
across time, a factor by which frequency multiplies or divides 
per unit of time. The terms acceleration and deceleration are 
used to describe accelerating performances or decelerating 
performances. 

A line drawn from the bottom left corner to the top right 
corner has a slope of 34° on all Standard Celeration Charts. 
The slope has a celeration value of <2 (read as “times-2”). 
Celerations are expressed with a multiplication or division 
sign. A X2 celeration represents a doubling in frequency per 
celeration period. The celeration period for the daily chart is 
per week; per month for the weekly chart, per 6 months for the 
monthly chart, and per 5 years for the yearly chart. 

An instructional decision-making system, called 
precision teaching, has been developed for use with the 
Standard Celeration Chart. Precision teaching is predicated 
on the position that (a) learning is best measured as a change 
in response rate, (b) learning most often occurs through pro- 
portional changes in behavior, and (c) past changes in per- 
formance can project future learning. Figure 6.16 shows how 
precision teaching could be implemented to help a student 
working on 1|-digit by 1-digit multiplication facts. Several 
times per week the student is given a worksheet of 1|-digit 
by 1-digit multiplication problems and told to write the 
answer to as many problems as he can in | minute. Prac- 
tice sheets during the initial phase consist of problems with 


digits 0 through 3 (e.g., 3 X 3,2 X 1,0 X 1). When the stu- 
dent attains the dual aims of 80 correct digits and 0 incor- 
rect digits per minute, he starts practicing problems with 
increased range of digits and his performance is assessed 
with practice sheets sampling more diverse populations of 
problems. He completes the skill by reaching the aim on 
the final set of worksheets sampling all possible 1-digit by 
1-digit multiplication problems. The Standard Celeration 
Chart shows the student took the longest to reach aim on 
the first set of sheets but attained each successive aim more 
quickly and with fewer errors. The reduced variability, called 
bounce by precision teachers, for corrects for each succes- 
sive type of practice sheet suggests tighter stimulus control. 
Table 6.1 provides names and descriptions of chart features in 
Figure 6.16.” 


Scatterplots 


A scatterplot is a graphic display that shows the relative distri- 
bution of individual measures in a data set with respect to the 
variables depicted by the x- and y-axes. Data points on a scat- 
terplot are unconnected. Scatterplots show how much changes 
in the value of the variable depicted by one axis correlate with 
changes in the value of the variable represented by the other 
axis. Patterns of data points falling along lines on the plane or 
clusters suggest certain relationships. For example, a scatterplot 


TABLE 6.1 Features of the Standard Celeration Chart as Used by Precision Teaching (numbers refer to Figure 6.16) 


Chart Feature 


Description and Charting Convention 


1. Acceleration target frequency Responses to accelerate noted by a solid dot on the appropriate day line. 


. Deceleration target frequency Responses to decelerate noted by an X on the appropriate day line. 


2 
3. Time bar 
4. No count (zero) 


Amount of recording time per day noted by a solid horizontal line on the charted day. 
No behaviors intended to accelerate (i.e., dots) or decelerate (i.e., X) occurred during the 


recording period and charted at a distance +2 below the time bar. 


5. Condition change line 
a new intervention. 


6. Condition change label 
7. Celeration lines 


Vertical line drawn between the last charted day of one intervention and the first charted day of 


Brief description of condition/intervention. 


A straight line representing celeration or deceleration of five or more charted days with each 


condition. Drawn by free hand on paper charts; automatically calculated with linear regression 
algorithm and plotted on some digital charts. 


8. Bounce lines 


A measure of performance variability. Parallel straight lines above and below a celeration 


line that encompass the majority of performance. The up bounce line is placed through the 
upmost point (excluding outliers); the down bounce line is placed through the downmost point 


(excluding outliers). 
9. Acceleration frequency aim 


A symbol representing the desired frequency aim and date to achieve it for the behavior targeted 


for acceleration. Drawn with horizontal bar on the desired frequency and date and intersecting 


caret pointing up. 
10. Deceleration frequency aim 


A symbol representing the desired frequency aim and date to achieve it for the behavior targeted 


for deceleration. Drawn with horizontal bar on the desired frequency and date and intersecting 


caret pointing down. 
11. Celeration value 


Ze 


Bounce values 


A measure of performance change per minute per week. Accelerations (X) and decelerations (a) 
represented on the chart or text show the change in performance. 


The total amount of bounce (i.e., variability) around celeration and deceleration in a condition. 
Always written as an “X” value. 
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TABLE 6.1 (continued) 


Chart Feature 

13. Charted blanks Supervisor 

Adviser 

Manager 

Organization 

Division 

Room 

Counter 

Performer 

Charter 

Counted 

Small Unlabeled Blanks 
Large Unlabeled Blanks 


Description and Charting Convention 


Sees the chart monthly, advises the adviser or manager 
Advises the manager or performer 

Works with the performer daily 

Where the recorded performance takes place 

Location or subdivision within the organization 

Room where the recording takes place 

Counts the performer's behavior 

Learner whose performance appears on the chart 
Person who creates chart with recorded performance data 
Description of performer’s counted behavior 

Optional: performer's age 

Optional: additional relevant information 


Douglas Kostewicz, Ph.D., BCBA, University of Pittsburgh. Used with permission. 


graph presented by Meindl et al. (2013) suggested that the more 
fluent a student’s performance on a set of training flash cards, 
the greater the decrease in fluency with a generalization set of 
cards (see Figure 6.17). This finding supported the researchers’ 
hypothesis that the student’s responses to the training set flash- 
cards were controlled in part by irrelevant stimulus features of 
the cards (e.g., specific words in specific locations on the cards), 
which inhibited generalization to new cards. 


Scatterplots can also reveal relationships among different 
subsets of data. For example, Boyce and Geller (2001) created 
the scatterplot shown in Figure 6.18 to see how the behavior 
of individuals from different demographic groups related to a 
ratio of driving speed and following distance that represents one 
element of safe driving (e.g., the proportion of data points for 
young males falling in the at-risk area of the graph compared 
to the proportion of drivers from other groups). Each data point 


Individual Student Change in Fluency by Final Fluency Measure 


Change in Fluency (Final Gen Rate-Final Training Rate) 


10 12 14 16 


20 22 24 26 


Fluency (Correct per Minute) 


Figure. 2 Scatterplot depicting the relation between student fluency on the final training timing and the 


change in fluency from the final timing to the generalization test 


Figure 6.17 Scatterplot depicting the relation between student fluency on the final training 
timing and the change in fluency from the final timing to a generalization test. 

From “An Examination of Stimulus Control in Fluency-based Strategies: SAFMEDS and Generalization, " by 

J. N. Meindl, J. W. Ivy, N. Miller, N. A. Neef, and R. L. Williamson, 2013, Journal of Behavioral Education, 22, 

p. 238. Copyright 2013 by Springer Sciences+Business Media. Used by permission. 
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Figure 6.18 Scatterplot showing how the behavior 

of individuals from different demographic groups 
relates to a standard measure of safe driving. 

Based on “A Technology to Measure Multiple Driving Behaviors 
without Self-Report or Participant Reactivity” by T. E. Boyce and 
E. S. Geller, 2001, Journal of Applied Behavior Analysis, 34, p. 49. 
Copyright 2001 by the Society for the Experimental Analysis of 
Behavior, Inc. Used by permission. 


Following Distance in Meters 


shows a single driver’s behavior in terms of speed and following 
distance and whether the speed and following distance combina- 
tion is considered safe or at risk for accidents. Such data could 
be used to target interventions for certain demographic groups. 

Applied behavior analysts sometimes use scatterplots 
to discover the temporal distribution of a target behavior 
(e.g., Kahng et al., 1998; Symons, McDonald, & Wehby, 
1998; Touchette, MacDonald, & Langer, 1985). Touchette and 
colleagues described a procedure for observing and recording 
behavior that produces a scatterplot that graphically shows 
whether the behavior’s occurrence is typically associated 
with certain time periods. The use of scatterplot recording is 
described further in Chapter 27. 


Other Graphic Displays 


Applied behavior analysts have used a wide variety of other 
graphic formats to display and communicate behavioral data. 
For example, Raiff, Jarvis, and Dallery (2016) created a 
graphic display consisting of a horizontal sequence of boxes 
in different shades to illustrate patients’ adherence to antidia- 
betic medication routines during baseline and treatment condi- 
tions. Romanowich and Lamb (2015) illustrated and compared 
participants’ ratings on a treatment acceptability scale with a 
horizontal bar graph. Still other graphic formats are bubble 
graphs (e.g., Schultz, Kohn, & Musto, 2017), dot charts (see 
Figure A in Box 6.1), and frequency profiles (Epstein, Mejia, & 
Robertson, 2017). 

Applied behavior analysis researchers have devised creative 
combinations of graphic devices to present data in ways that invite 
insights, understanding, and analyses of their findings. Interesting 
and effective examples can be found in Cammilleri and Hanley 
(2005 [Figure 29.10 in this text]; Fahmie, Iwata, and Mead (2016, 
Figure 1); Ingvarsson, Kramer, Carp, Pétursdéttir, and Macias 
(2016, Figures | and 2); Jessel et al. (2016, Figure 5); and Jones, 
Carr, and Feeley (2006, Figure 10).!° 


CONSTRUCTING LINE GRAPHS 


The skills required for constructing effective, distortion-free 
graphic displays are as important as any in the behavior ana- 
lyst’s repertoire.!! As applied behavior analysis has developed, 
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so have certain stylistic conventions and expectations regard- 
ing the construction of graphs. An effective graph presents 
the data accurately, completely, and clearly, and makes the 
viewer’s task of understanding the data as easy as possible. 
The graph maker must strive to fulfill each of these require- 
ments while remaining alert to features in the graph’s design 
or construction that might create distortion and bias—either 
the graph maker’s or that of a future viewer when interpret- 
ing the extent and nature of the behavior change depicted by 
the graph. 

Despite the graph’s prominent role in applied behavior 
analysis, relatively few detailed treatments of how to construct 
behavioral graphs have been published. Notable exceptions 
have been chapters by Parsonson and Baer (1978, 1986) and 
by Spriggs, Lane, and Gast (2014); discussion of graphic dis- 
play tactics by Johnston and Pennypacker (2009); and a recent 
critique of graphs published in behavior analytic journals by 
Kubina, Kostewicz, Brennan, and King (2015). Recommenda- 
tions frora these sources and others (American Psychological 
Association, 2009; Tufte, 2001) contributed to the preparation 
of this section. Additionally, hundreds of graphs published in 
applied behavior analysis journals were examined in an effort 
to discover those features that most clearly communicate the 
story told by the data. 

Although there are few hard-and-fast rules for con- 
structing graphs, adhering to the following conventions will 
result in clear, well-designed graphic displays consistent in 
format and appearance with current practice. Although most 
of the recommendations are illustrated by graphs presented 
throughout this text, we created Figures 6.19 and 6.20 to 
serve as models for most of the practices suggested here. The 
recommendations generally apply to all behavioral graphs. 
However, each data set and the conditions under which the 
data were obtained present their own challenges to the graph 
maker. 


Drawing, Scaling, and Labeling Axes 


Ratio of the Vertical and Horizontal Axes 


The relative length of the vertical axis to the horizontal axis, in 
combination with the scaling of the axes, determines the degree 
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Figure 1. Percent of 10-second intervals of appropriate and inappropriate behavior by an 8-year-old boy. Each interval was 
scored as appropriate, inappropriate, or neither; thus the two behaviors do not always total 100%. 


Figure 6.19 Graph of hypothetical data illustrating various conventions and guidelines for 


constructing behavioral graphs. 
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Figure 1. Number of words read correctly and incorrectly by Ardy and Belva during 1-min 
probes following each session. Arrows below horizontal axes indicate sessions in which 


student used reading material brought from home. 


Figure 6.20 Graph of hypothetical data illustrating various 
conventions and guidelines for constructing behavioral 
graphs. 
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to which a graph will accentuate or minimize variability in a 
given data set. The legibility of a graph is enhanced by a bal- 
anced ratio between the height and width so that data points 
are neither too close together nor too spread apart. Recom- 
mendations in the behavioral literature for the ratio, or relative 
length, of the vertical axis to the horizontal axis range from 5:8 
(Johnston & Pennypacker, 2009) to 3:4 (Katzenberg, 1975). 
Tufte (2001), whose book The Visual Display of Quantitative 
Information is a wonderful storehouse of guidelines and exam- 
ples of effective graphing techniques, also recommends a verti- 
cal axis—to—horizontal axis ratio of 5:8. 

A vertical axis approximately two-thirds the length of the 
horizontal axis works well for most behavioral graphs. When 
multiple sets of axes will be presented one atop another in a 
single figure and/or when the number of data points to be plot- 
ted on the horizontal axis is very large, the length of the vertical 
axis relative to the horizontal axis can be reduced (as shown in 
Figures 6.2 and 6.7). 


Scaling the Horizontal Axis 


The horizontal axis should be marked in equal intervals, with 
each unit representing from left to right the chronological 
succession of equal time periods or response opportunities in 
which the behavior was (or will be) measured and from which 
an interpretation of behavior change is to be made (e.g., days, 
sessions, trials). It is not necessary to mark each point on the 
x-axis. Avoid visual clutter by placing tick marks at regularly 
spaced increments on the outside of the axis and numbering 
them accordingly (e.g., by 5s, 10s, 20s). 

When two or more sets of axes are stacked vertically, and 
each horizontal axis represents the same time frame, unnum- 
bered tick marks on the horizontal axes of the upper tiers are 
permissible. However, each horizontal axis should have the tick 
marks corresponding to those on the bottom tier to facilitate 
comparison of performance across tiers at any given point in 
time (see Figure 6.4). 


Representing Discontinuities of Time on 
the Horizontal Axis 


Behavior change, its measurement, and all manipulations of 
treatment or experimental variables occur within and across 
time. Therefore, time is a fundamental variable in all experi- 
ments that should not be distorted or arbitrarily represented in a 
graphic display. Each equally spaced unit on the horizontal axis 
should represent an equal passage of time. Discontinuities in the 
progression of time should be indicated on the horizontal axis 
by a scale break: an open, unconnected portion of the axis with 
a squiggly line at each end. Scale breaks on the x-axis can also 
be used to signal periods when data were not collected or when 
otherwise regularly spaced data points represent consecutive 
measurements made at unequal intervals (see the numbering of 
school days for the follow-up condition in Figure 6.19). 

When measurement occurs across consecutive obser- 
vations of opportunities for the target behavior to occur 
(e.g., stories read, meals, tennis serves) rather than standard 
units of time, the horizontal axis still serves as a visual represen- 
tation of the progression of time because the data plotted above 


it are recorded one after the other. The text accompanying such 
a figure should indicate the real time in which the consecutive 
measurements were made (e.g., ““Two or three peer tutoring ses- 
sions were conducted each school week’’), and any irregularities 
or discontinuities in that time context should be clearly indicated 
or marked with scale breaks (see Figure 6.20). 


Labeling the Horizontal Axis 


The dimension by which the horizontal axis is scaled is identi- 
fied in a brief label centered below and parallel to the axis. 


Scaling the Vertical Axis 


The most significant feature of an equal-interval graph’s repre- 
sentation of behavior change (or lack thereof) is its vertical axis 
scale. Common practice is to mark the origin at 0 (on cumu- 
lative graphs the bottom on the vertical axis must be 0) and 
then to mark off the vertical axis to accommodate all values 
represented in the data set and the range of socially important/ 
desired performance. Increasing the distance on the vertical axis 
between each unit of measurement magnifies the variability in 
the data, whereas contracting the units of measurement on the 
vertical axis minimizes the portrayal of variability in the data 
set. The graph maker should plot the data set against several dif- 
ferent vertical axis scales, watching for distortion of the graphic 
display that might lead to inappropriate interpretations (Dart & 
Radley, 2017). 

When zero values make no sense, truncate the y-axis. 
“Charting is about being true to the data. Some data never fall 
to zero—the body temperature of a living person, for instance” 
(Yanofsky 2015). Edward Tufte (2017) agrees: “Don’t spend 
a lot of empty vertical space trying to reach down to the zero 
point at the cost of hiding what is going on in the data line 
fESelt. (hp.): 

The social significance of various levels of behavior 
change for the behavior being graphed should be considered 
when scaling the vertical axis. If relatively small numerical 
changes in performance are socially significant, the y-axis scale 
should reflect a smaller range of values. For example, to dis- 
play data most effectively from a training program in which an 
industrial employee’s percentage of correctly executed steps in a 
safety checklist increased from an unsafe preintervention range 
of 85% to 90% to a nearly accident-free postintervention level 
of 98% to 100%, the vertical axis should focus on the 80% to 
100% range. However, the scaling of the vertical axis should be 
contracted when small numerical changes in behavior are not 
socially important and the degree of variability obscured by the 
compressed scale is of little interest. 

Horizontal numbering of regularly spaced tick marks 
on the outside of the vertical axis reduces visual clutter and 
facilitates reading the scale. The vertical axis should not be 
extended beyond the tick mark indicating the highest value on 
the axis scale. 

When the data set includes several measures of O, start 
the vertical axis at a point slightly above the horizontal axis to 
keep data points from falling directly on the axis. This produces a 
neater graph and heips the viewer discriminate 0-value data points 
from those representing measures close to 0 (see Figure 6.19). 


Chapter 6 * Constructing and Interpreting Graphic Displays of Behavioral Data 167 


In most instances, scale breaks should not be used on 
the vertical axis. However, when two sets of data with widely 
different and nonoverlapping ranges are displayed against the 
same y-axis, a scale break can be used to separate the range of 
measures encompassed by each data set (see Figure 6.20). 

In multiple-tier graphs, equal distances on each vertical 
axis should represent equal changes in behavior to aid the com- 
parison of data across tiers. Also, whenever possible, similar 
positions on each vertical axis of multiple-tier graphs should 
represent similar absolute values of the dependent variable. 
When the differences in behavioral measures from one tier to 
another would result in one or more overly long vertical axes, 
scale breaks can be used to highlight the difference in absolute 
values and aid point-to-point comparisons across tiers. 

When the range values to be plotted on the vertical axis 
exceed what could be displayed on an axis of proportional length 
to the horizontal axis, it may be possible to split the range across 
two or more tired graphs. For example, in a study on compli- 
ance, Cook, Rapp, and Schulze (2015) presented their results 
in a three-tiered graph. (see Figure 9.11 this text). The vertical 
axis of the top tier is scaled in seconds, the middle tier in min- 
utes, and the bottom tier in hours. The horizontal axis of each 
descending tier represented a continuation of time as measured 
by experimental trials. 


Labeling the Vertical Axis 


A brief label, printed and centered to the left and parallel to 
the vertical axis, should identify the dimension by which the 
axis is scaled. On multiple-tiered graphs, one label identify- 
ing the dimension portrayed on all of the vertical axes can be 
centered along the axes as a group. Additional labels identi- 
fying the different behaviors (or some other relevant aspect) 
graphed within each set of axes are sometimes printed to the 
left and parallel to each vertical axis. These individual tier 
labels should be printed to the right of and in smaller sized 
font than the label identifying the dimension by which all of 
the vertical axes are scaled. 


Identify Experimental Conditions 


Condition Change Lines 


Vertical lines extending upward from the horizontal axis indi- 
cate changes in treatment or experimental procedures. Condition 
change lines should be placed after (to the right of) the data 
point representing the last measure prior to the change in con- 
ditions signified by the line and before (to the left of) the data 
point representing the first measure obtained after the change in 
procedure. In this way, data points fall clearly on either side of 
change lines and never on the lines themselves. Drawing condi- 
tion change lines to a height equal to the height of the vertical 
axis helps the viewer estimate the value of data points near the 
top of the vertical axis range. 

Condition change lines can be drawn with either solid 
or dashed lines. However, when an experiment or a treatment 
program includes relatively minor changes within an ongoing 
condition, a combination of solid and dashed lines should be 
used to distinguish the major and minor changes in conditions. 


For example, the solid lines in Figure 6.19 change from base- 
line, to Self-Record, to Self-Record+Tokens, to Follow-Up 
conditions, and dashed lines indicate changes in the sched- 
ule of reinforcement from CRF, to VR 5, to VR 10 within the 
Self-Record+Tokens condition (CRF and VR schedules of rein- 
forcement are described in Chapter 13). 

When the same manipulation of an independent variable 
occurs at different points along the horizontal axes of multiple- 
tiered graphs, a dog-leg connecting the condition change lines 
from one tier to the next makes it easy to follow the sequence and 
timing of events in the experiment (see Figure 6.20). 

Unplanned events that occur during an experiment or treat- 
ment program, as well as minor changes in procedure that do not 
watrant a condition change line, can be indicated by placing small 
arrows, asterisks, or other symbols next to the relevant data points 
(see Figure 6.6) or just under the x-axis (see Figure 6.20). The 
figure caption should explain any special symbols. 


Condition Labels 


Labels identifying the conditions in effect during each period 
of an experiment are centered above the space delineated by the 
condition change lines. Whenever space permits, condition labels 
should be parallel to the horizontal axis. Labels should be brief but 
descriptive (e.g., Contingent Praise is preferable to Treatment), 
and the labels should use the same terms or phrases used in the 
accompanying text describing the condition. Abbreviations may 
be used when space or design limitations prohibit printing the 
complete label. A single condition label should be placed above 
and span across labels identifying minor changes within that con- 
dition (see Figure 6.19). Numbers are sometimes added to condi- 
tion labels to indicate the number of times the condition has been 
in effect during the study (e.g., Baseline 1, Baseline 2). 


Plot Data Points and Draw Data Paths 


Data Points 


When graphing data by hand, behavior analysts must take great 
care to ensure that they plot each data point exactly on the coor- 
dinate of the horizontal and vertical axis values of the measure- 
ment it represents. The inaccurate placement of data points is an 
unnecessary source of error in graphic displays, which can lead 
to mistakes in clinical judgment and/or experimental method. 
Accurate placement is aided by careful selection of graph paper 
with grid lines sized and spaced appropriately for the data to be 
plotted. When many different values must be plotted within a 
small distance on the vertical axis, a graph paper with many grid 
lines per inch should be used. 

Should a data point fall beyond the range of values 
described by the vertical axis scale, it is plotted just above the 
scale it transcends, with the actual value of the measurement 
printed in parentheses next to the data point. Breaks in the data 
path leading to and from the off-the-scale data point also help to 
highlight its discrepancy (see Figure 6.20, Session 19). 

Data points should be marked with bold symbols that are 
easily discriminated from the data path. When only one set of 
data is displayed on a graph, solid dots are most often used. 
When multiple data sets are plotted on the same set of axes, a 
different geometric symbol should be used for each set of data. 
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The symbols for each data set should be selected so that the 
value of each data point can be determined when data points fall 
near or on the same coordinates on the graph (see Figure 6.19, 
Sessions 9-11). 


Data Paths 


Data paths are created by drawing a straight line from the center 
of each data point in a given data set to the center of the next 
data point in the same set. All data points in a given data set are 
connected in this manner with the following exceptions: 


¢ Data points falling on either side of a condition change 
line are not connected. 

¢ Data points should not be connected across a significant 
span of time in which behavior was not measured. To do 
so implies that the resultant data path represents the level 
and trend of the behavior during the span of time in which 
no measurement was conducted. 

¢ Data points should not be connected across discontinuities 
of time in the horizontal axis (see Figure 6.19, 1-week 
school vacation). 

* Data points on either side of a regularly scheduled mea- 
surement period in which data were not collected or were 
lost, destroyed, or otherwise not available (e.g., participant’s 
absence, recording equipment failure) should not be joined 
together (see Figure 6.20, Belva’s graph, baseline condition). 

* Follow-up or postcheck data points should not be con- 
nected with one another (see Figure 6.19) unless they 
represent successive measures spaced in time in the same 
manner as measures obtained during the rest of the exper- 
iment (see Figure 6.20). 


When multiple data paths are displayed on the same graph, 
different styles of lines, in addition to different symbols for the 
data points, may be used to help distinguish one data path from 
another (see Figure 6.19). The behavior represented by each data 
path should be clearly identified, either by printed labels with 
arrows drawn to the data path (see Figures 6.19 and 6.20) or by 
a legend showing models of the symbols and line styles (see 
Figure 6.13). When two data sets travel the same path, their lines 
should be drawn close to and parallel with one another to help 
clarify the situation (see Figure 6.19, Sessions 9-11). 


Write Figure Caption 


Printed below the graph, the figure caption should give a concise 
but complete description of the figure. The caption should also 
direct the viewer’s attention to any features of the graph that 
might be overlooked (e.g., scale changes) and should explain 
the meaning of any added symbols representing special events. 


Print Graph 


Graphs should be printed in only one color—black. Although 
the use of color can enhance the attractiveness of a visual dis- 
play and highlight certain features, it is discouraged in the sci- 
entific presentation of data. Every effort must be made to let 
the data stand on their own. The use of color can encourage 
perceptions of performance or experimental effects that differ 


from perceptions of the same data displayed in black. The fact 
that graphs and charts may be reproduced in journals and books 
is another reason for using black only. 


Constructing Graphs with Computer Software 


Software programs for producing computer-generated graphs 
are available and becoming both increasingly sophisticated and 
easier to use. Most of the graphs displayed throughout this book 
were constructed with computer software. Even though com- 
puter graphics programs offer a tremendous time savings over 
hand-plotted graphs, careful examination should be made of the 
range of scales available and the printer’s capability for both 
accurate data point placement and precise printing of data paths. 

Most computer generated graphs are produced with Mic- 
rosoft Excel software. Detailed task analyses and step-by-step 
instructions for creating behavioral graphs using Microsoft 
Excel are available (e.g., Cole & Witts, 2015; Deochand, 2017; 
Deochand, Costello, & Fuqua, 2015; Dixon et al., 2009; Lo & 
Konrad, 2007). Vanselow and Bourret (2012) developed an online 
interactive tutorial for creating graphs with Excel. 

Software programs that automatically create graphs from 
digitally collected or practitioner-uploaded data save time 
that can be devoted to other clinical or instructional activi- 
ties. Autographing software can help clinicians keep graphs 
of client performance current, enabling more frequent data- 
based decisions (e.g., Pinkelman & Horner, 2016; Sleeper 
et al., 2017). 


INTERPRETING GRAPHICALLY DISPLAYED 
BEHAVIORAL DATA 


The effects of an intervention that produces dramatic, replica- 
ble changes in behavior that last over time are readily seen in 
a well-designed graphic display. People with little or no formal 
training in behavior analysis can read the graph correctly in 
such cases. Many times, however, behavior changes are not 
so large, consistent, or durable. Behavior sometimes changes 
in sporadic, temporary, delayed, or seemingly uncontrolled 
ways; and sometimes behavior may hardly change at all. 
Graphs displaying these kinds of data patterns often reveal 
equally important and interesting subtleties about behavior and 
its controlling variables. 

Behavior analysts employ a systematic form of examina- 
tion known as visual analysis to interpret graphically displayed 
data. Visual inspection of graphs showing behavior change in 
relation to the presence or absence of environmental variables is 
the primary method of data analysis in behavior analysis. Visual 
analysis of data from an applied behavior analysis experiment is 
conducted to answer two questions: (a) Did behavior change in 
a meaningful way? and (b) if so, to what extent can that behav- 
ior change be attributed to the independent variable? Although 
there are no formalized rules for visual analysis, the dynamic 
nature of behavior, the scientific and technological necessity 
of discovering effective interventions, and the applied require- 
ment of producing socially meaningful levels of performance all 
combine to focus the behavior analyst’s interpretive attention on 
certain properties common to all behavioral data: (a) variability, 
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(b) level, and (c) trend. Visual analysis entails examining each of 
these characteristics both within and across the conditions and 
phases of an experiment.!? 

Before attempting to interpret the data displayed in a 
graph, the viewer should carefully examine the graph’s over- 
all construction. First, read the figure legend, axis labels, and 
all condition labels to obtain a basic understanding of what 
the graph is about. The viewer should then look at the scaling 
of each axis, taking note of the location, numerical value, and 
relative significance of any scale breaks. Did the graph maker 
truncate the vertical axis scale to emphasize behavior change? 


A common complaint of this is that it gives the appearance 
of severity when none exists. First, this is why charts have 
scales. Blaming a chart’s creator for a reader who doesn’t 
look at clearly labeled axes is like blaming a supermarket 
for selling someone food he’s allergic to. 

Second, the degree to which the chart emphasizes 
certain aspects of the data is a judgment of storytelling not 
chart-making. Sure, this mindset can yield misleading dis- 
plays, but how is that different than words? Charts should 
be fair, not impartial. (Yanofsky, 2015, n.p.) 


Next, visually track each data path to determine that 
the data points are properly connected. Does each data point 


represent a single measurement or observation, or are the 
data “blocked” such that each data point represents an aver- 
age or some other summary of multiple measurements? Do 
the data show the performance of an individual subject or 
the average performance of a group of subjects? If blocked 
or group data are displayed, is a visual representation of the 
range or variation of scores provided; or do the data them- 
selves allow determination of the amount of variability that 
was collapsed in the graph? For example, if the horizontal 
axis is scaled in weeks and each data point represents a stu- 
dent’s average score for a week of daily five-word spelling 
tests, data points falling near 0 or at the top end of the closed 
scale, such as 4.8, pose little problem because they can be 
the result of only minimal variability in the daily scores for 
that week. However, data points near the center of the scale, 
such as 2 to 3, can result from either stable or highly variable 
performance. 

Only when satisfied that a graph is properly constructed 
and does not distort the events it represents, should the viewer 
examine the data to discover what they reveal about the behav- 
ior during each condition of the study. If the viewer suspects 
the graph’s construction distorts, overstates, or hides important 
features, interpretive judgments of the data should be withheld 
(see Box 6.2). 


Don't Be Fooled by a Pretty Graph 


Graphic displays tell visual stories of the data they represent. 
But just as storytellers can conceal, distort, and mislead by 
selecting and sequencing biased words and phrases, graph 
makers can make certain interpretations of the data more or 
less likely by manipulating chart features. When looking at 
a graphic display, ask yourself whether it has been designed 
to tell a story that accurately reflects the underlying data, 
or whether it has been designed to tell a story more closely 
aligned with what the designer would like you to believe. Trun- 
cated or expanded axes with questionable scaling; extraneous 
color and depth cues; misleading labels, legends, and captions; 
and bar graphs with y-axes that do not include 0 are common 
forms of graphic foolery (Bergstrom & West, 2018a). 

Why does a bar graph need to include 0 on the dependent 
axis, whereas a line graph need not do so? Our view is that the 
two types of graphs are telling different stories. By its design, 
a bar graph emphasizes the absolute magnitude of values asso- 
ciated with each category, whereas a line graph emphasizes the 
change in the dependent variable (usually the y value) as the 
independent variable (usually the x value) changes. 

For a bar graph to provide a representative impression 
of the values being plotted, the visual weight of each bar—the 
amount of ink on the page, if you will—must be proportional 
to the value of that bar. Setting the axis above zero interferes 
with this. For example, if we create a bar graph with values 15 
and 20 but set the axis at 10, the bar corresponding to 20 has 
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Figure A Country of Baseball’s World Series-Winning 
Teams (1903-2017). 


twice the visual weight of the bar corresponding to 15, despite 
the value 20 being only 4/3 of the value 15. (Bergstrom & 
West, 2018b, n.p.) 

Don’t be fooled by graphs comparing data sets that 
have no logical or possible relation to one another. Figure A, 
for example, compares the number of baseball teams located 


(continued) 
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BOX 6.2 (continued) 


in the United States who have won baseball’s World Series 
with the number of World Series winners from outside the 
United States. The story told by this graph is U.S. dominance 
over the rest of the baseball-playing world. Although the data 
depicted are accurate, they represent dramatically different 
opportunities. From the first World Series in 1903 through 
1968 there were 0 non-U.S. major league teams. The Mon- 
treal Expos competed in the National League from 1969 to 
2004. The Toronto Blue Jays, World Series winners in 1992 
and 1993, have played in the American League since 1977. 

Comparing multiple data sets with different scales for 
each set can create even more deceptive graphs. One must 
be especially careful comparing multiple data series on the 
graph, with different scales for each series. Figure B shows 
a correlation between the use of the herbicide glyphosate 
(trade name Roundup) on corn and soybean crops and the 
increasing incidence of cancer. The authors reported: “We 
found strong correlations for cancers of the liver, kidney, 
bladder/urinary and thyroid. .. . Thyroid and bladder cancers 
especially seem to track with the advent of GE [genetically 
engineered] crops and associated glyphosate applications” 
(Swanson, Leu, Abrahamson, & Wallet, 2014, p. 16, words 
in brackets added). 

Exposure to Roundup may well have serious health con- 
sequences, but whatever they may be, this particular graph is 


not persuasive. First of all, there’s the obvious point that cor- 
relation is not causation. One would find a similar correlation 
between cell phone usage and hypertension, for example— 
or even between cell phone usage and Roundup usage! The 
authors make no causal claims, but we fail to see the value in 
fishing for correlations. 

The left-hand axis doesn’t go to zero. We’ve already 
noted why this is problematic. But it gets worse. Both the scale 
and the intercept of the other vertical axis, at right, have been 
adjusted so that the red curve traces the peaks of the yellow 
bars. Most remarkably, to make the curves do this, the axis has 
to go all the way to negative 10% GE corn planted and negative 
10,000 tons glyphosate used! We’ve noted that the y-axis need 
not go to zero, but if it goes to a negative value for a quantity 
(percentage or tonnage) that can only take on positive values, 
this should set off alarm bells. (Bergstrom & West, 2018b, 
emphasis in original, n.p.) 

While a challenge to everyone, detecting graphic 
sleight of hand is particularly important for applied behavior 
analysts, who rely on visual analysis of graphed data to deter- 
mine the story of behavior change. Fortunately, transparency 
is a characteristic of behavior analytic research and practice. 
Many graphic displays used in behavior analysis provide 
direct access to the original data, which allows the inquisitive 
or skeptical viewer to replot the data on a new set of axes. 
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Figure B Graph suggesting correlation between rising incidence of thyroid cancer and use of herbicide glyphosate on 


U.S. corn and soy crops that are genetically engineered. 


From “Genetically Engineered Crops, Glyphosate and the Deterioration of Health in the United States of America,” by N. L. Swanson, A. Leu 
J. Abrahamson, & B. Wallet, 2014, Journal of Organic Systems, 9, p. 18. Copyright 2014 by Journal of Organic Systems. Used by permission. 


Visual Analysis Within Conditions 


Data within a given condition are examined to determine 
(a) the number of data points, (b) the nature and extent of 
variability in the data, (c) the absolute and relative level of the 


behavioral measure, and (d) the direction and degree of any 
trend(s) in the data.!* 


Number of Data Points 


First, the viewer should determine the quantity of data reported 
during each condition. This entails a simple counting of data points. 
As a general rule, the more measurements of the dependent variable 
per unit of time and the longer the period of time in which mea- 
surement occurred, the more confidence one can have in the data 
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path’s estimation of the true course of behavior change (given, of 
course, a valid and accurate observation and measurement system). 

The number of data points needed to provide a believable 
record of behavior during a given condition also depends on 
how many times the same condition has been repeated during 
the study. As a rule, fewer data points are needed in subsequent 
replications of an experimental condition if the data depict the 
same level and trend in performance that were noted in earlier 
applications of the condition. 

The published literature of applied behavior analysis also 
plays a part in determining how many data points are sufficient. 
In general, briefer phases are required of experiments investi- 
gating relations between previously studied and well-established 
variables if the results are also similar to those of the previous 
studies. More data are needed to demonstrate new findings, 
whether or not new variables are under investigation. 

There are other exceptions to the rule of the-more-data- 
the-better. Ethical concerns do not permit the repeated measure- 
ment of certain behaviors (e.g., self-injurious behavior) under an 
experimental condition in which there is little or no expectation 
for improvement (e.g., during a no-treatment baseline condi- 
tion or a condition intended to reveal variables that exacerbate 
problem behavior). Also, there is little purpose in repeated 
measurement in situations in which the subject cannot logically 
perform the behavior (e.g., measuring the number of correct 
answers to long division problems when concurrent observations 
indicate that the student has not learned the necessary compo- 
nent skills of multiplication and subtraction). Nor are many data 
points required to demonstrate that behavior did not occur when 
in fact it had no opportunity to occur. 

Familiarity with the response class measured and the 
conditions under which it was measured may be the graph 
viewer’s biggest aid in determining how many data points 
constitute believability. The quantity of data needed in a given 
condition or phase is also partly determined by the analytic tac- 
tics employed in a given study. Experimental design tactics are 
described in Chapters 7 through 10. 


Variability 


How often and the extent to which repeated measures of 
behavior yield different outcomes is called variability. A high 
degree of variability (or “bounce”) within a given condition 
usually indicates that the researcher or practitioner has achieved 
insufficient control over the factors influencing the behavior. 
(An important exception is when the purpose of an intervention 
is to produce a high degree of variability.) In general, the greater 
the variability within a given condition, the greater the number 
of data points that are necessary to establish a predictable pattern 
of performance. By contrast, fewer data points are required to 
present a predictable pattern of performance when those data 
reveal relatively little variability. 

Practitioners and researchers should also look for variabil- 
ity in the form of cyclical patterns, such as those shown in the 
bottom panel of Figure 6.1. 


Level 


The value on the vertical axis scale around which a set of 
behavioral measures converge is called level. In the visual anal- 
ysis of behavioral data, level is examined within a condition in 
terms of its absolute value on the y-axis (mean, median, and/ 
or range), the degree of stability or variability, and the extent 
of change from one level to another. The graphs in Figure 6.21 
illustrate four different combinations of level and variability. 
The mean level of a series of behavioral measures within a 
condition can be graphically illustrated by the addition of a mean 
level line: a dashed horizontal line drawn through a series of data 
points within a condition at that point on the vertical axis equaling 
the average value of the series of measures. Although mean level 
lines provide an easy-to-see summary of average performance 
within a given condition or phase, they should be used and inter- 
preted with caution. With highly stable data paths, mean level 
lines pose no serious drawbacks. However, the less variability 
there is within a series of data points, the less need there is for a 
mean level line. For instance, a mean level line would serve little 


Figure 6.21 Four data paths illustrating (A) a low, 
stable level of responding; (B) a high, variable level 
of responding; (©) an initially high, stable level of 
responding followed by a lower, more variable level 
of responding; and (D) an extremely variable pattern 
of responding not indicative of any overall level of 
responding. Dashed horizontal lines on graphs B, C, 
and D represent the mean levels of responding. 
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purpose in Graph A in Figure 6.21. And although mean level lines 
have been added to Graphs B, C, and D in Figure 6.21, Graph B 
is the only one of the three for which a mean level line provides 
an appropriate visual summary of level. The mean level line in 
Graph C is not representative of any measure of behavior taken 
during the phase. The data points in Graph C show a behavior 
best characterized as occurring at two distinct levels during the 
condition and beg for an investigation of the factor(s) responsible 
for the clear change in levels. The mean level line in Graph D is 
inappropriate because the variability in the data is so great that 
only 4 of the 12 data points fall close to the mean level line. 

A median level line is another method for visually sum- 
marizing the overall level of behavior in a condition. Because a 
median level line represents the most typical performance within 
a condition, it is not so influenced by one or two measures that 
fall far outside the range of the remaining measures. Therefore, 
one should use a median level line instead of a mean level line 
to graphically represent the central tendency of a series of data 
points that include several outliers, either high or low. 


Figure 6.22 Data patterns indicating various 
combinations of trend direction, degree, and 
variability: (A) zero trend, high stability; (B) zero 

trend, high variability; (©) gradually increasing stable 
trend; (D) rapidly increasing variable trend; (E) rapidly 
decreasing stable trend; (F) gradually decreasing 
variable trend; (G) rapidly increasing trend followed by 
rapidly decreasing trend; (H) no meaningful trend, too 
much variability and missing data. Split-middle lines of 
progress have been added to Graphs C-F. 


Change in level within a condition is determined by 
calculating the difference in absolute values on the y-axis between 
the first and last data points within the condition. Another method, 
somewhat less influenced by variability in the data, is to compare 
the difference between the median value of the first three data 
points in the condition with the median value of the final three 
data points in the condition (Koenig & Kunzelmann, 1980). 


Trend 


The overall direction taken by a data path is its trend. Trends 
are described in terms of their direction (increasing, decreasing, 
or zero trend), degree or magnitude, and extent of variability of 
data points around the trend. The graphs in Figure 6.22 illustrate 
a variety of trends. The direction and degree of trend in a series 
of graphed data points can be visually represented by a straight 
line drawn through the data. called a trend line or line of prog- 
ress. Several methods for calculating and fitting trend lines to 
a series of data have been developed. One can simply inspect 
the graphed data and draw a straight line that visually provides 
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the best fit through the data. For this freehand method, Lindsley 
(1985) suggested ignoring one or two data points that fall well 
beyond the range of the remaining values in a data series and 
fitting the trend line to the remaining scores. Although the free- 
hand method is the fastest way of drawing trend lines and can 
be useful for the viewer of a published graph, hand-drawn trend 
lines may not always result in an accurate representation of 
trend and are typically not found in graphs of published studies. 

Trend lines can also be calculated using a mathematical for- 
mula called the ordinary least-squares linear regression equation 
(McCain & McCleary, 1979; Parsonson & Baer, 1978). Trend 
lines determined in this fashion have the advantage of complete 
reliability: The same trend line will always result from the same 
data set. The disadvantage of this method is the many mathemat- 
ical operations that must be performed to calculate the trend line. 
A computer program that can perform the equation eliminates the 
time concern in calculating a least-squares trend line. 

A method of calculating and drawing lines of progress that 
is more reliable than the freehand method and much less time 
consuming than linear regression methods is the split-middle 
line of progress. White (2005) developed the split-middle tech- 
nique for use with rate data plotted on semilogarithmic charts, 
and it has proven a useful technique for predicting future behav- 
ior from such data. Split-middle lines of progress can also be 
drawn for data plotted against an equal-interval vertical axis, 
but it must be remembered that such a line is only an estimate 
that summarizes the overall trend (Bailey, 1984). A trend line 
cannot be drawn by any method through a series of data points 
spanning a scale break in the vertical axis and generally should 
not be drawn across scale breaks in the horizontal axis. 

The specific degree of acceleration or deceleration of trends 
in data plotted on semilogarithmic charts can be quantified in 
numerical terms. For example, on the daily Standard Celeration 
Chart, a “‘times-2” celeration means that the response rate is dou- 
bling each week, and a “times-1.25” means that the response rate is 
accelerating by a factor of one fourth each week. A “divide-by-2” 
celeration means that each week the response rate will be one half 
of what it was the week before, and a “divide-by-1.5” means that 
the frequency is decelerating by one third each week. 

There is no direct way to determine visually from data 
plotted on equal-interval charts the specific rates at which trends 
increase or decrease. But visual comparison of trend lines drawn 
through data on equal-interval charts can provide important infor- 
mation about the relative rates of behavior change. 

A trend may be highly stable with all of the data points fall- 
ing on or near the trend line (see Figure 6.22, Graphs C and E). Data 
paths can also follow a trend even though a high degree of variabil- 
ity exists among the data points (see Figure 6.22, Graphs D and F). 


Visual Analysis Between Conditions 


After inspection of the data within each condition or phase of 
a study, visual analysis proceeds with a comparison of data 
between conditions. Drawing proper conclusions entails com- 
paring the previously discussed properties of behavioral data— 
level, trend, and variability between different conditions and 
among similar conditions. 

A condition change line indicates that an independent 
variable was manipulated at a given point in time. To determine 


whether an immediate change in behavior occurred at that point 
in time, one needs to examine the difference between the last 
data point before the condition change line and the first data 
point in the new condition. 

The data are also examined in terms of the overall level of 
performance between conditions. In general, when all data points 
in one condition fall outside the range of values for all data points 
in an adjacent condition (i.e., there is no overlap of data points 
between the highest values obtained in one condition and the low- 
est values obtained in the other condition), there is little doubt that 
behavior changed from one condition to the next. When many 
data points in adjacent conditions overlap one another on the ver- 
tical axis, less confidence can be placed in the effect of the inde- 
pendent variable associated with the change in conditions. '* 

Mean or median level lines can be helpful in examining 
the overall level between conditions. However, using mean 
or median level lines to summarize and compare the overall 
central tendency of data across conditions poses two serious 
problems. First, the viewer of such a visual display must 
guard against letting “apparently large differences among 
measures of central tendency visually overwhelm the pres- 
ence of equally large amounts of uncontrolled variability” 
(Johnston & Pennypacker, 1980, p. 351). Emphasizing mean 
changes in performance in a graphic display can suggest a 
greater degree of experimental control was obtained than is 
warranted by the data. In the top graph of Figure 6.23, half of 


Condition B 


Condition A 


Response Measure 


Time 


Condition A Condition B 


Response Measure 


Time 


Figure 6.23 Inappropriate use of mean level lines, 
encouraging interpretation of a higher overall level 
of responding in Condition B when extreme variability 
(top graph) and trends (bottom graph) warrant 
different conclusions. 
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the data points in Condition B fall within the range of values 
of the measures taken during Condition A, but the mean level 
lines suggest a clear change in behavior. Second, measures 
of central tendency can obscure important trends in the data 
that warrant interpretations other than those suggested by the 
central tendency indicators. Although a mean or median line 
accurately represents the average or typical performance, 
neither provides any indication of increasing or decreasing 
performance. In the bottom graph of Figure 6.23, for exam- 
ple, the mean line suggests a higher level of performance in 
Condition B than in Condition A, but an examination of trend 
yields a very different picture of behavior change within and 
between Conditions A and B. 

Those analyzing behavioral data should also note any 
changes in level that occur after a new condition has been in 
place for some time and any changes in level that occur early 
in a new condition but are later lost. Such delayed or temporary 
effects can indicate that the independent variable must be in 


Figure 6.24 Stylized data paths illustrating the differ- 
ent combinations of change or lack of change in level 
and trend between two adjacent conditions: Graphs A 
and B show no change in either level or trend between 
the two conditions, Graphs C and D show changes in 
level and no change in trend, Graphs E and F depict no 
immediate change in level and a change in trend, and 
Graphs G and H reveal change in both level and trend. 
From “Time-Series Analysis in Operant Research,” by R. R. Jones, R. 
S. Vaught, and M. R. Weinrott, 1977, Journal of Applied Behavior 
Analysis, 10, p. 157. Copyright 1977 by the Society for the Experi- 
mental Analysis of Behavior, Inc. Adapted by permission. 
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place for some time before behavior changes, or that the tem- 
porary level change was the result of an uncontrolled variable. 
Either case calls for further investigation in an effort to isolate 
and control relevant variables. 

Visual analysis of data between adjacent conditions 
includes an examination of the trends exhibited by the data 
in each condition to determine whether the trend found in 
the first condition changed in direction or slope during the 
subsequent condition. In practice, because each data point in 
a series contributes to level and trend, the two characteris- 
tics are viewed in conjunction with one another. Figure 6.24 
presents stylized data paths illustrating four basic combina- 
tions of change or lack of change in level and trend between 
adjacent conditions. Of course, many other data patterns 
could display the same characteristics. Idealized, straight- 
line data paths that eliminate the variability found in most 
repeated measures of behavior have been used to highlight 
level and trend. 


Chapter 6 * Constructing and Interpreting Graphic Displays of Behavioral Data 175 


Visual analysis includes not only an examination and 
comparison of changes in level and trend between adjacent 
conditions, but also an examination of performance across 
similar conditions. Interpreting what the data from an applied 
behavior analysis mean requires more than visual analysis and 
the identification and description of level, trend, and variability. 
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SUMMARY 


1. Applied behavior analysts document and quantify behavior 
change by direct and repeated measurement of behavior, 
and the product of those measurements is called data. 


2. Graphs are relatively simple formats for visually 
displaying relationships among and between a series 
of measurements and relevant variables. 


Purpose and Benefits of Graphic Displays of 
Behavioral Data 


3. Graphing each measure of behavior as it is collected pro- 
vides the practitioner or researcher with an immediate and 
ongoing visual record of the participant’s behavior, allow- 
ing treatment and experimental decisions to be responsive 
to the participant’s performance. 


4. Direct and continual contact with the data in a readily 
analyzable format enables the practitioner or researcher to 
identify and investigate interesting variations in behavior 
as they occur. 


5. As a judgmental aid for interpreting experimental results, 
graphic display is a fast, relatively easy-to-learn method 
that imposes no arbitrary levels of significance for 
evaluating behavior change. 


6. Visual analysis of graphed data is a conservative 
method for determining the significance of 
behavior change; only variables able to produce 
meaningful effects repeatedly are considered 
significant, and weak and unstable variables are 
screened out. 


a 


Graphs enable and encourage independent judgments of the 
meaning and significance of behavior change by others. 


8. Graphs can serve as effective sources of feedback to the 
people whose behavior they represent. 


Graphs Used by Applied Behavior Analysts 


9. Line graphs, the most commonly used format for the 
graphic display of behavioral data, are based on the Carte- 
sian plane, a two-dimensional area formed by the intersec- 
tion of two perpendicular lines. 


10. Major parts of the simple line graph are the horizontal 
axis (also called the x-axis), the vertical axis (also called 
the y-axis), condition change lines, condition labels, data 
points, the data path, and the figure caption. 


When behavior change is demonstrated over the course of a treat- 
ment program or research study, the next question to be asked is, 
Was the change in behavior a function of the treatment or exper- 
imental variable? The remaining chapters of Part Three describe 
strategies and tactics of experimental design used by applied 
behavior analysts in an effort to provide a meaningful answer. 


11. Graphs with multiple data paths on the same set of axes 
are used in applied behavior analysis to show (a) two or 
more dimensions of the same behavior, (b) two or more 
different behaviors, (c) the same behavior under differ- 
ent and alternating experimental conditions, (d) changes 
in target behavior relative to the changing values of an 
independent variable, and (e) the behavior of two or more 
participants. 


12. A second vertical axis, which is drawn on the right-hand 
side of the horizontal axis, is sometimes used to show dif- 
ferent scales for multiple data paths. 


13. Bar graphs are used for two primary purposes: (a) to dis- 
play discrete data not related by an underlying dimension 
that can be used to scale the horizontal axis and (b) to 
summarize and enable easy comparison of the perfor- 
mance of a participant or group of participants during the 
different conditions of an experiment. 


14. Each data point on a cumulative record represents the total 
number of responses emitted by the subject since measure- 
ment began. The steeper the slope of the data path on a 
cumulative graph, the higher the response rate. 


15. Overall response rate refers to the average rate of response 
over a given time period; a local response rate refers to the 
rate of response during a smaller period within a larger 
period for which an overall response rate has been given. 


16. Cumulative records are especially effective for displaying 
data when (a) the total number of responses made over 
time is important, (b) the graph is used as a source of 
feedback to the subject, (c) the target behavior can occur 
only once per measurement period, and (d) a fine analysis 
of a single instance or portions of data from an experiment 
is desired. 


17. Ratio charts use a logarithmic-scaled y-axis so that 
changes in behavior that are of equal proportion (e.g., dou- 
blings of the response measure) are represented by equal 
distances on the vertical axis. 


18. The Standard Celeration Chart is a six-cycle multiply-divide 
graph that enables the standardized charting of celeration, a 
linear measure of frequency change across time, a factor by 
which frequency multiplies or divides per unit of time. 


19. A scatterplot shows the relative distribution of individ- 
ual measures in a data set with respect to the variables 
depicted by the x- and y-axes. 
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Constructing Line Graphs 


20. The vertical axis is drawn to a length approximately two- 
thirds that of the horizontal axis. 


21. The horizontal axis is marked in equal intervals, each 
representing, from left to right, the chronological suc- 
cession of equal time periods within which behavior was 
measured. 


22. Scale breaks on the horizontal axis indicate discontinuities 
of time. 


23. The vertical axis is scaled relative to the dimension of 
behavior measured, the range of values of the measures 
obtained, and the social significance of various levels of 
change in the target behavior. 


24. Condition change lines indicate changes in the treatment 
program or manipulations of an independent variable and 
are drawn to the same height as the vertical axis. 


25. A brief, descriptive label identifies each condition of an 
experiment or behavior change program. 


26. Data points should be accurately placed with bold, solid 
dots. When multiple data paths are used, different geomet- 
ric symbols are used to distinguish each data set. 


27. Data paths are created by connecting successive data 
points with a straight line. 


28. Successive data points should not be connected when 
(a) they fall on either side of a condition change line; 
(b) they span a significant period in which behavior was 
not measured; (c) they span discontinuities of time on the 
horizontal axis; (d) they fall on either side of a regularly 
scheduled measurement period in which data were not col- 
lected or were lost, destroyed, or otherwise not available: 
(e) they fall in a follow-up or postcheck period that is not 
regularly spaced in time in the same manner as the rest of 
the study; or (f) one member of the pair falls outside the 
range of values described by the vertical axis. 


29. The figure caption provides a concise but complete 
description of the graph, giving all of the information 
needed to interpret the display. 


30. Graphs should be printed in black ink only. 


Interpreting Graphically Displayed Behavioral Data 


31. Visual analysis of graphed data attempts to answer two 
questions: (a) did a socially meaningful change in behav- 
ior take place and (b) if so, can the behavior change be 
attributed to the independent variable? 


32. Before beginning to evaluate the data displayed in a graph, 
a careful examination of the graph’s construction should 
be undertaken. If distortion is suspected from the 
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36. 


37. 


38. 
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40. 


41. 


features of the graph’s construction, the data should be 
replotted on a new set of axes before interpretation 1s 
attempted. 


Blocked data and data representing the average perfor- 
mance of a group of subjects should be viewed with the 
understanding that significant variability may have been 
lost in the display. 


Visual analysis of data within a given condition focuses on 
the number of data points, the variability of performance, 
the level of performance, and the direction and degree of 
any trends in the data. 


As a general rule, the more data in a condition and the 
greater the stability of those data, the more confidence 
one can place in the data path’s estimate of behavior 
during that time. The more variability in the behavioral 
measures during a condition, the greater the need for 
additional data. 


Variability refers to the frequency and degree to which 
multiple measures of behavior yield different outcomes. 
A high degree of variability within a given condition usu- 
ally indicates that little or no control has been achieved 
over the factors influencing the behavior. 


Level refers to the value on the vertical axis around which 
a series of data points converges. When the data in a given 
condition all fall at or near a specific level, the behavior is 
considered stable with respect to level; to the extent that 
the behavioral measures vary considerably from one to 
another, the data are described as showing variability with 
respect to level. In cases of extreme variability, no particu- 
lar level of performance is evidenced. 


Mean or median level lines are sometimes added to graphic 
displays to represent the overall average or typical perfor- 
mance during a condition. Mean and median level lines 
should be used and interpreted with care because they can 
obscure important variability and trends in the data. 


Trend refers to the overall direction taken by a data path; 
trends are described in terms of their direction (increasing, 
decreasing, or zero trend), degree (gradual or steep), and 
the extent of variability of data points around the trend. 


Trend direction and degree can be visually represented by 
drawing a trend line, or line of progress, through a series 
of data points. Trend lines can be drawn freehand, using 
the least-squares regression equation, or using a method 
called the split-middle line of progress. Split-middle lines 
of progress can be drawn quickly and reliably and have 
proven useful in analyzing behavior change. 


Visual analysis of data across conditions determines 
whether change in level, trend, and/or variability occurred 
and to what extent any changes were significant. 


Chapter 6 * Constructing and Interpreting Graphic Displays of Behavioral Data 177 
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KEY TERMS 

bar graph graph 

cumulative record independent variable 
cumulative recorder level 

data line graph 

data path local response rate 


dependent variable overall response rate 


equal-interval scale precision teaching 
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MULTIPLE-CHOICE QUESTIONS 


1. Visual analysis is considered a(n) 
determining the social significance of behavior change. 


a. Temporary 

b. Arrangement 

c. Subjective 

d. Conservative 

Hint: (See “Purpose and Benefits of Graphic Displays of 
Behavioral Data’) 


2. Graphic displays of behavioral data are considered simple 
formats for visually displaying 
between a series of measurements and relevant variables. 


a. Time series 

b. Data 

c. Relationships 

d. Graphs 

Hint: (See “Purpose and Benefits of Graphic Displays of 
Behavioral Data’’) 


3. This type of graph is also known as a histogram and is 


useful in summarizing behavioral data allowing for a Ts 


quick comparison of performance across participants and/ 
or conditions. 


a. Line graph 

b. Scatterplot 

c. Cumulative Record 

d. Bar graph 

Hint: (See “Graphs Used by Applied Behavior Analysts”) 
4. This is the most commonly used graph in applied 

behavior analysis. 

a. Line graph 

b. Scatterplot 

c. Cumulative Record 

d. Bar graph 

Hint: (See “Graphs Used by Applied Behavior Analysts”) 


method for 5. 


among and 6. 


8. 


ratio scale 

scatterplot 

split-middle line of progress 
Standard Celeration Chart 
trend 

variability 


visual analysis 
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You are interested in the total number of sight words 
accurately read since your data collection began last week. 
Which graphing convention allows you to most efficiently 
answer your inquiry? 

a. Line graph 

b. Scatterplot 

c. Cumulative Record 

d. Bar graph 

Hint: (See “Graphs Used by Applied Behavior Analysts”) 
You are interested in displaying the mean score for 

both your baseline and experimental condition. For this 


summary, you are willing to sacrifice the presentation and 
detection of variability and trends. 


a. Line graph 

b. Scatterplot 

c. Cumulative Record 

d. Bar graph 

Hint: (See “Graphs Used by Applied Behavior Analysts’’) 
A is depicted on a line graph by an open spot in 
the axis with a squiggly line at each end. 

a. Condition line 

b. Data set 

c. Scale break 

d. Horizontal axis 

Hint: (See “Line Graphs’’) 

This part of a line graph is printed below the graph 


and gives a concise but complete description of the 
figure. 


a. Figure caption 

b. Data set 

c. Scale break 

d. Condition label 

Hint: (See “Line Graphs’) 
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10. 


il. 


12. 


13. 


A chart is a type of semilogarithmic chart useful 
for charting accelerating and decelerating performances 
over time. 


a. Cumulative time 

b. Standard celebration 

c. Scatter plot 

d. Bar graph 

Hint: (See “Graphs Used by Applied Behavior 
Analysts”’) 


In order to produce a visual representation of an overall 
rate on a cumulative graph, the first and last data points of 
a given series of observations should be: 


a. Plotted bilaterally 

b. Plotted using different symbols 

c. Drawn simultaneously 

d. Connected with a straight line 

Hint: (See “Graphs Used by Applied Behavior 
Analysts”’) 

The of each data path on a cumulative record 
represents the different rates of acquisition. 

a. Position 

b. Slope 

c. Scatter 

d. Slant 

Hint: (See “Graphs Used by Applied Behavior 
Analysts”) 


A scatterplot shows the distribution of 
individual measures in a data set with respect to the 
variables depicted by the both the x- and y-axes. 


a. Exact 

b. Scattered 

c. Relative 

d. Standard 

Hint: (See “Graphs Used by Applied Behavior 
Analysts”) 


Scatterplots are useful when applied behavior analysts are 
interested in finding out the distribution of the 
target behavior. In other words, a scatterplot illustrates 
whether the behavior’s occurrence is typically associated 
with certain time periods. 


a. Temporal 
b. Independent 
c. Dependent 
d. False 


Hint: (See “Graphs Used by Applied Behavior 
Analysts”’) 


14. Which part of the line graph is the arrow pointed at? 


aor 


15. 


a. X-axis 

b. Condition change line 

c. Y-axis 

d. Data path 

Hint: (See “Line Graphs’’) 

The type of graph illustrated in this example has disconnected 
data points and is used primarily to depict temporal 


distributions and reveal correlations between variables 
depicted on the y-axis with variables depicted on the x-axis. 


16. 


a. Cumulative record 

b. Line graph 

c. Bar graph 

d. Scatterplot 

Hint: (See “Line Graphs”) 

You are interested in beginning a self-management 
program to help you save money for a new car. Which 


type of graph would provide the most appropriate 
feedback and display your progress toward your goal? 


a. Cumulative record 

b. Line graph 

c. Bar graph 

d. Scatterplot 

Hint: (See “Graphs Used by Applied Behavior Analysts’’) 


17. 


18. 


L 
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You are interested in comparing the effects of two 
different interventions on a single target behavior. You 
have decided to present each of the interventions in an 
alternating fashion and measure the changes in the value 
of your dependent variable. You would like to plot the 
results of both interventions on one graph. Which would 
be the best format to display your results? 

a. Cumulative record 

b. Line graph 

c. Bar graph 

d. Scatterplot 


Hint: (See “Graphs Used by Applied Behavior Analysts’’) 


Which of the following best describes the direction, degree, 


and variability of the data path depicted in this graph? 
25 


State a purpose and a benefit of using graphic displays of 


behavioral data for decision-making over listing a series of 


numbers in a tabular format. 


Hint: (See “Purpose and Benefits of Graphic Displays of 
Behavioral Data’) 


. Describe a benefit of using visual analysis over statistical 


analysis in interpreting the social significance of behavior 
change. 

Hint: (See “Purpose and Benefits of Graphic Displays of 
Behavioral Data’’) 


. List and describe the parts of a basic line graph. 


Hint: (See “Parts of a Basic Line Graph”) 


. Discuss why a line graph is often the most appropriate 


format for communicating changes in level, trend, and the 
variability of behavioral data when comparing the effects 
of intervention A to intervention B. 


Hint: (See “Graphs Used by Applied Behavior Analysts”’) 


19. 
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. Rapidly increasing, variable trend 
. Zero trend 


. Gradually decreasing, variable trend 


ano St 


. Rapidly decreasing, stable trend 
Hint: (See “Interpreting Graphically Displayed Behavioral 
Data’) 


Which of the following best describes the direction, 
degree, and variability of the data path depicted in this 
graph? 


. Rapidly increasing, variable trend 


a 

b. Cannot be determined 

c. Gradually decreasing, variable trend 

d. Rapidly decreasing, stable trend 

Hint: (See “Interpreting Graphically Displayed 
Behavioral Data’’) 
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Discuss a situation in which a cumulative record would 
be preferable to a basic line graph. 

Hint: (See “Graphs Used by Applied Behavior 

Analysts’) 


. Discuss a situation in which a scatterplot would be the 


most appropriate graphic format. 


Hint: (See “Graphs Used by Applied Behavior 
Analysts’) 


. Which type of graph would you select to communicate 


changes in behavior when the behavior can only occur or 
not occur one time during a measurement period? 

Hint: (See “Graphs Used by Applied Behavior 
Analysts’’) 


. Define the following fundamental properties of behavior 


change: variability, level, and trend. 
Hint: (See “Interpreting Graphically Displayed 
Behavioral Data”) 
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NOTES 


i 


Although often used as a singular construction (e.g., “The data shows 
that. . .”), data is the plural form of datum, noun of Latin origin; thus, data 
is used correctly with plural verbs (e.g., “These data are . . .”). 

Skillful visual analysis is not quite so simple, as will become evident later 
in this chapter and in Chapter 10. 

A comparison of the visual analysis of graphed data and inferences based 
on statistical tests of significance is presented in Chapter 10. 

Graphs, like statistics, can also be manipulated to make certain interpreta- 
tions of the data more or less likely. Unlike statistics, however, most graphic 
displays used in behavior analysis provide direct access to the original data, 
which allows the inquisitive or doubtful reader to regraph the data and have 
another look. 

The terms condition and phase are related but not synonymous. Properly 
used, condition indicates the environmental arrangements in effect; phase 
refers to a period within a study or behavior change program. 

Functional communication training is described in Chapter 26. 

To learn the interesting history and role of the cumulative recorder in the 
development of behavior analysis, see Lattal (2004) and Morris and Smith 
(2004). 

Technically, Figure 6.12 does not represent true rates of response because 
the number of words spelled correctly was measured and not the rate, or 
speed, at which they were spelled. However, the slope of each data path 
represents the “rate” of mastering the spelling words in each session within 
the context of a total of 10 new words presented per session. 

For detailed explanations of the Standard Celeration Chart and its use by 
precision teachers see Cooper, Kubina, and Malanga (1998); Graf and 


Mal 


iI, 


3), 


14, 


Lindsley (2002); Johnson and Street (2014); Kubina and Yurich 2012); 
Lindsley (1990, 1992, 1996); and Pennypacker, Gutierrez, and Lindsley 
(2003). 

Perhaps the ultimate example of combining visual display techniques is 
Charles Minard’s use of space-time-story graphics to illustrate the inter- 
relations of six variables during Napoleon’s ill-fated Russian campaign of 
1812-1813 (see Tufte, 1983, p. 41). Tufte called Minard’s graph perhaps 
“the best statistical graphic ever drawn’(p. 40). 

Although most graphs published in behavior analysis journals since the 
mid-1990s and many graphs used by practitioners are constructed with 
computer software, we believe it is essential that applied behavior analysts 
know how to construct graphs by hand. The knowledge and skills required 
to create proper graphs by hand enable effective use of graphing software 
(cited later in this chapter). 

Several studies have evaluated methods for training visual data analysis 
skills to behavior analysis students and practitioners (e.g., Kahng et al., 
2010; Stewart, Carr, Brandt, & McHenry, 2007; Vanselow, Thompson, & 
Karsina, 2011; Wolfe & Slocum, 2015; Young & Daly, 2016). 

When each data point represents multiple measures, a within-session anal- 
ysis of the data may be warranted to detect behavioral functions and trends 
not apparent in aggregated data displays (Fahmie & Hanley, 2008). See 
Figure 6.14 for an example. 

Whether a documented change in behavior should be interpreted as a 
function of the independent variable depends on the experimental design 
used in the study. Strategies and tactics for designing experiments are 
presented in Chapters 7 through 10. 
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Analyzing Behavior Change: Basic 


Assumptions and Strategies 


LEARNING OBJECTIVES 


¢ Define and give examples of the basic assumptions underlying the analysis of behavior. 
e List and describe the three levels of scientific understanding. 


¢ Define and give examples of functional relations. 


¢ Outline how experiments should be conducted to control for various threats to internal validity. 
° Compare and contrast the role of variability in single-subject versus group research designs. 


List and define the essential components of experiments in applied behavior analysis research. 


e Write a specific research question given a behavioral phenomenon of interest. 


° Define and identify steady or stable rate responding. 
e List and define the three elements of baseline logic. 


Identify four types of baseline data patterns. 


Discuss the value of establishing a steady baseline in applied behavior analysis. 
Outline guidelines for establishing a steady baseline in applied behavior analysis research. 


e Discuss how single-subject research employs the inductive logic known as “affirmation of the consequent.” 


¢ Systematically manipulate independent variables and analyze their effects on treatment. 


Our technology of behavior change is also a technology 

of behavior measurement and of experimental design; it 

developed as that package, and as long as it stays in that 

package, it is a self-evaluating enterprise. Its successes 

are successes of known magnitude; its failures are almost 

immediately detected as failures; and whatever its out- 

comes, they are attributable to known inputs and proce- 

dures rather than to chance events or coincidences. 
—Donald M. Baer (personal communication, 

October 21, 1982) 


easurement shows whether, when, and how much 

behavior has changed, but measurement alone does 

not reveal why, or more accurately how, behav- 
ior change occurred. A useful technology of behavior change 
depends upon knowing the specific arrangements of environ- 
mental variables that produced behavior change. Without this 
knowledge, practitioners’ efforts to change behavior would con- 
sist of procedures selected from a bag of tricks with little or no 
generality from one situation to the next. 

The search for and demonstration of functional and reliable 
relations between socially important behavior and its controlling 
variables is a defining characteristic of applied behavior anal- 
ysis. As articulated by Don Baer’s chapter-opening statement, 


a major strength of applied behavior analysis is its insistence 
on experimentation as its method of proof, which enables and 
demands an ongoing, self-correcting search for effectiveness. 
An experimental analysis must be accomplished to deter- 
mine if and how a given behavior functions in relation to spe- 
cific changes in the environment. This chapter introduces basic 
concepts and strategies that underlie the analysis in applied 
behavior analysis.! First, we briefly review some general con- 
ceptions of science and then continue with a discussion of two 
defining features and two assumptions about behavior that dic- 
tate the experimental methods most conducive to the subject 
matter. We identify the essential components of an experiment 
in applied behavior analysis and describe the basic logic that 
guides experimental methods in applied behavior analysis. 


CONCEPTS AND ASSUMPTIONS UNDERLYING 
THE ANALYSIS OF BEHAVIOR 


As discussed in Chapter |, scientists share a set of common 
perspectives that include assumptions about the nature of the 
phenomena they study (determinism), the kind of information 
that should be gathered on the phenomena of interest (empiri- 
cism), the way questions about the workings of nature are most 
effectively examined (experimentation), and how an exper- 
iment’s results should be judged (with parsimony and philo- 
sophic doubt). As Skinner (1953) noted, these attitudes apply to 
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all scientific disciplines: “The basic characteristics of science 
are not restricted to any particular subject matter” (p. 11). 

The overall goal of science is complete understanding of 
the phenomena under study—socially significant behavior, in the 
case of applied behavior analysis. Scientists use a wide variety 
of methods that yield understanding at three levels: description, 
prediction, and control. First, systematic observation enables 
scientists to describe natural phenomena accurately. Descrip- 
tive knowledge of this type yields a collection of facts about 
the observed events—facts that can be quantified arid classified, 
a necessary and important element of any scientific discipline. 

A second level of scientific understanding is achieved 
when repeated observation shows that two events consistently 
covary. That is, the occurrence of one event (e.g., marriage) is 
associated with the occurrence of another event at some reliable 
degree of probability (e.g., longer life expectancy). The system- 
atic covariation between two events—termed a correlation—can 
be used to predict the probability that one event will occur based 
on the presence of the other event. 

The ability to predict successfully is a useful result of 
science; prediction allows preparation. However, the greatest 
potential benefits of science are derived from the third, and 
highest, level of scientific understanding, which comes from 
establishing experimental control. “The experimental method is 
a method for isolating the relevant variables within a pattern of 
events. Methods that depend merely on observed correlations, 
without experimental intervention, are inherently ambiguous” 
(Dinsmoor, 2003, p. 152). 


Experimental Control: The Path to and Goal 
of Behavior Analysis 


Behavior—the interaction between an organism and its 
environment—is best analyzed by measuring behavior change 
produced by imposed variations on the environment. This 
statement embodies the general strategy and the goal of behav- 
ior analytic research: to demonstrate that changes in the tar- 
get behavior occurred because of experimentally manipulated 
changes in the environment. 

Experimental control is achieved when a predictable 
change in behavior (the dependent variable) is reliably pro- 
duced by the systematic manipulation of some aspect of the 
environment (the independent variable). Experimentally deter- 
mining the effects of environmental manipulation on behavior 
and demonstrating that those effects can be reliably produced 
constitute the analysis in applied behavior analysis. An analy- 
sis of a behavior has been achieved when a reliable functional 
relation between the behavior and some specified aspect of the 
environment has been demonstrated convincingly. Knowledge 
of functional relations enables practitioners trained in applied 
behavior analysis to reliably alter behavior in meaningful ways. 

An analysis of behavior “requires a believable demonstra- 
tion of the events that can be responsible for the occurrence or 
nonoccurrence of that behavior. An experimenter has achieved 
an analysis of a behavior when he can exercise control over it” 
(Baer, Wolf, & Risley, 1968, p. 94),? That behavior analysts seek 
and value the experimental isolation of a given environmental 
variable of which a behavior is shown to be a function is often 


misinterpreted as support for a simplistic conception of behav- 
ioral causality. The fact that a behavior varies as a function of a 
given variable does not preclude that behavior also varying as a 
function of other variables. Thus, Baer and colleagues described 
an experimental analysis as a convincing demonstration that a 
variable can be responsible for the observed behavior change. 
Although a complete analysis (i.e., understanding) of a behavior 
requires an accounting of all of its multiple causes, an applied 
(i.e., technologically useful) analysis has been accomplished when 
the investigator has isolated an environmental variable (or group of 
variables that operate together as a treatment package) that reliably 
produces socially significant behavior change. An applied analysis 
also requires the target behavior to be a function of an environ- 
mental event that can be practically and ethically manipulated. 

An experiment that shows convincingly that changes in 
the dependent variable are a function of the independent vari- 
able and not the result of uncontrolled or unknown variables is 
said to have a high degree of internal validity. A study with 
questionable internal validity yields no meaningful statements 
regarding functional relations between the variables exam- 
ined in the experiment. Nor can a study without convincing 
internal validity be used as the basis for statements claiming 
generality of the findings to other persons, settings, and/or 
behaviors. External validity refers to the degree to which a 
study’s results are generalizable to other subjects, settings, and/ 
or behaviors. Strategies for assessing and extending the external 
validity of experimentally demonstrated functional relations are 
discussed in Chapter 10. 

When planning an experiment and later when examining 
data from an ongoing study, investigators must always look out 
for threats to internal validity. Uncontrolled variables known 
or suspected to exert an influence on the dependent variable 
are called confounding variables. For example, suppose a 
researcher wants to analyze the effects of guided lecture notes 
on high school biology students’ scores on next-day quizzes. 
Just one of many potential confounding variables the researcher 
would need to take into account would be each student’s chang- 
ing level of interest in and background knowledge about the 
specific curriculum content (e.g., a student’s high score on a 
quiz following a lecture on sea life may be due to his prior 
knowledge about fishing, not the guided notes provided during 
that lecture). 

A primary factor in evaluating the internal validity of an 
experiment is the extent to which it eliminates or controls the effects 
of confounding variables while still investigating the research ques- 
tions of interest. Eliminating every possible source of uncontrolled 
variability in an experiment is an impossible ideal, but one all 
researchers should strive to attain. In reality, the goal of experi- 
mental design is eliminating as many uncontrolled variables as 
possible while holding constant the influence of all other variables 
except the independent variable, which is purposefully manipulated 
to determine its effects. 


Behavior: Defining Features and Assumptions 


that Guide Its Analysis 


Behavior is a difficult subject matter, not because it is inac- 
cessible, but because it is extremely complex. Since it is a 
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process, rather than a thing, it cannot easily be held still for 
observation. It is changing, fluid, evanescent, and for this 
reason it makes great technical demands upon the ingenuity 
and energy of the scientist. 


—B. F. Skinner (1953, p. 15) 


How a scientific discipline defines its subject matter exerts pro- 
found influence and imposes certain constraints on the experi- 
mental strategies that will be most effective in an understanding 
of it. Experimental methods in behavior analysis are guided 
by two defining features of behavior: (a) Behavior is an indi- 
vidual phenomenon and (b) behavior is dynamic and continu- 
ous; and two assumptions concerning its nature: (a) behavior 


is determined and (b) behavioral variability is extrinsic to the 
organism. 


Behavior Is an Individual Phenomenon 


If behavior is defined as a person’s interaction with the envi- 
ronment, it follows that a science seeking to discover general 
principles or laws that govern behavior must study the behavior 
of individuals. Groups of people do not behave; individual peo- 
ple do. Thus, the experimental strategy of behavior analysis uses 
within-subject (or single-case) methods of analysis. 

The average performance of a group of individuals often 
yields interesting and useful information and may, depending 
on the methods by which individuals were selected to be in the 
group, enable probability statements about the average perfor- 
mance within the larger population represented by the group. 
However, “group data” provide no information about the behay- 
ior of any individual or how any individual might perform in 
the future. For example, although administrators and taxpayers 
may be justifiably interested in the average increase in students’ 
reading comprehension from grade level to grade level, such 
information is of little use to the classroom teacher who must 
decide how to improve a given student’s comprehension skills. 

Nonetheless, learning how behavior—environment rela- 
tions work with many individuals is vital. A science of behavior 
contributes to a useful technology of behavior change only to 
the extent that it discovers functional relations with generality 
across individuals. The issue is how to achieve that general- 
ity. Behavior analysts have found that discovery of behavioral 
principles with generality across persons is best accomplished 
by replicating demonstrated functional relations with additional 
subjects. 


Behavior Is Dynamic and Continuous 


Just as behavior cannot take place in an environmental void, so 
must behavior occur in time. “Behavior is a continuous process, 
from birth through to death. The problem is that this stream of 
change cannot be stopped for scientific purposes” (Keenan & 
Dillenburger, 2000, p. 26). Measuring behavior as it changes 
over time is a fundamental feature of applied behavior anal- 
ysis. Therefore, single measures, or even multiple measures 
sporadically dispersed over time, cannot provide an adequate 
description of behavior change. Only continuous measurement 
yields a complete record of behavior as it occurs in context with 
its environmental influences. Because true continuous measure- 
ment is seldom feasible in applied settings, systematic repeated 


measurement of behavior (as described in Chapters 4 and 5) is 
the hallmark of applied behavior analysis. 


Behavior Is Determined 


As discussed in Chapter 1, all scientists assume that the universe 
is a lawful and orderly place and that natural phenomena occur 
in relation to other natural events. 


The touchstone of all scientific research is order. In 

the experimental analysis of behavior, the orderliness 

of relations between environmental variables and the 

subject’s behavior is at once the operating assumption 

upon which the experimenter proceeds, the observed 

fact that permits doing so, and the goal that continuously 

focuses experimental decisions. (Johnston & Pennypacker, 

1993a, p. 238) 

Behavior analysts consider behavior to be a natural 
phenomenon that, like all natural phenomena, is determined. 
Although determinism must always remain an assumption—it 
cannot be proven—it is an assumption with strong empirical 
support. 


Data gathered from all scientific fields indicate that deter- 
minism holds throughout nature. It has become clear that 
the law of determinism, that is, that all things are deter- 
mined, holds for the behavioral area also. . .. When looking 
at actual behavior we’ ve found that in situation 1, behavior 
is caused; in situation 2, behavior is caused; in situation 3, 
behavior is caused; . . . and in situation 1001, behavior is 
caused. Every time an experimenter introduces an indepen- 
dent variable that produces some behavior or some change 
in behavior, we have further empirical evidence that behav- 
ior is caused or deterministic. (Malott, General, & Snapper, 
19735 ppl (0175) 


Behavioral Variability ls Extrinsic to the Organism 


When repeated measures reveal inconsistent responding by a sub- 
ject when environmental conditions are held constant, the data 
are said to display variability. The experimental approach most 
commonly used in psychology and other social and behavioral 
sciences (e.g., education, sociology, political science) makes two 
assumptions about such variability: (a) Behavioral variability is 
an intrinsic characteristic of the organism, and (b) behavioral vari- 
ability is distributed randomly among individuals in any given 
population. These assumptions have critical methodological 
implications: (a) Attempting to control or investigate variability 
is a waste of time—it simply exists, it’s a given; and (b) by aver- 
aging the performance of individual subjects within large groups, 
the random nature of variability can be statistically controlled or 
canceled out. Both assumptions about behavioral variability are 
likely false (empirical evidence points in the opposite direction), 
and the methods they encourage are detrimental to a science of 
behavior. “Variables are not canceled statistically. They are sim- 
ply buried so their effects are not seen” (Sidman, 1960, p. 162).° 

Behavior analysts approach variability in their data quite 
differently. A fundamental assumption underlying the design 
and guiding the conduct of experiments in behavior analysis is 
that, rather than being an intrinsic characteristic of the organism, 
behavioral variability is the result of environmental influence: the 
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independent variable with which the investigator seeks to produce 
change, an uncontrolled aspect of the experiment itself, and/or an 
uncontrolled or unknown factor outside the experiment. 

The assumption of extrinsic variability yields the follow- 
ing methodological implication: Instead of averaging the per- 
formance of many subjects in an attempt to mask variability 
(and forfeiting the opportunity to understand and control it), 
behavior analysts experimentally manipulate factors suspected 
of causing the variability. Searching for the causal factors con- 
tributes to the scientific understanding of behavior, because 
experimentally demonstrating a source of variability suggests 
another functional relation. In some studies, “tracking down 
these answers may even turn out to be more rewarding than 
answering the original experimental question” (Johnston & 
Pennypacker, 1980, p. 226). 

From a purely scientific viewpoint, experimentally track- 
ing down sources of variability is always the preferred approach. 
However, the applied behavior analyst, with a problem to solve, 
must often take variability as it presents itself (Sidman, 1960). 
Sometimes the applied researcher has neither the time nor the 
resources to experimentally manipulate even suspected and 
likely sources of variability (e.g., a teacher who interacts with 
a student for only part of the day has no hope of controlling 
the many variables outside the classroom that influence the 
student’s behavior). In most settings, applied behavior analysts 
seek treatment variables robust enough to overcome variability 
induced by uncontrolled factors (Baer, 1977b). 


COMPONENTS OF APPLIED BEHAVIOR 
ANALYSIS EXPERIMENTS 


Nature to be commanded must be obeyed. . . . But, that coin 
has another face. Once obeyed, nature can be commanded. 
—B. F. Skinner (1956, p. 232) 


Experimentation is the scientist’s way of discovering nature’s 
rules. Discoveries that prove valid and reliable can contribute 
to a technology of effective behavior change. Experiments in 
applied behavior analysis include these essential components: 


* Research question 

¢ At least one participant (subject) 

* At least one behavior (dependent variable) 

* At least one setting 

* A system for measuring the behavior and ongoing visual 
analysis of the data 

* At least one intervention or treatment condition (indepen- 
dent variable) 

* Manipulation of the independent variable to determine its 
effects on the dependent variable (experimental design) 


Research Question 


We conduct experiments to find out something we do 
not know. 


—Murray Sidman (1960, p. 214) 


For the applied behavior analyst, Sidman’s “something we do 
not know” is cast in the form of a question about the existence 


and/or specific nature of a functional relation between mean- 
ingful improvement in socially significant behavior and one or 
more of its controlling variables. A research question specifies 
what the investigator wants the experiment to answer. Research 
questions in applied behavior analysis take many forms; most 
questions are one of four types: 


¢ Demonstration—to what extent will this intervention 
work (i.e., change the behavior of interest)? 

¢ Parametric—does more or less of the intervention work 
better? 

¢ Component—how effective is the intervention when var- 
ious components are added or subtracted? 

¢ Comparative—does one intervention work better than 
another? (adapted from Wolery, Lane, & Common, 2018) 


In some published reports of applied behavior analysis 
studies, the investigators state their research question(s) explic- 
itly, as in these examples: 


¢ The research questions were as follows: 1. What are the 
effects of a self-monitoring package with conferencing on 
the completion of spelling and math homework for stu- 
dents with disabilities in an inclusive general education 
classroom? 2. What are the effects of a self-monitoring 
package with conferencing on the accuracy of spelling 
and math homework for students with disabilities in an 
inclusive general education classroom? 3. If gains are 
made in completion and accuracy of spelling and math 
homework, will students maintain them 2 and 3 weeks 
after the intervention has ended? (Falkenberg & Barbetta, 
2013, pp. 191-192) 

* Four primary research questions were addressed: What 
are the effects of training middle school students with 
learning disabilities to recruit teacher attention in the spe- 
cial education classroom on (a) the number of recruiting 
responses they emit in the general education classroom, 
(b) the number of teacher praise statements received by 
the students in the general education classroom, (c) the 
number of instructional feedback statements received 
by the students in the general education classroom, and 
(d) the students’ academic productivity and accuracy 
in the general education classroom? (Alber, Heward, & 
Hippler, 1999, p. 255) 

* Experiment 1—What are the differences in responding 
under the “no programmed consequence,” “praise,” and 
“known reinforcer” conditions? To what extent will 3 
to 5 “pairing” sessions per day affect responding during 
“praise” sessions conducted at the beginning and end of 
each day? (Axe & Laprime, 2017, p. 330) 


More often, however, the research question is implicit 
within a statement of the study’s purpose. For example: 


* [T]he purpose of this study was to determine the effects 
of self-graphing on the quantity and quality of writing 
for students with high-incidence disabilities. (Stolz, Itoi, 
Konrad, & Alber-Morgan, 2008, p. 174) 

* [T]he purpose of this study was to evaluate the extent to 
which lag schedules of reinforcement produced responses 
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that were already in the existing repertoires of three chil- 
dren diagnosed with autism, as well as the extent to which 
lag schedules produced novel responses. (Contreras & 
Betz, 2016, p. 5) 

¢ [T]he purpose of this study was to examine the use of 
habit reversal to decrease filled pauses that occur during 
public speaking. (Mancuso & Miltenberger, 2016, p. 188) 

* [W]e sought to determine if teaching three children with 
autism to monitor their peers’ reading responses would lead 
to the acquisition of sight words. (Taylor, DeQuinzio, & 
Stine, 2012, p. 815) 

* The purpose of this study was to determine whether natu- 
rally occurring meals would affect performance adversely 
during postmeal sessions in which highly preferred food 
was used as reinforcement. (Zhou, Iwata, & Shore, 2002, 
pp. 411-412) 


Whether the research question is stated explicitly in the 
form of a question or implicit within a statement of purpose, all 


aspects of the experiment’s design and conduct should follow 
from it. 


A good design is one that answers the question convinc- 
ingly, and as such needs to be constructed in reaction to 
the question and then tested through arguments in that 
context (sometimes called “thinking through”), rather 
than imitated from a textbook. (Baer, Wolf, & Risley, 
1987, p. 319) 


Participant 


Experiments in applied behavior analysis are most often 
referred to as single-subject or single-case designs. These 
terms do not mean that applied behavior analysis studies 
are conducted with only one subject (though some are); 
they denote an experimental logic for detecting and analyz- 
ing functional relations between environmental variables 
and behavior change that employs each subject as her own 
control.* Repeated measures of each subject’s behavior are 
obtained as she is exposed to each condition of the study 
(e.g., the presence and absence of the independent variable). 
A subject is often exposed to each condition several times 
over the course of an experiment. Measures of the subject’s 
behavior during each phase of the study provide the basis for 
comparing the effects of experimental variables as they are 
presented or withdrawn in subsequent conditions. 

Although most applied behavior analysis studies involve 
more than one subject, each subject’s data are graphed and ana- 
lyzed separately.> Instead of using single-case or single-subject 
design to refer to an experiment in which each subject serves 
as his or her own control, some authors use more aptly descrip- 
tive terms such as within-subject design, intrasubject design, and 
repeated-measures design.® 

Sometimes the applied behavior analyst is interested in 
assessing the overall effect of a treatment variable within a group 
of subjects—for example, the number of homework assign- 
ments completed by members of a class of fifth-grade students. 
In such a case, the total number of assignments completed may 
be measured, graphed, and analyzed as a dependent variable. 


However, unless each student’s data are individually graphed 
and interpreted, no individual student’s behavior has been ana- 
lyzed, and the data for the group may not be representative of any 
individual subject. 

Use of a single participant, or a small number of partici- 
pants, each of whom is considered an intact experiment, stands in 
sharp contrast to the between-group designs traditionally used in 
psychology and the other social sciences that employ large num- 
bers of subjects. Proponents of between-group designs contend 
that using a large number of subjects controls for the variability 
discussed earlier and increases the generality (or external validity) 
of any findings to the population from which the subjects were 
selected. The advantages and disadvantages of an experimental 
approach based on within-subject comparisons of the behavior 
of individual subjects versus comparisons of the average perfor- 
mance of different groups of subjects is discussed in Chapter 10. 


Behavior: Dependent Variable 


The target behavior in an applied behavior analysis experiment, 
or more precisely a measurable dimensional quantity of that 
behavior (e.g., rate, duration), is called the dependent variable. 
It is so labeled because the experiment is designed precisely 
to determine whether the behavior is, in fact, dependent on 
(i.e., a function of) the independent variable(s) manipulated by the 
investigator. (Criteria and procedures for selecting and defining 
response classes that meet the applied requirements as dependent 
variables in applied behavior analysis are described in Chapter 3.) 
In some studies multiple behaviors are measured. One rea- 
son for measuring multiple behaviors is to provide data patterns 
that can serve as controls for evaluating and replicating the effects 
of an independent variable as it is applied sequentially to each of 
the behaviors.’ A second reason for multiple dependent measures 
is to assess the presence and extent of the independent variable’s 
effects on behaviors other than the response class to which it was 
applied directly. This strategy can reveal whether the independent 
variable has collateral effects—either desired or undesired—on 
other behaviors of interest. Such behaviors are referred to as sec- 
ondary dependent variables. The experimenter obtains regular 
measures of their rate of occurrence, though perhaps not as often 
as measures of the primary dependent variable are recorded. 
Still another reason for measuring multiple behaviors is 
to determine whether changes in the behavior of a person other 
than the subject occur during the course of an experiment and 
whether such changes might in turn explain observed changes 
in the subject’s behavior. This strategy is implemented primarily 
to assess the effects of a suspected confounding variable: The 
additional behavior or behaviors measured are not true depen- 
dent variables in the sense of undergoing analysis. For exam- 
ple, in a classic study analyzing the effects of the self-recording 
by a junior high school girl on her classroom study behavior, 
Broden, Hall, and Mitts (1971) observed and recorded the num- 
ber of times the girl’s teacher paid attention to her through- 
out the experiment. If the frequency of teacher attention had 
covaried with changes in the girl’s study behavior, a functional 
relation between self-recording and study behavior would not 
have been demonstrated. In that case, teacher attention would 
likely have been identified as a potential confounding variable, 
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and the focus of the investigation shifted to include efforts to 
experimentally control it (i.e., to hold teacher attention constant) 
or to systematically manipulate and analyze its effects. However, 
the data revealed no correlation between teacher attention and 
study behavior during the first four phases of the experiment, 
when concern was highest that teacher attention may have been 
a confounding variable. 


Setting 


Control the environment and you will see order in behavior. 
—B. F. Skinner (1967, p. 399) 


Functional relations are demonstrated when observed variations 
in behavior can be attributed to specific operations imposed 
on the environment. Experimental control is achieved when 
the systematic manipulation of some aspect of the subject’s 
environment (the independent variable) reliably produces a 
predictable change in behavior (the dependent variable). To 
make such an attribution, the investigator must, among other 
things, control two sets of environmental variables. First, the 
investigator must control the presence, absence, or value of the 
independent variable in every phase of the experiment. Sec- 
ond, the investigator must control, by holding constant, all other 
aspects of the experimental setting—extraneous variables— 
to prevent unplanned environmental variation. These two 
operations—precisely manipulating the independent variable 
and maintaining the constancy of every other relevant aspect 
of the experimental setting—define the second meaning of 
experimental control. 

In basic laboratory research, experimental space is 
designed and furnished to maximize experimental control. 
Lighting, temperature, and sound, for example, are all held con- 
stant, and programmed apparatus virtually guarantee the presen- 
tation of antecedent stimuli and the delivery of consequences 
as planned. Applied behavior analysts carry out research where 
socially important behaviors naturally occur—classrooms, 
homes, workplaces, and community settings. It is impossible to 
control every feature of an applied environment, and to add to 
the difficulty, subjects are typically in the experimental setting 
for a small portion of each day, bringing with them the influ- 
ences of events and contingencies operating in other settings. 

In spite of the complexity and ever-changing nature of 
applied settings, the behavior analyst must make every effort to 
hold constant all seemingly relevant aspects of the environment. 
When unplanned variations are noted, the investigator must wait 
out their effects or try to incorporate them into the design of the 
experiment. In any event, repeated measures of the subject’s 
behavior provide the barometer for assessing whether unplanned 
environmental changes are of concern. 

Applied studies are often conducted in more than one 
setting. Researchers sometimes use concurrent measures of the 
same behavior obtained across settings as controls for analyz- 
ing the effects of an independent variable that is sequentially 
applied to the behavior in each setting.® Data are also collected 
in multiple settings to assess the extent to which behavior 
changes observed in the primary setting have occurred in the 
other setting(s). 


Measurement System and Ongoing Visual 
Analysis 


Beginning students of behavior analysis sometimes believe the 
discipline is preoccupied with issues and procedures related to 
the observation and measurement of behavior. They want to get 
on with the analysis. Measuring behavior change over time is the 
principal feature of single-case experimental method. The results 
of any experiment can be presented and interpreted only in terms 
of what was measured, and the observation and recording proce- 
dures used in the study determine not only what was measured 
but also how well it was measured (i.e., how representative of the 
subject’s actual behavior is the estimate provided by the experi- 
mental data—all measurements of behavior, no matter how fre- 
quent and technically precise, are estimates of true values). 

Observation and recording procedures must be conducted 
in a standardized manner throughout each session of an experi- 
ment. Standardization involves every aspect of the measurement 
system, from the definition of the target behavior (dependent 
variable) to the scheduling of observations to the manner in 
which the raw data are transposed from recording sheets to ses- 
sion summary sheets to the way the data are graphed. As detailed 
in Chapter 5, unintended or imposed changes in measurement 
tactics can result in unwanted variability or confounded treat- 
ment effects. 

The previous chapter outlined advantages that accrue to 
the behavioral researchers and practitioners who maintain direct 
contact with the experimental data by ongoing visual inspection 
of graphic displays. Behavior analysts must recognize changes 
in level, trend, and degree of variability as these changes develop 
in the data. Because behavior is a continuous, dynamic phenom- 
enon, experiments designed to discover its controlling variables 
must enable the investigator to inspect and respond to the data 
continuously as the study progresses. Only in this way can the 
behavior analyst be ready to manipulate features of the environ- 
ment at the time and in the manner that will best reveal functional 
relations and minimize the effects of confounding variables. 


Intervention or Treatment: Independent 
Variable 


Behavior analysts seek functional relations between behavior 
change and environmental variables. The particular aspect of 
the environment that the experimenter manipulates to find out 
whether it affects the subject’s behavior is called the independent 
variable. Sometimes called the experimental variable, inter- 
vention, or treatment, this component of an experiment is 
called the independent variable because the researcher controls 
or manipulates it independent of the subject’s behavior or any 
other event. (As we will soon see, though, manipulating the 
independent variable without regard to measures of the depen- 
dent variable is unwise.) Whereas changes in the experimen- 
tal setting required to conduct the study (e.g., the presence 
of observers to measure behavior) are made with the goal of 
minimizing their effects on the dependent variable, “changes 
in the independent variable are arranged by the experimenter in 
order to maximize . . . its influence on responding” (Johnston & 
Pennypacker, 1980, p. 260). 
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Manipulation of the Independent Variable: 
Experimental Design 


Experimental design refers to the particular arrangement of con- 
ditions in a study so that meaningful comparisons of the effects 
of the presence, absence, or different values of the independent 
variable can be made. Independent variables can be introduced, 
withdrawn, increased or decreased in value, or combined across 
behaviors, settings, and/or subjects in an infinite number of ways.” 
However, with respect to the behavior of a given subject, at a 
given point in time in a given setting, an experimenter can make 
only two basic kinds of independent variable changes: introduce 
a new condition or reinstate an old condition. In essence, “Exper- 
imental designs are merely temporal arrangements of various new 
and old conditions across behaviors and settings in ways that pro- 
duce data that are convincing to the investigator and the audience” 
(Johnston & Pennypacker, 1980, p. 270). 

In the simplest case—from an analytic perspective, but 
not necessarily a practical point of view—an independent vari- 
able can be either present or absent during each time period or 
phase of a study. When the independent variable is in either 
of these conditions during a study, the experiment is termed a 
nonparametric study. In contrast, a parametric analysis seeks to 
discover the differential effects of a range of values of the inde- 
pendent variable. Lerman, Kelley, Vorndran, Kuhn, and LaRue 
(2002) conducted a parametric analysis when they assessed the 
effects of different reinforcer magnitudes (20, 60, or 300 sec 
of access to toys or escape from demands) on the duration of 
postreinforcement pause and resistance to extinction. Examples 
of different values of independent variables in other parametric 
studies: short (2 sec), progressive (2 sec to 20 sec), and long 
(20 sec) intertrial-interval durations during language training 
(Cariveau, Kodak, & Campbell, 2016); and teacher praise rates 
of 1, 2, or 8 per minute during small-group instruction (Kranak, 
Alber-Morgan, & Sawyer, 2017). 

When an investigator is interested in comparing the 
effects of two or more treatment alternatives, multiple inde- 
pendent variables become part of the experiment. For example, 
perhaps two separate treatments are evaluated as well as the 
effects of a third treatment, which is a combination of both vari- 
ables. However, even in studies with multiple independent vari- 
ables, the researcher must heed a simple but fundamental rule 
of experimentation: Change only one variable at a time. Failure 
to heed this rule eliminates the possibility of attributing any 
measured behavior change to a specific independent variable. 

If two or more variables are altered simultaneously and 
changes in the dependent variable are noted, no conclusions can 
be made with regard to the contribution of any one of the altered 
variables to the behavior change. If two variables changed 
together, both could have contributed equally to the resultant 
behavior change; one variable could have been solely, or mostly, 
responsible for the change; or one variable may have had a nega- 
tive or counterproductive effect, but the other independent vari- 
able was sufficiently strong to overcome this effect, resulting in 
a net gain. Any of these explanations or combinations may have 
accounted for the change. 

As stated previously, applied behavior analysts often con- 
duct their experiments in “noisy” environments where effective 


treatments are required for reasons related to personal safety 
or exigent circumstances. In such cases, applied behavior ana- 
lysts sometimes “package” multiple and well-documented 
and effective treatments, knowing that multiple independent 
variables are being introduced. As implied earlier, a pack- 
age intervention is one in which multiple independent vari- 
ables are being combined or bundled into one program (e.g.. 
token reinforncement + praise + self-recording + time-out). 
However, from the perspective of experimental analysis, the rule 
to change only one variable at a time still holds. When manipu- 
lating a treatment package, the experimenter must ensure that the 
entire package is presented or withdrawn each time a manipula- 
tion occurs. In this situation, it is important to understand that the 
entire package is being evaluated, not the discrete components 
that make up the package. If, at a later time, the analyst wishes to 
determine the relative contributions of each part of the package, 
she would need to carry out a component analysis. Chapters 8 and 
9 describe experimental tactics for component analyses. 

No off-the-shelf experimental designs are available for 
a given research problem. Investigators must avoid following 
textbook “designs” that (a) require a priori assumptions about 
the nature of the functional relation one seeks to investigate and 
(b) may be insensitive to unanticipated changes in behavior. 
Researchers should select and combine experimental tactics that 
best fit their research questions, while standing ever ready to 
“explore relevant variables by manipulating them in an impro- 
vised and rapidly changing design” (Skinner, 1966, p. 21). 

Simultaneous with, and to a large degree responsible 
for, the growth and success of applied behavior analysis have 
been the development and refinement of a powerful group of 
extremely flexible experimental tactics for analyzing behavior— 
environment relations. We describe the most widely used of 
these tactics in Chapters 8 and 9. However, to effectively select, 
modify, and combine these tactics into convincing experiments, 
behavior analysts must fully understand the experimental rea- 
soning, or logic, that provides the foundation for within-subject 
experimental comparisons. 


STEADY STATE STRATEGY AND BASELINE 
LOGIC 


Steady state responding—‘a pattern of responding that exhib- 
its relatively little variation in its measured dimensional quan- 
tities over a period of time” (Johnston & Pennypacker, 1993a, 
p. 199)—provides the basis for a powerful form of experimental 
reasoning called baseline logic. Baseline logic entails three ele- 
ments—prediction, verification, and replication—each of which 
depends on an overall experimental approach called steady state 
strategy. Steady state strategy entails exposing a subject to a 
given condition while trying to eliminate or control any extra- 
neous influences on the behavior and obtaining a stable pattern 
of responding before introducing the next condition. 


Nature and Function of Baseline Data 


Behavior analysts discover behavior—-environment relations by 
comparing data from repeated measures of a subject’s behay- 
ior under the different conditions of the experiment. The most 
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common method of evaluating the effects of a given variable is 
to impose it on an ongoing measure of behavior obtained in its 
absence. These original data serve as the baseline against which 
any observed changes in behavior when the independent vari- 
able is introduced can be compared. A baseline serves as a con- 
trol condition; baseline does not necessarily mean the absence 
of instruction or treatment, only the absence of the independent 
variable of experimental interest. 


Why Establish a Baseline? 


From a purely scientific or analytic perspective, the primary pur- 
pose for establishing a baseline level of responding is to use the 
subject’s performance in the absence of the independent variable 
as an objective basis for detecting the effects of the independent 
variable. Obtaining baseline data can yield several applied bene- 
fits. For one, systematic observation of the target behavior before 
a treatment variable is introduced provides the opportunity to 
look for and note environmental events that occur just before and 
just after the target behavior. Such empirically obtained descrip- 
tions of antecedent-behavior-consequent correlations are often 
invaluable in planning an effective intervention (see Chapter 25). 
For example, baseline observations revealing that a child’s dis- 
ruptive outbursts are consistently followed by parent or teacher 
attention can be used in designing an intervention of ignoring 
outbursts and contingent attention following desired behavior. 

Second, baseline data can provide valuable guidance in 
setting initial criteria for reinforcement, a particularly important 
step when a contingency is first put into effect (see Chapter 11). 
If the criteria are too high, the subject never comes into contact 
with the contingency; if they are too low, little or no improve- 
ment can be expected. 

From a practical perspective, a third reason for collect- 
ing baseline data concerns the merits of objective measurement 


versus subjective opinion. The results of systematic baseline 
measurement may convince the behavior analyst or significant 
others to reconsider the necessity and value of attempting to 
change the behavior. For example, a behavior targeted for inter- 
vention because of several recent and extreme instances is no 
longer targeted because baseline data show it is decreasing. Or, 
perhaps a behavior’s topography attracted undue attention from 
teachers or parents, but objective baseline measurement over 
several days reveals the behavior is not occurring at a rate or 
level that warrants intervention. 


Baseline Data Patterns 


Figure 7.1 shows examples of four data patterns sometimes gen- 
erated by baseline measurement. These hypothetical baselines 
represent only four examples of the wide variety of baseline 
data patterns experimenters and practitioners encounter. The 
potential combinations of different levels, trends, and degrees 
of variability are infinite. Nevertheless, in an effort to provide 
guidance to the beginning behavior analyst, some general state- 
ments will be given about the experimental decisions that might 
be warranted by the data patterns shown in Figure 7.1. 

Graph A shows a relatively stable baseline. The data pres- 
ent no evidence of an upward or downward trend, and all of the 
measures fall within a small range of values. A stable baseline 
provides the most desirable basis, or context, against which to 
look for effects of an independent variable. If changes in level, 
trend, and/or variability coincide with the introduction of an 
independent variable on a baseline as stable as that shown in 
Graph A, one can reasonably suspect that those changes may be 
related to the independent variable. 

The data in Graphs B and C represent an ascending 
baseline and a descending baseline, respectively. The data 
path in Graph B shows an increasing trend in the behavior over 
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time, whereas the data path in Graph C presents a decreas- 
ing trend. The applied behavior analyst must treat ascending 
and descending baseline data cautiously. Applied behavior 
analysts select dependent variables because they represent 
target behaviors that need to be changed. But ascending 
and descending baselines reveal behaviors in the process of 
changing. The effects of an independent variable introduced 
at this point are likely to be obscured or confounded by the 
variables responsible for the already-occurring change. But 
what if the applied investigator needs to change the behay- 
ior immediately? The applied perspective can help solve the 
dilemma. 

Whether a treatment variable should be introduced 
depends on whether the trending baseline data represent 
improving or deteriorating performance. When an ascending or 
a descending baseline represents behavior change in the ther- 
apeutically desired direction, the investigator should withhold 
treatment and continue to monitor the dependent variable under 
baseline conditions. When the behavior ceases to improve (as 
evidenced by stable responding) or begins to deteriorate, the 
independent variable can be applied. If the trend does not level 
off and the behavior continues to improve, the original problem 
may no longer be present, leaving no reason for introducing 
the treatment as planned (although the investigator might be 
motivated to isolate and analyze the variables responsible for 
the “spontaneous” improvement). Introducing an independent 
variable to an already-improving behavior makes it difficult, 
and often impossible, to claim any continued improvement as a 
function of the independent variable. 

An ascending or descending baseline that represents 
significantly deteriorating performance signals an immediate 
application of the independent variable. From an applied per- 
spective, the decision to intervene is obvious: The subject’s 
behavior is worsening, and a treatment designed to improve 
it should be introduced. An independent variable capable of 
effecting desired behavior change in spite of other variables 
“pushing” the behavior in the opposite direction is most likely 
a robust variable, one that will be a welcome addition to the 
behavior analyst’s list of effective treatments. The decision to 
introduce a treatment variable on a descending baseline is also a 
sound one from an analytic perspective, which will be discussed 
in the next section. 

Graph D in Figure 7.1 shows a highly unstable or 
variable baseline. The data in Graph D show just one of many 
possible patterns of unstable responding. The data points do not 
consistently fall within a narrow range of values, nor do they 
suggest any clear trend. Introducing the independent variable in 
the presence of such variability is unwise from an experimental 
standpoint. Variability is assumed to be the result of environ- 
mental variables, which in the case shown by Graph D seem to 
be operating in an uncontrolled fashion. Before the researcher 
can analyze the effects of an independent variable effectively, 
these uncontrolled sources of variability must be isolated and 
controlled. 

Stable baseline responding provides an index of the 
degree of experimental control the researcher has established. 
“If sufficiently stable responding cannot be obtained, the 
experimenter is in no position to add an independent variable of 


suspected but unknown influence. To do so would be to com- 
pound confusion and lead to further ignorance” (Johnston & 
Pennypacker, 1980, p. 229). 

Again, however, applied considerations must be bal- 
anced against purely scientific pursuits. The applied problem 
may be one that cannot wait to be solved (e.g., severe self- 
injurious behavior). Or, confounding variables in the subject’s 
environment and the setting(s) of the investigation may simply 
be beyond the experimenter’s control.!° In such situations, the 
independent variable is introduced with the hope of producing 
stable responding in its presence. Sidman (1960) agreed that 
“the behavioral engineer must ordinarily take variability as he 
finds it, and deal with it as an unavoidable fact of life” (p. 192). 


Prediction 


Prediction can be defined as “the anticipated outcome of 
a presently unknown or future measurement. It is the most 
elegant use of quantification upon which validation of all 
scientific and technological activity rests” (Johnston & 
Pennypacker, 1980, p. 120). Figure 7.2 shows a series of 
hypothetical measures representing a stable pattern of base- 
line responding. The consistency of the first five data points in 
the series encourages the prediction that—if no changes occur 
in the subject’s environment—subsequent measures will fall 
within the range of values obtained thus far. Indeed, a sixth 
measure is taken that gives credence to this prediction. The 
same prediction is made again, this time with more confi- 
dence, and another measure of behavior shows it to be correct. 
Throughout a baseline (or any other experimental condition), 
predictions are made and confirmed until the investigator has 
every reason to believe that the response measure will not 
change appreciably under the present conditions. Open data 
points in Figure 7.2 represent predicted measures of future 
responding. Given the stability of the obtained measures, few 
experienced scientists would quarrel with the prediction. 


Behavior 
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Figure 7.2 Solid data points represent actual measures of 
behavior that might be generated in a stable baseline; 
open data points within the box represent the level 
of responding predicted on the basis of the obtained 
measures, should the environmental conditions remain 
unchanged. 
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How many measures must be taken before an experi- 
menter or practitioner can predict future behavior with confi- 
dence? There are no criteria or pre-determined number of data 
points that ensure confident prediction. However, the following 
statements can serve as guidelines. 


¢ Baer and colleagues (1968) recommended continuing 
measurement until “stability is clear.” 

e All things being equal, many measures are better than a 
few; and the longer the period of time in which stable 
responding is obtained, the better the predictive power of 
those measures. 

¢ If the experimenter is not sure whether measurement has 
revealed stable responding, in all likelihood it has not, and 
more data should be collected before the independent vari- 
able is introduced. 
The investigator’s knowledge of the characteristics of the 
behavior being studied under constant conditions is invalu- 
able in deciding when to terminate baseline measurement 
and introduce the independent variable. That knowledge 
can be drawn from personal experience in obtaining stable 
baselines on similar response classes and from familiarity 
with patterns of baseline responding found in the pub- 
lished literature. 


Statements such as “collect baseline data for at least 
five sessions” or “obtain baseline measures for 2 consecutive 
weeks” are misguided or naive. Depending on the situation, five 
baseline data points or 2 weeks of baseline data may or may not 
provide a convincing picture of steady state responding. The 
question that must be addressed is this: Are the data sufficiently 
stable to provide a basis for comparison to measures collected 
from other conditions? This question can be answered only by 
ongoing prediction and confirmation with repeated measures 
obtained in an environment where all relevant conditions are 
held constant. 

Behavior analysts are often interested in analyzing func- 
tional relations between an instructional variable and the acqui- 
sition of new skills. In such situations it is sometimes assumed 
that baseline measures are zero. For example, one would expect 
repeated observations of a child who has never tied her shoes to 
yield a perfectly stable baseline of zero correct responses. How- 
ever, casual observations that have never shown a child to use 
a particular skill do not constitute a scientifically valid baseline 
and should not be used to justify any claims about the effects 
of instruction. It could be that, if given repeated opportunities 
to respond, the child would begin to emit the target behavior at 
a nonzero level. 

Practice effects refer to improvements in performance 
resulting from repeated opportunities to emit the behavior so 
that baseline measurements can be obtained. For example, 
attempting to obtain stable baseline data for students answer- 
ing arithmetic problems may result in improved performance 
simply because of the repeated practice inherent in the measure- 
ment process. Practice effects can confound a study, making it 
impossible to separate and account for the effects of practice and 
instruction on the student’s final performance. Repeated base- 
line measures should be used either to reveal the existence or to 
demonstrate the nonexistence of practice effects. When practice 


effects are suspected or found, baseline data collection should be 
continued until steady state responding is attained. 

The necessity to demonstrate a stable baseline and to 
control for practice effects empirically does not require applied 
behavior analysts to withhold needed treatment or intervention. 
Nothing is gained by collecting unduly long baselines of behay- 
iors that cannot reasonably be expected to be in the subject's 
repertoire. Many behaviors cannot be emitted unless the subject 
is competent in certain prerequisite behaviors. For example, there 
is no legitimate possibility of a child’s tying his shoes if he cur- 
rently does not pick up the laces, or of a student’s solving division 
problems if she cannot subtract and multiply. Obtaining extended 
baseline data in such cases is unnecessary pro forma measure- 
ment. Such measures do “not so much represent zero behavior 
as zero opportunity for behavior to occur, and there is no need to 
document at the level of well-measured data that behavior does 
not occur when it cannot” (Horner & Baer, 1978, p. 190). 

Fortunately, applied behavior analysts need neither abandon 
the use of steady state strategy nor repeatedly measure nonexis- 
tent behavior at the expense of a treatment condition. The multi- 
ple probe design, described in Chapter 9, embodies steady state 
logic to analyze functional relations between instruction and the 
acquisition of behaviors shown to be nonexistent in the subject’s 
repertoire prior to the introduction of the independent variable. 


Affirmation of the Consequent 


The predictive power of steady state responding enables the 
behavior analyst to employ a form of inductive logic known 
as affirmation of the consequent (Johnston & Pennypacker, 
1980). The reasoning behind affirmation of the consequent 
begins with a true antecedent—consequent (if-A-then-B) state- 
ment and proceeds as follows: 


1. If A is true, then B is true. 
2. B is found to be true. 
3. Therefore, A is true. 


The behavior analyst’ s version goes like this: 


1. If the independent variable is a controlling factor for the 
behavior (A), then the data obtained in the presence of 
the independent variable will show that the behavior has 
changed (B). 


2. When the independent variable is present, the data show 
that the behavior has changed (B is true). 


3. Therefore, the independent variable is a controlling vari- 
able for the behavior (therefore, A is true). 


Although the logic is flawed—other factors could be 
responsible for the truthfulness of A—a successful (iés).cone 
vincing) experiment affirms several if-A-then-B possibilities, 
each one reducing the likelihood that factors other than the 
independent variable are responsible for the observed changes 
in behavior. 

Figures 7.3 to 7.5 illustrate how prediction, verification, 
and replication are employed in a hypothetical experiment using 
a reversal design, one of the more powerful analytic tactics used 
by behavior analysts (see Chapter 8). The data in Figure 7.3 
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Baseline Treatment 
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Figure 7.3 Affirmation of the consequent supporting 
the possibility of a functional relation between the 
behavior and treatment variable. The measures obtained 
in the presence of the treatment variable differ from 
the predicted level of responding in the absence of the 
treatment variable (open data points within the boxed 
area). Solid data points represent an A-B design. 


show a successful affirmation of the consequent. Steady state 
responding during Baseline enabled a prediction that, if no 
changes were made in the environment, continued measurement 
would yield data similar to those represented by the open-circle 
data points within the boxed area. The experimenter also pre- 
dicted that (more precisely, questioned whether) the treatment 
condition would produce a change in the behavior. The inde- 
pendent variable was introduced, and repeated measures of the 
dependent variable showed that the behavior did indeed change 
(solid data points in Treatment condition). 

These data enable two comparisons, one real and one 
hypothetical. First, the difference between the obtained mea- 
sures in the presence of the independent variable and those 
obtained during baseline supports the prediction that treatment 
would change the behavior and represents the extent of a possi- 
ble effect of the independent variable. The second comparison is 
between the data obtained in the treatment condition and the pre- 
dicted measures had the treatment variable not been introduced 
(i.e., the open data points within the boxed area of Figure 7.3). 
This comparison represents the behavior analyst’s hypothetical 
approximation of the ideal, but impossible-to-achieve, experi- 
mental design: the simultaneous measurement and comparison 
of the behavior of an individual subject in both the presence and 
absence of the treatment variable (Risley, 1969). 

Although the data in Figure 7.3 affirm the initial antecedent— 
consequent statement—a change in the behavior was observed in 
the presence of the independent variable—asserting a functional 
relation between the treatment variable and behavior change is 
unwarranted. The experimenter has not yet ruled out the possibil- 
ity of other variables being responsible for the behavior change. 
Perhaps some factor within or outside the experimental setting 
occurred at the same time the independent variable was intro- 
duced, and that factor is responsible for the change in behavior. 

Although two-phase experiments consisting of a pre- 
treatment baseline condition followed by a treatment condition 


(i.e., an A-B design) enable neither verification of the predic- 
tion of continued responding at baseline levels nor replication 
of the effects of the independent variable, studies using A-B 
designs can nevertheless contribute important and useful find- 
ings to research and practice (e.g., Azrin & Wesolowski, 1974; 
Ghaemmaghami, Hanley, & Jessel, 2016; Krueger, Rapp, Ott, 
Lood, & Novotny, 2013; Reid, Parsons, Phillips, & Green, 1993). 

A firmer statement about the relation between the treat- 
ment and the behavior can be made at this point, however, if 
changes in the dependent variable are not observed in the pres- 
ence of the independent variable. If accurate measures of the 
dependent variable were obtained with a measurement system 
sensitive to changes in the behavior and the treatment condi- 
tion was implemented as planned, then no behavior change in 
the presence of the independent variable constitutes discon- 
firmation of the consequent (B was shown not to be true), and 
the independent variable may be eliminated as a controlling 
variable. However, eliminating a treatment from the ranks of 
controlling variables on the basis of no observed effects pre- 
supposes experimental control of the highest order (Johnston & 
Pennypacker, 1993a). 

However, the data set in Figure 7.3 shows that behavior 
changed in the presence of the independent variable, revealing 
a correlation between the independent variable and the behavior 
change. To what extent was the behavior change a function of the 
independent variable? To pursue this question, the behavior ana- 
lyst employs the next component of baseline logic: verification. 


Verification 


The experimenter can increase the probability that an observed 
change in behavior was functionally related to the introduc- 
tion of the independent variable by verifying the original pre- 
diction of unchanging baseline measures. Verification can be 
accomplished by demonstrating that the prior level of baseline 
responding would have remained unchanged had the indepen- 
dent variable not been introduced. If that can be demonstrated, 
this operation verifies the accuracy of the original prediction of 
continued stable baseline responding and reduces the probability 
that some uncontrolled (confounding) variable was responsi- 
ble for the observed change in behavior. Again, the reasoning 
behind affirmation of the consequent is the logic that underlies 
the experimental strategy. 

Figure 7.4 illustrates the verification of effect in our 
hypothetical experiment. When steady state responding was 
established in the presence of the independent variable, the 
investigator removed the treatment variable, thereby returning to 
the previous baseline conditions. This tactic allows the possibil- 
ity of affirming two different antecedent—consequent statements. 
The first statement and its affirmation follow this pattern: 


1. If the independent variable is a controlling factor for the 
behavior (A), then its removal will coincide with changes 
in the response measure (B). 


2. Removal of the independent variable is accompanied by 
changes in the behavior (B is true). 


3. Therefore, the independent variable controls responding 
(therefore, A is true). 
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Figure 7.4 Verification of a previously predicted 
level of baseline responding by withdrawal of 
the treatment variable. The measures obtained 
during Baseline 2 (solid data points within the 
shaded box) show a successful verification and 
a second affirmation of the consequent based 
on a comparison with the predicted level of 
responding (open dots in Baseline 2) in the 
continued presence of the treatment variable. 


Behavior 


The second statement and affirmation follows this pattern: 


1. If the original baseline condition controlled the behavior 
(A), then a return to baseline conditions will result in sim- 
ilar levels of responding (B). 


2. The baseline condition is reinstated, and levels of respond- 
ing similar to those obtained during the original baseline 
phase are observed (B is true). 


3. Therefore, the baseline condition controlled the behavior 
both then and now (therefore, A is true). 


The six measures within the shaded area obtained during 
Baseline 2 of our hypothetical experiment verify the predic- 
tion made for Baseline 1. The open data points in Baseline 2 
represent the predicted level of responding if the independent 
variable had not been removed. (The prediction component 
of baseline logic applies to steady state responding obtained 
during any phase of an experiment, baseline and treatment 
conditions alike.) The difference between the data actually 
obtained during Treatment (solid data points) and the data 
obtained during Baseline 2 (solid data points) affirms the 
first if-A-then-B statement: If the treatment is a controlling 
variable, then its removal will result in changes in behavior. 
The similarity between measures obtained during Baseline 
2 and those obtained during Baseline 1 confirms the second 
if-A-then-B statement: If baseline conditions controlled the 
behavior before, reinstating baseline conditions will result in 
similar levels of responding. 

As before, the observed changes in behavior associated 
with the introduction and withdrawal of the independent vari- 
able are subject to interpretations other than the existence of a 
functional relation between the two events. However, the case 
for a functional relation is becoming stronger. When the inde- 
pendent variable was applied, behavior change was observed; 
when the independent variable was withdrawn, behavior again 
changed and responding returned to baseline levels. To the 
extent that the experimenter has effectively controlled the pres- 
ence and absence of the independent variable and held constant 
all other variables in the experimental setting that may have 
influenced the behavior, a functional relation appears likely. 


Baseline 2 


= 


An important behavior change has been produced and reversed 
by the introduction and withdrawal of the independent vari- 
able. The process of verification reduces the likelihood that a 
variable other than the independent variable was responsible 
for the observed behavior changes. 

Does this two-step strategy of prediction and verification 
constitute sufficient demonstration of a functional relation? 
What if some uncontrolled variable covaried with the inde- 
pendent variable as it was presented and withdrawn and that 
uncontrolled variable was responsible for the observed changes 
in behavior? If such were the case, claiming a functional rela- 
tion between the target behavior and the independent variable 
would at best be wrong and at the worst perhaps end the search 
for the actual controlling variables whose identification and 
control would contribute to an effective and reliable technol- 
ogy of behavior change. 

The appropriately skeptical investigator (and practitioner/ 
research consumer) will also question the reliability of the 
obtained effect. Was the apparent functional relation a fleet- 
ing, One-time-only phenomenon, or will repeated applica- 
tions of the independent variable reliably (i.e., consistently) 
produce a similar pattern of behavior change? An effective 
(.e., convincing) experimental design yields data responsive 
to these important questions. To investigate uncertain reliabil- 
ity, the behavior analyst employs the final, and perhaps the 
most important, component of baseline logic and experimental 
design: replication. 


Baseline Treatment 


Time 


Replication 


Replication is the essence of believability. 
—Baer, Wolf, and Risley (1968, p. 95) 


Within the context of any given experiment, replication means 
repeating independent variable manipulations conducted pre- 
viously in the study and obtaining similar outcomes. !! Repli- 
cation within an experiment has two important purposes. First, 
replicating a previously observed behavior change reduces 
the probability that some factor other than the independent 
variable was responsible for the now twice-observed behavior 
change. Second, replication demonstrates the reliability of the 
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Baseline Treatment 1 Baseline 2 Treatment 2 


Behavior 


Figure 7.5 Replication of experimental 
effect accomplished by reintroducing the 


treatment variable. The measures obtained 
Warcciieysve ON yo during Treatment 2 (solid data points within 
the cross hatched box) enhance the case for 


a functional relation between the treatment 
Time variable and the target behavior. 


behavior change; it can be made to happen again. Together, most important of which are the accuracy and sensitivity of the 
these joint outcomes make replication the basis for inferences measurement system; the degree of control the experimenter 
that the independent variable is responsible for the observed maintained over all relevant variables; the duration of experi- 


behavior change. mental phases; the stability of responding within each phase; 
Figure 7.5 adds the component of replication to our hypo- and the speed, magnitude, and consistency of behavior change 

thetical experiment. After steady state responding was obtained between conditions. If each of these considerations is satisfied, 

during Baseline 2, the independent variable is reintroduced. then replication of effect becomes perhaps the most critical fac- 

To the extent the data obtained during the Treatment 2 phase tor in claiming a functional relation. 

(data points within the box shaded with cross-hatched lines) An independent variable can be manipulated in an effort to 


resemble the measures obtained during Treatment 1, replication replicate an effect many times within an experiment. The num- 
has occurred. Our hypothetical experiment has now produced ber of replications required to demonstrate a functional relation 
powerful evidence of a functional relation between the inde- convincingly is related to many considerations, including all 
pendent and dependent variables. Confidence in the assertion of those just enumerated, and to the existence of other similar 
of a functional relation rests on numerous factors, some of the experiments that have produced the same effects. 


SUMMARY 


Introduction 7. A correlation exists when two events systematically covary 
with one another. Predictions can be made about the prob- 
ability that one event will occur based on the occurrence of 
the other event. 


1. Measurement can show whether, when, and how much 
behavior changes, but measurement alone cannot reveal 
why the behavior changes occurred. 


2. A useful technology of behavior change depends upon Exper imental Control: The Path to and Goal of 
knowing that specific arrangements of environmental vari- Behavior Analysis 
ables will reliably produce desired forms of behavior change. 8. The greatest potential benefits of science are derived from 

3. An experimental analysis must be performed to determine the third, and highest, level of scientific understanding, 
how a given behavior functions in relation to specific envi- which comes from establishing experimental control. 
ronmental events. 9. Experimental control is achieved when a predictable 

; j f change in behavior (the dependent variable) is reliably 
Concepts and Assumptions Underlying the Analysis of produced by the systematic manipulation of some aspect 
Behavior of the environment (the independent variable). 

4, The overall goal of science is to achieve a understanding 10. An analysis of a behavior has been achieved when a reli- 
of the phenomena under study—socially HUD O REE yy tae able functional relation between the behavior and some 
ior, in the case of applied behavior analysis. specified aspect of the environment has been demonstrated 

5. Science produces understanding at three levels: descrip- convincingly. 
tion, prediction, and control. 11. A functional analysis does not eliminate the possibility 

6. Descriptive research yields a collection of facts about the that the behavior under investigation is also a function of 


observed events—facts that can be quantified and classified. other variables. 
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12. An experiment that shows convincingly that changes in 
behavior are a function of the independent variable and not 
the result of uncontrolled or unknown variables has inter- 
nal validity. 


13. External validity refers to the degree to which a study’s 
results are generalizable to other subjects, settings, and/ 
or behaviors. 


Behavior: Defining Features and Assumptions that 
Guide Its Analysis 


14. Confounding variables exert unknown or uncontrolled 
influences on the dependent variable. 


15. Because behavior is an individual phenomenon, the 
experimental strategy of behavior analysis features 
within-subject (or single-case) methods of analysis. 


16. Because behavior is a dynamic and continuous phenom- 
enon that occurs in and changes through time, measuring 
behavior as it changes over time is a fundamental feature 
of applied behavior analysis. 


17. The assumption of determinism guides experimental 
method in behavior analysis. 


18. Experimental methods in behavior analysis also derive 
from the assumption that behavioral variability is extrinsic 
to the organism, not an inherent trait; that is, variability is 
imposed by environmental variables. 


19. Behavior analysts strive to isolate and experimentally 
manipulate the environmental factors responsible for 
variability. 


Components of Applied Behavior Analysis 
Experiments 


20. A research question specifies what the investigator wants 
to learn from conducting the experiment; all aspects of an 
experiment’s design and conduct should follow from it. 


21. Experiments in applied behavior analysis are referred to 
as single-case research designs because the experimental 
logic for detecting and analyzing behavior change employs 
each subject as his or her own control. 


22. The dependent variable in an applied behavior analysis 
experiment is a measurable dimensional quantity of the 
target behavior. 


23. Some studies include multiple dependent variables for 
three major reasons: (a) To provide additional data paths 
that serve as controls for evaluating and replicating the 
effects of an independent variable as it is applied sequen- 
tially to each behavior, (b) to assess the presence and extent 
of the independent variable’s effects on behaviors other 
than the response class to which it was applied directly, 
and (c) to determine whether changes in the behavior of a 
person other than the subject occur during the course of an 
experiment and whether such changes might in turn explain 
observed changes in the subject’s behavior. 


24. In addition to precise manipulation of the independent 
variable, the behavior analyst must hold constant all other 
aspects of the experimental setting—extraneous variables— 
to prevent unplanned environmental variation. 


25. When unplanned events or variations occur in the exper- 
imental setting, the behavior analyst must either wait out 
their effects or incorporate them into the design of the 
experiment. 


26. Observation and measurement procedures must be conducted 
in a standardized manner throughout an experiment. 


27. Because behavior is a continuous and dynamic phenome- 
non, ongoing visual inspection of the data is necessary to 
identify changes in level, trend, and/or variability as they 
develop during the course of an experiment. 


28. Changes in the independent variable are made to maxi- 
mize its effect on the target behavior. 


29. The term experimental design refers to the sequence and 
form of independent variable manipulations in a study. 


30. Although an infinite number of experimental designs are 
possible as a result of the many ways independent vari- 
ables can be manipulated and combined, an experimenter 
can make only two basic kinds of independent variable 
changes: introduce a new condition, or reinstate an old 
condition. 


31. A parametric study compares the differential effects of dif- 
ferent values of the independent variable. 


32. The fundamental rule of experimentation: change only one 
variable at a time. 


33. Rather than follow rigid, pro forma experimental designs, 
behavior analysts should select experimental tactics suited 
to the original research questions, while standing ready to 
“explore relevant variables by manipulating them in an impro- 
vised and rapidly-changing design” (Skinner, 1966, p. 21). 


Steady State Strategy and Baseline Logic 


34. Steady state responding enables a form of inductive rea- 
soning called baseline logic. Baseline logic entails three 
elements: prediction, verification, and replication. 


35. The most common method for evaluating the effects of a 
given variable is to impose it on an ongoing measure of 
behavior obtained in its absence. These preintervention 
data serve as the baseline against which any observed 
changes in behavior when the independent variable is 
introduced can be compared. 


36. A baseline condition does not necessarily mean the 
absence of instruction or treatment per se, only the absence 
of a specific independent variable of experimental interest. 


37. In addition to the primary purpose of establishing a base- 
line as an objective basis for evaluating the effects of the 
independent variable, three other reasons for baseline data 
collection are: (a) Systematic observation of the target 
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behavior prior to intervention sometimes yields information 
about antecedent-behavior-consequent correlations that may 
be useful in planning an effective intervention; (b) baseline 
data can provide valuable guidance in setting initial criteria 
for reinforcement; and (c) baseline data may reveal that the 
behavior targeted for change does not warrant intervention. 


38. Four types of baseline data patterns are stable, ascending, 
descending, and variable. 


39. The independent variable should be introduced when sta- 
ble baseline responding has been achieved. 


40. The independent variable should not be introduced when 


an ascending or descending baseline indicates improving 
performance. 


41. The independent variable should be introduced when an 
ascending or a descending baseline indicates deteriorating 
performance. 


42. An independent variable should not be imposed on a 
highly variable, unstable baseline. 


Affirmation of the Consequent 


43. Prediction of future behavior can be made when repeated 
measures of behavior under relatively constant environ- 
mental conditions show little or no variation. 


44. Practice effects refer to improvements in performance 
resulting from opportunities to emit the behavior that must 
be provided to obtain repeated measures. 


45. Extended baseline measurement is not necessary for 
behaviors that have no logical opportunity to occur. 


46. The predictive power of steady state responding enables 
a form of inductive reasoning known as affirmation of the 
consequent. 


47. Although the logic of affirmation of the consequent is 
flawed (some other event may have caused the change 
in behavior), an effective experimental design confirms 
several if-A-then-B possibilities, each one reducing the 
likelihood that factors other than the independent variable 
are responsible for the observed changes in behavior. 


48. Verification is accomplished by demonstrating that the 
prior level of baseline responding would have remained 
unchanged had the independent variable not been 
introduced. 


49. Replication within an experiment means reproducing a 
previously observed behavior change by reintroducing the 
independent variable. Replication demonstrates the reli- 
ability of the behavior change and reduces the probability 
that some factor other than the independent variable was 
responsible for the now twice-observed behavior change. 
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KEY TERMS 
A-B design 


affirmation of the consequent 


experimental design 
external validity 

ascending baseline extraneous variable 
baseline independent variable 
baseline logic internal validity 
confounding variable parametric analysis 
dependent variable practice effects 
descending baseline prediction 


experimental control research question 


MULTIPLE-CHOICE QUESTIONS 


1. Which of the following is not an assumption underlying 
the analysis of behavior? 


a. Determinism 

b. Empiricism 

c. Philosophic doubt 
d. Pseudoscience 


Hint: (See “Concepts and Assumptions Underlying the 
Analysis of Behavior”) 
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a ae 


replication 

single-case designs 
stable baseline 

steady state responding 
steady state strategy 
variable baseline 


verification 


2. The assumption that the universe is a lawful and orderly place 
and that natural phenomena occur in relation to other events 
describes which assumption underlying the analysis of behavior? 


a. Empiricism 

b. Determinism 

c. Parsimony 

d. Philosophic doubt 


Hint: (See “Concepts and Assumptions Underlying the 
Analysis of Behavior’) 
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8. An experiment has a high degree of when the 
experiment shows convincingly that changes in behavior are 


3. The three levels of scientific understanding include: 


a. Empiricism, determinism, and philosophic doubt 

b. Verification, prediction, and replication 

c. Group designs, single-subject designs, and 
correlations 

d. Description, prediction, control 

Hint: (See “Concepts and Assumptions Underlying the 

Analysis of Behavior’) 


‘ science yields a collection of facts about the observed 
events—facts that can be quantified and classified. 


generalizable to other subjects, settings, and/or behaviors. 
a. Internal validity 

b. External validity 

c. Experimental control 

d. Prediction 


Hint: (See “Experimental Control: The Path to and Goal of 
Behavior Analysis”’) 


a function of the independent variable and not the result of 
uncontrolled or unknown variables. 


a. Internal validity 
b. External validity 
c. Replication 
d. Prediction 


Hint: (See “Experimental Control: The Path to and Goal of 
Behavior Analysis’’) 


a. Replicated 9. are variables known or suspected to exert an 
Hee seca uncontrolled influence on the dependent variable. 
. Experimenta 

P " fi a. Independent variables 
c. Predictive 
eee b. Investigated variables 

. Descriptive 
. Confounding variables 
Hint: (See “Concepts and Assumptions Underlying the pray on a ee 
Analysis of Behavior’) d. Specified variables 
; Hint: (See “Experimental Control: The Path to and Goal of 
. The highest level of scientific understanding results from a ( im acta os meorirer 
ésablishing: Behavior Analysis’’) 
a. Experimental control 10. Behavior analysts believe behavioral variability is = 
wards ot to the organism. 
deer a. Internal 
c. Replication 
. b. External 
d. Analysis ee 
Hint: (See “Experimental Control: The Path to and Goal of . ea e 
Behavior Analysis”) d. Confined 
e7 OeOciavior hichern arn ore en Hint: (See “Behavior: Defining Features and Assumptions 
a reliable functional relation between the behavior PEL ECA) 
and the independent variable has been demonstrated 11. is the “hallmark of applied behavior 
convincingly. analysis.” 
a. Application a. Internal and external validity 
b. Investigation b. Replication, prediction, and control 
c. Analysis c. Single-measurement 
d. Experimental design d. Systematic repeated measurement of behavior 
Hint: (See “Experimental Control: The Path to and Goal of Hint: (See “Behavior: Defining Features and Assumptions 
Behavior Analysis”’) that Guide Its Analysis’’) 
refers to the degree to which a study’s results are 123 is the interaction between an organism and its 


environment. 

a. Measurement 

b. Independent variable 
c. Behavior 

d. Science 


Hint: (See “Behavior: Defining Features and Assumptions 
that Guide Its Analysis’’) 


13. 


14. 


15. 


ESSAY-TYPE QUESTIONS 
1. 
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Which of the following is not considered to be an essential 
component of experiments in applied behavior analysis? 
a. At least one subject 

b. At least one setting 

c. At least one large group for statistical comparisons 

d. At least one independent variable 

Hint: (See “Components of Applied Behavior Analysis 
Experiments’) 

A well-planned experiment begins with 

a. A research question 

b. Selection of participants 

c. Selection of a setting 

d. Selection of an experimental design 

Hint: (See “Components of Applied Behavior Analysis 
Experiments’) 

As a general rule, the independent variable should be intro- 
duced when baseline responding has been achieved. 
a. Variable 

b. Replicable 

c. Stable 

d. Complete 

Hint: (See “Steady State Strategy and Baseline Logic’’) 
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16. 


17. 


When the data show no evidence of an upward or downward 
trend and all the measures fall within a small range of values, 
the data pattern is referred to as 


a. Ascending 

b. Descending 

c. Level 

d. Stable 

Hint: (See “Steady State Strategy and Baseline Logic”) 
The three elements of baseline logic include the 
following: 

a. Prediction, verification, and replication 

b. Ascending, descending, and stable 

c. Internal validity, external validity, and reliability 
d 


- Dependent variable, independent variable, 
baseline 


Hint: (See “Steady State Strategy and Baseline 
Logic’) 


Discuss the basic assumption scientists hold about the 
universe. 


Hint: (See “Behavior: Defining Features and Assumptions 
That Guide Its Analysis”) 


. Discuss how experimental control, the highest level of 


scientific understanding, is achieved in a single-subject 
research design. 

Hint: (See “Experimental Control: The Path to and Goal of 
Behavior Analysis’) 


. Discuss how behavior analysts approach variability in 


their data. 


Hint: (See “Behavior: Defining Features and Assumptions 
That Guide Its Analysis”) 


. List the essential components of experiments in behavior 


analysis. 
Hint: (See “Components of Applied Behavior Analysis 
Experiments”) 


. What provides the basis for comparison in single-subject 


experimental designs? 


Hint: (See “Components of Applied Behavior Analysis 
Experiments’’) 


. Discuss two reasons an experimenter might measure 


multiple behaviors within one single-subject study. 


Hint: (See “Components of Applied Behavior Analysis 
Experiments’’) 


. Discuss the importance of the following “fundamental 


rule of experimentation” as it relates to the demonstration 
of a functional relationship between a dependent and an 
independent variable: “Change only one variable at a time.” 


Hint: (See “Intervention or Treatment: Independent 
Variable”) 


. Discuss one scientific benefit and one applied benefit of 


establishing a baseline level of responding. 


Hint: (See “Steady State Strategy and Baseline Logic’) 
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9. Draw the following from the written description provided: 


NOTES 


it. 


a. Baseline: High level, stable, no trend 

b. Baseline: Low level, stable, no trend 

c. Baseline: Slightly increasing, high variability 
d. Baseline: Decreasing variable trend 


Hint: (See “Steady State Strategy and Baseline Logic’’) 


Behavior analysis has benefited immensely from two particularly note- 
worthy contributions to the literature on experimental method: Sidman’s 
Tactics of Scientific Research (1960) and Johnston and Pennypacker’s Strat- 
egies and Tactics of Behavioral Research (1980). Both books are essential 
reading and working references for any serious student or practitioner of 
behavior analysis. 

The researcher’s audience (journal editors, reviewers, readers, and prac- 
titioners) ultimately determines whether a claimed functional relation is 
believable or convincing. We examine the believability of research findings 
further in Chapter 10. 

Some investigators also use between-group designs to produce results they 
believe will have more external validity. Group comparison and within- 
subject experimental methods are compared in Chapter 10. 

It has become commonplace to refer to the person(s) whose behavior is 
the dependent variable in an experiment as a participant, instead of the 
more traditional term, subject. We use both terms in this text and urge 
readers to consider Sidman’s (2002) perspective on the issue: “[W]e are no 
longer permitted to call our subjects ‘subjects.’ The term is supposed to be 
dehumanizing, and so we are supposed to call them ‘participants.’ I think 
this is completely misguided. Experimenters, too, are participants in their 
experiments. What does making them non-participants do to our percep- 
tion of science and of scientists? Are experimenters merely robots who 
follow prescribed and unbreakable scientific rules? Are they supposed just 
to manipulate variables and coldly record the results of their manipula- 
tions? Separating them as nonparticipating manipulators and recorders of 
the behavior of participants really dehumanizes not only experimenters but, 
along with them, the whole scientific process.” (p. 9) 

A study by Rindfuss, Al-Attrash, Morrison, and Heward (1998) provides 
a good example of the extent to which the term single-subject research 
can be a misnomer. A reversal design was used to evaluate the effects of 


10. List and describe the three components of baseline logic 


10. 


ite 


(also known as “experimental reasoning’’) used in single- 
subject research designs. 


Hint: (See “Steady State Strategy and Baseline Logic’’) 


SESE SEE EARN NNN NII IIE ETE OO 


response cards on the quiz and exam scores of 85 students in 5 eighth-grade 
American history classes. Although a large group of students participated, 
the study actually consisted of 85 individual experiments; or 1 experiment 
and 84 replications! 

Iverson (2013) points out two good reasons why N = 1 design should be 
avoided as a synonym for single-case design: (a) N = 1 clearly means 
the use of only one subject; and (b) in statistics, N denotes the number of 
measures or data points collected, not the number of subjects. Additionally, 
single-case designs should not be confused with case studies, which do 
involve just one participant and may or may not entail an experimental 
design. 


. This is the distinguishing feature of the multiple baseline across behav- 


iors design, an experimental tactic used widely in applied behavior analy- 
sis. Multiple baseline designs are described in Chapter 9. 


. This analytic tactic is known as a multiple baseline across settings design. 


Multiple baseline designs are presented in Chapter 9. 

How many different experimental designs are there? Because an experi- 
ment’s design includes careful selection and consideration of each com- 
ponent discussed here (i.e., subject, setting, behavior, etc.), not counting 
the direct replication of experiments, one could say that there are as many 
experimental designs as there are experiments. 

The applied researcher must guard very carefully against assuming auto- 
matically that unwanted variability is a function of variables beyond his 
capability or resources to isolate and control, and thus fail to pursue the 
investigation of potentially important functional relations. 

Replication also refers to the repeating of experiments to determine the 
reliability of functional relations found in previous studies and the extent 
to which those findings can be extended to other subjects, settings, and/or 
behaviors (i.e., generality or external validity). The replication of experi- 
ments is examined in Chapter 10. 
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Reversal and Multielement Designs 


LEARNING OBJECTIVES 


Analyze the effects of a variety of independent variables using two types of experimental tactics 


widely used by applied behavior analysts: the reversal design and the alternating treatments design. 
¢ Select the appropriate experimental tactic based on the research question of interest and the 


appropriateness of the design. 


° Discuss how the reversal design and the alternating treatments design and their variations 
incorporate the elements of baseline logic (prediction, verification, and replication). 


e State and describe advantages and disadvantages in using the reversal design and alternating 


treatments design. 


¢ Identify practical and ethical considerations in using the reversal design and alternating treatments design. 


his chapter describes the reversal and multielement 

designs, two single-case research tactics used widely 

by applied behavior analysts. In a reversal design, the 
effects of introducing, withdrawing (or “reversing” the focus 
of), and reintroducing an independent variable are observed 
on the target behavior. In a multielement design, two or more 
experimental treatments are rapidly alternated (or presented 
simultaneously and the participant is able to select the treat- 
ment), and the differential effects on the target behavior noted. 
We explain how each design incorporates the three elements of 
steady state strategy—prediction, verification, and replication— 
and present examples illustrating the major variations of each. 
Considerations for selecting and using reversal and multiele- 
ment designs are also presented. 


REVERSAL DESIGN 


A reversal design entails repeated measures of behavior in a 
given setting that requires at least three consecutive phases: 
(a) an initial baseline phase in which the independent variable is 
absent, (b) an intervention phase during which the independent 
variable is introduced and remains in contact with the behavior, 
and (c) a return to baseline conditions accomplished by with- 
drawal of the independent variable. In the widely used notation 
system for describing experimental designs in applied behavior 
analysis, the capital letters A and B denote the first and second 
conditions, respectively, in a study. Baseline (A) data are collected 
until steady state responding is achieved. Next, an intervention (B) 
condition is applied that signifies the presence of a treatment— 
the independent variable. An experiment entailing one reversal is 
described as an A-B-A design. Although studies using an A-B-A 
design are reported in the literature (e.g., Armendariz & Umbriet, 
1999; Christle & Schuster, 2003; Geller, Paterson, & Talbott, 
1982: Jacobson, Bushell, & Risley, 1969; Raiff & Dallery, 2010), 


the A-B-A-B design is preferred because reintroducing the B con- 
dition enables replication of treatment effects, which strengthens 
the demonstration of experimental control (see Figure 8.1). 

The A-B-A-B reversal is the most straightforward and 
generally most powerful within-subject design for demonstrat- 
ing a functional relation between an environmental manipulation 
and a behavior. When a reversal design! reveals a functional 
relation, the data show how the behavior works. 


As explanations go, the one offered by the reversal design 
was not at all a bad one. In answer to the question, “How 
does this response work?” we could point out demonstrably 
that it worked like so [e.g., see Figure 8.1]. Of course, it 
might also work in other ways; but, we would wait until 
we had seen the appropriate graphs before agreeing to any 
other way. (Baer, 1975, p. 19, words in brackets added) 


Baer’s point must not be overlooked: Showing that a behay- 
ior “works” in a predictable and reliable way in the presence 
and absence of a given variable provides only one answer to the 
question, How does this behavior work? There may be (and quite 
likely are) other controlling variables for the targeted response 
class. Whether additional experimentation exploring those other 
possibilities is necessary or desirable depends on the social and 
scientific importance of obtaining a more complete analysis. 


Operation and Logic of the Reversal Design 


Risley (2005) described the rationale and operation of the rever- 
sal design as follows: 


The reversal or ABAB design that Wolf reinvented from 
Claude Bernard’s early examples in experimental medi- 
cine entailed establishing a baseline of repeated quantified 
observations sufficient to see a trend and forecast that trend 
into the near future (A); to then alter conditions and see 
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Behavior 


Figure 8.1 Graphic prototype of an A-B-A-B 
reversal design (hypothetical data). 


if the repeated observations become different than they 
were forecast to be (B); to then change back and see if the 
repeated observations return to confirm the original fore- 
cast (A); and, finally, to reintroduce the altered conditions 
and see if the repeated observations again become different 
than forecast (B). (pp. 280-281)? 


Because the reversal design was used in Chapter 7 to illus- 
trate baseline logic, a brief review of the roles of prediction, ver- 
ification, and replication in the reversal design will suffice here. 
Figure 8.2 shows the same data from Figure 8.1 with the addition 
of open data points representing predicted measures of behav- 
ior if conditions in the previous phase had remained unchanged. 
After a stable pattern of responding, or counter-therapeutic trend, 
is obtained in Baseline 1, the independent variable is introduced. 
In our hypothetical experiment, the measures obtained during 
Treatment 1, when compared with those from Baseline | and with 
the measures predicted by Baseline 1, show that behavior change 
occurred and that the change in behavior coincided with the inter- 
vention. After steady state responding is attained in Treatment 1, 
the independent variable is withdrawn and baseline conditions 
re-established. If the level of responding in Baseline 2 parallels 
or closely approximates the measures obtained during Baseline 
1, verification of the prediction made from Baseline 1 data is 
obtained. Stated otherwise, had the intervention not been intro- 
duced and had the initial baseline conditions remained in effect, 
the predicted data path would have appeared as shown in Base- 
line 2. When withdrawal of the independent variable results in a 
reversal of the behavior change associated with its introduction, 


Baseline 1 


Figure 8.2 Illustration of baseline logic in an 
A-B-A-B reversal design (hypothetical data). 
Prediction: Open data points (in clear boxes) 
represent predicted measures if conditions 
from previous phases remained in effect. 
Verification: Baseline 2 data (lightly shaded 
box) verify the prediction from Baseline 1. 
Replication: Treatment 2 data (cross-hatch 


Behavior 


Baseline 1 


Treatment 2 


PFT 


Treatment 1 Baseline 2 


ie eS 


Time 


a case builds that the intervention is responsible for the observed 
behavior change. If reintroduction of the independent variable 
in Treatment 2 reproduces the behavior change observed during 
Treatment 1, replication of effect has been achieved, and a func- 
tional relation demonstrated. Stated in other terms, had the inter- 
vention continued and had the second baseline condition not been 
introduced, the predicted data path of the treatment would have 
appeared as shown in Treatment 2. 

Lambert, Cartledge, Lo, and Heward (2006) used an 
A-B-A-B design to evaluate the effects of response cards on 
disruptive behavior and academic responding of students during 
math lessons in two fourth-grade classrooms. During the base- 
line (A) condition, single-student responding (SSR), the teacher 
called on one student who had raised a hand to answer the 
question. During the treatment (B) condition, response cards (RC), 
each student could respond to every teacher-posed question by 
writing an answer on a white laminated board. Figure 8.3 shows 
the results for disruptive behavior by four students targeted in 
one of the two classrooms. The data reveal a clear functional 
relation between the use of response cards and substantial reduc- 
tions in disruptive behavior. The A-B-A-B design revealed an 
equally clear functional relation between response cards and 
increased academic responding. 

In the 1960s and early 1970s, applied behavior analysts 
relied almost exclusively on the A-B-A-B reversal design. The 
straightforward A-B-A-B design played such a dominant role 
in the early years of applied behavior analysis that it came to 
symbolize the field (Baer, 1975). This was no doubt due, at least 
in part, to the reversal design’s potential to expose variables 
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Figure 1. Number of disruptive behaviors during single-student responding (SSR) and response card (RC) 
conditions for the target students in Classroom A. Breaks in data points represent student absences. 


for what they are—strong and reliable or weak and unstable. 
Another reason for the reversal design’s early dominance may 
have been that few alternative analytic tactics were available that 
effectively combined the intrasubject experimental elements of 
prediction, verification, and replication. 

Although the reversal design is just one of many experi- 
mental designs available to applied behavior analysts today, the 
simple, unadorned A-B-A-B design continues to play a major 
role in the behavior analysis literature (e.g., Beavers, Iwata, & 
Gregory, 2014; Bicard, Ervin, Bicard, & Baylot-Casey 2012; 
Hine, Ardoin, & Foster, 2015; Kamps, Conklin, & Wills, 2015; 
Krentz, Miltenberger, & Valbuena, 2016). Additional examples 
of the reversal design can be viewed in studies described in this 
text: Anderson and Long (2002, Figure 24.2); Austin and Bevan 
(2011, Figure 25.10); Cammilleri and Hanley (2005, Figure 
30.14); Cowdery, Iwata, and Pace (1990, Figure 25.6); Deaver, 
Miltenberger, and Stricker (2001, Figure 24.3); Hall et al. (1971, 
Figure 14.1); Kuhn, Lerman, Vorndran, and Addison (2006, 
Figure 23.13); and Lindberg, Iwata, Kahng, and DeLeon (1999, 
Figure 25.9). 
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Figure 8.3 Example of an A-B-A-B reversal 
design. 


Based on “Effects of Response Cards on Disruptive 
Behavior and Academic Responding by Fourth-Grade 
Urban Students” by M. C. Lambert, G. Cartledge, 

Y. Lo, and W. L. Heward, 2006, Journal of Positive 
Behavioral Interventions, 8, p. 93. Copyright 2006 by 
Sage Publishing. 
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Variations of the A-B-A-B Design 


Many applied behavior analysis studies use variations or exten- 
sions of the A-B-A-B design. 


Repeated Reversals 


Perhaps the most obvious variation of the A-B-A-B reversal 
design is a simple extension in which the independent variable 
is withdrawn and reintroduced once again (A-B-A-B-A-B). 
Each additional presentation and withdrawal that reproduces 
the previously observed effects on behavior increases the 
likelihood that the behavior changes result from manipulat- 
ing the independent variable. All other things being equal, an 
experiment that incorporates multiple reversals presents a more 
convincing and compelling demonstration of a functional rela- 
tion than does an experiment with one reversal (e.g., Krentz 
et al., 2016 [Figure 6.2]; Steege et al., 1990). That said, it is 
also possible to reach a point of redundancy beyond which the 
findings of a given analysis are no longer enhanced significantly 
by additional reversals. 
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B-A-B and B-A-B-A Designs 


The B-A-B design begins with the application of the indepen- 
dent variable: the treatment. After stable responding has been 
achieved during the initial treatment phase (B), the independent 
variable is withdrawn. If the behavior worsens in the absence 
of the independent variable (the A condition), the treatment 
variable is reintroduced in an attempt to recapture the level of 
responding obtained during the first treatment phase, which 
would verify the prediction based on the data path obtained 
during the initial treatment phase. 

Practitioners may prefer the B-A-B design to the A-B-A 
design in that the study ends with the treatment variable in 
effect. However, in terms of demonstrating a functional relation, 
the B-A-B design is the weaker of the two because it does not 
enable an assessment of the effects of the independent variable 
on the preintervention level of responding. The nonintervention 
(A) condition in a B-A-B design cannot verify a prediction of 
a previous nonexistent baseline. Practitioners can remedy this 
weakness by withdrawing and then reintroducing the inde- 
pendent variable, as in a B-A-B-A-B design (e.g., Dixon, 
Benedict, & Larson, 2001 [see Figure 25.1]). 

Because the B-A-B design provides no data to determine 
whether the measures of behavior taken during the A condition 
represent preintervention performance, sequence effects cannot 
be ruled out: The level of behavior observed during the A con- 
dition may have been influenced by the fact that the treatment 
condition preceded it. Nevertheless, there are exigent situations 
in which initial baseline data cannot be collected. For instance, 
the B-A-B design may be appropriate with target behaviors that 
result in physical harm or danger to the participant or to others. 


In such instances, withholding a possibly effective treatment 
until a stable pattern of baseline responding can be obtained may 
present an ethical problem. For example, Murphey, Ruprecht, 
Baggio, and Nunes (1979) used a B-A-B design to evaluate 
the effectiveness of mild punishment combined with reinforce- 
ment on the number of self-choking responses by a 24-year- 
old man with profound intellectual disabilities. After treatment 
was in effect for 24 sessions, it was withdrawn for 3 sessions, 
during which an immediate and large increase in self-choking 
was recorded (see Figure 8.4). Reintroduction of the treatment 
package reproduced behavior levels observed during the first 
treatment phase. The mean number of self-chokes during each 
phase of the B-A-B study was 22, 265, and 24, respectively. 
Despite the impressive reduction of behavior, the results 
of Murphey and colleagues’ (1979) B-A-B design study may 
have been enhanced by gathering and reporting objectively mea- 
sured data on the level of behavior prior to the first interven- 
tion. Presumably, they chose not to collect an initial baseline for 
ethical and practical reasons. Murphey and colleagues reported 
anecdotally that self-chokes averaged 434 per day immediately 
prior to their intervention when school staff had used a different 
procedure to reduce the self-injurious behavior. This anecdotal 
information increased the believability of the functional relation 
suggested by the experimental data from the B-A-B design. 
Other situations in which a B-A-B or B-A-B-A design 
might be warranted include (a) when a treatment is already in 
place (e.g., Marholin, Touchette, & Stuart, 1979; Pace & Troyer, 
2000), (b) when “baseline levels of performance are obvious 
because the behavior has never been performed (e.g., exercise 
for many of us, practicing a musical instrument, reading, eating 
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Figure 8.4 Example of a B-A-B reversal design. 50 
Based on “The Use of Mild Punishment in Combination with 
Reinforcement of Alternate Behaviors to Reduce the Self- 25 
Injurious Behavior of a Profoundly Retarded Individual” by 
R. J. Murphy, M. J. Ruprecht, P. Baggio, and D. L. Nunes, 0 
1979, AAESPH Review, 4, p. 191. Copyright 1979 by the 
AAESPH Review. 
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healthful foods)” (Kazdin, 2011, p. 134), (c) when the behay- 
ior analyst has limited time in which to demonstrate practical 
and socially significant results (e.g., Robinson et al., 1981), and 
(d) when the client or significant others want an intervention to 
begin immediately (e.g., Kazdin & Polster, 1973). 


Multiple Treatment Reversal Designs 


Experiments that use the reversal design to compare the effects 
of two or more experimental conditions to baseline and/or to one 
another are said to use a multiple treatment reversal design. 
The letters C, D, and so on, denote when additional conditions 
were in effect during the study, as in the following designs: A-B- 
A-C-B-C (Dickman, Bright, Montgomery, & Miguel, 2012); 
A-B-C-A-C (Kamps et al., 2015); A-B-C-A-C-B-C (Falcomata, 
Roane, Hovanetz, Kettering, & Keeney (2004); A-B-A-B-C- 
A-C-B (Hansen & Wills, 2014); and A-B-A-C-A-D-A-E-A-F 
(Wilson & Tunnard, 2014). 

These designs are considered variations of the reversal 
design because they embody the experimental method and logic 
of the reversal tactic: Responding in each phase provides base- 
line (or control condition) data for the subsequent phase (pre- 
diction), independent variables are withdrawn in an attempt to 
reproduce levels of behavior observed in a previous condition 
(verification), and each independent variable that contributes 
fully to the analysis is introduced at least twice (replication). 
Independent variables can be introduced, withdrawn, changed 
in value, combined, and otherwise manipulated to produce an 
endless variety of experimental designs. 

For example, Kennedy and Souza (1995) used an A-B- 
C-B-C-A-C-A-C design to analyze and compare the effects of 
two kinds of competing sources of stimulation on eye poking 
by a 19-year-old student with profound disabilities. Geoff had 
a 12-year history of poking his forefinger into his eyes during 
periods of inactivity, such as after lunch or while waiting for 
the bus. The two treatment conditions were music (B) and a 
video game (C). During the music condition, Geoff was given 
a Sony Walkman radio with headphones. The radio was tuned 
to a station that his teacher and family thought he preferred. 
Geoff had continuous access to the music during this condition, 
and he could remove the headphones at any time. During the 
video game condition, Geoff was given a small handheld video 
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game on which he could observe a variety of visual patterns and 
images on the screen with no sound. As with the music condi- 
tion, Geoff had continuous access to the video game and could 
discontinue using it at any time. 

Figure 8.5 shows the results of the study. Following an 
initial baseline phase (A) in which Geoff averaged 4 eye pokes 
per hour, the music condition (B) was introduced and eye pokes 
decreased to a mean of 2.8 per hour. The video game (C) was 
implemented next, and eye pokes decreased further to 1.1 per 
hour. Measures obtained during the next two phases—a reintro- 
duction of music (B) followed by a second phase of the video 
game (C)—replicated previous levels of responding under each 
condition. This B-C-B-C portion of the experiment revealed a 
functional relation between video game condition and lower fre- 
quency of eye pokes compared to music. The final five phases of 
the experiment (C-A-C-A-C) provided an experimental compar- 
ison of the video game and baseline (no-treatment) condition. 

In most instances, extended designs involving multiple 
independent variables are not preplanned. Instead of following 
a predetermined structure that dictates when and how experi- 
mental manipulations will be made, the behavior analyst makes 
design decisions based on ongoing assessment of the data. For 
example, Cummings and Carr (2005) used an A-B-C-A-D- 
BC-A-E-F-A-BF-BCDEF design to evaluate the effects of five 
interventions that resulted in a treatment package (the BCDEF 
condition) successful in eliminating the joint dislocation behav- 
ior of a child with autism. 


[A] single experiment may be viewed as a number of 
successive designs that are collectively necessary to clarify 
relations between independent and dependent variables. 
Thus, some design decisions might be made in response 

to the data unfolding as the investigation progresses. 

This sense of design encourages the experimenter to pursue 
in more dynamic fashion the solutions to problems of 
experimental control immediately upon their emergence. 
(Johnston & Pennypacker, 1980, pp. 250-251) 


Students of applied behavior analysis should not interpret 
this description of experimental design as a recommendation for 
a free-form approach to the manipulation of independent vari- 
ables. The researcher must always obey the rule of changing only 
one variable at a time and must understand the opportunities for 


Figure 8.5 Example of a multiple treatment reversal 
design (A-B-C-B-C-A-C-A-C). 

From “Functional Analysis and Treatment of Eye Poking,” by 

C. H. Kennedy and G. Souza, 1995, Journal of Applied Behavior 
Analysis, 28, p. 33. Copyright 1995 by the Society for the Experimental 
Analysis of Behavior, Inc. Reprinted by permission. 
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legitimate comparisons and the limitations that a given sequence 
of manipulations places on the conclusions that can be drawn 
from the results. 

Experiments using a reversal design to compare two or 
more treatments are vulnerable to confounding by sequence 
effects. Sequence effects are the effects on a subject’s behavior 
in a given condition that are influenced by the subject’s expe- 
rience with a prior condition. For example, the analyst must 
use Caution in interpreting results from the A-B-C-B-C design 
that is produced by the following fairly common sequence of 
events in practice: After baseline (A), an initial treatment (B) 
is implemented and little or no behavioral improvements are 
noted. A second treatment (C) is next tried, and the behavior 
improves. A reversal is then conducted by reintroducing the first 
treatment (B), followed by reinstatement of the second treatment 
(C) (e.g., Foxx & Shapiro, 1978). In this case, we can only speak 
knowingly about the effects of C when it follows B. Recapturing 
the original baseline levels of responding before introducing the 
second treatment condition (i.e., an A-B-A-C-A-C sequence) 
reduces the threat of sequence effects (or helps to expose them 
for what they are). 

An A-B-A-B-C-B-C design, for instance, enables direct 
comparisons of B to A and C to B, but not of C to A. An exper- 
imental design consisting of A-B-A-B-B+C-B-B+C (e.g., 
Jason & Liotta, 1982) permits an evaluation of the additive or 
interactive effects of B+C, but does not reveal the independent 
contribution of C. And in both of these examples, it is impos- 
sible to determine what effects, if any, C may have had on the 
behavior if it had been implemented prior to B. Manipulating 
each condition so that it precedes and follows every other con- 
dition in the experiment (e.g., A-B-A-B-C-B-C-A-C-A-C) is 
the only way to know for sure. However, manipulating multiple 


Figure 8.6 Reversal design using noncontingent 
reinforcement (NCR) as a control technique. 


From “Recent Examples of Behavior Modification in Pre-school 
Settings,” by D. M. Baer and M. M. Wolf, in Behavior Modification 
in Clinical Psychology, pp. 14-15, edited by C. Neuringer and 

J. L. Michael, 1970, Upper Saddle River, NJ: Prentice Hall. Copyright 
1970 by Prentice Hall. Adapted by permission. 
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conditions requires significant time and resources, and such 
extended designs become more susceptible to confounding by 
maturation and other historical variables not controlled by the 
experimenter. 


Noncontingent Reinforcement (NCR) 
Reversal Technique 


With interventions based on positive reinforcement, it can be 
hypothesized that observed improvements in behavior result 
from the participant’s feeling better about himself because of the 
enriched environment created by the reinforcement, not because 
specific instances of the target behavior have been immediately 
followed by contingent reinforcement. This hypothesis is most 
often advanced when interventions consisting of social reinforce- 
ment are involved. For example, a person may claim that it doesn’t 
matter how the teacher’s praise and attention were given; the stu- 
dent’s behavior improved because the praise and attention created 
a warm and supporting environment. If, however, the behavioral 
improvements observed during a contingent reinforcement con- 
dition are lost during a condition in which equal amounts of the 
same consequence are delivered independent of the occurrence of 
the target behavior, a functional relation between the reinforce- 
ment contingency and behavior change may be demonstrated. 

This experimental control tactic, called noncontingent 
reinforcement (NCR) reversal technique, can show that 
behavior change is the result of contingent reinforcement, not 
simply the presentation of or contact with the stimulus event 
(Thompson & [wata, 2005). A study by Baer and Wolf (1970a) 
on the effects of teachers’ social reinforcement on the coopera- 
tive play of a preschool child provides an excellent example of 
the NCR reversal technique (Figure 8.6). The authors described 
the use and purpose of the design as follows: 
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[The teachers first collected] baselines of cooperative and 
other related behaviors of the child, and of their own inter- 
action with the child. Ten days of observation indicated that 
the child spent about 50% of each day in proximity with 
other children (meaning within 3 feet of them indoors, or 

6 feet outdoors). Despite this frequent proximity, however, 
the child spent only about 2% of her day in cooperative 
play with these children. The teachers, it was found, inter- 
acted with this girl about 20% of the day, not all of it pleas- 
ant. The teachers, therefore, set up a period of intense social 
reinforcement, offered not for cooperative play but free 

of any response requirement at all: the teachers took turns 
standing near the girl, attending closely to her activities, 
offering her materials, and smiling and laughing with her in 
a happy and admiring manner. The results of 7 days of this 
noncontingent extravagance of social reinforcement were 
straightforward: the child’s cooperative play changed not at 
all, despite the fact that the other children of the group were 
greatly attracted to the scene, offering the child nearly dou- 
ble the chance to interact with them cooperatively. These 

7 days having produced no useful change, the teachers then 
began their planned reinforcement of cooperative behav- 
ior. .. . Contingent social reinforcement, used in amounts 
less than half that given during the noncontingent period, 
increased the child’s cooperative play from its usual 2% to 
a high of 40% in the course of 12 days of reinforcement. At 
that point, in the interests of certainty, the teachers discon- 
tinued contingent reinforcement in favor of noncontingent. 
In the course of 4 days, they lost virtually all of the coop- 
erative behavior they had gained during the reinforcement 
period of the study, the child showing about a 5% average 
of cooperative play over that period of time. Naturally, 

the study concluded with a return to the contingent use 

of social reinforcement, a recovery of desirable levels of 
cooperative play, and a gradual reduction of the teacher’s 
role in maintaining that behavior. (pp. 14-15) 


Using NCR as a control condition to demonstrate a func- 
tional relation is advantageous when it is not possible or appro- 
priate to eliminate completely the event or activity used as a 
contingent reinforcement. For example, Lattal (1969) employed 
NCR as acontrol condition to “reverse” the effects of swimming 
as reinforcement for tooth brushing by children in a summer 
camp. In the contingent reinforcement condition, the campers 
could go swimming only if they had brushed their teeth; in the 
NCR condition, swimming was available whether or not tooth 
brushing occurred. The campers brushed their teeth more often 
in the contingent reinforcement condition. 

The usual procedure is to deliver NCR on a fixed or 
variable time schedule independent of the subject’s behavior. 
A potential weakness of the NCR control procedure becomes 
apparent when high rate of the desired behavior has been pro- 
duced during the preceding contingent reinforcement phase. It is 
probable in such situations that at least some instances of NCR, 
delivered according to a predetermined time schedule, will 
follow occurrences of the target behavior closely in time, and 
thereby function as adventitious, or accidental, reinforcement 
(Thompson & Iwata, 2005). In fact, an intermittent schedule of 
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reinforcement might be created inadvertently that results in even 
higher levels of performance than those obtained under con- 
tingent reinforcement. (Intermittent schedules of reinforcement 
and their effects are described in Chapter 13). In such cases the 
investigator might consider using one of the two control tech- 
niques described next, both of which involve “reversing” the 
behavioral focus of the contingency.” 


DRO Reversal Technique 


One way to ensure that reinforcement will not immediately 
foliow the target behavior is to deliver reinforcement immedi- 
ately following the subject’s performance of any behavior other 
than the target behavior. With a DRO reversal technique, the 
control condition consists of delivering the event suspected of 
functioning as reinforcement following the emission of any 
behavior other than the target behavior (e.g., Baer, Peterson, & 
Sherman, 1967 Osborne, 1969; Poulson, 1983). For example, 
Reynolds and Risley (1968) used contingent teacher attention to 
increase the frequency of talking in a 4-year-old girl enrolled in 
a preschool program for disadvantaged children. After a period 
of teacher attention contingent on verbalization, in which the 
girl’s talking increased from a baseline average of 11% of the 
intervals observed to 75%, a DRO condition was implemented 
during which the teachers attended to the girl for any behavior 
except talking. During the 6 days of DRO, the girl’s verbal- 
ization dropped to 6%. Teacher attention was then delivered 
contingent on talking, and the girl’s verbalization “immediately 
increased to an average of 51%” (p. 259). 


DRI/DRA Reversal Technique 


During the control condition ina DRI/DRA reversal technique, 
occurrences of a specified behavior that is either incompatible 
with the target behavior (i.e., the two behaviors cannot possi- 
bly be emitted at the same time) or an alternative to the target 
behavior are immediately followed by the same consequence 
previously delivered as contingent reinforcement for the target 
behavior. Goetz and Baer’s (1973) investigation of the effects of 
teacher praise on preschool children’s creative play with building 
blocks illustrates the use of a DRI control condition. Figure 8.7 
shows the number of different block forms (e.g., arch, tower, 
roof, ramp) constructed by the three children who participated in 
the study. During baseline (data points indicated by the letter N), 
“the teacher sat by the child as she built with the blocks, watching 
closely but quietly, displaying neither criticism nor enthusiasm 
about any particular use of the blocks” (p. 212). During the next 
phase (the D data points), “the teacher remarked with interest, 
enthusiasm, and delight every time that the child placed and/or 
rearranged the blocks so as to create a form that had not appeared 
previously in that session’s construction(s). . . . “Oh, that’s very 
nice—that’s different!’” (p. 212). Then, after increasing form 
diversity was clearly established, instead of merely withdrawing 
verbal praise and returning to the initial baseline condition, the 
teacher provided descriptive praise only when the children had 
constructed the same forms (the S data points). “Thus, for the 
next two to four sessions, the teacher continued to display inter- 
est, enthusiasm, and delight, but only at those times when the 
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Figure 8.7 Reversal design using a DRI control technique. 


Based on “Social Control of Form Diversity and the Emergence of New 
Forms in Children’s Blockbuilding” by E. M. Goetz and D. M. Baer, 1973, 
Journal of Applied Behavior Analysis, 6, p. 213. Copyright 1973 by the 
Society for the Experimental Analysis of Behavior, Inc. 


child placed and/or rearranged a block so as to create a repetition 
of a form already apparent in that session’s construction(s). . . . 
Thus, no first usage of a form in a session was reinforced, but 
every second usage of that form and every usage thereafter 
within the session was. . . . ‘How nice—another arch!’” (p. 212). 
The final phase of the experiment entailed a return to descriptive 
praise for different forms. Results show that the form diversity 
of children’s block building was a function of teacher praise and 
comments. The DRI reversal tactic allowed Goetz and Baer to 
determine that it was not just the delivery of teacher praise and 
comment that resulted in more creative block building by the 
children; the praise and attention had to be contingent on differ- 
ent forms to produce increasing form diversity.* 


Considering the Appropriateness of 
the Reversal Design 


The primary advantage of the reversal design is its ability to 
provide a clear demonstration of the existence (or absence) of 
a functional relation between the independent and dependent 


variables. An investigator who reliably turns the target behavior 
on and off by presenting and withdrawing a specific variable 
makes a clear and convincing demonstration of experimental 
control. In addition, the reversal design enables quantification of 
the amount of behavior change over the preintervention level of 
responding. And the return to baseline provides information on 
the need to program for maintenance. Furthermore, a complete 
A-B-A-B design ends with the treatment condition in place.” 
In spite of its strengths as a tool for analysis, the reversal 
design entails some potential scientific and social disadvantages 
that should be considered prior to its use. The considerations are 
of two types: irreversibility, which affects the scientific utility 
of the design; and the social, educational, and ethical concerns 
related to withdrawing a seemingly effective intervention. 


Irreversibility: A Scientific Consideration 


A reversal design is not appropriate in evaluating the effects of a 
treatment variable that, by its very nature, cannot be withdrawn 
once it has been presented. Although independent variables 
involving reinforcement and punishment contingencies can 
be manipulated with some certainty—the experimenter either 
presents or withholds the contingency—an independent variable 
such as providing information or modeling, once presented, can- 
not simply be removed. For example, a reversal design would 
not be an effective element of an experiment investigating the 
effects of attending an in-service training workshop for teachers 
during which participants observed a master teacher use contin- 
gent praise and attention with students. After the participants 
have listened to the rationale for using contingent praise and 
attention and observed the master teacher model it, the expo- 
sure provided by that experience could not be withdrawn. Such 
interventions are said to be irreversible. 

Irreversibility of the dependent variable must also be 
considered in determining whether a reversal would be an 
effective analytic tactic. Behavioral irreversibility means 
that a level of behavior observed in an earlier phase cannot be 
reproduced even though the experimental conditions are the 
same as they were during the earlier phase (Sidman, 1960). 
Once improved, many target behaviors of interest to the 
applied behavior analyst remain at their newly enhanced level 
even when the intervention responsible for the behavior change 
is removed. From a clinical or educational standpoint, such a 
state of affairs is desirable: The behavior change is shown to be 
durable, persisting even in the absence of continued treatment. 
For example, Kuoch and Mirenda (2003) reported irreversibil- 
ity as a positive outcome of their study evaluating the effects 
of social stories on problem behavior. Problem behavior by 
three children with autism decreased from initial baseline lev- 
els during a social stories intervention and remained low when 
the intervention was withdrawn, a pattern of results the authors 
described as “short-term maintenance of behavioral reduction 
once the social stories were discontinued” (p. 224). However, 
irreversibility is a problem if demonstration of the independent 
variable’s role in the behavior change depends on verification 
by recapturing baseline levels of responding. 

For example, baseline observations might reveal very low, 
almost nonexisient, rates of talking and social interaction for 
a young child. An intervention consisting of teacher-delivered 


social reinforcement for talking and interacting could be imple- 
mented, and after some time the girl might talk to and interact 
with her peers at a frequency and in a manner similar to that of 
her classmates. The independent variable, teacher-delivered rein- 
forcement, could be terminated in an effort to recapture baseline 
rates of talking and interacting. But the girl might continue to 
talk to and interact with her classmates even though the inter- 
vention, which may have been responsible for the initial change 
in her behavior, is withdrawn. In this case a source of reinforce- 
ment uncontrolled by the experimenter—the girl’s classmates 
talking to and playing with her as a consequence of her increased 
talking and interacting with them—could maintain high rates of 
behavior after the teacher-delivered reinforcement is no longer 
provided. In such instances of irreversibility, an A-B-A-B design 
would fail to reveal a functional relation between the indepen- 
dent variable and the target behavior. 

Nonetheless, one of the major objectives of applied 
behavior analysis is establishing socially important behav- 
ior through experimental treatments so that the behavior will 
contact natural “communities of reinforcement” to maintain 
behavioral improvements in the absence of treatment (Baer & 
Wolf, 1970b). When irreversibility is suspected or apparent, in 
addition to considering DRO or DRI/DRA conditions as con- 
trol techniques, investigators can consider other experimental 
tactics, most notably the multiple baseline designs described 
in Chapter 9. A multiple treatment reversal design requires a 
reversible dependent variable. If the initial treatment condition 
in a multiple treatment reversal design changes an irreversible 
behavior, the opportunity to assess and compare the effects of 
any other treatment conditions is lost. 


Withdrawing an Effective Intervention: 
A Social, Educational, and Ethical Consideration 


Although it can yield an unambiguous demonstration of exper- 
imental control, withdrawing a seemingly effective interven- 
tion to evaluate its role in behavior change presents a legitimate 
cause for concern. One must question the appropriateness of any 
procedure that allows (indeed, seeks) an improved behavior to 
deteriorate to baseline levels of responding. Various concerns 
have been voiced over this fundamental feature of the reversal 
design. Although considerable overlap exists among the con- 
cerns, they can be classified as having primarily a social, edu- 
cational, or ethical basis. 


Social Concerns. Applied behavior analysis is, by definition, 
a social enterprise. Behaviors are selected, defined, observed, 
measured, and modified by and for people. Sometimes the peo- 
ple involved in an applied behavior analysis—administrators, 
teachers, parents, and participants—object to the withdrawal of 
an intervention they associate with desirable behavior change. 
Even though a reversal may provide the most unqualified picture 
of the behavior—environment relation under study, it may not 
be the analytic tactic of choice because key participants do not 
want the intervention to be withdrawn. When a reversal design 
offers the best experimental approach scientifically and poses no 
ethical problems, the behavior analyst may choose to explain the 
operation and purpose of the tactic to those who do not favor it. 
But it is unwise to attempt a reversal without the full support of 
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the people involved, especially those who will be responsible 
for withdrawing the intervention. Without their cooperation the 
procedural integrity of the experiment could easily be compro- 
mised. For example, people who are against the withdrawal of 
treatment might sabotage the return to baseline conditions by 
implementing the intervention, or at least those parts of it they 
consider the most important. 


Educational and Clinical Issues. Educational or clinical issues 
concerning the reversal design are often raised in terms of 
instructional time lost during the reversal phases, as well as the 
possibility that the behavioral improvements observed during 
intervention may not be recaptured when treatment is resumed 
after a return to baseline conditions. Prolonged reversals are 
neither justifiable nor necessary (Stolz, 1978). If preinterven- 
tion levels of responding are reached quickly, reversal phases 
can be quite short in duration. Sometimes only three or four 
sessions are needed to show that initial baseline rates have been 
reproduced (e.g., Ashbaugh & Peck, 1998; Cowdery et al., 1990 
[Figure 25.6]; Hall et al., 1971, Figure 14.1). Two or three brief 
reversals can provide an extremely convincing demonstration 
of experimental control. Concern that the improved levels of 
behavior will not return when the treatment variable is reintro- 
duced, while understandable, has not been supported by empir- 
ical evidence. Hundreds of published studies have shown that 
behavior acquired under a given set of environmental conditions 
can be reacquired rapidly during subsequent reapplication of 
those conditions. 


Ethical Concerns. A serious ethical concern must be addressed 
when the use of a reversal design is considered for evaluating a 
treatment for self-injurious or dangerous behaviors. With mild 
self-injurious or aggressive behaviors, short reversal phases con- 
sisting of one or two baseline probes can sometimes provide 
the empirical evidence needed to reveal a functional relation 
(e.g., Akers, Higbee, Pollard, Pellegrino, & Gerencser, 2016; 
Kelley, Jarvie, Middlebrook, McNeer, & Drabman, 1984; Luce, 
Delquadri, & Hall, 1980; Murphey et al., 1979 [Figure 8.4]). 
For example, in their study evaluating a treatment for elope- 
ment (i.e., running away from supervision) by a child with 
attention-deficit hyperactivity disorder (ADHD), Kodak, Grow, 
and Northup (2004) returned to baseline conditions for a single 
session (see Figure 8.8). 

Nonetheless, with some behaviors it may be determined 
that withdrawing an intervention associated with improvement 
for even a few one-session probes would be unethical. In such 
cases experimental designs that do not rely on the reversal tactic 
must be used. 


MULTIELEMENT DESIGN 


An important and frequently asked question by teachers, ther- 
apists, and others who are responsible for changing behavior is 
this: Which of these treatments will be most effective with this 
student or client? In many situations, the research literature, the 
analyst’s experience, and/or logical extensions of the principles 
of behavior point to several possible interventions. Determining 
which of several possible treatments or combination of treat- 
ments will produce the greatest improvement in behavior is a 
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primary task for applied behavior analysts. As described earlier, 
although a multiple treatment reversal design (e.g., A-B-C-B-C) 
can be used to compare the effects of two or more treatments, 
such designs have some inherent limitations. Because the dif- 
ferent treatments in a multiple treatment reversal design are 
implemented during separate phases that occur in a particular 
order, the design is particularly vulnerable to confounding by 
sequence effects (e.g., Treatment C may have produced its effect 
only because it followed Treatment B). A second disadvantage 
of using the reversal tactic to compare multiple treatments is 
the extended time required to demonstrate differential effects. 
Most behaviors targeted for change by teachers and therapists 
are selected because those behaviors need immediate improve- 
ment. An experimental design that will quickly reveal the 
most effective treatment among several possible approaches is 
important for the applied behavior analyst. The multielement 
design (also called alternating treatments design) is an exper- 
imentally sound and efficient method for comparing the effects 
of two or more treatments. 


Operation and Logic of the Multielement Design 


The multielement design is characterized by the rapid alterna- 
tion of two or more distinct treatments (i.e., independent vari- 
ables) while their effects on the target behavior (i.e., dependent 
variable) are measured. In contrast to the reversal design, in 
which experimental manipulations are made after steady state 
responding is achieved in a given phase of an experiment, the 
different interventions in a multielement design are manipu- 
lated independent of the level of responding. To aid the sub- 
ject’s discriminating which treatment condition is in effect at 
any given time, a distinct stimulus (e.g., a sign, verbal instruc- 
tions, different-colored worksheets) is often associated with 
each treatment. Barlow and Hayes (1979) introduced the term 
alternating treatments design and described it this way: 


The data are plotted separately for each intervention to 
provide a ready visual representation of the effects of each 
treatment. Because confounding factors such as time of 
administration have been neutralized (presumably) by 
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counterbalancing, and because the two treatments are read- 
ily discriminable by subjects through instructions or other 
discriminative stimuli, differences in the individual plots of 
behavior change corresponding with each treatment should 
be attributable to the treatment itself, allowing a direct 
comparison between two (or more) treatments. (p. 200) 


Figure 8.9 shows a graphic prototype of an alternat- 
ing treatments design comparing the effects of two treat- 
ments, A and B, on some response measure. In an alternating 
treatments design, the different treatments can be alternated 
in a variety of ways. For example, the treatments might be 
(a) alternated across daily sessions, one treatment in effect each 
day; (b) administered in separate sessions occurring within the 
same day; or (c) implemented each during a portion of the same 
session. Counterbalancing the days of the week, times of day, 
sequence in which the different treatments occur (e.g., first or 
second each day), persons delivering the different treatments, 
and so forth, reduces the probability that any observed dif- 
ferences in behavior are the result of variables other than the 
treatments themselves. For example, assume that Treatments 
A and B in Figure 8.9 were each administered for a single 
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Figure 8.9 Graphic prototype of a multielement design 
comparing the differential effects of two treatments (A and B). 


30-minute session each day, with the daily sequence of the two 
treatments determined by a coin flip. 

The data points in Figure 8.9 are plotted on the horizontal 
axis to reflect the actual sequence of treatments each day. Thus, 
the horizontal axis is labeled Sessions, and each consecutive pair 
of sessions occurred on a single day. Some published reports of 
experiments using an alternating treatments design in which two 
or more treatments were presented each day (or session) plot the 
measures obtained during each treatment above the same point 
on the horizontal axis, thus implying that the treatments were 
administered simultaneously. This practice masks the temporal 
order of events and has the unfortunate consequence of mak- 
ing it difficult for the researcher or reader to discover potential 
sequence effects. 

The three components of steady state strategy—prediction, 
verification, and replication—are found in the alternating treat- 
ments design. However, each component is not readily identified 
with a separate phase of the design. In an alternating treatments 
design, each successive data point for a specific treatment 
plays all three roles: It provides (a) a basis for the prediction of 
future levels of responding under that treatment, (b) potential 
verification of the previous prediction of performance under that 
treatment, and (c) the opportunity for replication of previous 
effects produced by that treatment. 

To see this logic unfold, the reader should place a piece 
of paper over all the data points in Figure 8.9 except those 
for the first five sessions of each treatment. The visible por- 
tions of the data paths provide the basis for predicting future 
performance under each respective treatment. Moving the 
paper to the right reveals the two data points for the next day, 
each of which provides a degree of verification of the previ- 
ous predictions. As more data are recorded, the predictions of 
given levels of responding within each treatment are further 
strengthened by continued verification (if those additional 
data conform to the same level and/or trend as their predeces- 
sors). Replication occurs each time Treatment A is reinstated 
and measurement reveals responding similar to previous 
Treatment A measures and different from those obtained when 
Treatment B is in effect. Likewise, another mini-replication 
is achieved each time a reintroduction of Treatment B results 
in measures similar to previous Treatment B measures and 
different from Treatment A levels of responding. A consistent 
sequence of verification and replication is evidence of exper- 
imental control and strengthens the investigator’s confidence 
of a functional relation between the two treatments and dif- 
ferent levels of responding. 

The presence and degree of experimental control in a 
multielement design is determined by visual inspection of the 
differences between (or among) the data paths representing the 
different treatments. Inspecting multielement design data should 
include visual analysis of mean shift in level, trend, and vari- 
ability throughout the comparison phase(s) of the experiment 
(Diller, Barry, & Gelino, 2016). Experimental control is defined 
in this instance as objective, believable evidence that different 
levels of responding (and perhaps degrees of trends and vari- 
ability) are predictably and reliably produced by the presence 
of the different treatments. When the data paths for two treat- 
ments show no overlap with each other and either stable levels or 
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opposing trends, a clear demonstration of experimental control 
has been made. Such is the case in Figure 8.9, in which there 
is no overlap of data paths and the picture of differential effects 
is clear. When some overlap of data paths occurs, a degree of 
experimental control over the target behavior can still be demon- 
strated if the majority of data points for a given treatment fall 
outside the range of values of the majority of data points for the 
contrasting treatment. 

The extent of any differential effects produced by 
two treatments is determined by the vertical distance—or 
fractionation—between their respective data paths and quanti- 
fied by the vertical axis scale. The greater the vertical distance, 
the greater the differential effects of the two treatments on the 
response measure. It is possible for experimental control to be 
shown between two treatments but for the amount of behavior 
change to be socially insignificant. For instance, experimental 
control may be demonstrated for a treatment that reduces a 
person’s severe self-injurious behavior from 10 occurrences 
per hour to 2 per hour, but the participant is still engaged in 
self-mutilation. However, if the vertical axis is scaled mean- 
ingfully, the greater the separation of data paths on the vertical 
axis, the higher the likelihood that the difference represents a 
socially significant effect. 

Data from an experiment that compared the effects of two 
types of group-contingent rewards on the spelling accuracy of 
fourth-grade underachievers (Morgan, 1978) illustrate how the 
alternating treatments design reveals experimental control and the 
quantification of differential effects. The six children in the study 
were divided into two equally skilled teams of three on the basis 
of pretest scores. Each day during the study the students took a 
five-word spelling test. The students received a list of the words 
the day before, and a 5-minute study period was provided just 
prior to the test. Three different conditions were used in the alter- 
nating treatments design: (a) no game, in which the spelling tests 
were graded immediately and returned to the students, and the 
next scheduled activity in the school day began; (b) game, in 
which test papers were graded immediately, and each member 
of the team who had attained the highest total score received 
a Certificate of Achievement and was allowed to stand up and 
cheer; and (c) game plus, consisting of the same procedure as 
the game condition, plus each student on the winning team also 
received a small trinket (e.g., a sticker or pencil). 

The results for Student 3 (see Figure 8.10) show that 
experimental control over spelling accuracy was obtained 
between the no-game condition and both the game and the 
game-plus conditions. Only the first two no-game data points 
overlap the lower range of scores obtained during the game 
or the game-plus conditions. However, the data paths for the 
game and game-plus conditions overlap completely and con- 
tinuously throughout the study, revealing no difference in spell- 
ing accuracy between the two treatments. The vertical distance 
between the data paths represents the amount of improvement in 
spelling accuracy between the no-game condition and the game 
and the game-plus conditions. The mean difference between 
the two game conditions and the no-game condition was two 
words per test. Whether such a difference represents a signifi- 
cant improvement is an educational question, not a mathematical 
or Statistical one, but most educators and parents would agree 
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Figure 8.10 A multielement design comparing the 
effects of three different treatments on the spelling 
accuracy of a fourth-grade student. 

Based on Comparison of Two “Good Behavior Game” Group 
Contingencies on the Spelling Accuracy of Fourth-Grade Students 
by Q. E. Morgan, 1978, unpublished master’s thesis, The Ohio 
State University. 


Number of Words 
Spelled Correctly 


that an increase of two words spelled correctly out of five is 
socially significant, especially if that gain can be sustained from 
week to week. The cumulative effect over a 180-day school 
year would be impressive. There was virtually no difference in 
Student 3’s spelling performance between the game and game- 
plus conditions. However, even a larger mean difference would 
not have contributed to the conclusions of the study because 
of the lack of experimental control between the game and the 
game-plus treatments. 

Student 6 earned consistently higher spelling scores in 
the game-plus condition than he did in the game or no-game 
conditions (see Figure 8.11). Experimental control was demon- 
strated between the game-plus and the other two treatments for 
Student 6, but not between the no-game and game conditions. 
Again, the difference in responding between treatments is quan- 
tified by the vertical distance between the data paths. In this case 
there was a mean difference of 1.55 correctly spelled words per 
test between the game-plus and no-game conditions. 

Figures 8.10 and 8.11 illustrate two other important points 
about the multielement design. First, the two graphs show how 
the multielement design enables a quick comparison of inter- 
ventions. After 20 sessions the teacher had sufficient empirical 
evidence for selecting the most effective consequences for each 
student. If only two conditions had been compared, even fewer 
sessions may have been required to identify the most effective 
intervention. Second, these data underscore the importance of 
evaluating treatment effects at the level of the individual subject. 


Figure 8.11 A multielement design comparing the 
effects of three different treatments on the spelling 
accuracy of a fourth-grade student. 


Based on Comparison of Two “Good Behavior Game” Group 
Contingencies on the Spelling Accuracy of Fourth-Grade Students 
by Q. E. Morgan, 1978, unpublished master’s thesis, The Ohio 
State University. 
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All six children spelled more words correctly under one or both 
of the game conditions than they did under the no-game con- 
dition. However, Student 3’s spelling accuracy was equally 
enhanced by either the game or the game-plus contingency, 
whereas Student 6’s spelling scores improved only when a tan- 
gible reward was available. 


Variations of the Multielement Design 


The multielement design can be used to compare one or more 
treatments to a no-treatment or baseline condition, assess the rel- 
ative contributions of individual components of a package inter- 
vention, and perform parametric investigations in which different 
values of an independent variable are alternated to determine 
differential effects on behavior change. The following sections 
describe the most common variations of the multielement design. 


Single-phase Multielement Design 


A single-phase experiment in which the effects of two or more 
treatment conditions are compared is the most basic application of 
the multielement design (e.g., Barbetta, Heron, & Heward, 1993; 
McNeish, Heron, & Okyere, 1992; Morton, Heward, & Alber, 
1998). For example, Brady and Kubina (2010) compared the 
multiplication fact fluency of three elementary-age students with 
attention-deficit hyperactivity disorder (ADHD) under two prac- 
tice procedures: whole time practice trials and endurance building 
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practice trials. Each procedure lasted 2.5 minutes and consisted 
of students writing as many answers to single-digit multiplica- 
tion math fact sheet assessments as they could for a combined 
total of 2 minutes and receiving corrective feedback from the 
experimenter for a combined total of 30 seconds. The procedures 
differed in how time for writing answers and corrective feedback 
was distributed. The whole time practice condition consisted of 
one |-minute practice trial, 30 seconds of corrective feedback, and 
a 1-minute multiplication math fact sheet assessment. The endur- 
ance building practice condition consisted of three 20-second 
practice trials interspersed with 10-second corrective feedback, 
and a I!-minute multiplication math fact sheet assessment. The 
two practice conditions were alternated daily. 

Figure 8.12 shows the results on the daily multiplication 
math fact sheet assessments for one of the three participants. 
Brady and Kubina (2010) reported that as participants’ “per- 
formance levels for the three 20 second intervals improved so 
too did their ability to perform over the longer time interval 
of | minute” (p. 90). Although practice time equaled 1 minute 
in both conditions, the students practiced 30% more problems 
during the endurance building practice trials condition than they 
did in the whole time practice trial condition. 

A no-treatment condition is often incorporated into the 
design as one of the treatments to be compared. For example, 
the no-game condition in the Morgan (1978) study served as a 
no-treatment control condition against which the students’ spelling 
scores in the game and game-plus conditions were compared (see 
Figures 8.10 and 8.11). Including a no-treatment control condi- 
tion as one of the conditions in a multielement design provides 
valuable information on any differences in responding under the 
intervention treatment(s) and no treatment. However, the measures 
obtained during the no-treatment control condition should not be 
considered representative of an unknown baseline level of respond- 
ing. It may be that the measures obtained in the no-treatment 
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condition represent only the level of behavior under a no-treat- 
ment condition when it is interspersed within an ongoing series of 
treatment condition(s), and do not represent the level of behavior 
that existed before the alternating treatments design was begun. 

The single-phase multielement design is the primary 
experimental tool for conducting functional analyses of problem 
behavior (Chapter 27). Figure 6.5 and Figures 27.7, 27.8, 25.10, 
and 27.11 in this text show examples of functional analyses 
using multielement designs. 


Two-phase Design: Baseline and Comparison 


Investigators often use a two-phase experimental design in which 
baseline measures are collected until a stable level of responding 
or counter-therapeutic trend is obtained prior to the alternating 
treatments phase (e.g., Kranak, Alber-Morgan, & Sawyer, 2017; 
Martens, Lochner, & Kelly, 1992 [see Figure 13.6]; Neef, 
Iwata, & Page, 1980 [see Figure 6.12]). The baseline condition 
may be continued during the treatments comparison phase as a 
no-treatment control condition. 

Wilder, Atwell, and Wine (2006) used a multielement 
design with an initial baseline phase to evaluate the relative 
effectiveness of three levels of treatment integrity on compli- 
ance by typically developing preschool children with instruc- 
tions not usually complied with by the children: “Give me the 
[snack item],” “Put the toy away,” and “Come here.” Each 
session the therapist presented one of the three instructions 10 
times at a rate of once per minute. Compliance, defined as doing 
what the therapist instructed within 10 seconds, resulted in brief 
praise from the therapist throughout the study. During base- 
line, noncompliance produced no response from the therapist. 
During the alternating treatments phase, the therapist responded 
to noncompliance by implementing a three-step prompting pro- 
cedure for 100% of trials for one instruction (e.g. “Give me the 
[snack item]’’), 50% of trials for another instruction (e.g., “Put 


Figure 8.12 A multielement design 
comparing two treatment conditions. 
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the toy away”), and 0% of trials for the third instruction (e.g., 
“Come here”). The 0% integrity condition was identical to base- 
line and served as a no-treatment control condition. 

Figure 8.13 shows the results for the two children who 
participated in the study. Each child complied most often with 
the instruction for which the therapist implemented the prompt- 
ing intervention with 100% integrity, complied at a higher than 
baseline level with the instruction associated with 50% treatment 
integrity, and seldom or never complied with the instruction 
associated with 0% integrity (i.e., baseline conditions). By asso- 
ciating the 100% integrity condition with a different instruction 
for each child, the experimenters strengthened the replication of 
the functional relation between the 100% integrity condition and 
the highest level of compliance. 

Sindelar, Rosenberg, and Wilson (1985) described a vari- 
ation of the multielement design for comparing the efficiency 
of instructional procedures they called the adapted alternating 
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Figure 1. Percentage of trials with compliance across baseline and three-step 
prompting integrity levels (i.e., 100%, 50%, and 0%). 


Figure 8.13 Multielement design with an initial baseline. 


Based on “The Effects of Varying Levels of Treatment Integrity on Child 
Compliance During Treatment with a Three-Step Prompting Proce- 
dure” by D. A., Wilder, J. Atwell, and B. Wine, 2006, Journal of Applied 
Behavior Analysis, 39, p. 372. Copyright 2006 by the Society for the 
Experimental Analysis of Behavior, Inc. 


treatments design. The comparison phase of the design features 
the alternating application of two (usually) or more different teach- 
ing methods, each method applied to different but equivalent sets 
of instructional items. All items are topographically different mem- 
bers of the same response or skill class, such as reading printed 
words, defining vocabulary terms, spelling words, answering 
math problems, and stating history facts. Sindelar and colleagues 
described the design’s operation and purpose as follows: 


A baseline in which the equivalence of performance on the 
two sets is demonstrated is followed by an experimental 
condition in which acquisition of one set taught by one 
method is compared to the acquisition of the second set 
taught by the second method. Differences are demonstrated 
when acquisition of one set is more rapid than acquisition 
of the other, and the effect is consistent across subjects, set- 
tings, or behaviors. (p. 70) 


Viadescu and Kodak (2013) used an adapted alternat- 
ing treatments design to analyze the extent and speed with 
which preschool children with autism would acquire second- 
ary learning targets as a function of presenting information 
on secondary learning targets during the antecedent or conse- 
quence portion of learning trials for primary targets. Pretests 
determined an individualized pool of unknown targets—nam- 
ing items on picture cards or responding to fill-in-the-blank 
statements (“The opposite of hot is ____ ?””) for each par- 
ticipant. The targets were assigned to different sets, each set 
representing an experimental condition (i.e., teaching method) 
distinguished by whether and when information on secondary 
targets was presented. The experimenters equated the sets by 
considering the number of syllables contained in responses to 
the targets and putting targets that sounded similar in different 
sets. In the antecedent condition, the experimenter presented 
information on a secondary target (e.g., held up a picture and 
said, “This is a seal.”), waited 3 seconds, and then presented 
the primary target (e.g., held up a picture of a lion and asked, 
“What is it?’””). In the consequence condition, the experimenter 
presented information on a secondary target in the same way 
immediately after delivering reinforcement for the participant’s 
response to the primary target. An error correction procedure 
ensured that primary target trials ended with a correct response 
by the participant. In the secondary-targets-only condition, the 
experimenter presented information on secondary targets in 
the absence of teaching primary targets. In the primary-targets- 
only condition, no secondary target information was provided. 
Participants’ acquisition of secondary-target information was 
measured by probes, conducted every one to three training ses- 
sions in other conditions. Participants received no feedback or 
reinforcement for responses to secondary target presentations 
in any condition. 

Figure 8.14 shows the results for one of the four partici- 
pants. Rick acquired the primary targets in the antecedent, con- 
sequence, and primary-targets-only conditions (top panel) in 
five, six, and five sessions, respectively. He correctly echoed the 
secondary targets on 100% of opportunities during the anteced- 
ent, consequence, and secondary-targets-only conditions. Probes 
showed that Rick mastered the secondary targets during the 
training of primary targets, and he did so in a similar number of 
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Figure 8.14 Adapted alternating treatments design. 


Based on “Increasing Instructional Efficiency by Presenting Additional 
Stimuli in Learning Trials for Children with Autism Spectrum Disorders” 
by J. C. Viadescu and T. M. Kodak, 2013, Journal of Applied Behavior 
Analysis, 46, p. 812. Copyright 2013 by the Society for the Experimental 
Analysis of Behavior, Inc. 


sessions as the primary targets (bottom panel). Like Rick, two of 


the three other children in the study mastered secondary targets 
without explicit instruction. That is, when information on sec- 
ondary targets was presented in the antecedent or consequence 
condition, the children learned double the number of targets in 
a similar amount of training time compared to the condition in 
which only the primary targets were presented. 


Three-phase Design: Baseline, Comparison, 
and Best Treatment 


Another widely used variation of the alternating treatments design 
consists of three sequential phases: an initial baseline phase, a 
second phase comparing two or more alternating treatments 
(one of which may be the baseline control condition), and a final 
phase in which only the most effective treatment is administered 
(e.g., Heckaman, Alber, Hooper, & Heward, 1998; Mechling, 
2006; Ollendick, Matson, Esvelt-Dawson, & Shapiro, 1980). 
Tincani (2004) used such a design to investigate the rela- 
tive effectiveness of sign language and picture exchange training 
on the acquisition of mands (requests for preferred items) by two 
children with autism.° A related research question was whether 
a relation existed between students’ pre-existing motor imitation 
skills and their abilities to learn mands through sign language or 
by picture exchange. Two assessments were conducted for each 
student prior to baseline. A stimulus preference assessment was 
conducted to identify a list of 10 to 12 preferred items (e.g., 
drinks, edibles, toys), and each student’s ability to imitate 27 
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hand, arm, and finger movements similar to those required for 
sign language was assessed.’ 

The purpose of baseline was to ensure that the participants 
were not able to request preferred items with picture exchange, si gn 
language, or speech prior to training. Baseline trials consisted of 
giving the student 10 to 20 seconds of noncontingent access to a 
preferred item, removing the item briefly, and then placing it out of 
the student’s reach. A laminated 2-inch-by-2-inch picture of the item 
was placed in front of the student. If the student placed the picture 
symbol in the experimenter’s hand, signed the name of the item, or 
spoke the name of the item within 10 seconds, the experimenter pro- 
vided access to the item. If not, the item was removed and the next 
item on the list was presented. Following a three-session baseline, 
during which neither participant emitted an independent mand in 
any modality, the alternating treatments phase was begun. 

The sign language training procedures were adapted from 
Sundberg and Partington’s (1998) Teaching Language to Children 
with Autism or Other Developmental Disabilities. The simplest 
sign from American Sign Language for each item was taught. 
Procedures used in the Picture Exchange Communication System 
(PECS) training condition were adapted from Bondy and Frost’s 
(2002) The Picture Exchange Communication System Training 
Manual. In both conditions, training on each preferred item con- 
tinued for five to seven trials per session, or until the participant 
showed no interest in the item. At that time, training then began 
on the next item and continued until all 10 or 12 items on the par- 
ticipant’s list of preferred items had been presented. During the 
study’s final phase, each participant received either sign language 
or PECS training only, depending on which method had been 
most successful during the alternating treatments phase. 

The percentage of independent mands by the two stu- 
dents throughout the study is shown in Figures 8.15 (Jennifer) 
and 8.16 (Carl). Picture exchange training was clearly more 
effective than sign language for Jennifer. Jennifer demonstrated 
weak motor imitation skills in the pre-baseline assessment, cor- 
rectly imitating 20% of the motor movements attempted in the 
pre-baseline imitation assessment. After a slight modification in 
sign language training procedures was implemented to eliminate 
Carl’s prompt dependency, he emitted independent mands more 
often during sign language training than with picture exchange 
training. Carl’s pre-existing motor imitation skills were better 
than Jennifer’s. He imitated correctly 43% of the attempted 
motor movements in the pre-baseline imitation assessment. 

This study highlights the importance of individual analy- 
ses and exploring the possible influence of variables not manip- 
ulated during the study. In discussing the study’s results, Tincani 
(2004) noted that 


For learners without hand-motor imitation skills, including 
many children with autism, PECS training may be more 
appropriate, at least in terms of initial mand acquisition. 
Jennifer had weak hand-motor imitation skills prior to 
intervention and learned picture exchange more rapidly 
than sign language. For learners who have moderate 
hand-motor imitation skills, sign language training may be 
equally, if not more, appropriate. Carl had moderate hand- 
motor imitation skills prior to intervention and learned sign 
language more rapidly than picture exchange. (p. 160) 
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Figure 8.15 Example of a multielement 
design with initial baseline and a final 
best-treatment-only condition. 
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Children with Autism” by M. Tincani, 2004, Focus 
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Three-phase Design: Baseline, Comparison, and 
Participant’s Choice 


Another three-phase variation of the multielement design 
entails an initial baseline, a comparison phase in which two or 
more treatment conditions (one of which may be the baseline 
control condition) are alternated, and a final “choice” phase 
in which only the treatment selected by each participant is 
implemented. 

Jowett Hirst, Dozier, and Payne (2016) used this sequence 
of phases in two experiments assessing (a) the relative effects of 
the differential reinforcement of alternative behavior (DRA) and 
response cost (RC) procedures on the on-task behavior of 
typically developing preschool children and (b) the children’s 
preferences for the procedures.® In Experiment I, conducted in 
the context of small-group activities, both DRA and RC 
increased on-task behavior for the majority of children and five 
of six children chose RC more often than DRA. Prior to each 
session in Experiment II, conducted in the context of individual 
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work tasks (e.g., worksheets with printed letters and shapes for 
tracing), the experimenter explained the rules and contingencies 
that would be in effect: 


Baseline. “Today you get the white board, and there are 
no tokens. When we start, you can either work on trac- 
ing or play with toys. If you are working (i.e., tracing), 
nothing will happen. If you are not working, nothing will 
happen.” (p. 338) 

Differential reinforcement of alternative behavior. 
“Today you get the green board, and it doesn’t have 
any tokens on it. When we start, you can either work on 
tracing or play with toys. If you are working, you will 
get a token; if you are not working, you will not geta 
token. At the end, you can trade your tokens for prizes 
and snacks. If you don’t have any tokens, you don’t get 
anything.” (pp. 338) 

Response cost. “Today you get the red board, and it 
has 10 tokens on it. When we start, you can either work on 


tracing or play with toys. If you are working, you will keep 

your tokens; if you are not working, you will lose tokens. 

At the end,. ..” [same as DRA instructions] (p. 338) 
Choice. The experimenter placed the stimuli 

(i.e., poster and token boards) associated with each type of 

condition (i.e., baseline, RC, and DRA) near the participant 

and reminded him or her of the contingencies associated 

with each set of materials... . [T]he experimenter 

asked the participant to pick (by pointing to or touching 

a set of materials) which session he or she wanted to 

do. (pp. 338-339) 


On 10 occasions (at intervals ranging from 15 to 45 seconds) 
during each 5-minute session, except for baseline sessions, the 
experimenter delivered (DRA) or removed (RC) a token per the 
condition in effect. Figure 8.17 shows the results for 6 of the 
10 children who participated in the multielement comparison 
of DRA and RC in Experiment II. All children displayed mod- 
erate and variable or extremely low levels of on-task behavior 
in baseline. As in Experiment I, both DRA and RC resulted in 
roughly equal levels of increased on-task behavior during the 
comparison phase (Adam was the lone exception), and those 
levels were maintained during the choice phase. Two partici- 
pants selected DRA for each session of the choice phase, two 
selected RC exclusively, and six switched their selections. 


Concurrent Chains (or Schedule) Design 


A design in which participants choose between two or more con- 
currently available treatments is termed a concurrent schedule 
or concurrent chains design.” Participants are presented with 
two or more response options; each option is associated with a 
distinctive discriminative stimulus and leads to a different set 
of treatment procedures. Hanley, Piazza, Fisher, and Maglieri 
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(2005) used a concurrent chains design to assess clients’ pref- 
erence for problem behavior treatments. Participants were two 
children who had been admitted to an inpatient unit that special- 
ized in treating severe problem behavior. A functional analysis 
demonstrated that both children engaged in problem behavior 
(hitting, biting, head butting, throwing objects at people) most 
often in the attention condition (the therapist provided brief 
reprimand following each occurrence of problem behavior). 
The researchers then evaluated (with reversal and multielement 
designs) the effects of several function-based treatments for each 
participant. Functional communication training (FCT) was mod- 
erately effective in reducing problem behavior for both children, 
and FCT plus punishment was more effective in reducing prob- 
lem behavior for both children than FCT alone. 

With FCT, each child was trained to emit an alterna- 
tive response (saying “attention please” or “excuse me” for 
Betty; handing the therapist a card on which the word “play” 
was printed for Jay). Emitting the alternative behavior resulted 
in 20 seconds of attention from the therapist. Occurrences 
of problem behavior were followed by no differential conse- 
quence (i.e., extinction). Procedures for FCT plus punishment 
were identical to FCT except that each occurrence of problem 
behavior involved “ta 30-s hands-down procedure for Jay (the 
therapist stood behind Jay and held his hands to his sides) and 
a 30-s hands-down and visual-screen procedure for Betty (the 
therapist stood behind the child and placed one arm around the 
child’s arms while placing the other hand over the child’s eyes).” 
(Hanley et al., 2005, p. 56). 

The researchers then used a concurrent-chains strategy 
to evaluate each child’s relative preference for various treat- 
ments. Prior to entering the therapy room during each session 
of this analysis, each child was presented with three different- 
colored microswitches, each switch corresponding to one of 
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three treatment procedures: FCT, FCT with punishment, and 
punishment only. Pressing one of the switches (the initial links 
in the chained schedule) resulted in a 2-minute period inside the 
therapy room (terminal links in the chain) during which the child 
experienced the contingencies corresponding to the switch he or 
she had selected. The FCT and FCT plus punishment conditions 
were carried out in the same manner as the treatment evaluation. 
When the punishment-only condition was selected, problem 
behavior resulted in a 30-seconds hands-down procedure for Jay 
or a 30-seconds hands-down and visual-screen procedure for 
Betty. Prior to using switch pressing to evaluate treatment pref- 
erence, each child was prompted to push the different switches 
and was exposed to the different contingencies associated with 
each switch (four exposure sessions with Jay, two with Betty). 

During treatment preference sessions, after each two- 
link chain (switch-press followed by 2 minutes of the selected 
contingencies for problem behavior), the child then exited the 
therapy room and was repositioned in front of the switches. The 
procedure was repeated until 20 minutes had elapsed for Jay; 
or until 10 initial-link responses were recorded for Betty. Han- 
ley and colleagues (2005) described the results of the treatment 
preference evaluation (see Figure 8.18): 


Jay allocated the majority (79.3%) of his switch presses to 
the FCT plus punishment intervention. A preference for 
FCT plus punishment emerged during Betty’s evaluation, 
in that all switch presses were allocated to FCT plus pun- 
ishment in the final five sessions. These data provide direct 
evidence that the function-based treatments involving pun- 
ishment contingencies were more preferred (reinforcing) 
than similar function-based treatments involving extinction 
or interventions consisting of punishment alone. (p. 61) 


Readers should not view the variations of the multielement 
design described here as a catalog of possible and appropriate 
applications of this experimental tactic. In countless other ways, 
a multielement/alternating treatments tactic can be incorporated 
within an experimental design employing one or more reversal 
to baseline phases, multiple baselines, and changing criterion 
designs (e.g., Cariveau, Kodak, & Campbell, 2016; Coon & 
Miguel, 2012; Saini, Gregory, Uran, & Fantetti, 2015). 


Advantages of the Multielement Design 


The multielement design offers numerous advantages for eval- 
uating and comparing two or more independent variables. 
Most of the benefits cited here were described by Ulman and 
Sulzer-Azaroff (1975), who first brought the rationale and possi- 
bilities of the multielement design to the attention of the applied 
behavior analysis community. 


Treatment Withdrawal Not Required 


A major advantage of the alternating treatments design is that 
it does not require the investigator to withdraw a seemingly 
effective treatment to demonstrate a functional relation. Revers- 
ing behavioral improvements raises ethical issues that can be 
avoided with the alternating treatments design. Regardless of 
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Figure 8.18 Concurrent chains design, participant’s 
choice phase. 


Based on “On the effectiveness of and preference for punishment and 
extinction components of function-based interventions” by G. P. Hanley, 
C. C. Piazza, W. W. Fisher, and K. A. Maglieri, 2005, Journal of Applied 
Behavior Analysis, 38, p. 60. Copyright 2005 by the Society for the Experi- 
mental Analysis of Behavior, Inc. Reprinted by permission. 


ethical concerns, however, administrators and teachers may 
be more likely to accept a multielement design over a reversal 
design even when one of the alternating treatments is a no-treat- 
ment control condition. “It would appear that a return to baseline 
conditions every other day or every third day is not as disagree- 
able to a teacher as is first establishing a high level of desirable 
behavior for a prolonged period, and then reinstating the baseline 
behaviors” (Ulman & Sulzer-Azaroff, 1975, p. 385). 


Allows Rapid Comparison 


The experimental comparison of two or more treatments can 
often be made quickly with the multielement design. The abil- 
ity of this design to rapidly demonstrate differential effects is a 
major reason it is the basic experimental tactic used in functional 
analysis of problem behavior (see Chapter 27). 

When the effects of different treatments become apparent 
early in a multielement design, the investigator can then switch 
to programming only the most effective treatment. The effi- 
ciency of the multielement design can leave a researcher with 
meaningful data even when an experiment must be terminated 
early. A reversal or multiple baseline design, in contrast, must 
be carried through to completion to show a functional relation. 

Researchers and practitioners must guard against judging 
that a functional relation exists between outcomes associated 
with two or more conditions in a multielement design when no 


such relation exists. Such false positives are highly likely when 
only one or two measures are taken per condition. For example, 
Kahng and Iwata (1999) found that false positives were obtained 
more often in brief multielement designs than in multielement 
designs with three or more sessions in each condition. 

Equally important is discovering ineffective treatments. 
For example, multielement designs enabled researchers to 
demonstrate that colored overlays did not improve reading flu- 
ency of children diagnosed with dyslexia (Freeze Denton & 
Meindle, 2016); and that weighted vests did not increase appro- 
priate in-seat behaviors of elementary-age students with autism 
and severe to profound intellectual disabilities (Cox, Gast, 
Luscre, & Ayers, 2009). 


Minimizes Irreversibility Problem 


Some behaviors, even though they have been brought about or 
modified by an intervention, do not return to baseline levels 
when the intervention is withdrawn and thereby resist analysis 
with an A-B-A-B design. However, rapidly alternating treatment 
and no-treatment (baseline) conditions may reveal differences 
in responding between the two conditions, especially early in 
an experiment before responding in the no-treatment condition 
begins to approximate the level of responding in the treatment 
condition. 


Minimizes Sequence Effects 


A properly conducted multielement design minimizes the extent 
to which sequence effects confound the experiment’s results. 
Sequence effects pose a threat to the internal validity of any 
experiment, but especially to studies involving multiple treat- 
ments. The concern over sequence effects can be summed up by 
this simple question: Would the results have been the same had 
the sequence of treatments been different? Sequence effects can 
be extremely difficult to control in experiments using reversal or 
multiple baseline tactics (see Chapter 9) to compare two or more 
independent variables because each experimental condition 
must remain in effect for a fairly long period, thereby produc- 
ing a specific sequence of events. However, in a multielement 
design, independent variables can be rapidly alternated in a 
random fashion that produces no particular sequence. Also, 
each treatment is in effect for short periods, reducing the like- 
lihood of carryover effects (O’Brien, 1968). The multielement 
design’s ability to minimize sequence makes it a powerful tool 
for achieving complex behavior analyses. 


Accommodates Unstable Data 


Determining functional behavior—environment relations in 
the presence of unstable data presents a serious problem for 
the applied behavior analyst. Using steady state respond- 
ing to predict, verify, and replicate behavioral changes is 
the foundation of experimental reasoning in behavior anal- 
ysis (Sidman, 1960). However, obtaining stable baseline 
responding with many socially important behaviors of inter- 
est to applied behavior analysts is extremely difficult. Merely 
providing a subject with repeated opportunities to emit a 
target response can result in gradually improved performance. 
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Although practice effects are worthy of empirical investiga- 
tion because of their applied and scientific importance, the 
unstable baselines they create pose problems for the analysis 
of intervention variables. The changing levels of task diffi- 
culty inherent in moving through a curriculum of progres- 
sively more complex material also make obtaining steady 
state responding for many academic behaviors difficult. 

Because the different treatment conditions are alternated 
rapidly in a multielement design, because each condition is pre- 
sented many times throughout each time period encompassed by 
the study, and because no single condition is present for any con- 
siderable length of time, it can be presumed that any effects of 
practice, change in task difficulty, maturation, or other historical 
variables are equally represented in each treatment condition and 
therefore will not differentially affect any one condition more 
or less than the others. For example, even though each of two 
data paths representing a student’s reading performance under 
two different teaching procedures shows variable and ascending 
trends that might be due to practice effects and uneven curric- 
ulum materials, any consistent separation and vertical distance 
between the data paths can be attributed to differences in the 
teaching procedures. 


Reveals Generalization of Effects 


By alternating various conditions of interest, an experimenter 
can continually assess the degree of generalization of behavior 
change from an effective treatment to other conditions of inter- 
est. For example, by alternating different therapists in the final 
phase of their study of pica behavior, Singh and Winton (1985) 
demonstrated the extent to which the overcorrection treatment 
was effective when presented by different persons. 


Intervention Can Begin Immediately 


Although determining the preintervention level of responding 
is generally preferred, the clinical necessity of immediately 
attempting to change some behaviors precludes repeated mea- 
surement in the absence of intervention. When necessary, a 
multielement design can begin without an initial baseline phase. 


Considering the Appropriateness of the 
Multielement Design 


The advantages of the multielement design are significant. 
As with any experimental tactic, however, the multielement 
design presents certain disadvantages and leaves unanswered 
certain questions that can be addressed only by additional 
experimentation. 


Multiple Treatment Interference 


The fundamental feature of the multielement design is the rapid 
alternation of two or more independent variables irrespective 
of the behavioral measures obtained under each treatment. 
Although the rapid alternation minimizes sequence effects and 
reduces the time required to compare treatments, it raises the 
important question of whether the effects observed under any 
of the alternated treatments would be the same if each treatment 
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were implemented alone. Multiple treatment interference 
refers to the confounding effects of one treatment on a subject’s 
behavior being influenced by the effects of another treatment 
administered in the same study. 

Multiple treatment interference must always be suspected 
in the multielement design. However, by following the alternat- 
ing treatments phase with a phase in which only one treatment 
is in effect, the experimenter can assess the effects of that treat- 
ment when administered in isolation. 


Unnatural Nature of Rapidly Alternating Treatments 


The rapid back-and-forth switching of treatments does not reflect 
the typical manner in which clinical and educational interventions 
are applied. From an instructional perspective, rapid switching of 
treatments can be viewed as artificial and undesirable. In most 
instances, however, the quick comparison of treatments offered 
by the multielement design compensates for its contrived nature. 
The concern of whether participants might suffer detrimental 
effects from the rapid alternation of conditions is an empirical 
question that can be determined only by experimentation. Also, 
it is helpful for practitioners to remember that one purpose of the 
multielement design is to identify an effective intervention as 
quickly as possible so that the participant does not have to endure 
ineffective instructional approaches or treatments that would 
delay progress toward educational or clinical goals. On balance, 
the advantages of rapidly switching treatments to identify effica- 
cious interventions will usually outweigh any undesirable effects 
that such manipulation may cause. 


Limited Capacity 

Although the multielement design enables an elegant, scientifi- 
cally sound method for comparing the differential effects of two 
or more treatments, it is not an open-ended design with which an 
unlimited number of treatments can be compared. Although mul- 
tielement designs with up to five conditions have been reported 
(e.g., Didden, Prinsen, & Sigafoos, 2000), in most situations a 
maximum of four different conditions (one of which may be 
a no-treatment control condition) can be compared effectively 
within a given phase of an alternating treatments design, and in 
many instances only two different treatments can be accommo- 
dated. To separate the effects of each treatment condition from 
effects that may be caused by aspects of the alternating treatments 
design, each treatment must be carefully counterbalanced across 
all potentially relevant aspects of its administration (e.g., time of 
day, order of presentation, settings, therapists). In many applied 
settings, the logistics of counterbalancing more than two or three 
treatments would be overly cumbersome and require too many 
sessions to complete the experiment. Also, too many compet- 
ing treatments might hamper the subject’s ability to discriminate 
between treatments, thereby reducing the design’s effectiveness. 


Selection of Treatments 


Theoretically, although a multielement design can be used to 
compare the effects of any two discrete treatments, in reality the 
design is more limited. To enhance the probability of discrimina- 
tion between conditions, and thereby the possibility of obtaining 
reliable, measurable differences in behavior, the treatments should 
embody significant differences from one to the other. For example, 
an investigator using a multielement design to study the effects of 
group size on students’ academic performance during instruction 
might include conditions of 4-student, 10-student, and 20-student 
groups. Alternating conditions of 6-, 7-, and 8-student groups is 
less likely to reveal differences between group size and perfor- 
mance. However, a treatment condition should not be selected for 
inclusion in a multielement design only because it might yield a 
data path easily differentiated from that of another condition. 

The applied in applied behavior analysis encompasses the 
nature of treatment conditions as well as the nature of behav- 
iors investigated (Wolf, 1978). An important consideration in 
selecting treatment conditions should be the extent to which they 
represent current practices or practices that could conceivably be 
implemented. Although an experiment comparing the effects of 5, 
10, and 20 minutes of math homework per school night on math 
achievement might be useful, a study comparing the effects of 5 
minutes, 10 minutes, and 3 hours of math homework per night 
would not be. Even if such a study found 3 hours of nightly math 
homework effective in raising students’ achievement in math, few 
teachers, parents, administrators, or students would carry out a 
program of 3 hours of nightly homework for a single content area. 

Another consideration is that some interventions may 
not produce socially significant behavior change unless and 
until they have been implemented consistently for a continuous 
period of time. 


When a multielement baseline design is employed, overlap- 
ping data do not necessarily rule out the possible efficacy 
of an experimental procedure. The session-by-session 
alternation of conditions might obscure effects that could 
be observed if the same condition was presented during 
several consecutive sessions. It is therefore possible that a 
given treatment may prove to be effective with a reversal or 
multiple baseline design, but not with a multielement base- 
line design. (UIman & Sulzer-Azaroff, 1975, p. 382) 


The suspicion that a given treatment may be effective if 
presented in isolation for an extended period is an empirical 
question that can be explored through experimentation. At one 
level, if extended application of a single treatment results in 
behavioral improvement, the practitioner might be satisfied, and 
no further action would be needed. However, the practitioner-re- 
searcher who is interested in determining experimental control 
might return to a multielement design and compare the perfor- 
mance of the single treatment with that of another intervention. 
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SUMMARY 


Reversal Design 


1. The reversal tactic (A-B-A) entails repeated measures 


. 


of behavior in a given setting during three consecutive 
phases: (a) baseline (independent variable absent), (b) 
treatment (independent variable present), and (c) return to 
baseline (independent variable absent). 


Reintroducing the independent variable in the form of an 
A-B-A-B design strengthens the reversal design tremen- 
dously. The A-B-A-B design is the most straightforward 
and generally most powerful intrasubject design for 
demonstrating functional relations. 


Variations of the A-B-A-B Design 
3. 


Extending the A-B-A-B design with repeated reversals 
may provide a more convincing demonstration of a func- 
tional relation than a design with one reversal. 


. The B-A-B reversal design can be used with target behav- 


iors for which an initial baseline phase is inappropriate or 
not possible for ethical or practical reasons. 


. Multiple treatment reversal designs (e.g., A-B-A-B-C-B-C, 


A-B-C-A-C-B-C) use the reversal tactic to compare the 
effects of two or more experimental conditions to baseline 
and/or to one another. 


. Multiple treatment reversal designs are particularly sus- 


ceptible to confounding by sequence effects. 


. The noncontingent reinforcement (NCR) reversal tech- 


nique enables the isolation and analysis of the contingent 
aspect of reinforcement. 


. Reversal techniques incorporating differential reinforce- 


ment control conditions (DRO and DRI/DRA) can also be 
used to analyze the effects of contingent reinforcement. 


Considering the Appropriateness of the Reversal Design 


9. Experimental designs featuring the reversal tactic are inef- 


fective in evaluating the effects of treatment variables that, 
by their very nature, cannot be withdrawn once presented 
(e.g., instruction, modeling). 


10. Once changed during treatment, some behaviors will 


not reverse to baseline levels when the independent vari- 
able has been withdrawn. Such behavioral irreversibility 
precludes effective use of the reversal design. 


11. Withdrawing a seemingly effective treatment variable to 


provide scientific verification of its function in changing 
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behavior may raise legitimate social, educational, and 
ethical concerns. 


12. Brief reversal phases, sometimes even one-session base- 


line probes, may be sufficient to demonstrate believable 
experimental control. 


Multielement Design 


13. The multielement design (also called alternating 
treatments design) compares two or more treatments 
(1.e., independent variables) while their effects on the 
target behavior (i.e., dependent variable) are measured. 


14. In a multielement design, each successive data point 
for a specific treatment plays three roles: (a) a basis for 
the prediction of future levels of responding under that 
treatment, (b) potential verification of the previous pre- 
diction of performance under that treatment, and (c) the 
opportunity for replication of previous effects produced 
by that treatment. 


15. Experimental control is demonstrated in a multielement 
design when the data paths for two different treatments 
show little or no overlap. 


16. The extent of differential effects produced by two treatments 
is determined by the vertical distance between their respec- 
tive data paths and quantified by the vertical axis scale. 


Variations of the Multielement Design 
17. Common variations of the multielement (alternating treat- 
ments) design include the following: 


¢ Single-phase design alternating two or more conditions 
(one of which may be a no-treatment control condition) 


¢ Two-phase design: initial baseline and comparison 
phase 


¢ Three-phase design: initial baseline, alternating 
treatments, and best treatment only 


¢ Three-phase design: initial baseline, alternating 
treatments, and participant’s choice 


¢ Concurrent chains (or schedule) design 


Advantages of the Multielement Design 

18. Advantages of the multielement (alternating treatments) 
design include the following: 
¢ Does not require treatment withdrawal. 


¢ Rapidly compares the relative effectiveness of 
treatments. 
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¢ Minimizes the irreversibility problem. 20. 


¢ Minimizes sequence effects. 
¢ Can be used with unstable data patterns. 


* Can assess generalization of effects. 


21. 


¢ Intervention can begin immediately. 


Considering the Appropriateness of the 22. 


Multielement Design 


19. Multielement designs are susceptible to multiple treatment 
interference. However, by following the alternating treat- 23. 
ments phase with a phase in which only one treatment is 
administered, the experimenter can assess the effects of 
that treatment in isolation. 


The rapid back-and-forth switching of treatment 
conditions does not typify clinical or education 
practice and may be viewed as artificial and 
undesirable. 


An alternating treatments phase is usually limited to a 
maximum of four different treatment conditions. 


Multielement designs are most effective in revealing the 
differential effects of treatment conditions that differ 
significantly from one another. 


Multielement designs are not effective for assess- 
ing the effects of independent variables that change 
behavior only when implemented over a continuous 
period of time. 
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A-B-A design DRI/DRA reversal technique noncontingent reinforcement (NCR) 
A-B-A-B design DRO reversal technique reversal technique 

adapted alternating treatments design irreversibility reversal design 

alternating treatments design multielement design sequence effects 

B-A-B design multiple treatment interference withdrawal design 

concurrent chains (schedule) design multiple treatment reversal design 


MULTIPLE-CHOICE QUESTIONS 


1. All things being equal, an experiment that incorporates 
reversal(s) presents a more convincing and 
compelling demonstration of experimental control than an 
experiment with reversal(s). 


a. Multiple, one 4. 


b. One, four 

c. Several, multiple 

d. Five, seven 

Hint: (See “Reversal Design”) 


2. Which of the following is considered to be a reason for 
using a B-A-B design instead of an A-B-A-B design? 


a. Irreversibility 5. 


b. Unlimited time in which to demonstrate practical and 
social significance 


c. Sequence effects 
d. When treatment is already in place 
Hint: (See “Reversal Design”) 
3. This design is considered the most “straightforward” and 


generally the most powerful within-subject design for 
demonstrating a functional relationship. 


a. Alternating treatments design 
b. Multiple baseline design 
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c. AB design 
d. A-B-A-B reversal design 
Hint: (See “Reversal Design”) 
For most investigations, it is suggested that you only 
change variable(s) at a time. 
a. One ; 
b. Two 
c. Three 
d. Four 
Hint: (See “Reversal Design”) 
control conditions might be used as a variation 
of the reversal design when it is not possible or when it is 


deemed inappropriate to completely eliminate an event or 
activity as a reinforcer. 


a. Difference 

b. Noncontingent reinforcement 
c. Sequenced 

d. Temporal 

Hint: (See “Reversal Design”) 
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6. This experimental tactic entails repeated measures of c. A-B-A-B reversal design 
behavior in a given setting that require at least three 


d. Single-subject design 
consecutive phases: baseline, introduction of the i : : 


independent variable, and a return to baseline. Hint: (See “Multielement Design”) 
a. Alternating treatments design 9. Aninvestigatorwho_——_turns the target behavior 
b. Multiple baseline design on and off by presenting and withdrawing a specific 


variable makes a clear and convincing demonstration of 


c. Repeated multiple measures design experimental control 


d. ABA reversal design a. Conditionally 


b. Haphazardly 


Te means that a level of behavior observed in c. Covertly 
an earlier phase cannot be reproduced even though d. Reliably 
experimental conditions return to preintervention conditions. 


Hint: (See “Reversal Design’) 


Hint: (See “Reversal Design’’) 
a. Sequence effects 


10. This term is used to describe the effects on a subject’s 
behavior in a given condition that may be a result of the 
subject’s experience with a prior condition. 


b. Irreversibility 

c. Order effects 

d. Reliability 

Hint: (See “Reversal Design’’) 


a. Sequence 
b. Irreversible 


8. This design is predicated on the behavioral principle of c. Temporal 
stimulus discrimination and allows for the comparison of d.. Variable 


two or more different treatments. i : myles 
, Hint: (See “Multielement Design’) 
a. Multiple baseline design 


b. Alternating treatments design 
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ESSAY-TYPE QUESTIONS 


1. Describe the experimental tactic shown in the figure. 3. State one disadvantage in the use of a reversal design and 
Discuss how the design allows for the detection of a in the use of an alternating treatments design in applied 
functional relationship between the dependent and settings. 
independent variables of interest. Hint: (See “Reversal Design” and ““Multielement Design’’) 

1 4. Summarize how the reversal design incorporates the 
‘oe elements of baseline logic (prediction, verification, and 
8 replication). 
6 Hint: (See “Multielement Design’’) 
; 5. State one ethical consideration in the use of the reversal 
design as an experimental tactic in an applied school 
2 setting. 
Hint: (See “Multielement Design”) 
Sa jade rs 


Hint: (See “Reversal Design’) 


2. You are interested in comparing the effects of two 
distinctly different schedules of reinforcement on students’ 
academic engagement. Select an appropriate experimental 
tactic and describe its implementation. 


Hint: (See ““Multielement Design”) 
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Some authors use the term withdrawal design to describe experiments 
based on an A-B-A-B analysis and reserve the term reversal design for stud- 
ies in which the behavioral focus of the treatment variable is reversed (or 
switched to another behavior), as in the DRO and DRI/DRA reversal tech- 
niques described later in this chapter. However, reversal design, as behav- 
ior analysts most often use the term, encompasses both withdrawals and 
reversals of the independent variable, signifying the investigator’s attempt to 
demonstrate “behavioral reversibility” (Baer, Wolf, & Risley, 1968; Thomp- 
son & Iwata, 2005). Also, withdrawal design is sometimes used to describe 
an experiment in which the treatment variables are sequentially or partially 
withdrawn after their effects have been analyzed in an effort to promote 
maintenance of the target behavior (Rusch & Kazdin, 1981). 

Risley (1997, 2005) credits Montrose Wolf with designing the first exper- 
iments using the reversal and multiple baseline designs. “The research 
methods that Wolf pioneered in these studies were groundbreaking. That 
methodology came to define applied behavior analysis” (pp. 280-281). 


. Strictly speaking, using NCR as an experimental control technique to demon- 


strate that the contingent application of reinforcement is requisite to its effec- 
tiveness is not a separate variation of the A-B-A reversal design. Technically, 
the NCR reversal technique, as well as the differential reinforcement of other 
behavior (DRO) and differential reinforcement of incompatible behavior 
(DRI)/differential reinforcement of alternative behavior (DRA) reversal tech- 
niques described next, is a multiple treatment design. For example, the Baer 


eo 


and Wolf (1970a) study of social reinforcement shown in Figure 8.6 used an 
A-B-C-B-C design, with B representing the NCR conditions and C represent- 
ing the contingent reinforcement conditions. 

The extent to which the increased diversity of the children’s block building 
can be attributed to the attention and praise (“That’s nice’’) or the descrip- 
tive feedback (‘‘. . . that’s different’) in the teacher’s comments cannot be 
determined from this study because social attention and descriptive feed- 
back were delivered as a package. 

Additional manipulations in the form of the partial or sequential withdrawal 
of intervention components are made when it is necessary or desirable for 
the behavior to continue at its improved level in the absence of the complete 
intervention (cf., Rusch & Kazdin, 1981). 

The mand is one of six types of elementary verbal operants identified by 
Skinner (1957). Chapter 18 describes Skinner’s analysis of verbal behavior 
and its importance to applied behavior analysis. 

Stimulus preference assessment procedures are described in Chapter 11. 
DRA and response cost are described in Chapters 25 and 15, respectively. 
Some published studies using a concurrent schedule design are described by 
their authors as using a simultaneous treatment design. Barlow and Hayes 
(1979), who introduced the term simultaneous schedule design, could find 
only one true example in the applied literature: a study by Browning (1967) 
in which three techniques for reducing the bragging of a 10-year-old boy 
were compared. 
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Multiple Baseline and Changing 


Criterion Designs 


LEARNING OBJECTIVES 


¢ Describe the multiple baseline technique. 


e Describe the differences between types of multiple baseline designs. 


¢ Describe the changing criterion design technique. 


¢ Explain how to systematically manipulate independent variables to analyze effects on treatment. 
¢ Identify and explain the practical and ethical considerations in using various experimental designs. 


his chapter describes two additional experimental tactics 

for analyzing behavior—environment relations: the multi- 

ple baseline design and the changing criterion design. In 
a multiple baseline design, after collecting initial baseline data 
simultaneously across two or more behaviors, settings, or people, 
the behavior analyst then applies the treatment variable sequen- 
tially across these behaviors, settings, or people and notes the 
effects. The changing criterion design is used to analyze improve- 
ments in behavior as a function of stepwise, incremental criterion 
changes in the level of responding required for reinforcement. 
In both designs, experimental control and a functional relation 
are demonstrated when the behaviors change from a steady state 
baseline to a new steady state after the introduction of the inde- 
pendent variable is applied, or a new criterion established. 


MULTIPLE BASELINE DESIGN 


The multiple baseline design is the most widely used experimen- 
tal design for evaluating treatment effects in applied behavior 
analysis. It is a highly flexible tactic that enables researchers 
and practitioners to analyze the effects of an independent vari- 
able across multiple behaviors, settings, and/or subjects with- 
out having to withdraw the treatment variable to verify that the 
improvements in behavior were a direct result of the applica- 
tion of the treatment. As you recall from Chapter 8, the reversal 
design by its very nature requires that the independent variable 
be withdrawn to verify the prediction established in baseline. 
This is not so with the multiple baseline design. 


Operation and Logic of the Multiple 
Baseline Design 


Baer, Wolf, and Risley (1968) first described the multiple 
baseline design in the applied behavior analysis literature. They 
presented the multiple baseline design as an alternative to the 
reversal design for two situations: (a) when the target behavior 
is likely to be irreversible (e.g., once changed by the treatment 


variable, the behavior comes into contact with naturally existing 
contingencies of reinforcement or other sources of control that 
maintain the new level of performance) or (b) when it is unde- 
sirable, impractical, or unethical to withdraw a seemingly effec- 
tive treatment condition. Additionally, some interventions, such 
as training or instructions, cannot be withdrawn. Terminating an 
ongoing instructional program does not re-establish conditions 
that existed prior to training; the participant “has been taught.” 

Figure 9.1 illustrates Baer and colleagues’ explanation of 
the basic operation of the multiple baseline design. 


In the multiple baseline technique, a number of responses 
are identified and measured over time to provide baselines 
against which changes can be evaluated. With these base- 
lines established, the experimenter then applies an experi- 
mental variable to one of the behaviors, produces a change 
in it, and perhaps notes little or no change in the other base- 
lines. If so, rather than reversing the just-produced change, 
he instead applies the experimental variable to one of the 
other, as yet unchanged, responses. If it changes at that 
point, evidence is accruing that the experimental variable is 
indeed effective, and that the prior change was not simply 
a matter of coincidence. The variable then may be applied 
to still another response, and so on. The experimenter is 
attempting to show that he has a reliable experimental vari- 
able, in that each behavior changes maximally only when 
the experimental variable is applied to it. (p. 94) 


The multiple baseline design takes three primary forms: 


The multiple baseline across behaviors design, consisting 

of two or more different behaviors of the same subject. 

¢ The multiple baseline across settings design, consisting 
of the same behavior of the same subject in two or more 
different settings, situations, or time periods. 

¢ The multiple baseline across subjects design, consisting 

of the same behavior of two or more different participants 

(or groups). 
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Behavior 1 


Response Measure 
Behavior 2 


Baseline 


Treatment 


Behavior 3 


Figure 9.1 Graphic prototype of a multiple 
baseline design. 


Although only one of the multiple baseline design’s basic 
forms is called an “across behaviors” design, all multiple base- 
line designs involve the time-lagged application of a treatment 
variable across technically different (meaning independent) 
behaviors. That is, in the multiple baseline across settings 
design, even though the subject’s performance of the same target 
behavior is measured in two or more settings, each behavior— 
setting combination is conceptualized and treated as a different 
behavior for analysis. Similarly, in a multiple baseline across 
subjects design, each subject-behavior combination functions 
as a different behavior in the operation of the design. 

Figure 9.2 shows the same data set displayed in 
Figure 9.1, with the addition of data points representing pre- 
dicted measures if baseline conditions were not changed and 
shaded areas illustrating how the three elements of baseline 
logic—prediction, verification, and replication—are operation- 
alized in the multiple baseline design.'! When stable baseline 
responding has been achieved for Behavior 1, a prediction is 
made that if the environment were held constant, continued mea- 
surement would reveal similar levels of responding. When the 
researcher’s confidence in such a prediction is justifiably high, 
the independent variable is applied to Behavior 1. The open data 
points in the treatment phase for Behavior | represent the pre- 
dicted level of responding. The solid data points show the actual 


5 10 15 20 25 30 
Sessions 


measures obtained for Behavior | during the treatment condi- 
tion. These data show a discrepancy with the predicted level of 
responding if no changes had been made in the environment, 
thereby suggesting that the treatment may be responsible for the 
change in behavior. The data collected for Behavior | in a multi- 
ple baseline design serve the same functions as the data collected 
during the first two phases of an A-B-A-B reversal design. 

Continued baseline measures of the other behaviors in 
the experiment offer the possibility of verifying the prediction 
made for Behavior 1. In a multiple baseline design, verification 
of a predicted level of responding for one behavior (or tier) is 
obtained if little or no change is observed in the data paths of 
the behaviors (tiers) that are still exposed to the conditions 
under which the prediction was made. In Figure 9.2 those por- 
tions of the baseline condition data paths for Behaviors 2 and 
3 within the shaded boxes verify the prediction for Behavior 
1. At this point in the experiment, two inferences can be made: 
(a) The prediction that Behavior | would not change in a con- 
stant environment is valid because the environment was held 
constant for Behaviors 2 and 3 and their levels of responding 
remained unchanged; and (b) the observed changes in Behav- 
ior | were brought about by the independent variable because 
only Behavior { was exposed to the independent variable and 
only Behavior ! changed. 


Baseline Treatment 


Behavior 1 
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Figure 9.2 Graphic prototype of a multi- 
ple baseline design with shading added 
to show elements of baseline logic. Open 
data points represent predicted measures 
if baseline conditions were unchanged. 
Baseline data points for Behaviors 2 and 3 
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Behavior 3 


within shaded areas verify the prediction 
made for Behavior 1. Behavior 3 baseline 
data within Bracket A verify the prediction 


made for Behavior 2. Data obtained during 
the treatment condition for Behaviors 
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In a multiple baseline design, the independent variable’s 
function in changing a given behavior is inferred by the lack of 
change in untreated behaviors. However, verification of func- 
tion is not demonstrated directly as it is with the reversal design, 
thereby making the multiple baseline design an inherently weaker 
tactic (i.e., less convincing from the perspective of experimental 
control) for revealing a functional relation between the indepen- 
dent variable and a target behavior. However, the multiple baseline 
design compensates somewhat for this weakness by providing 
the opportunity to verify or refute a series of similar predictions. 
Not only is the prediction for Behavior | in Figure 9.2 verified by 
continued stable baselines for Behaviors 2 and 3, but the bracketed 
portion of the baseline data for Behavior 3 also serves as verifica- 
tion of the prediction made for Behavior 2. 

When the level of responding for Behavior | under the 
treatment condition has stabilized or reached a predetermined 
performance criterion, the independent variable is then applied 
to Behavior 2. If Behavior 2 changes in a manner similar to the 
changes observed for Behavior 1, replication of the independent 
variable’s effect has been achieved (shown by the data path shaded 
with cross-hatching). After Behavior 2 has stabilized or reached a 
predetermined performance criterion, the independent variable is 
applied to Behavior 3 to see whether the effect will be replicated. 
The independent variable may be applied to additional behaviors in 


2 and 3 (cross-hatched shading) provide 
replications of the experimental effect. 


a similar manner until a convincing demonstration of the functional 
relation has been established (or rejected) and all of the behaviors 
targeted for improvement have received treatment. 

As with verification, replication of the independent variable’s 
specific effect on each behavior in a multiple baseline design is 
not manipulated directly. Instead, the generality of the independent 
variable’s effect across the behaviors constituting the experiment 
is demonstrated by applying it to a series of behaviors. Assuming 
accurate measurement and proper experimental control of relevant 
variables (i.e., the only environmental factor that changes during the 
course of the experiment should be the presence—or absence—of 
the independent variable), each time a behavior changes when, and 
only when, the independent variable is introduced, confidence in 
the existence of a functional relation increases. 

How many different behaviors, settings, or subjects must a 
multiple baseline design include to provide a believable demon- 
stration of a functional relation? Baer, Wolf, and Risley (1968) 
suggested that the number of replications needed in any design 
is ultimately a matter to be decided by the consumers of the 
research. In this sense, an experiment using a multiple baseline 
design must contain the minimum number of replications nec- 
essary to convince those who will be asked to respond to the 
experiment and to the researcher’s claims (e.g., teachers, admin- 
istrators, parents, funding sources, journal editors). A two-tier 
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multiple baseline design is a complete experiment and can 
provide strong support for the effectiveness of the independent 
variable (e.g., Grossi & Heward, 1998 [see Figure 29.3]; Harding, 
Wacker, Berg, Rick, & Lee, 2004; Lindberg, Iwata, Roscoe, 
Worsdell, & Hanley, 2003 [see Figure 26.2]; McCord, Iwata, 
Galensky, Ellingson, & Thomson, 2001 [see Figure 6.6]; May, 
Downs, Marchant, & Dymond, 2016; Newstrom, McLaughlin, 
& Sweeney, 1999 [see Figure 28.13]; Pennington & McComas, 
2017; Test, Spooner, Keul, & Grossi, 1990 [see Figure 23.10]). 
McClannahan, McGee, MacDuff, and Krantz (1990) conducted 
a multiple baseline design study in which the independent vari- 
able was sequentially implemented in an eight-tier design across 
12 participants. Multiple baseline designs of three to five tiers are 
most common. When the effects of the independent variable are 
substantial and reliably replicated, a three- or four-tier multiple 
baseline design provides a convincing demonstration of exper- 
imental effect. Suffice it to say that the more replications one 
conducts, the more convincing the demonstration will be. 

Some of the earliest examples of the multiple baseline 
design in the applied behavior analysis literature were studies 
by Risley and Hart (1968); Barrish, Saunders, and Wolf (1969); 
Barton, Guess, Garcia, and Baer (1970); Panyan, Boozer, and 
Morris (1970); and Schwarz and Hawkins (1970). Some of 
the pioneering applications of the multiple baseline technique 
are not readily apparent with casual examination: The authors 
may not have identified the experimental design as a multiple 
baseline design (e.g., Schwarz & Hawkins, 1970), and/or the 
now-common practice of stacking the tiers of a multiple base- 
line design one on the other so that all of the data can be dis- 
played graphically in the same figure was not always used (e.g., 
Maloney & Hopkins, 1973; McAllister, Stachowiak, Baer, & 
Conderman, 1969; Schwarz & Hawkins, 1970). 

In 1970, Vance Hall, Connie Cristler, Sharon Cranston, 
and Bonnie Tucker published a paper that described three exper- 
iments, each an example of one of the three basic forms of the 
multiple baseline design: across behaviors, across settings, and 
across subjects. Hall and colleagues’ paper was important not 
only because it provided excellent illustrations that today still 
serve as models of the multiple baseline design, but also because 
the studies were carried out by teachers and parents, indicating 
that practitioners “can carry out important and significant studies 
in natural settings using resources available to them” (p. 255). 


Multiple Baseline Across Behaviors Design 


The multiple baseline across behaviors design begins with 
the concurrent measurement of two or more behaviors of each 
participant in the study. After steady state responding has been 
obtained under baseline conditions, the investigator applies the 
independent variable to one of the behaviors and maintains base- 
line conditions for the other behavior(s). When steady state or 
criterion-level performance has been reached for the first behav- 
ior, the independent variable is applied to the next behavior, and 
so on (e.g., Davenport, Alber-Morgan, Clancy, & Kranak, 2017: 
Downs, Miltenberger, Biedronski, & Witherspoon 2015; Gena, 
Krantz, McClannahan, & Poulson, 1996; Harding et al., 2004; 
Higgins, Williams, & McLaughlin, 2001 [see Figure 28.1]). 
Ward and Carnes (2002) used a multiple baseline across 
behaviors design to evaluate the effects of self-set goals and public 


posting on the execution of three skills by five linebackers on a 
college football team: (a) reads, in which the linebacker positions 
himself to cover a specified area on the field on a pass play or from 
the line of scrimmage on a run; (b) drops, in which the linebacker 
moves to the correct position depending on the offensive team’s 
alignment; and (c) tackles. A video camera recorded the players’ 
movements during all practice sessions and games. Data were col- 
lected for the first 10 opportunities each player had with each skill. 
Reads and drops were recorded as correct if the player moved to 
the zone identified in the coaches’ playbook; tackles were scored 
as correct if the offensive ball carrier was stopped. 

Following baseline, each player met with one of the 
researchers, who described the player’s mean baseline perfor- 
mance for a given skill. Players were asked to set a goal for 
their performances during practice sessions; no goals were set 
for games. The correct performances during baseline for all five 
players ranged from 60 to 80%, and all players set goals of 90% 
correct performance. The players were informed that their perfor- 
mance in each day’s practice would be posted on a chart prior to 
the next practice session. A Y (yes) or an N (no) was placed next 
to each player’s name to indicate whether he had met his goal. A 
player’s performance was posted on the chart only for the skill(s) 
in intervention. The chart was mounted on a wall in the locker 
room where all players on the team could see it. The head coach 
explained the purpose of the chart to other players on the team. 
Players’ performances during games were not posted on the chart. 

The results for one of the players, John, are shown in Figure 9.3. 
John met or exceeded his goal of 90% correct performance during 
all practices for each of the three skills. Additionally, his improved 
performance generalized to games. The same pattern of results 
was obtained for each of the other four players in the study, illus- 
trating that the multiple baseline across behaviors design is a sin- 
gle-subject experimental strategy in which each subject serves as 
his own control. Each player constituted a complete experiment, 
replicated in this case with four other participants. 


Multiple Baseline Across Settings Design 


In the multiple baseline across settings design, a single behay- 
ior of a person (or group) is targeted in two or more different 
settings or conditions (e.g., locations, times of day). After stable 
responding has been demonstrated under baseline conditions, 
the independent variable is introduced in one of the settings 
while baseline conditions remain in effect in the other settings. 
When maximum behavior change or criterion-level performance 
has been achieved in the first setting, the independent variable 
is applied in the second setting, and so on. 

Roane, Kelly, and Fisher (2003) employed a multiple 
baseline across settings design to evaluate the effects of a treat- 
ment designed to reduce the rate at which an 8-year-old boy put 
inedible objects in his mouth. Jason, who had been diagnosed 
with autism, cerebral palsy, and moderate intellectual disabili- 
ties, had a history of putting objects such as toys, cloth, paper, 
tree bark, plants, and dirt into his mouth. Data on Jason’s mouth- 
ing were obtained concurrently in a classroom, a playroom, and 
outdoors—three settings that contained a variety of inedible 
objects and where caretakers had reported Jason’s mouthing to 
be problematic. Observers in each setting unobtrusively tallied 
the number of times Jason inserted an inedible object past the 
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plane of his lips during 10-minute sessions. The researchers 
reported that Jason’s object mouthing usually consisted of a 
series of discrete episodes, rather than an extended, continuous 
event, and that he often placed multiple objects (inedible objects 
and food) in his mouth simultaneously. 

Roane and colleagues (2003) described the baseline and 
treatment conditions as follows: 


The baseline condition was developed based on the 
functional analysis results, which showed that mouthing 
was maintained by automatic reinforcement and occurred 
independent of social consequences. During baseline, a 
therapist was present (approximately 1.5 to 3m from Jason), 
but all occurrences of mouthing were ignored (i.e., no social 
consequences were arranged for mouthing, and Jason was 
allowed to place items in his mouth). No food items were 
available during baseline. The treatment condition was 
identical to baseline except that Jason had continuous access 
to foods that had been previously identified to compete 

with the occurrence of object mouthing: chewing gum, 


behaviors design showing percentage 
of correct reads, drops, and tackles by a 
college football player during practices 
and games. 


Based on “Effects of Posting Self-Set Goals 
on Collegiate Football Players’ Skill Execution 
During Practice and Games” by P. Ward and 


M. Carnes, 2002, Journal of Applied Behavior 


40 Analysis, 35, p. 5. Copyright 2002 by the Society 
for the Experimental Analysis of Behavior, Inc. 


marshmallows, and hard candy. Jason wore a fanny pack 
containing these items around his waist. (pp. 580-581 y 


Figure 9.4 shows the staggered treatment sequence and 
results in each setting. During baseline, Jason mouthed objects 
at mean rates of 0.9, 1.1, and 1.2 responses per minute in the 
classroom, a playroom, and outdoor settings, respectively. Intro- 
duction of the fanny pack with food in each setting produced an 
immediate drop to a zero or near zero rate of mouthing. During 
treatment, Jason put items of food from the fanny pack into his 
mouth at mean rates of 0.01, 0.01, and 0.07 responses per minute 
in the classroom, a playroom, and outdoor settings, respectively. 
The multiple baseline across settings design revealed a func- 
tional relation between the treatment and a marked reduction in 
the frequency of Jason’s object mouthing. No measures obtained 
during the treatment condition were as high as the lowest mea- 
sures in baseline. During 22 of 27 treatment sessions across the 
three settings, Jason put no inedible objects in his mouth. 

As was done in the Roane and colleagues’ (2003) study, 
data for the different tiers in a multiple baseline across settings 
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Responses per Minute (Mouthing) 


Figure 9.4 A multiple baseline across settings 

design showing the number of object mouth- 1 
ing responses per minute during baseline and 

treatment conditions. 

Based on “The Effects of Noncontingent Access to Food on 

the Rate of Object Mouthing across Three Settings” by 

H. S. Roane, M. L. Kelly, and W. W. Fisher, 2003, Journal of 0 
Applied Behavior Analysis, 36, p. 581. Copyright 2003 by 

the Society for the Experimental Analysis of Behavior, Inc. 


design are typically obtained in different physical environments 
(e.g., Cushing & Kennedy, 1997; Dalton, Martella, & Marchand- 
Martella, 1999). However, the different “settings” in a multiple 
baseline across settings design may exist in the same physical 
location and be differentiated from one another by different con- 
tingencies in effect, the presence or absence of certain people, 
and/or different times of the day. For example, in a study by 
Parker and colleagues (1984), the presence or absence of other 
people in the training room constituted the different settings 
(environments) in which the effects of the independent vari- 
able were evaluated. The attention, demand, and no-attention 
conditions (i.e., contingencies in effect) defined the different 
settings in a multiple baseline design study by Kennedy, Meyer, 
Knowles, and Shukla (2000; see Figure 6.4). The afternoon and 
the morning portions of the school day functioned as different 
settings in the multiple baseline across settings design used by 
Dunlap, Kern-Dunlap, Clarke, and Robbins (1991) to analyze 
the effects of curricular revisions on a student’s disruptive and 
off-task behaviors. 

In some studies using a multiple baseline across settings 
design, the participants are varied, changing, and perhaps even 
unknown to the researchers. For example, Van Houten and 
Malenfant (2004) used a multiple baseline design across two 
crosswalks on busy streets to evaluate the effects of an inten- 
sive driver enforcement program on the percentage of drivers 
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yielding to pedestrians and the number of motor vehicle—pedes- 
trian conflicts. Watson (1996) used a multiple baseline design 
across men’s restrooms on a college campus to assess the effec- 
tiveness of posting-signs in reducing bathroom graffiti. 


Multiple Baseline Across Subjects Design 


In the multiple baseline across subjects design, one target behav- 
ior is selected for two or more subjects in the same setting. After 
steady state responding has been achieved under baseline con- 
ditions, the independent variable is applied to one subject while 
baseline conditions remain in effect for the-other subjects. When 
stable or criterion-level responding has been attained for the first 
subject, the independent variable is applied to another subject, and 
so on. The multiple baseline across subjects design is the most 
widely used of all three forms of the design, in part because prac- 
titioners are commonly confronted by multiple clients or students 
needing to learn the same skill or eliminate the same problem 
behavior (e.g., Craft, Alber, & Heward, 1998 [see Figure 30.10]; 
Kahng, Iwata, DeLeon, & Wallace, 2000 [See Figure 26,1]; 
Killu, Sainato, Davis, Ospelt, & Paul, 1998 [see Figure 26.3]; 
Kladopoulos & McComas, 2001 [see Figure 6.3]). Sometimes a 
multiple baseline design is conducted across “groups” of partici- 
pants (e.g., Dixon & Holcomb, 2000 [see Figure 13.8]; Lewis, 
Powers, Kelk, & Newcomer, 2002 [see Figure 28.7]; White & 
Bailey, 1990 [see Figure 15.3]). 


Scherrer and Wilder (2008) used a multiple baseline across 
subjects design to evaluate the effects of training of safe tray carry- 
ing by cocktail servers. Participants were three servers chosen ran- 
domly from the full-time staff of a bar, whose “managers reported 
that servers had complained of sore muscles and joints, particularly 
after a busy shift” (p. 132). The dependent variable was the percent- 
age of eight safe tray-carrying behaviors (e.g., carrying position for 
amount of weight, finger and thumb positions, wrist position, tray 
distance from body) performed by each server when carrying trays 
to customers. During evening shifts, 3 to 4 days per week over the 
8-week study, trained observers sitting at tables not assigned to the 
participants recorded safe tray-carrying behaviors on checklists they 
concealed. All three servers, when asked at the end of the study if 
they had been aware of data collectors recording their behavior, said 
they were unaware of the observers. 

The independent variable was a single training session 
that began with a three-step procedure for each of the eight safe 
tray-carrying behaviors: (1) The trainer explained the correct 
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position or technique, (2) the trainer modeled the correct posi- 
tion or technique, and (3) the participant described the correct 
position or technique and then demonstrated it. A series of trials 
were followed in which the server had to demonstrate each cor- 
rect position or technique four times. All three participants met 
the mastery criteria in 30 to 50 minutes. 

Figure 9.5 shows the results of the study. During base- 
line the three participants performed the safe carrying posi- 
tions and techniques at means of 40% (Sara), 41% (Mike), and 
49% (Tanya). Each participant’s performance was variable, 
and no server attained as high as 70% safe performance on 
any baseline session. Although just three baseline measures 
were obtained for Sara, the sharp downward trend in the data 
indicated she should receive safety training first. Safe carry- 
ing behaviors by each participant increased immediately after 
training and remained at high and stable levels for the remain- 
der of the study (means of 96% for Sara, 93% for Mike, and 
96% for Tanya). 


1 
1 
1 
1 
1 
1 
\ 
\ 
' 
' 
1 
i 
t 
1 
1 
1 
i] 
t 
' 
' 
1 
0 5 = 
1 
1 
\ 
1 
' 
1 
ORNS, Re Oe RT eee ery 


100 


% of Behaviours performed safely 


0 5 at ee La : ar ee | Tra 


100 
90 
80 
70 
60 
50 
40 
30 
20 
10 


Figure 9.5 A multiple baseline across 
subjects design showing the percentage of 
safe tray-carrying positions and techniques 
by cocktail servers during baseline and 
post safety training conditions. 

From “Training to Increase Safe Tray Carrying 
Among Cocktail Servers,” by M. D. Scherrer and 

D. A. Wilder, 2008, Journal of Applied Behavior 
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Analysis, 41, p. 134. Copyright 2008 by the Society 
for the Experimental Analysis of Behavior, Inc. 
Reprinted by permission. 
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The multiple baseline is not a true single-case design because 
each subject does not serve as his or her own control. Instead, ver- 
ification of predictions based on the baseline data for each subject 
must be inferred from the relatively unchanging baseline measures 
of other subjects who are still in baseline, and replication of effects 
must be inferred from changes in the behavior of other subjects 
when they come into contact with the independent variable. This is 
both a weakness and a potential advantage of the multiple baseline 
across subjects design, and is discussed later in the chapter. 


Variations of the Multiple Baseline Design 


Three variations of the multiple baseline design are the multiple 
probe design, the delayed multiple baseline design, and the non- 
concurrent multiple baseline across participants design. 


Multiple Probe Design 


The multiple probe design enables the behavior analyst to extend 
the operation and logic of the multiple baseline tactic to behav- 
iors or situations in which concurrent measurement of all behav- 
iors constituting the design is unnecessary, potentially reactive, 
impractical, or too costly. The multiple probe design was first 


described by Horner and Baer (1978) as a method for analyzing 
the relation between an independent variable and the acquisition 
of successive approximations or task sequence. In contrast to the 
standard multiple baseline design—in which data are collected 
simultaneously throughout the baseline phase for each behav- 
ior, setting, or subject in the experiment—in the multiple probe 
design, intermittent measures, or probes, provide the basis for 
determining whether behavior change has occurred prior to inter- 
vention. According to Horner and Baer, when applied to a chain 
or sequence of related behaviors to be learned, the multiple probe 
design provides answers to four questions: (a) What is the initial 
level of performance on each step (behavior) in the sequence? 
(b) What happens when sequential opportunities to perform each 
step in the sequence are provided prior to training on that step? 
(c) What happens to each step as training is applied? and (d) What 
happens to the performance of untrained steps in the sequence as 
criterion-level performance is reached on the preceding steps? 
Figure 9.6 shows a graphic prototype of the multiple probe 
design. Although researchers have developed many variations of 
the multiple probe technique, the basic design has three key fea- 
tures: (a) An initial probe determines the subject’s level of perfor- 
mance on each behavior in the sequence; (b) a series of baseline 


Baseline Instruction Follow-Up 
100- oO oO Oo a oa 
5 
a ; 
= 1 
& 
0! oO 
100 
™N 
S 
a 
ches. 
{<) 
faa} 
os 
o 
a 
SiMedt8 
o 100- 
vu 
® 
a 
~ 
S 
Fs 
whew 
fc) 
faa} 
go 
1005 
a 
Ss 
> 
co 
Ss 
(3) 
co 
0 
— <A — — 
1 10 20 30 40 50 74 94 
Sessions 


Figure 9.6 Graphic prototype of a multiple probe design. Square data points represent results of 
probe sessions in which the entire sequence or set of behaviors (1-4) is tested. 


measures is obtained on each step prior to training on that step; 
and (c) after criterion-level performance is reached on any train- 
ing step, a probe of each step in the sequence is obtained to deter- 
mine whether performance has changed in any other steps. 

Jimenez, Browder, Spooner, and Dibiase (2003) used a 
multiple probe design to analyze the effects of a peer-mediated 
instructional procedure on the acquisition of science knowledge by 
middle school students with moderate intellectual disability. Par- 
ticipants included five target students ages 11 to 14 with IQs rang- 
ing from 34 to 55 and six 11-year-old peer tutors from the same 
inclusive classroom of 26 students. Throughout the study, the gen- 
eral education teacher introduced each science inquiry lesson to 
the entire class, using the first three letters of a KWHL chart (K = 
what do you Know?; W = What do you want to know?; H = 
How will you find out?; L = what did you Learn?) to prompt 
student responses and thinking. Students then broke into groups of 
four or five, each group including one target student and one peer 
tutor, and participated in a hands-on experiment or online activity 
and reported what they saw. 
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The primary dependent variable was the number of correct 
responses by the target students on experimenter-administered 
assessment probes conducted after each science lesson. Each probe 
consisted of eight response opportunities: stating two printed sci- 
ence vocabulary words, naming two science pictures (e.g., student 
shown three flash cards with science vocabulary words or pictures 
and told, “find kinetic energy”), matching two word/picture com- 
binations, and completing two concept statements (e.g., “Can you 
find the word that completes this statement? is the energy 
of motion.” [p. 305]). Three different picture sets were created for 
each vocabulary word. Jn vivo probes, conducted at least once per 
each unit of instruction, consisted of the target student’s perfor- 
mance on the first peer-presented trial for each science response. 

During intervention, the peer tutors (1) used a constant 
time-delay procedure to teach the target students to use the 
KWHL chart as the teacher led the class to complete their 
charts, and (2) embedded three constant time-delay learning tri- 
als for each science response during the small-group activity.° 
Figure 9.7 shows the results for one of the target students. 
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Figure 9.7 A multiple probe design across three curriculum units showing the number 
of correct responses on probes of science knowledge by a student with moderate 
intellectual disability as a function of peer-mediated embedded instruction. 

From “Inclusive Inquiry Science Using Peer-Mediated Embedded Instruction for Students with Moderate 
Intellectual Disabilities,” by B. A. Jimenez, D. M. Browder, F. Spooner, & D. Dibiase, 2012, Exceptional 
Children, 78, p. 310. Copyright 2012 by the Council for Exceptional Children. Reprinted by permission. 
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During baseline, Jade averaged 1.75 to 2 correct responses per 
probe across the three units. Her performance improved during 
intervention to a mean of 6 to 7.8 correct responses per probe. 
Jade’s performance on maintenance probes remained well above 
baseline levels for two of the three units. The other four target 
students obtained similar results, suggesting a functional rela- 
tion between peer-mediated embedded instruction and acquisi- 
tion of science knowledge by students with moderate intellectual 
disability. 

The multiple probe design is particularly appropriate for 
evaluating the effects of instruction on skill sequences in which 
it is highly unlikely that the subject can improve performance 
on later steps in the sequence without acquiring the prior steps. 
For example, the repeated measurement of a student’s accuracy 
in solving division problems when the student possesses no 
skills in addition, subtraction, and multiplication would add 
little to an analysis. Horner and Baer (1978) made this point 
exceedingly well: 


The inevitable zero scores on the division baseline have 
no real meaning: division could be nothing else than 
zero (or chance, depending on the test format), and there 
is no real point in measuring it. Such measures are 

pro forma: they fill out the picture of a multiple base- 
line, true, but in an illusory way. They do not so much 
represent zero behavior as zero opportunity for the 
behavior to occur, and there is no need to document at 
the level of well-measured data that behavior does not 
occur when it cannot. (p. 190) 


Thus, the multiple probe design avoids the necessity 
of ritualistic baseline data collection when the performance 
of any component of a chain or sequence is impossible or 
unlikely before acquisition of its preceding components. In 
addition to the two uses already mentioned—analysis of the 
effects of instruction on complex skill sequences and reduc- 
tion in the amount of baseline measurement for behaviors that 
have no plausible opportunity to occur—the multiple probe 
technique is an effective experimental strategy for situations 
in which extended baseline measurement may prove reactive, 
impractical, or costly. The repeated measurement of a skill 
under nontreatment conditions can prove aversive to some stu- 
dents; and extinction, boredom, or other undesirable responses 
can occur. In his discussion of multiple baseline designs, Cuvo 
(1979) suggested that researchers should recognize that “there 
is a trade-off between repeatedly administering the dependent 
measure to establish a stable baseline on one hand and risking 
impaired performance by subjecting participants to a poten- 
tially punishing experience on the other hand” (pp. 222-223). 
Furthermore, complete assessment of all skills in a sequence 
may require too much time that could otherwise be spent on 
instruction. 

Other examples of the multiple probe design can be 
found in Aldi et al. (2016; see Figure 21.2 in this text) Arntzen, 
Halstadtrd, and Halstadtrg (2003); Coleman-Martin and Wolff 
Heller (2004); Lambert and colleagues (2016); Mechling, 
Ayres, Purrazzella, and Purrazzella (2012); Werts, Caldwell, 
and Wolery (2016; see Figure 23.9 in this text) and Yanardag, 
Akmanoglu, and Yilmaz (2013). 


Delayed Multiple Baseline Design 


The delayed multiple baseline technique can be used when a 
planned reversal design is no longer possible or proves ineffec- 
tive; it can also add additional tiers to an already operational 
multiple baseline design, as would be the case if new subjects 
were added to an ongoing study. The delayed multiple base- 
line design is an experimental tactic in which an initial baseline 
and intervention are begun, and subsequent baselines are added 
in a staggered or delayed fashion. Figure 9.8 shows a graphic 
prototype of the delayed multiple baseline design. The design 
employs the same experimental reasoning as a full-scale multi- 
ple baseline design, with the exception that data from baselines 
begun after the independent variable has been applied to pre- 
vious behaviors, settings, or subjects cannot be used to verify 
predictions based on earlier tiers of the design. In Figure 9.8 
baseline measurement of Behaviors 2 and 3 was begun early 
enough for those data to be used to verify the prediction made 
for Behavior 1. The final four baseline data points for Behavior 
3 also verify the prediction for Behavior 2. However, baseline 
measurement of Behavior 4 began after the independent vari- 
able had been applied to each of the previous behaviors, thus 
limiting its role in the design to an additional demonstration of 
replication. 

A delayed multiple baseline design may allow the behav- 
ior analyst to conduct research in certain environments in which 
other experimental tactics cannot be implemented. Heward 
(1978) suggested three-such situations. 


A Reversal Design Is No Longer Desirable or Possible. In 
applied settings the research environment may shift, negating 
the use of a previously planned reversal design. Such shifts may 
involve changes in the subject’s environment that make the tar- 
get behavior no longer likely to reverse to baseline levels, or 
changes in the behavior of parents, teachers, administrators, the 
subject/client, or the behavior analyst that, for any number of 
reasons, make a previously planned reversal design no longer 
desirable or possible. If other behaviors, settings, or subjects 
are appropriate for application of the independent variable, the 
behavior analyst could use a delayed multiple baseline technique 
and still pursue evidence of a functional relation. 


Limited Resources, Ethical Concerns, or Practical Difficulties 
Preclude a Full-scale Multiple Baseline Design. This situation 
occurs when the behavior analyst only controls resources suf- 
ficient to initially record and intervene with one behavior, set- 
ting, or subject, and another research strategy is inappropriate. 
It may be that as a result of the first intervention, more resources 
become available for gathering additional baselines. This might 
occur following the improvement of certain behaviors whose 
pretreatment topography and/or rate required an inordinate 
expenditure of staff resources. Or, it could be that a reluctant 
administrator, after seeing the successful results of the first inter- 
vention, provides the resources necessary for additional analysis. 
Ethical concerns may preclude extended baseline measurement 
of some behaviors (e.g., Linscheid, Iwata, Ricketts, Williams, & 
Griffin, 1990). Also under this heading would fall the “practical 
difficulties” cited by Hobbs and Holt (1976) as a reason for 
delaying baseiine measurement in one of three settings. 


Baseline 


Chapter 9 * Multiple Baseline and Changing Criterion Designs 233 


Treatment 
: [ee 
2 
> 
oO 
near 
wo 
> cee S/N ire 
| | =| if | l 
Cu ee <a 
— 
2 
> 
2 
g aa wi yacfone 
wn 
Ls) 
Ss 
a el | | (ee? | 
< 
° 
a 
wn 
Psa ioe) [tiem 
= 
2 
> 
oO 
oe 
(<b) 
ne A Ot eS 
Slit | 
e) ware 
S 
> 
oO 
Is 
(<3) 
faa) 


5 10 15 20 
Sessions 


“New” Behaviors, Settings, or Subjects Become Available. A 
delayed multiple baseline technique might become a useful ana- 
lytic tactic due to changes in the environment (e.g., the subject 
begins to emit another behavior appropriate for intervention 
with the experimental variable, the subject begins to emit the 
original target behavior in another setting, or additional subjects 
displaying the same target behavior become available [e.g., 
Halld6rsdottir, Valdimarsdottir, & Arntzen, 2017; Young & Daly, 
2016, Figures 1 and 3]). Todd, Horner, and Sugai (1999) imple- 
mented a self-management intervention during Class Period B at 
the teacher’s request because the teacher had noted an immediate 
and “dramatic change” in the student’s performance when the 
intervention began in praise in Class Period A (see Figure 29.6). 

Researchers have used delayed multiple baseline designs 
to evaluate a wide variety of interventions (e.g., Baer, Williams, 
Osnes, & Stokes, 1984; Jones, Fremouw, & @arplesin1977; 
Linscheid et al., 1990; Risley & Hart, 1968; Schepis, Reid, 
Behrmann, & Sutton, 1998; McKenzie, Smith, Simmons, & 
Soderlund, 2008, [see Figure 14.3]; Towery, Parsons, & Reid, 
2014; White & Bailey, 1990 [see Figure 1524 Poche, Brou- 
wer, and Swearingen (1981) used a delayed multiple baseline 
design to evaluate the effects of a training program designed to 
prevent children from being abducted by adults. Three typically 


25 Figure 9.8 Graphic prototype of a 
delayed multiple baseline design. 


developing preschool children were selected as subjects because, 
during a screening test, each readily agreed to leave with an adult 
stranger. The dependent variable was the level of appropriateness 
of self-protective responses emitted by each child when an adult 
suspect approached the child and attempted to lure her away with 
a simple lure (“Would you like to go for a walk?”’), an authorita- 
tive lure (“Your teacher said it was all right for you to come with 
me.”), or an incentive lure (“I’ve got a nice surprise in my car. 
Would you like to come with me and see it?”). 

Each session began with the child’s teacher bringing the 
child outdoors and then pretending to have to return to the build- 
ing for some reason. The adult suspect (a confederate of the 
experimenters but unknown to the child) then approached the 
child and offered one of the lures. The confederate also served 
as observer, scoring the child’s response on a 0 to 6 scale, with 
a score of 6 representing the desired response (saying, “No, I 
have to go ask my teacher” and moving at least 20 feet away 
from the suspect within 3 seconds) and a score of 0 indicating 
that the child moved some distance away from the school build- 
ing with the suspect. Training consisted of modeling, behavioral 
rehearsal, and social reinforcement for correct responses. 

Figure 9.9 shows the results of the training program. 
During baseline, all three children responded to the lures with 
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Figure 9.9 A delayed multiple baseline design 
showing the level of appropriateness of self- 
protective responses during baseline, training, 
and generality probes in school and community 
settings. Closed symbols indicate data gathered 
near the school; open symbols, in a location away 
from the school. 

Based on “Teaching Self-Protection to Young Children,” by 
C. Poche, R. Brouwer, and M. Swearingen, 1981, Journal of 
Applied Behavior Analysis, 14, p. 174. Copyright 1981 by the 
Society for the Experimental Analysis of Behavior, Inc. 
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safety ratings of 0 or 1. All three children mastered correct 
responses to the incentive lure in one to three training sessions, 
with one or two more sessions required for each child to master 
correct responses to the other two lures. Overall, training took 
approximately 90 minutes per child distributed over five or six 
sessions. All three children responded correctly when the lures 
were administered in generalization probes on sidewalk loca- 
tions 150 to 400 feet from the school. 

Although each baseline in this study was of equal length 
(i.e., had an equal number of data points), contradicting the gen- 
eral rule that the baselines in a multiple baseline design should 
vary significantly in length, there are two good reasons that 
Poche and colleagues began training when they did with each 
subject. First, the nearly total stability of the baseline perfor- 
mance of each child provided an ample basis for evaluating the 
training program (the only exception to complete susceptibility 
to the adult suspect’s lures occurred when Stan stayed near the 
suspect instead of actually going away with him on his fourth 
baseline observation). Second, and more important, the nature 
of the target behavior required that it be taught to each child as 
soon as possible. Although continuing baseline measurement for 
varying lengths across the different tiers of any multiple baseline 
design is good practice from a purely experimental viewpoint, 
the ethics of such a practice in this instance would be highly 
questionable, given the potential danger of exposing the children 
to adult lures repeatedly while withholding training. 

The delayed multiple baseline design entails several lim- 
itations. First, from an applied standpoint the design is not a 
good choice if the behavior analyst must wait too long to modify 
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important behaviors, although this potential problem must be 
considered in all multiple baseline designs. Second, there is a 
tendency for the delayed baseline phases to contain fewer data 
points than found in a standard multiple baseline design, in 
which all baselines are begun simultaneously, resulting in base- 
line phases of considerable and varying length. Long baselines, 
if stable, provide the predictive power that permits convincing 
demonstrations of experimental control. Behavior analysts using 
any form of multiple baseline design must ensure that all base- 
lines, regardless of when they begin, are of sufficient and varied 
length to provide believable bases for comparing experimental 
effects. A third limitation of the delayed multiple baseline design 
is that it can mask the interdependence of dependent variables. 


The strength of any multiple baseline design is that little 

or no change is noticed in the other, as yet untreated, 
behaviors until, and only until, the experimenter applies 

the independent variable. In a delayed multiple baseline 
design, the “delayed baseline” data gathered for subsequent 
behaviors may represent changed performance due to 

the experimental manipulation of other behaviors in the 
design and, therefore, may not be representative of the 

true, preexperimental operant level... . In such instances, 
the delayed multiple baseline might result in a “false 
negative,” and the researcher may erroneously conclude 
that the intervention was not effective on the subsequent 
behavior(s), when in reality the lack of simultaneous 
baseline data did not permit the discovery that the 
behaviors covaried. This is a major weakness of the delayed 


multiple baseline design and makes it a research tactic of 
second choice whenever a full-scale multiple baseline can 
be employed. However, this limitation can and should be 
combated whenever possible by beginning subsequent 
baselines at least several sessions prior to intervention on 
previous baselines. (Heward, 1978, pp. 8-9) 


Both the multiple probe design and the delayed multiple 
baseline design offer the applied behavior analyst alternative 
tactics for pursuing a multiple baseline analysis when extended 
baseline measurement is unnecessary, impractical, too costly, or 
unavailable. Perhaps the most useful application of the delayed 
multiple baseline technique is in adding tiers to an already oper- 
ational multiple baseline design. Whenever a delayed baseline 
can be supplemented by probes taken earlier in the course of the 
study, experimental control is strengthened. As a general rule, 
the more baseline data, the better. 


Nonconcurrent Multiple Baseline Across 
Participants Design 


The nonconcurrent multiple baseline across participants 
design consists of a related series of A-B (baseline-intervention) 
sequences conducted across subjects at different points in time. 

Watson and Workman (1981) proposed the nonconcur- 
rent multiple baseline across participants design as a method 
for experimentally assessing treatment effects when concurrent 
measurement across subjects is not possible. They wrote, “The 
requirement [of a standard multiple baseline across subjects 
design] that observations be taken concurrently clearly poses 
problems for researchers in applied settings (e.g., schools, men- 
tal health centers), since clients with the same target behavior 
may only infrequently be referred at the same point in time” 
(p. 257, words in brackets added). The nonconcurrent multi- 
ple baseline across participants design is used frequently in 
applied behavior analysis (e.g., Garcia, Dukes, Brady, Scott, & 
Wilson, 2016; Greer, Neibert & Dozier, 2016 [see Figure 6.7]; 
Gunby, Carr, & LeBlanc, 2010; Kelley & Miltenberger, 2016; 
Speelman, Whiting, & Dixon, 2015). 

“The nonconcurrent MB design is essentially a series of 
A-B replications in which the length of each baseline phase 
differs. The participant’s graphs are then vertically aligned and 
visually inspected as one would a concurrent MB design” (Carr, 
2005, p. 220). Although conjoining the graphs of two or more 
A-B designs with a doglegged dashed line produces a figure that 
“looks like” a multiple baseline design, doing so may suggest a 
greater degree of experimental control than is warranted (Harris 
& Jenson, 1985). 

The nonconcurrent multiple baseline across participants 
design is inherently weaker than other multiple baseline design 
variations. Although the design entails two of the three elements 
of baseline logic—prediction and replication—the absence of 
concurrent baseline measures precludes the verification of inter- 
vention effects. We recommend labeling and scaling the hori- 
zontal axes of graphs of nonconcurrent multiple baseline design 
studies in a manner clearly depicting the actual time periods 
when data were obtained for each A-B sequence with respect 
to the others (e.g., see Carr, 2005; Harvey, May, & Kennedy, 
2004, Figure 2). 
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Assumptions and Guidelines for Using 
Multiple Baseline Designs 


Like all experimental tactics, the multiple baseline design entails 
certain assumptions about how the behavior-environment rela- 
tions under investigation function, even though discovering the 
existence and operation of those relations is the very reason 
for conducting the research. In this sense, the design of behav- 
ioral experiments resembles an empirical guessing game—the 
experimenter guesses; the data answer. The investigator makes 
assumptions, hypotheses in the informal sense, about behay- 
ior and its relation to controlling variables and then constructs 
experiments designed to produce data capable of verifying or 
refuting those conjectures.” 

Because verification and replication in the multiple base- 
line design depend on what happens, or does not happen, to other 
behaviors as a result of the sequential application of the inde- 
pendent variable, the experimenter must be particularly careful 
to plan and carry out the design in a manner that will afford 
the greatest degree of confidence in any relations suggested by 
the data. Although the multiple baseline design appears decep- 
tively simple, its successful application requires much more than 
selecting two or more behaviors, settings, or subjects, collecting 
some baseline data, and then introducing a treatment condition 
to one behavior after the other. We suggest the following guide- 
lines for designing and conducting experiments using multiple 
baseline designs. 


Select Independent, yet Functionally Similar, 
Baselines 


Demonstration of a functional relation in a multiple baseline 
design depends on two occurrences: (a) the behavior(s) still 
in baseline showing no significant change in level, trend, or 
variability while the behavior(s) in contact with the independent 
variable changes; and (b) each behavior changes when, and only 
when, the independent variable has been applied to it. Thus, the 
experimenter must make two, at times seemingly contradictory, 
assumptions about the behaviors targeted for analysis in a multi- 
ple baseline design. The assumptions are that the behaviors are 
functionally independent of one another (the behaviors will not 
covary with one another), and yet the behaviors share enough 
similarity that each will change when the same independent 
variable is applied to it. An error in either assumption can result 
in a failure to demonstrate a functional relation. 

For example, let us suppose that the independent variable 
is introduced with the first behavior, and changes in level and/ 
or trend are noted, but the other behaviors still in baseline also 
change. Do the changes in the still-in-baseline behaviors mean 
that an uncontrolled variable is responsible for the changes 
in all of the behaviors and that the independent variable is an 
ineffective treatment? Or do the simultaneous changes in the 
untreated behaviors mean that the changes in the first behavior 
were affected by the independent variable and have generalized 
to the other behaviors? Or, let us suppose instead that the first 
behavior changes when the independent variable is introduced, 
but subsequent behaviors do not change when the independent 
variable is applied. Does this failure to replicate mean that a 
factor other than the independent variable was responsible for 
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the change observed in the first behavior? Or does it mean only 
that the subsequent behaviors do not operate as a function of 
the experimental variable, leaving open the possibility that the 
change noted in the first behavior was affected by the indepen- 
dent variable? 

Answers to these questions can and should be pursued 
by further experimental manipulations. In both kinds of fail- 
ure to demonstrate experimental control, the multiple baseline 
design does not rule out the possibility of a functional relation 
between the independent variable and the behavior(s) that did 
change when the variable was applied. In the first instance, the 
failure to demonstrate experimental control with the originally 
planned design is offset by the opportunity to investigate and 
possibly isolate the variable robust enough to change multiple 
behaviors simultaneously. Discovery of variables that reliably 
produce generalized changes across behaviors, settings, and/or 
subjects is a major goal of applied behavior analysis; and if 
the experimenter is confident all other relevant variables were 
held constant before, during, and after the observed behavior 
changes, the original independent variable is the first candidate 
for further investigation. 

In the second situation, failure to replicate changes from 
one behavior to another, the experimenter can pursue the possi- 
bility of a functional relation between the independent variable 
and the first behavior, perhaps using a reversal technique, and 
seek to discover later an effective intervention for the behavior(s) 
that did not change. Another possibility is to drop the original 
independent variable altogether and search for another treatment 
that might be effective with all of the targeted behaviors. 


Select Concurrent and Plausibly Related 
Multiple Baselines 


In an effort to ensure the functional independence of behaviors 
in a multiple baseline design, experimenters should not select 
response classes or settings so unrelated to one another as to 
offer no plausible means of comparison. For the ongoing base- 
line measurement of one behavior to provide the strongest basis 
for verifying the prediction of another behavior that has been 
exposed to an independent variable, two conditions must be met: 
(a) The two behaviors must be measured concurrently, and (b) all 
of the relevant variables that influence one behavior must have an 
opportunity to influence the other behavior. Studies that employ 
a multiple baseline approach across subjects and settings often 
stretch the logic of the design beyond its capabilities. For exam- 
ple, using the stable baseline measures of one child’s compli- 
ance with parental requests as the basis for verifying the effect of 
intervention on the compliance behavior of another child living 
with another family is questionable practice. The sets of variables 
influencing the two children are surely differentiated by more than 
the presence or absence of the experimental variable. 


There are some important limits to designating multiple 
behavior/setting combinations that are intended to function 
as part of the same experiment. In order for the use of mul- 
tiple behaviors and settings to be part of the same design 
and thus augment experimental reasoning, the general 
experimental conditions under which the two responses 
(whether two from one subject or one from each of two 


subjects) are emitted and measured must be ongoing con- 
currently. . .. Exposure [to the independent variable] does 
not have to be simultaneous for the different behavior/set- 
ting combinations, [but] it must be the identical treatment 
conditions along with the associated extraneous variables 
that impinge on the two responses and/or settings. This is 
because the conditions imposed on one behavior/setting 
combination must have the opportunity of influencing 

the other behavior/setting combination at the same time, 
regardless of the condition that actually prevails for the 
second. . . . It follows that using responses of two subjects 
each responding in different settings would not meet the 
requirement that there be a coincident opportunity for 
detecting the treatment effect. A treatment condition [as 
well as the myriad other variables possibly responsible 
for changes in the behavior of one subject] could not then 
come into contact with the responding of the other sub- 
ject, because the second subject’s responding would be 
occurring in an entirely different location. . . . Generally, 
the greater the plausibility that the two responses would 
be affected by the single treatment [and all other relevant 
variables], the more powerful is the demonstration of 
experimental control evidenced by data showing a change 
in only one behavior. (Johnston & Pennypacker, 1980, 

pp. 276-278) 


The requirements of concurrency and plausible influ- 
ence must be met for the verification element of baseline logic 
to operate in a multiple baseline design. However, replication 
of effect is demonstrated each time a baseline steady state is 
changed by the introduction of the independent variable, more 
or less regardless of where or when the variable is applied. Such 
nonconcurrent and/or unrelated baselines can provide valuable 
data on the generality of a treatment’s effectiveness. See the pre- 
vious discussion of the nonconcurrent multiple baseline design. 

This discussion should not be interpreted to mean that 
a valid (i.e., logically complete) multiple baseline design can- 
not be conducted across different subjects, each responding 
in different settings. Numerous studies using mixed multiple 
baselines across subjects, response classes, and/or settings have 
contributed to the development of an effective technology of 
behavior change (e.g., Dixon et al., 1998; Durand, 1999 [see 
Figure 26.4]; Ryan, Ormond, Imwold, & Rotunda, 2002). 

Let us consider an experiment designed to analyze the 
effects of a particular teacher training intervention, perhaps a 
workshop on using tactics to increase each student’s opportunity 
to respond during group instruction. Concurrent measurement 
is begun on the frequency of student response opportunities 
in the classrooms of the teachers who are participating in the 
study. After stable baselines have been established, the work- 
shop is presented first to one teacher (or group of teachers) and 
eventually, in staggered multiple baseline fashion, to all of the 
teachers. 

In this example, even though the different subjects 
(teachers) are all behaving in different environments (different 
classrooms), comparison of their baseline conditions is exper- 
imentally sound because the variables likely to influence their 
teaching styles operate in the larger, shared environment in 


which they all behave (the school and teaching community). 
Nevertheless, whenever experiments are proposed or published 
that involve different subjects responding in different settings, 
researchers and consumers should view the baseline compari- 
sons with a critical eye toward their logical relation to one other. 


Do Not Apply the Independent Variable 
to the Next Behavior Too Soon 


To reiterate, for verification to occur in a multiple baseline 
design, it must be established clearly that as the independent 
variable is applied to one behavior and change is noted, little 
or no change is observed in the other, as-yet-untreated behav- 
iors. The potential for a powerful demonstration of experimental 
control has been destroyed in many studies because the inde- 
pendent variable was applied to subsequent behaviors too soon. 
Although the operational requirement of sequential application 
in the multiple baseline tactic is met by introduction of the inde- 
pendent variable even in adjacent time intervals, the experimen- 
tal reasoning afforded by such closely spaced manipulations is 
minimal. 


The influence of unknown, concomitant, extraneous 
variables that might be present could still be substantial, 
even a day or two later. This problem can be avoided by 
demonstrating continued stability in responding for the 
second behavior/setting combination during and after the 
introduction of the treatment for the first combination until 
a sufficient period of time has elapsed to detect any effect 
on the second combination that might appear. (Johnston & 
Pennypacker, 1980, p. 283) 


Vary Significantly the Lengths of Multiple Baselines 


Generally, the more the baseline phases in a multiple baseline 
design differ in the number of measures from one another, the 
stronger the design will be. Baselines with significantly differ- 
ent numbers of data points allow the unambiguous conclusion 
(assuming an effective treatment variable) that each behavior 
not only changes when the independent variable is applied, but 
also that each behavior does not change until the independent 
variable has been applied. If the different baselines are of the 
same or similar length, the possibility exists that changes noted 
when the independent variable is introduced are the result of a 
confounding variable, such as practice or reactivity to observa- 
tion and measurement, and not a function of the experimental 
variable. 


Those effects . . . called practice, adaptation, warm-up, 
self-analysis, etc.; whatever they may be and whatever they 
may be called, the multiple baseline design controls for 
them by systematically varying the length of time (sessions, 
days, weeks) in which they occur prior to the introduction 
of the training package. . . . Such control is essential, and 
when the design consists of only two baselines, then the 
number of data points in each prior to experimental inter- 
vention should differ as radically as possible, at least by a 
factor of 2. I cannot see not systematically varying lengths 
of baselines prior to intervention, and varying them as 
much as possible/practical. Failure to do that . . . weakens 
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the design too much for credibility. (D. M. Baer, personal 
communication June 2, 1978) 


Intervene on the Most Stable Baseline First 


In an ideal multiple baseline design, the independent variable 
is not applied to any of the behaviors until steady state respond- 
ing has been achieved for each. However, applied behavior 
analysts are often denied the option of delaying treatment just 
to increase the strength of an experimental analysis. When 
intervention must begin before stability is evident across each 
tier of the design, the independent variable should be applied 
to the behavior, setting, or subject showing the most stable 
baseline data path (or counter-therapeutic trend, as with Sara in 
Figure 9.5). For example, if a study is designed to evaluate 
the effects of a teaching procedure on the rate of math com- 
putation of four students and no a priori reason exists for 
teaching the students in any particular sequence, instruction 
should begin with the student showing the most stable base- 
line. However, this recommendation should be followed only 
when the majority of the baselines in the design show reason- 
able stability. 

Sequential applications of the independent variable should 
be made in the order of greatest stability at the time of each sub- 
sequent application. Again, however, the realities of the applied 
world must be heeded. The social significance of changing a 
particular behavior must sometimes take precedence over the 
desire to meet the requirements of experimental design. 


Considering the Appropriateness of Multiple 
Baseline Designs 


The multiple baseline design offers significant advantages, 
which no doubt have accounted for its widespread use by 
researchers and practitioners. Those advantages, however, must 
be weighed against the limitations and weaknesses of the design 
to determine its appropriateness in any given situation. 


Advantages of the Multiple Baseline Design 


Probably the most important advantage of the multiple baseline 
design is that it does not require withdrawing a seemingly effec- 
tive treatment to demonstrate experimental control. This is a 
critical consideration for target behaviors that are self-injurious 
or dangerous to others. This feature of the multiple baseline 
design also makes it an appropriate method for evaluating the 
effects of independent variables that cannot, by their nature, be 
withdrawn and for investigating target behaviors that are likely 
or that prove to be irreversible (e.g., Duker & van Lent, 1991). 
Additionally, because the multiple baseline design does not 
necessitate a reversal of treatment gains to baseline levels, par- 
ents, teachers, or administrators may accept it more readily as a 
method of demonstrating the effects of an intervention. 

The requirement of the multiple baseline design to 
sequentially apply the independent variable across multiple 
behaviors, settings, or subjects complements the usual practice 
of many practitioners whose goal is to develop multiple behavior 
changes. Teachers are charged with helping multiple students 
learn multiple skills to be used in multiple settings. Likewise, 
clinicians typically need to help their clients improve more than 
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one response class and emit more adaptive behaviors in several 
settings. The multiple baseline design is ideally suited to the 
evaluation of the progressive, multiple behavior changes sought 
by many practitioners in applied settings. 

Because the multiple baseline design involves concurrent 
measurement of two or more behaviors, settings, or subjects, it is 
useful in assessing the occurrence of generalization of behavior 
change. The simultaneous monitoring of several behaviors gives 
the behavior analyst the opportunity to determine their covari- 
ation as a result of manipulations of the independent variable. 
Although changes in behaviors still under baseline conditions 
eliminate the ability of the multiple baseline design to demon- 
strate experimental control, such changes reveal the possibility 
that the independent variable is capable of producing behavioral 
improvements with desirable generality, thereby suggesting an 
additional set of research questions and analytic tactics (e.g., 
Odom, Hoyson, Jamieson, & Strain, 1985). 

Finally, the multiple baseline design has the advantage 
of being relatively easy to conceptualize, thereby offering an 
effective experimental tactic for teachers and parents who are 
not trained formally in research methodology. 


Limitations of the Multiple Baseline Design 


The multiple baseline design presents at least three scientific 
limitations or considerations. First, a multiple baseline design 
may not allow a demonstration of experimental control even 
though a functional relation exists between the independent 
variable and the behaviors to which it is applied. Changes 
in behaviors still under baseline conditions and similar to 
concurrent changes in a behavior in the treatment condition 
preclude the demonstration of a functional relation within the 
original design. Second, from one perspective, the multiple 
baseline design is a weaker method for showing experimental 
control than the reversal design. This is because verification 
of the baseline prediction made for each behavior within a 
multiple baseline design is not directly demonstrated with 
that behavior, but must be inferred from the lack of change 
in other behaviors. This weakness of the multiple baseline 
design, however, should be weighed against the design’s 
advantage of providing multiple replications across different 
behaviors, settings, or subjects. Third, the multiple baseline 
design provides more information about the effectiveness of 
the treatment variable than it does about the function of any 
particular target behavior. 


Consistently [the] multiple baseline is less an experimental 
analysis of the response than of the technique used to alter 
the response. In the reversal design, the response is made 
to work again and again; in the multiple-baseline design, it 
is primarily the technique that works again and again, and 
the responses either work once each [if different responses 
are used] or else a single response works once each per 
setting or once each per subject. Repetitive working of the 
same response in the same subject or the same setting is not 
displayed. But, while repetitive working of the response is 
foregone, repetitive and diverse working of the experimen- 
tal technique is maximized, as it would not be in the rever- 
sal design. (Baer, 1975, p. 22) 


Two important applied considerations that must be evalu- 
ated in determining the appropriateness of the multiple baseline 
design are the time and resources required for its implementation. 
First, because the treatment variable cannot be applied to subse- 
quent behaviors, settings, or subjects until its effects have been 
observed on previous behaviors, settings, or subjects, the multiple 
baseline design requires that intervention be withheld for some 
behaviors, settings, or subjects, perhaps for a long time. This delay 
raises practical and ethical concerns. Treatment cannot be delayed 
for some behaviors; their importance makes delaying treatment 
impractical. And as Stolz (1978) pointed out, “If the intervention 
is generally acknowledged to be effective, denying it simply to 
achieve a multiple-baseline design might be unethical” (p. 33). 
Second, the resources needed for the concurrent measurement of 
multiple behaviors must be considered. Use of a multiple baseline 
design can be particularly costly when behavior must be observed 
and measured in several settings. However, when the use of inter- 
mittent probes during baseline can be justified in lieu of continu- 
ous measurement (Horner & Baer, 1978), the cost of concurrently 
measuring multiple behaviors can be reduced. 


CHANGING CRITERION DESIGN 


The changing criterion design can be used to evaluate the effects 
of a treatment that is applied in a graduated or stepwise fashion 
to a single target behavior. The changing criterion design was 
first described in the applied behavior analysis literature in two 
papers coauthored by Vance Hall (Hall & Fox, 1977; Hartmann 
& Hall, 1976). 


Operation and Logic of the Changing 
Criterion Design 


The reader can refer to Figure 9.10 before and after read- 
ing Hartmann and Hall’s (1976) description of the changing 
criterion design. 


The design requires initial baseline observations on a single 
target behavior. This baseline phase is followed by imple- 
mentation of a treatment program in each of a series of 
treatment phases. Each treatment phase is associated with 

a step-wise change in criterion rate for the target behavior. 
Thus, each phase of the design provides a baseline for 

the following phase. When the rate of the target behavior 
changes with each stepwise change in the criterion, ther- 
apeutic change is replicated and experimental control is 
demonstrated. (p. 527) 


The operation of two elements of baseline logic—predic- 
tion and replication—is clear in the changing criterion design. 
When stable responding is attained within each phase of the 
design, a prediction of future responding is made. Replication 
occurs each time the level of behavior changes in a systematic 
way when the criterion is changed. Verification of the predic- 
tions based on each phase is not so obvious in this design but can 
be approached in two ways. First, varying the lengths of phases 
systematically enables a form of self-evident verification. The 
prediction is made that the level of responding will not change 
if the criterion is not changed. When the criterion is not changed 
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and stable responding continues, the prediction is verified. When 
it can be shown within the design that levels of responding do 
not change unless the criterion is changed, regardless of the 
varied lengths of phases, experimental control is evident. Hall 
and Fox (1977) suggested another possibility for verification: 
“The experimenter may return to a former criterion and if the 
behavior conforms to this criterion level there is also a cogent 
argument for a high degree of behavioral control” (p. 154). 
Such a reversed criterion is shown in the next-to-last phase of 
Figure 9.10. Although returning to an earlier criterion level 
requires a brief interruption of the steady improvement in behav- 
ior, the reversal tactic strengthens the analysis considerably and 
should be included in the changing criterion design unless other 
factors indicate its inappropriateness. 

One way to conceptualize the changing criterion 
design is as a variation of the multiple baseline design. Both 
Hartmann and Hall (1976, p. 530) and Hall and Fox (1977, 
p. 164) replotted data from changing criterion design experi- 
ments in a multiple baseline format, with each tier of the mul- 
tiple baseline showing the occurrence or nonoccurrence of the 
target behavior at one of the criterion levels used in the experi- 
ment. A vertical condition change line doglegs through the tiers, 
indicating when the criterion for reinforcement was raised to 
the level represented by each tier. By graphing whether the tar- 
get behavior was emitted during each session at or above the 
level represented on each tier both before and after the change 
in criterion to that level, a kind of multiple baseline analysis 
is revealed. However, the strength of the multiple baseline 
argument is not convincing because the “different” behaviors 
represented by each tier are not independent of one another. 
For example, if a target behavior is emitted 10 times in a given 
session, all of the tiers representing criteria below 10 responses 
would have to show that the behavior occurred, and all of the 
tiers representing criteria of 11 or more would have to show no 
occurrence of the behavior, or zero responding. The majority 
of the tiers that would appear to show verification and replica- 
tion of effect, in fact, could only show these results because of 
the events plotted on another tier. A multiple baseline design 
provides a convincing demonstration of experimental control 
because the measures obtained for each behavior in the design 


a changing criterion design. 


are a function of the controlling variables for that behavior, not 
artifacts of the measurement of another behavior. Thus, recasting 
the data from a changing criterion design into a many-tiered 
multiple baseline format will often result in a biased picture in 
favor of experimental control. 

Even though the multiple baseline design is not com- 
pletely analogous, the changing criterion design can be concep- 
tualized as a method of analyzing the increasing (or decreasing) 
performance of a target behavior. As Sidman (1960) pointed out, 
“Tt is possible to make reinforcement contingent upon a speci- 
fied value of some aspect of behavior, and to treat that value as a 
response class in its own right” (p. 391). The changing criterion 
design can be an effective tactic for showing the repeated pro- 
duction of new rates of behavior as a function of manipulations 
of the independent variable (i.e., criterion changes). 

Cook, Rapp, and Schulze (2015) used a changing criterion 
design to evaluate the effects of differential negative reinforce- 
ment of other behavior (DNRO) on the duration with which 
an 8-year-old boy with autism wore a medical alert bracelet. 
Jonah’s parents were worried about their son’s history of get- 
ting separated from caregivers in public places and purchased a 
medical alert bracelet that contained their contact information 
and Jonah’s diagnosis.° Placing the bracelet on Jonah’s wrist 
evoked problem behavior in the form of yelling, demanding the 
bracelet be removed, flopping on the floor, hitting people, biting 
his hand, and trying to remove the bracelet by pulling on it and 
shaking his wrist. Baseline consisted of 122 escape trials over 
4 days. Each trial began with a therapist saying, “Jonah, it’s time 
to wear your bracelet now,” and placing the bracelet on Jonah’s 
wrist. Contingent on problem behavior, the therapist immedi- 
ately removed the bracelet for 30 seconds. When the escape 
period ended, the next trial began. 

Each trial during the DNRO condition began with a thera- 
pist telling Jonah, “It’s time to wear your bracelet now. If you can 
wait, I will take it off’ (p. 903). The therapist placed the bracelet 
on Jonah’s wrist and set a timer for the predetermined interval. If 
Jonah complied, the therapist simultaneously provided brief praise 
and removed the bracelet for the escape period. When the escape 
period ended, the next trial began. If Jonah was noncompliant, the 
therapist placed her hand over the bracelet to block his attempts to 


240 Part3 ¢ Evaluating and Analyzing Behavior Change 


remove it, described what Jonah needed to do to remove the brace- 
let (e.g., “Try to wait again with no yelling”), and reset the time 
to the current criterion wait time. The DNRO intervention began 
with a 5-second criterion for compliance time. Each time Jonah 
met criterion for five consecutive trials, the criterion was raised. 

Jonah’s mean latency to noncompliance during the escape 
baseline condition was 6 seconds (range 0 to 75 seconds) (see 
Figure 9.11; only the last 10 escape trials are plotted on the 
graph). Cook and colleagues (2015) described the sequence and 
magnitude of criterion changes and results: 


We introduced DNRO with a 5-s criterion and gradually 
increased the criterion to 60 s, which required 98 trials over 
3 days. We then implemented DNRO with a 2-min criterion 
and extended the criterion to 45 min (middle panel); this 
increase required 64 trials over 5 days. We extended the 
duration that Jonah wore the bracelet from | hr to 7 hr in 

37 trials over 12 days (bottom panel). Subsequently, 


Seconds compliant 


Minutes compliant 


Figure 9.11 A changing criterion design show- 
ing how long an 8-year-old boy with autism 
complied with wearing a medical alert brace- 
let during baseline and differential negative 
reinforcement of other behavior (DNRO). 

From “Differential Negative Reinforcement of Other 
Behavior to Increase Wearing of a Medical Bracelet,” by 
J. L. Cook, J. T. Rapp, & K. A. Schulze, 2015, Journal of 0 
Applied Behavior Analysis, 48, p. 904. Copyright 2015 by 

the Society for the Experimental Analysis of Behavior, Inc. 

Reprinted by permission. 
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therapists extended the intervention to Jonah’s home and 
school (data available from the second author). 

Thereafter, apart from brief periods to allow for 
cleaning, Jonah reportedly wore the bracelet continuously 
for the next 2 years. (p. 905) 


This study illustrates very well the changing criterion 
design’s flexibility: the 21 criterion changes of magnitudes ranged 
from 5 seconds to 4 hours. Although one or more reversals to a 
previously attained criterion level may have provided a more con- 
vincing demonstration of experimental control (a limitation noted 
by the authors), the practical and ethical considerations of doing 
so would be questionable. As always, the applied behavior analyst 
must balance experimental concerns with the need to improve 
behavior in the most effective, efficient, ethical manner. 

McDougall (2005) proposed a range-bound changing 
criterion design in which each intervention sub-phase includes 
both a lower and an upper criterion within which the participant 
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is expected to perform. McDougall suggested that behavioral 
“[iJmprovements as well as demonstrations of experimental 
control, are demonstrated definitively when the target behav- 
ior resides within, or conforms to, the pre-specified range” 
(p. 130). The author reported a study illustrating the design in 
which an overweight adult applied goal setting and self-graph- 
ing to increase minutes running each day (see Figure 9.12). 
After a 19-week baseline in which the participant ran on just 
3 days, he established an initial aim of running an average of 
20 minutes a day for 6 of 7 days of the week. He also created 
upper and lower ranges of + 10% of 20 minutes to serve as 
dual criteria for the minimum (18) and maximum (22) number 
of minutes he should run on any given day. When this aim was 
met, the participant increased his aim to 40 + 4 minutes per 
day, then to 60 + 6 minutes per day, and so on to a maximum 
of 100 + 20 minutes per day, with one sub-phase reversing 
to lower criteria. 

Compared to other single-case designs used in applied 
behavior analysis, relatively few studies using pure changing 
criterion designs have been reported (e.g., Allen & Evans, 2001; 
DeLuca & Holborn, 1992 [see Figure 13.2]; Foxx & Rubinoff, 
1979; Kurti & Dallery, 2013; Wilson & Gratz, 2016). Some 
researchers have employed a changing criterion tactic as an 
analytic element within a larger experimental design (e.g., Mar- 
tella, Leonard, Marchand-Martella, & Agran, 1993; Schleien, 
Wehman, & Kiernan, 1981). 


Guidelines for Using the Changing 
Criterion Design 


Proper implementation of the changing criterion design 
requires the careful manipulation of three design factors: 
length of phases, magnitude of criterion changes, and number 
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of criterion changes. A survey of behavioral research litera- 
ture published between 1971 and 2013 yielded 106 articles in 
57 different journals reporting studies incorporating a chang- 
ing criterion design (Klein, Houlihan, Vincent, & Panahon, 
2017). The authors reported that just 25% of the graphs they 
reviewed met all necessary requirements for adequate exper- 
imental control. “Specifically, most studies lacked variation 
in phase magnitude (and did not compensate with a rever- 
sal), lacked variation in phase length, or implemented two or 
fewer changes in criterion; while some restricted responding 
so that criterions were forcibly met and or could not exceed 
the criterion” (p. 60). 


Length of Phases 


Because each phase in the changing criterion design serves 
as a baseline for comparing changes in responding mea- 
sured in the next phase, each phase must be long enough to 
achieve stable responding. “Each treatment phase must be 
long enough to allow the rate of the target behavior to resta- 
bilize at a new and changed rate; it is stability after change 
has been achieved, and before introduction of the next change 
in criterion, that is crucial to producing a convincing demon- 
stration of control” (Hartmann & Hall, 1976, p. 531). Target 
behaviors that are slower to change therefore require longer 
phases. 

The length of phases in a changing criterion design 
should vary considerably to increase the design’s validity. 
For experimental control to be evident in a changing crite- 
rion design, the target behavior not only must change to the 
level required by each new criterion in a predictable (prefer- 
ably immediate) fashion, but also must conform to the new 
criterion for as long as it is in effect. When the target behavior 
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Figure 9.12 Example of a range-bound changing criterion design. 
From “The Range-bound Changing Criterion Design,” by D. McDougall, 2005, Behavioral Interventions, 20, p. 132. 
Copyright 2005 by John Wiley & Sons. Reprinted by permission. 
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closely follows successively more demanding criteria that are 
held in place for varied periods, the likelihood is reduced that 
the observed changes in behavior are a function of factors 
other than the independent variable (e.g., maturation, prac- 
tice effects). In most situations, the investigator should not 
set a predetermined number of sessions for which each crite- 
rion level will remain in effect. It is best to let the data guide 
ongoing decisions, whether to extend the length of a current 
criterion phase or introduce a new criterion. 


Magnitude of Criterion Changes 


Varying the size of the criterion changes enables a more 
convincing demonstration of experimental control. When 
changes in the target behavior occur not only at the time a 
new criterion is implemented but also to the level specified 
by the new criterion, the probability of a functional relation 
is strengthened. In general, a target behavior’s immediate 
change to meet a large criterion change is more impressive 
than a behavior change in response to a small criterion change. 
However, two problems arise if criterion changes are too 
large. First, setting aside practical considerations, and speak- 
ing from an experimental design standpoint only, large crite- 
rion changes may not permit inclusion of a sufficient number 
of changes in the design (the third design factor) because the 
terminal level of performance is reached sooner. The second 
problem is from an applied view: Criterion changes cannot 
be so large that they conflict with good instructional practice. 
Criterion changes must be large enough to be detectable, but 
not so large as to be unachievable. Therefore, the variability 
of the data in each phase must be considered in determining 
the size of criterion changes. Smaller criterion changes can 
be employed with very stable levels of responding, whereas 
larger criterion changes are required to demonstrate behavior 
change in the presence of variability. 

When using a changing criterion design, behavior 
analysts must guard against imposing artificial ceilings (or 
floors) on the levels of responding that are possible in each 
phase. An obvious mistake of this sort would be to give a 
student only five math problems to complete when the cri- 
terion for reinforcement is five. Although the student could 
complete fewer than five problems, the possibility of exceed- 
ing the criterion has been eliminated, resulting perhaps in an 
impressive-looking graph, but one that is badly affected by 
poor experimental procedure. 


Number of Criterion Changes 


As a general rule, the more times the target behavior changes 
to meet new criteria, the more convincing the demonstration of 
experimental control. For example, eight criterion changes, one 
of which was a reversal to a previous level, were implemented 
in the changing design illustrated in Figure 9.10, and Cook et al. 
(2015) introduced 22 criterion changes in their study shown 
in Figure 9.11. Grey, Healy, Leader, and Hayes (2009) imple- 
mented 28 criterion changes in a study lasting 122 sessions. 


In both of these cases, a sufficient number of criterion changes 
occurred to demonstrate experimental control. The experimenter 
cannot, however, simply add any desired number of criterion 
changes to the design. Kratochwill and colleagues (2017) rec- 
ommended a minimum of three criterion changes. The number 
of criterion changes that are possible within a changing criterion 
design is interrelated with the length of phases and the magni- 
tude of criterion changes. Longer phases mean that the time 
necessary to complete the analysis increases; with a limited time 
to complete the study, the greater the number of phases, the 
shorter each phase can be. 


Considering the Appropriateness 
of the Changing Criterion Design 


The changing criterion design is a useful addition to the behavior 
analyst’s set of tactics for evaluating systematic behavior 
change. Like the multiple baseline design, the changing crite- 
rion design does not require that behavioral improvements be 
reversed. However, partial reversals to earlier levels of perfor- 
mance enhance the design’s capability to demonstrate experi- 
mental control. Unlike the multiple baseline design, only one 
target behavior is required. 

Several characteristics of the changing criterion design 
limit its effective range of applications. The design can be used 
only with target behaviors aiready in the subject’s repertoire and 
that lend themselves to stepwise modification. However, this is 
not as severe a limitation as it might seem. For example, students 
perform many academic skills to some degree, but not at useful 
rates. Many of these skills (e.g., solving math problems, reading) 
are appropriate for analysis with a changing criterion design. 
Allowing students to progress as efficiently as possible while 
meeting the design requirements of changing criterion analysis 
can be especially difficult. As Tawney and Gast (1984) noted, 
“the challenge of identifying criterion levels that will permit the 
demonstration of experimental control without impeding opti- 
mal learning rates” (p. 298) is problematic with all changing 
criterion designs. 

Although the changing criterion design is sometimes sug- 
gested as an experimental tactic for analyzing the effects of shap- 
ing programs, it is not appropriate for this purpose. In shaping, 
a new behavior that initially is not in the person’s repertoire is 
developed by reinforcing responses that meet a gradually chang- 
ing criterion, called successive approximations, toward the termi- 
nal behavior (see Chapter 22). However, the changing response 
criteria employed in shaping are topographical in nature, requir- 
ing different forms of behavior at each new level. The multiple 
probe design is more appropriate for analyzing a shaping pro- 
gram because each new response criterion (successive approxi- 
mation) represents a different response class whose frequency of 
occurrence is not wholly dependent on the frequency of behav- 
iors meeting other criteria in the shaping program. Conversely, 
the changing criterion design is best suited for evaluating the 
effects of instructional techniques on stepwise changes in the 
rate, accuracy, duration, or latency of a single target behavior. 


SUMMARY 


Multiple Baseline Design 


1. 


In a multiple baseline design, simultaneous baseline mea- 
surement is begun on two or more behaviors. After stable 
baseline responding has been achieved, the independent 
variable is applied to one of the behaviors while baseline 
conditions remain in effect for the other behavior(s). After 
maximum change has been noted in the first behavior, the 
independent variable is applied in sequential fashion to the 
other behaviors in the design. 


. Experimental control is demonstrated in a multiple base- 


line design by each behavior changing when, and only 
when, the independent variable is applied. 


. The multiple baseline design takes three basic forms: (a) a mul- 


tiple baseline across behaviors design consisting of two or 
more different behaviors of the same subject; (b) a multiple 
baseline across settings design consisting of the same behav- 
ior of the same subject in two or more different settings; and 
(c) a multiple baseline across subjects design consisting of 
the same behavior of two or more different participants. 


Variations of the Multiple Baseline Design 


4. 


The multiple probe design is effective for evaluating the 
effects of instruction on skill sequences in which it is 
highly unlikely that the participant’s performance on later 
steps in the sequence can improve without instruction 

or mastery of the earlier steps in the chain. The multiple 
probe design is also appropriate for situations in which 
prolonged baseline measurement may prove reactive, 
impractical, or too costly. 


. Ina multiple probe design, intermittent measures, or 


probes, are taken on all of the behaviors in the design at 
the outset of the experiment. Thereafter, probes are taken 
each time the subject masters one of the behaviors or skills 
in the sequence. Just prior to instruction on each behay- 
ior, a series of baseline measures is taken until stability is 
achieved. 


. The delayed multiple baseline design provides an analytic 


tactic in situations in which (a) a planned reversal design 
is no longer desirable or possible; (b) limited resources 
preclude a full-scale multiple baseline design; or (c) anew 
behavior, setting, or subject appropriate for an ongoing 
multiple baseline analysis becomes available. 


. Ina delayed multiple baseline design, baseline measure- 


ment of subsequent behaviors is begun sometime after 
baseline measurement began on earlier behaviors in the 
design. Only baselines begun while earlier behaviors in the 
design are still under baseline conditions can be used to 
verify predictions made for the earlier behaviors. 


. Limitations of the delayed multiple baseline design 


include the following: (a) The behavior analyst may have 
to wait too long to modify certain behaviors; (b) baseline 
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10. 


phases may contain too few data points, and (c) baselines 
begun after the independent variable has been applied to 
earlier behaviors in the design can mask the interdepen- 
dence (covariation) of behaviors. 


. The nonconcurrent multiple baseline across participants 


design consists of a series of A-B (baseline-intervention) 
sequences conducted across subjects at different points 
in time. 


The nonconcurrent multiple baseline across participants 
design is inherently weaker than other multiple base- 
line design variations. The nonconcurrent design entails 
two of three elements of baseline logic—prediction and 
replication—but the absence of concurrent baseline mea- 
sures does not enable verification of intervention effects. 


Assumptions and Guidelines for Using Multiple 
Baseline Designs 


11. 


12. 


13: 


14. 


15. 


16. 


Behaviors that are part of multiple baseline designs should 
be functionally independent (i.e., they do not covary) and 
share a reasonable likelihood that each will change when 
the independent variable is applied to it. 


Behaviors in a multiple baseline design should be mea- 
sured concurrently and have an equal opportunity of being 
influenced by the same set of relevant variables. 


In a multiple baseline design, the independent variable 
should not be applied to the next behavior until the pre- 
vious behavior has changed maximally and a sufficient 
period of time has elapsed to detect any effects on behav- 
iors still in baseline conditions. 


The length of the baseline phases for the different behav- 
iors making up a multiple baseline design should vary 
significantly. 


All other things being equal, the independent variable 
should be applied first to the behavior showing the most 
stable level or counter-therapeutic trend of baseline. 


Conducting a reversal phase in one or more tiers of a mul- 
tiple baseline design can strengthen the demonstration of a 
functional relation. 


Considering the Appropriateness 
of Multiple Baseline Designs 


17. 


Advantages of the multiple baseline design include the 

fact that (a) it does not require withdrawing a seemingly 
effective treatment; (b) sequential implementation of the 
independent variable parallels the practice of many teachers 
and clinicians whose task is to change multiple behaviors 
in different settings and/or subjects; (c) the concurrent mea- 
surement of multiple behaviors allows direct monitoring 

of generalization of behavior change; and (d) the design is 
relatively easy to conceptualize and implement. 
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18. Limitations of the multiple baseline design include 
the fact that (a) if two or more behaviors in the design 
covary, the multiple baseline design may not demon- 
strate a functional relation even though one exists; 
(b) because verification must be inferred from the lack 
of change in other behaviors, the multiple baseline 
design is inherently weaker than the reversal design in 
showing experimental control between the independent 
variable and a given behavior; (c) the multiple base- 
line design is more an evaluation of the independent 
variable’s general effectiveness than an analysis of the 
behaviors involved in the design; and (d) conducting a 
multiple baseline design experiment requires consider- 
able time and resources. 


Changing Criterion Design 


19. The changing criterion design can be used to evaluate 
the effects of a treatment on the gradual or stepwise 
improvement of a behavior already in the subject’s 
repertoire. 


20. After stable baseline responding has been achieved, the 
first treatment phase is begun, in which reinforcement is 
usually contingent on the subject performing at a specified 
level (criterion). The design entails a series of treatment 
phases, each requiring an improved level of performance 
over the previous phase. Experimental control is demon- 
strated when the subject’s behavior closely conforms to 
the changing criteria. 


21. In a range-bound changing criterion design, each 
intervention sub-phase includes both a lower and an 
upper criterion within which the subject is expected 
to perform. 


22. Three features combine to determine the potential for a 
changing criterion design to demonstrate experimental 
control: (a) the length of phases, (b) the magnitude of cri- 
terion changes, and (c) the number of criterion changes. 
The believability of the changing criterion design is 
enhanced if a previous criterion is reinstated and the sub- 
ject’s behavior reverses to the level previously observed 
under that criterion. 


Considering the Appropriateness 
of the Changing Criterion Design 


23. Advantages of the changing criterion design are (a) it does 
not require a withdrawal or reversal of a seemingly effec- 
tive treatment; and (b) it enables an experimental analysis 
within the context of a gradually improving behavior, thus 
complementing the goals of practitioners. 


24. Limitations of the changing criterion design are (a) the 
target behavior must already be in the subject’s repertoire, 
and (b) incorporating the necessary features of the design 
may impede optimal learning rates. 


changing criterion design multiple baseline across settings design 


delayed multiple baseline design multiple baseline across subjects 


multiple baseline across behaviors design 


design multiple baseline design 
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multiple probe design 
nonconcurrent multiple baseline 
across participants design 


range-bound changing criterion design 


MULTIPLE-CHOICE QUESTIONS 


1. Simultaneous measurement of two or more behaviors of 
a single participant describes which type of experimental 
design? 


a. Multiple baseline across behaviors 
b. Multiple baseline across settings 

c. Multiple baseline across subjects 

d. Multiple baseline across probes 
Hint: (See “Multiple Baseline Design”’) 


2. In a multiple baseline design, the functional relationship 
between the independent variable and the change in the 
subject’s behavior is determined by: 


a. Systematic introduction and withdrawal across phases 
b. Observation of unchanged behaviors in other tiers 

¢c. Observation of behavior change in the treated tier 

d. Replication of behavior change with each new criterion 
Hint: (See “Multiple Baseline Design’’) 


3. Which experimental design is most appropriate for evalu- 
ating the effect of video modeling on the acquisition of the 


skills for preparing a meal by adults with developmental 
disabilities? 


a. Multiple baseline across behaviors 

b. Changing criterion 

c. Multiple probe 

d. Delayed multiple baseline 

Hint: (See “Variations of the Multiple Baseline Design’) 
. All of the following are appropriate situations to use a 
delayed multiple baseline except: 

a. A planned reversal design is no longer possible 


b. The researcher has limited resources to conduct data 
collection 


c. A reversal design is too challenging to implement 


d. Additional subjects become available after the start of 
the study 


Hint: (See “Variations of the Multiple Baseline Design”) 
. All of the following are advantages of the multiple 
baseline design except: 

a. Easy to conceptualize 

b. Lack of treatment withdrawal 

c. Mimics common practice of practitioners 

d. Takes less time to implement 

Hint: (See “Assumptions and Guidelines for Using 
Multiple Baseline Designs’) 

. All of the following are limitations of the multiple 
baseline design except: 


a. It may not reveal the functional relationship, even if 
one exists 


b. Verification of predicted behavior change must be 
inferred from other behaviors. 


c. The behavior under study must be within the subject’s 
repertoire already 


d. It is weaker than the reversal design 


Hint: (See “Assumptions and Guidelines for Using 
Multiple Baseline Designs”) 


Ie 


10. 
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When an intervention that is known to be effective is 
withheld from a subject for achieving a multiple base- 
line design, this presents a(n) 
researchers. 


challenge for 


a. Practical 

b. Ethical 

c. Experimental 
d. Logical 


Hint: (See “Assumptions and Guidelines for Using 
Multiple Baseline Designs”) 


. One method of addressing the practical concern of costly 


measurement when using a multiple baseline design across 
settings is to: 


a. Include only one setting in the study 

b. Reduce the number of baseline data points 
c. Use intermittent probes during baseline 
d 


. Use a changing criterion design instead of the multiple 
baseline 


Hint: (See “Assumptions and Guidelines for Using 
Multiple Baseline Designs’) 


. When a subject’s behavior changes each time a new 


criterion is introduced, this is an example of 

a. Prediction 

b. Function 

c. Replication 

d. Systematic manipulation 

Hint: (See “Changing Criterion Design’) 

Which of the following is not a design factor to be 
considered when using a changing criterion design? 
a. Length of phase 

b. Magnitude of behavior change 

c. Number of criterion changes 

d. Verification of behavior change 


Hint: (See “Guidelines for Using the Changing Criterion 
Design’’) 
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ESSAY-TYPE QUESTIONS 
1. Explain how a researcher uses prediction and verification 4. Describe the five guidelines for using a multiple baseline 
of behavior change in a multiple baseline across behaviors design. 
design. Hint: (See “Assumptions and Guidelines for Using 


Hint: (See “Multiple Baseline Design”) Multiple Baseline Designs”) 


Describe a situation in which the most ethical experimental 


2. Identify the three types of multiple baseline designs and 5 
design would be the multiple probe design. 


give an example of when each is appropriate. 


Hint: (See “Multiple Baseline Design”) Hint: (See “Variations of the Multiple Baseline Design”) 


3. Briefly describe how prediction and replication are 
demonstrated in the changing criterion design. 


Hint: (See “Operation and Logic of the Changing Criterion 
Design’) 
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NOTES 


it 


i 


Although most of the graphic displays created or selected for this text as 
examples of experimental design tactics show data plotted on noncumulative 
vertical axes, the reader is reminded that repeated measurement data 
collected within any type of experimental design can be plotted with a variety 
of graphic displays. For example, Lalli, Zanolli, and Wohn (1994) and 
Mueller, Moore, Doggett, and Tingstrom (2000) used cumulative graphs 
to display the data they collected in multiple baseline design experiments; 
and Raiff, Jarvis, and Dallery (2016) presented the results of a multiple 
baseline design experiment with a graphic display consisting of a 
horizontal sequence of boxes in different shades to illustrate participants’ 
performances during baseline and treatment conditions. Students of applied 
behavior analysis should be careful not to confuse the different techniques 
for graphically displaying data with tactics for experimental analysis. 
Functional analysis and automatic reinforcement are described in Chapters 
11 and 27, respectively. 

Time delay, a procedure for transferring stimulus control from a response 
prompt (e.g., teacher says “apple” while pointing to an apple) to a natural 
stimulus (e.g., an apple), is described in Chapter 17. 
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4. In most of these studies, the experimental designs were described as 


multiple probe designs. However, the graphic displays show that baselines 
were added in a time-delayed fashion. 


. Hypothesis, as we are using the term here, should not be confused with 


formal hypothesis testing models that use inferential statistics to confirm 
or reject a hypothesis deduced from a theory. As Johnston and Penny- 
packer (2009) pointed out, “Researchers do not need to state hypotheses if 
they are asking a question about nature. When the experimental question 
simply asks about the relation between independent and dependent 
variables, there is no scientific reason to make a prediction about what 
will be learned from the data. Whatever is learned will describe something 
about that relationship that was presumably not known before. Whether 
it matches anyone’s expectations has nothing to do with its truth or 
importance (Sidman).” (p. 50) 


. Negative reinforcement and differential reinforcement of other behavior 


(DRO) are detailed in Chapters 12 and 25, respectively. 
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Planning and Evaluating Applied Behavior 


Analysis Research 


LEARNING OBJECTIVES 


e Explain the importance of the individual subject in behavioral research. 
¢ Explain the importance of flexibility when designing experiments. 
¢ Identify and design systems for evaluating internal validity. 


¢ Identify and design systems for evaluating socia! validity. 


e Identify and design systems for monitoring external validity. 
¢ Explain the importance of evaluating applied behavior analysis research. 


for selecting and defining target behaviors, detailed 

strategies for designing and implementing measurement 
systems, presented guidelines for displaying and interpreting 
behavioral data, and described experimental tactics for reveal- 
ing whether observed changes in behavior can be attributed 
to an intervention. This chapter supplements the information 
described thus far by examining additional questions and con- 
siderations that should be addressed when designing, replicating, 
and evaluating behavioral research. We begin by reviewing the 
central role of the individual subject in behavioral research, and 
follow with discussions of the value of flexibility in experimen- 
tal design, how researchers using single-subject and between- 
groups designs pursue internal and external validity, and the 
importance and means of assessing the applied value of treat- 
ments and the behavior changes they bring about. The chapter 
concludes with suggestions for evaluating the believability and 
usefulness of research in applied behavior analysis. 


Pers chapters outlined considerations and procedures 


IMPORTANCE OF THE INDIVIDUAL SUBJECT 
IN BEHAVIOR ANALYSIS RESEARCH 


The research methods of any science must match the defining char- 
acteristics of that science’s subject matter. Behavior analysis—a 
science devoted to discovering and understanding the con- 
trolling variables of behavior—defines its subject matter as the 
activity of living organisms, a dynamic phenomenon that occurs 
at the level of the individual organism. It follows that behavior 
analysis research methods feature direct and repeated measures 
of the behavior of individual organisms (the only place, by defi- 
nition, where behavior takes place). This focus on the behavior 
of individual subjects has enabled applied behavior analysts to 
discover and refine effective assessment tools and interventions 
for a wide range of socially significant behavior. 

To further explain the importance this focus on the indi- 
vidual subject or client holds for applied behavior analysis, we 


contrast it with a research model that compares data represent- 
ing the aggregate measures of different groups of subjects. This 
between-groups approach to experimental design has predomi- 
nated “behavioral research” in psychology, education, and other 
social sciences for decades. 


Brief Outline of a Between-groups Experiment 
The basic format of a between-groups experiment is as follows. 


1. A pool of subjects (e.g., 60 first-grade nonreaders) is 
selected randomly from the population (e.g., all first-grade 
nonreaders in a school district) relevant to the research 
question (e.g., Will the XYZ intensive phonics program 
improve first-grade nonreaders’ ability to decode unpre- 
dictable text?). 


2. The subjects are divided randomly into two groups: the 
experimental group and the control group. 


3. An initial measure (pretest) of the dependent variable 
(e.g., score on a test of decoding skills) is obtained for all 
subjects in the study, the individual pretest scores for the 
subjects in each group are combined, and the mean and 
standard deviation are calculated for each group’s perfor- 
mance on the pretest. 


4. Subjects in the experimental group are then exposed to the 
independent variable (e.g., 6 weeks of the XYZ program), 
which is not provided to subjects in the control group. 


5. After the treatment program has been completed, a 
posttest measure of the dependent variable is obtained for 
all subjects, and the mean and standard deviation posttest 
scores for each group are computed. ! 

6. The researcher compares any changes in each group’s 
scores from pretest to posttest, applying various sta- 
tistical tests to the data that enable inferences regard- 
ing the likelihood that any differences between the two 
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groups’ performances can be attributed to the indepen- 
dent variable. For example, assuming the mean pretest 
scores for the experimental and control groups were 
similar, and the posttest revealed an improved score for 
the experimental group but not for the control group, 
statistical analyses would indicate the mathematical 
probability that the difference was due to chance. When 
a Statistical test limits the role of chance to a predeter- 
mined acceptable degree—usually 5% (1 chance in 20) 
or 1% (1 chance in 100)—the researcher concludes that 
the independent variable was responsible for changes to 
the dependent variable (e.g., the experimental group’s 
improvement from pretest to posttest).” 


Researchers who compare measures of groups of sub- 
jects in this way do so for two primary reasons, each of 
which was introduced in Chapter 7. First, the assumption 
that averaging the measures of many subjects’ performance 
controls for intersubject variability enables the belief that any 
changes in performance are brought about by the independent 
variable. A second rationale for large groups of subjects is the 
notion that increasing the number of subjects increases the 
study’s external validity. That is, a treatment variable found 
effective with the subjects in the experimental group will also 
be effective with other subjects in the population from which 
the sample subjects were selected. 

In the next section we comment on the first reason for the 
use of groups of subjects—that doing so controls for intersubject 
variability. We identify four fundamental concerns with typical 
between-groups designs that bear heavily on experimental rea- 
soning. The assumption of increased generality of findings is 
discussed later in this chapter in the section on replication.° 


Group Data May Not Represent the Performance 
of Individual Subjects 


Knowing that the average performance of a group of subjects 
changed may reveal nothing about the performance of individual 
subjects. The average performance of subjects in the experimen- 
tal group may improve, while the performance of some subjects 
in the group stayed the same, and the performance of others 
deteriorated. It is even possible for the majority of subjects to 
show no improvement, for some subjects to get worse, and for a 
few subjects to improve sufficiently to yield an overall average 
improvement. 


Response Measure 


Figure 10.1 Hypothetical data showing 
that the mean performance of a group of 
subjects may not represent the behavior of 


@ Subject A 
O Subject B 
X Group Mean 


Between-groups studies can show that a treatment is gen- 
erally effective, that no treatment works with everyone, that 
people respond differently to the same treatment, and so on. 
However, the fact that the average performance of a group of 
subjects improved when they were exposed to a treatment is 
insufficient reason to adopt it. The factors responsible for one 
subject’s improvement and another’s lack of improvement must 
be discovered. To be most useful, a treatment must be under- 
stood at the level at which people contact it and are affected by 
it: the individual level. 

The two graphs in Figure 10.1 illustrate some of the faulty 
conclusions possible when interpreting a study on the basis of 
group mean scores. Each graph presents hypothetical data for 
a two-subject group. The group data showing no change from 
pretest to posttest in both graphs suggest that the independent 
variable had no effect on the subjects’ behavior. However, the 
left-hand graph in Figure 10.1 shows that Subject A’s perfor- 
mance improved from pretest to posttest while Subject B’s 
behavior deteriorated over the same period.* The right-hand 
graph shows identical pre- and posttest measures for Subjects 
C and D, while repeated measures between the pretest and 
posttest reveals significant variability within and between the 
two subjects. 


Group Data Mask Variability 


A second problem associated with the mean performance of a 
group of subjects is that it hides variability in the data. Even 
if repeated measures of Subject C’s and Subject D’s behavior 
between the pre- and posttest had been conducted as shown in 
Figure 10.1, a researcher who relied on the group’s mean per- 
formance as the primary indicator of behavior change would 
be ignorant of the variability that occurred within and between 
subjects. 

When repeated measurement reveals significant levels of 
variability, an experimental search with the goal of identifying 
and controlling the factors responsible for the variability is in 
order. The widespread belief that the effects of uncontrolled 
variables in a study can be controlled by statistical manipulation 
of the dependent variable is faulty. 


Statistical control is never a substitute for experimental 
control. ... The only way to determine whether or not 
uncontrolled variables are influencing the data is to inspect 
the data at the finest available level of decomposition, 


@ Subject C 
© Subject D 
X Group Mean 


Response Measure 


individual subjects. 


Pretest 


Posttest 
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usually point-by-point for each individual subject. No pur- 
pose 1s served by combining the data statistically to obscure 
such effects. (Johnston & Pennypacker, 1980, p. 371) 


Attempting to “cancel out” variability through statistical 
manipulation neither eliminates it nor controls the variables 
responsible for it. And the researcher who attributes the effects 
of unknown or uncontrolled variables to chance forgoes the 
opportunity to identify and analyze important variables. In his 
monumental work Tactics of Scientific Research, Sidman (1960) 
dealt repeatedly and forcefully with this critical issue. 


To some experimenters, chance is simply a name for the 
combined effects of uncontrolled variables. If such vari- 
ables are, in fact, controllable, then chance in this sense 

is simply an excuse for sloppy experimentation, and no 
further comment is required. If the uncontrolled variables 
are actually unknown, then chance is, as Boring (1941) 

has pointed out, a synonym for ignorance. . . . One of the 
most discouraging and at the same time challenging aspects 
of behavioral science is the sensitivity of behavior to a 
tremendous array of variables. . . . But variables are not 
canceled statistically. They are simply buried so that their 
effects cannot be seen. The rationale for statistical immo- 
bilization of unwanted variables is based on the assumed 
random nature of such variables. . . . Not only is the 
assumption of randomness with respect to the uncontrolled 
variables an untested one but it is also highly improbable. 
There are few, if any, random phenomena in the behavioral 
world. (pp. 45, 162—163) 


Sidman (1960) also commented on an experimenter’s use 
of statistics in an attempt to deal with troublesome sequence 
effects. 


He has a neat trick up his sleeve. By averaging together the 
data for both subjects under Condition A, and again under 
Condition B, he “cancels out” the order effect, and com- 
pletely bypasses the problem of irreversibility. By a simple 
arithmetical operation, two subjects have become one, and 
a variable has been eliminated. 

It has not, in fact, gone anywhere. Numbers may be 
made to disappear by adding and subtracting them from 
each other. Five apples minus three apples are two apples. 
The numbers are easily changed by a few strokes of the 
pen, but some eating has to be done before the apples 
themselves will vanish. (p. 250) 


The “eating” that must be done to control the effects of 
any variable can be accomplished in only two ways: (a) holding 
the variable constant throughout the experiment, or (b) isolating 
the suspected factor as an independent variable and manipulat- 
ing its presence, absence, and/or value during the experiment. 


Group Data Do Not Represent Real 
Behavioral Processes 


Skinner (1938) contended that researchers must demonstrate 
behavior—environment relations at the level of the individual 
organism or risk discovering synthetic phenomena that repre- 
sent mathematical, not behavioral, processes. Sidman (1960) 


described the challenges faced by an investigator interested in 
the relation between the number of times a behavior is reinforced 
during initial acquisition and responding during a subsequent 
extinction condition. A single-case approach would entail 
presenting an individual subject with a series of acquisition- 
extinction events, each acquisition condition consisting of a 
different number of reinforcements. The subject’s performance 
during each successive extinction phase would be confounded by 
the subject’s experience with all previous reinforcement-then- 
extinction events. Behavior during extinction would decrease to 
zero more rapidly as a function of that cumulative experience, 
rather than the number of reinforcements obtained in the acqui- 
sition phase immediately prior to each extinction test. 

A between-groups design solution to the problem that no 
individual subject can be exposed to differing amounts of rein- 
forcement during an initial acquisition phase would be to assign 
multiple subjects to different groups, expose subjects in each 
group to different values of the independent variable (e.g., 5, 10, 
and 20 reinforcements), and compare the mean rate of respond- 
ing by subjects in each group during a subsequent extinction 
condition. Such a design may reveal a relation between num- 
ber of reinforcements during initial acquisition and responding 
during extinction, but... 


The function does not represent a behavioral process. The 
use of separate groups destroys the continuity of cause 

and effect that characterizes an irreversible behavioral 
process. . . . If it proves impossible to obtain an uncontam- 
inated relation between the number of reinforcements and 
resistance to extinction in a single subject, because of the 
fact that successive extinctions interact with each other, 
then the “pure” relation simply does not exist. The solution 
is... to direct our research toward the study of behavior as 
it actually exists .. . [and] not be deceived into concluding 
that the group type of experiment in any way provides a 
more adequately controlled or more generalizable substitute 
for individual data. (Sidman, 1960, p. 53) 


Acknowledging that studies comparing group data 
describe “some kind of order in the universe” and may form the 
basis of a science, Sidman (1960) noted they: 


cannot, however, be a science of individual behavior 
except of the crudest sort. . . . It is a science of aver- 
aged behavior of individuals who are linked together 
only by the averaging process itself. Where this science 
fits in the scheme of natural phenomena is a matter 

of conjecture. My own feeling is that it belongs to 

the actuarial statistician, and not the investigator of 
behavior processes. (pp. 274-275) 


Between-groups Designs Lack Intrasubject 
Replication 


A fourth weakness of the between-groups research model is that 
the power of replicating effects within and across individual 
subjects is lost. One of the great strengths of single-case exper- 
imental designs is the convincing demonstration of functional 
relations made possible by replication of treatment effects within 
the designs. Although multiple subjects typically participate in 
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single-case studies and investigators often describe and display 
the data for all subjects as a group, data from individual sub- 
jects are the basis for interpreting experimental effects. Applied 
behavior analysts are wise to heed Johnston and Pennypacker’s 
(1980) admonition: “An effect that emerges only after individual 
data have been combined is probably artifactual and not repre- 
sentative of any real behavioral processes” (p. 257). 

This discussion should not be interpreted to mean 
that the overall performance of groups of subjects cannot, or 
should not, be studied with the research strategies and tactics 
of applied behavior analysis. Improving overall performance 
of a group is socially significant in many applied situations 
(Donaldson, Fisher, & Kahng, 2017; Friman, Finney, Rapoff, 
& Christophersen, 1985; Luyben, 2009). Brothers, Krantz, and 
McClannahan (1994) used a multiple baseline across three set- 
tings to evaluate an intervention that increased the number of 
pounds of office paper recycled by 25 school staff members. 
Cope, Allred, and Morsell (1991) employed a multiple treatment 
reversal design to reveal which of three experimental conditions 
most effectively reduced illegal parking by drivers of 558 vehi- 
cles in spaces designated for individuals with disabilities at a 
supermarket. 

Still, it is important to remember that group data may 
not represent the performance of individual participants, and 
vice versa. Lloyd, Eberhardt, and Drake (1996) compared the 
effects of group versus individual reinforcement contingencies 
within the context of collaborative group study conditions on 
quiz scores by students in a Spanish language class. The results 
showed that the group contingencies resulted in higher mean 
quiz scores for the class as a whole compared to the individ- 
ual contingencies condition. However, overall benefits at the 
class level were mitigated by differential results for individ- 
ual students. When group results do not represent individual 
performances, researchers should supplement group data 
with individual results, ideally in the form of graphic displays 
(e.g., Lloyd et al., 1996; Ryan & Hemmes, 2005). 

In some instances, the behavior analyst may not be able 
to control potential subjects’ access to the experimental setting 
and contingencies or even be able to identify who the subjects 
are. The dependent variable must then consist of responses 
emitted by individuals who enter the experimental setting. This 
approach is used frequently in community-based behavior ana- 
lysis research. For example, group data have been collected and 
analyzed on such dependent variables as recycling on a univer- 
sity campus (O’Conner, Lerman, & Fritz, 2010), car pooling by 
university students (Jacobs, Fairbanks, Poche, & Bailey, 1982), 
motorists yielding at pedestrian crosswalks (Crowley-Koch, Van 
Houten, & Lim, 2011), the use of child safety belts in shopping 
carts (Barker, Bailey, & Lee, 2004), and reducing graffiti on 
restroom walls (Watson, 1996). 


IMPORTANCE OF FLEXIBILITY 
IN EXPERIMENTAL DESIGN 


On one level, an effective experimental design is any sequence 
of independent variable manipulations that produces data that 
are interesting and convincing to the researcher and the audi- 
ence. In this context design is appropriate as a verb as well as a 


noun; the effective behavioral researcher must actively design 
each experiment so that each achieves its own unique design. 
The prototype designs presented in Chapters 8 and 9 are exam- 
ples of analytic tactics that afford a form of experimental rea- 
soning and control that has proven effective in advancing the 
scientific understanding of a wide range of phenomena of inter- 
est to applied behavior analysts. 

An investigator should not fall into the trap of believing that 
a certain type of research question, treatment, behavior, or set- 
ting indicates which analytic tactic(s) should be employed. While 
the experimental designs in published studies exploring similar 
research questions should be mined for methodological ideas, no 
ready-made experimental designs await selection, nor is there a 
set of rules that must be followed. Sidman (1960) was adamant 
in his warning regarding the undesirable effects of researchers’ 
believing in the existence of a given set of rules for experimental 
design. 


The examples may be accepted as constituting a set of 
rules that must be followed in the design of experiments. 

I cannot emphasize too strongly that this would be disas- 
trous. I could make the trite statement that every rule has 

its exception, but this is not strong enough. Nor is the more 
relaxed statement that the rules of experimental design are 
flexible, to be employed only where appropriate. The fact is 
that there are no rules of experimental design. (p. 214) 


We agree with Sidman. The student of applied behavior 
analysis should not be led to believe that any of the analytic 
tactics described in Chapters 8 and 9 constitute experimental 
designs per se.> Still, we believe that it is useful to present the 
most commonly used analytic tactics in design form for two 
reasons. First, the vast majority of studies that have advanced the 
knowledge base of applied behavior analysis have used exper- 
imental designs featuring one or more of these analytic tactics. 
Second, we believe examining specific examples of isolated 
experimental tactics and their application is an important step in 
learning the assumptions and strategic principles that guide the 
selection and arrangement of analytic tactics into experimental 
designs that effectively and convincingly address the research 
questions at hand. 


Experimental Designs Combining 
Analytic Tactics 


An experimental design that combines analytic tactics may allow 
a more convincing demonstration of experimental control than a 
design using a single analytic tactic. For example, withdrawing 
the treatment variable (a return to baseline) and then reapplying 
it within one or more tiers in a multiple baseline design may 
enable the researcher to determine the existence of a functional 
relation between the independent variable and each behavior, 
setting, or subject of the multiple baseline, in addition to ana- 
lyzing the effectiveness of the independent variable across the 
tiers (e.g., Barker et al., 2004; Donaldson et al., 2017; Heward & 
Eachus, 1979; Miller & Kelley, 1994 [see Figure 28.14]; Zhou, 
Goff, & Iwata, 2000). 

Whereas a design with a single analytic tactic may func- 
tion like a belt that effectively holds up a pair of jeans, a design 
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that combines analytic tactics is akin to adding a pair of sus- 
penders that more firmly answers the research or adds a bit 
of stylistic flair. However, beginning researchers should not 
assume that designs with multiple tactics are inherently better 
than single-tactic designs. A design featuring multiple analytic 
tactics may be weaker than a single-tactic design, if the addi- 
tional analytic tactics overshadow the powerful answer provided 
by data with a simple, yet elegant single-tactic design. 
Experimental designs combining multiple baseline, rever- 
sal, and/or multielement tactics can provide the basis for com- 
paring the effects of two or more independent variables. Haring 
and Kennedy (1990) used multiple baseline across settings and 
reversal tactics in an experiment comparing the effectiveness of 
time-out and differential reinforcement of other behavior (DRO) 
on the occurrence of problem behaviors by two secondary stu- 
dents with severe disabilities (see Figure 10.2).° Sandra and 
Raff each frequently engaged in repetitive, stereotypic problem 


behaviors (e.g., body rocking, loud vocalizations, hand flapping, 
spitting) that interfered with classroom and community activ- 
ities. In addition to assessing the effects of the time-out and 
DRO interventions against a no-treatment baseline condition, 
the design also enabled the researchers to conduct two compar- 
isons of the relative effects of each treatment during an instruc- 
tional task and leisure context. The design enabled Haring and 
Kennedy to discover that the time-out and DRO interventions 
produced different outcomes depending on the activity con- 
text in which they were applied. For both students, DRO was 
more effective than time-out in suppressing problem behavior 
in the task context; the opposite results were obtained in the 
leisure context, where time-out suppressed problem behavior 
and DRO proved ineffective. 

Experimenters have also incorporated alternating treat- 
ments into experimental designs containing multiple baseline 
elements. For example, Ahearn, Kerwin, Eicher, Shantz, and 
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Swearingin (1996; see Figure 12.5) evaluated the relative 
effects of two treatments for food refusal in an alternating 
treatments design implemented in a multiple baseline across 
subjects format. Likewise, Mcgee Krantz, and McClannahan 
(1985) evaluated the effects of several procedures for teach- 
ing language to children with autism with an experimental 
design that incorporated alternating treatments within a mul- 
tiple baseline across behaviors component that was, in turn, 
nested within an overall multiple baseline across subjects 
format. Zanolli and Daggett (1998) investigated the effects 
of reinforcement rate on the spontaneous social initiations of 
socially withdrawn preschoolers with an experimental design 
consisting of multiple baseline, alternating treatments, and 
reversal tactics. 

Sisson and Barrett (1984) incorporated multiple probe, 
multielement, and a multiple baseline across behaviors tactics 
in a design comparing the effects of two language-training 
procedures (see Figure 10.3). The design enabled the inves- 
tigators to discover the superiority of the total communica- 
tion method for these two children, as well as the fact that 
direct application of the treatment was required for learning 
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to occur on specific sentences. Results for a third subject 
revealed a functional relation of the same form and direction 
as that found for the two children whose results are shown 
in Figure 10.3, but one not so strongly in favor of the total 
communication procedure. 


Component Analyses 


When a behavioral intervention consists of multiple compo- 
nents, it is termed a treatment package. An effective toilet 
training program, for example, might entail replacing diapers 
with underwear or training pants, a schedule of sitting on the 
toilet for various durations, increased fluid intake, reinforce- 
ment for proper eliminations, and contingencies for accidents 
(Greer, Neibert, & Dozier, 2016 [see Figure 6.7]). Multicom- 
ponent interventions are common for treating complex feed- 
ing disorders. Treatment packages for food packing (holding 
or pocketing food in the oral cavity without swallowing) 
have included combinations of the following components: 
presenting highly preferred food on the spoon behind a bite 
of previously packed food; removing packed food and plac- 
ing it on the child’s tongue; a liquid chaser; inserting food 
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into the child’s mouth with a spoon, then flipping the spoon 
180° and dragging the bow] of the spoon toward the lips to 
deposit the food on the child’s tongue; removing packed food 
with a spoon; facilitating swallowing with a light touch on the 
tongue’s posterior; response cost for packing or expulsion; 
and various forms of positive reinforcement (e.g., preferred 
toy) and negative reinforcement (e.g., break from feeding) for 
swallows (Levin, Volkert, & Piazza, 2014; Sharp, Odom, & 
Jaquess, 2012). 

Practitioners and researchers create, implement, and 
evaluate treatment packages for many reasons. Here are sey- 
eral: A practitioner may believe two or more research-based 
interventions will be more effective than any of the component 
interventions in isolation. Interventions consisting of multiple 
components may be effective across a wider range of settings or 
participant characteristics. An intervention known to be mildly 
effective in isolation may, when implemented in conjunction 
with other research-based interventions, yield an additive, val- 
ue-enhancing effect. In an effort to change a behavior that has 
resisted numerous interventions, a practitioner might combine 
some or all of the failed interventions, perhaps adding new ones, 
into a treatment package. 

When desired behavior change coincides with or follows 
implementation of a treatment package, an analysis of whether 
or not a functional relation exists between the package and 
behavioral improvement is in order. Other important questions 
can then be pursued: Must every component of a treatment pack- 
age be implemented to achieve clinically meaningful behavior 
change? If not, which components are necessary? Is one com- 
ponent, or a subset of components, sufficient to bring about 
behavior change? How does the presence or absence of each 
component impact overall effectiveness? 

A component analysis is any experiment designed to 
identify the active elements of a treatment package, the relative 
contributions of different components in a treatment package, 
and/or the necessity and sufficiency of treatment components. 
The effectiveness of a treatment package requires the presence 
of all necessary components. A sufficient component, or subset of 
components, ensures treatment effectiveness in the absence of 
any other components. 

In their review of component analyses using single- 
subject experimental designs, Ward-Horner and Sturmey (2010) 
described the potential outcomes of component analyses using 
reversal designs of a treatment package consisting of two com- 
ponents (Y and Z). In their hypothetical examples shown in 
Figure 10.4, Component Y is praise and Component Z is a pri- 
mary reinforcer. 


Outcome 1 suggests that Y is necessary and sufficient, 
because Y is effective whether or not Z is present. 
Outcome 2 suggests an additive effect because the sum of 
the components produced behavior changes greater than 
either component alone. 

Outcome 3 illustrates a multiplicative effect because neither 
Y nor Z is effective in isolation, but the combination of these 
components produces substantial behavior change. 

Outcome 4 indicates components Y and Z are sufficient 
but neither component is necessary, whereas Outcomes 2 


and 3 indicate that neither Component Y nor Component 
Z is sufficient but both are necessary. 

Outcome 5 is challenging to interpretations: (a) the possi- 
bility that presenting the components together establishes 
relations between or among themselves that influence 
subsequent effects when presented alone; (b) perhaps Y is 
effective only because of its prior presentation with Z; 

or (c) Y may be effective only when preceded by Z, but 

Y may be ineffective when it precedes Z. (Adapted from 
Ward-Horner & Sturmey, 2010, p. 687) 


Ward-Horner and Sturmey (2010) described two methods 
for conducting component analyses. In a drop-out component 
analysis, the investigator presents the treatment package and 
then systematically removes components. If the treatment’s 
effectiveness wanes when a component is removed, the 
researcher has identified a necessary component. 


The main advantage of this approach is that the target 
behavior will improve immediately, within the first or 
second experimental phase, and that the subsequent 
removal of components may provide information on 

the components necessary to maintain treatment goals 
(L. J. Cooper et al., 1995). One disadvantage is that the 
behavioral effects of combining components might mask 
the effectiveness of individual components, which makes 
it difficult to determine the necessity and sufficiency of 
components. (p. 687) 


A drop-out analysis by Wacker and colleagues (1990) 
revealed that both components of a training package consist- 
ing of functional communication training and consequences for 
inappropriate behavior were necessary for three individuals with 
severe disabilities and behavior problems. 

An add-in component analysis assesses components 
individually or in combination before the complete treat- 
ment package is presented. The add-in method can identify 
sufficient components. Ward-Horner and Sturmey (2010) 
noted that sequence and floor or ceiling effects may mask 
the effects of components added in toward the end of the 
analysis. 

After data from a reversal design demonstrated the effec- 
tiveness of a treatment package of noncontingent reinforcement, 
response cost, and response blocking for wearing prescription 
glasses by three individuals with intellectual disabilities, DeLeon 
and colleagues (2008) used add-in or drop-out and component 
analyses to determine which components were sufficient and/or 
necessary for each participant. 

Our intent in describing several experiments that com- 
bined analytic tactics or analyzed the relative effects of treatment 
components is not to offer any of these studies as model designs. 
They are presented instead as illustrations of the infinite number 
of experimental designs that are possible by arranging different 
combinations and sequences of independent variable manipu- 
lations. In every instance the most effective (i.e., convincing) 
experimental designs are those that use an ongoing evaluation 
of data from individual subjects as the basis for employing the 
three elements of baseline logic—prediction, verification, and 
replication. 
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Figure 10.4 Potential outcomes of component analyses using reversal designs for a two- 


component treatment package. 


From “Component Analyses Using Single-Subject Experimental Designs: A Review,” by J. Ward-Horner 
and P. Sturmey, 2010, Journal of Applied Behavior Analysis, 43, p. 690. Copyright 2010 by the Society for 
the Experimental Analysis of Behavior, Inc. Reprinted by permission. 


INTERNAL VALIDITY: CONTROLLING 
POTENTIAL SOURCES OF CONFOUNDING IN 
EXPERIMENTAL DESIGN 


An experiment is interesting and convincing, and yields the most 
useful information for application, when it provides an unam- 
biguous demonstration that the independent variable was solely 
responsible for the observed behavior change. Experiments that 


demonstrate a clear functional relation have a high degree of 
internal validity. An experimental design’s strength is deter- 
mined by the extent to which it (a) demonstrates a reliable 
effect (i.e., repeated manipulation of the independent variable 
produces a consistent pattern of behavior) and (b) eliminates or 
reduces the likelihood that factors other than the independent 
variable produced the behavior change (i.e., controls for con- 
founding variables). 
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Implicit in the term experimental control, which is often 
used to signify a researcher’s ability to reliably produce a 
specified behavior change by manipulating an independent 
variable, is the idea that the researcher controls the sub- 
jJect’s behavior. However, “control of behavior” is inaccurate 
because the experimenter can control only some aspect of the 
subject’s environment. Therefore, the level of experimental 
control obtained by a researcher refers to the extent to which 
she controls all relevant variables in a given experiment. The 
researcher exerts this control within the context of an experi- 
mental design that, even though carefully planned at the outset, 
takes its ultimate form from the researcher’s ongoing examina- 
tion and response to the data. 

As noted in Chapter 7, an effective experimental 
design simultaneously reveals a reliable functional relation 
between independent and dependent variables (if one exists) 
and minimizes the likelihood that the observed behavior 
changes are the result of unknown or uncontrolled variables. 
An experiment has high internal validity when changes in the 
dependent variable are demonstrated to be a function only of 
the independent variable. When planning an experiment and 
later when examining the data from an ongoing study, the 
investigator must always be alert to threats to internal valid- 
ity. Uncontrolled factors known or suspected to have exerted 
influence on the dependent variable are called confounding 
variables (see Chapter 7). Much of a researcher’s efforts 
during the course of a study are aimed at identifying and con- 
trolling confounding variables. 

The attainment of steady state responding is the primary 
means by which applied behavior analysts assess the degree of 
experimental control. Separating the effects of the independent 
variable from the effects of a potentially confounding variable 
requires clear, empirical evidence that the potentially confounding 
variable is no longer present, has been held constant across exper- 
imental conditions, or has been isolated for manipulation as an 
independent variable. Any experiment can be affected by a nearly 
unlimited number of potential confounds; and as with every aspect 
of experimental design, there are no set rules for identifying and 
controlling confounding variables to which researchers can turn. 
However, common and likely sources of confounding can be iden- 
tified as well as tactics that can be considered to control them. 
Confounding variables can be viewed as related primarily to one 
of four elements of an experiment: subject, setting, measurement 
of the dependent variable, and independent variable. 


Subject Confounds 


A variety of subject variables can confound the results of 
a study. Maturation, which refers to changes that take place 
in a subject over the course of an experiment, is a potential 
confounding variable. For example, a subject’s improved per- 
formance during the later phases of a study may result from 
physical growth or the acquisition of academic, social, or other 
behaviors and may be unrelated to manipulations of the inde- 
pendent variable. Experimental designs that incorporate rapidly 
changing conditions or multiple introductions and withdrawals 
of the independent variable over time usually control for matu- 
ration effectively. 


In most applied behavior analysis research, a subject is 
in the experimental setting and contacting the contingencies 
implemented by the investigator for only a portion of the day. 
As in any study, the assumption is made that a subject’s behavior 
during each session will be primarily a function of the exper- 
imental conditions in effect. In reality, however, a subject’s 
behavior may also be influenced by events that occur outside the 
experiment. For example, suppose that the rate of contributions 
to a class discussion is the dependent variable. Now suppose that 
a student who has been contributing to discussions at a high rate 
gets into a fight in the lunchroom just prior to a session and then 
emits substantially fewer contributions compared to his level of 
responding in previous sessions. This change in the student’s 
behavior may, or may not, be a result of the lunchroom fight. If 
the lunchroom fight coincided with a change in the independent 
variable, it would be especially difficult to detect or separate 
any effects of the experimental conditions from those of the 
extra-experimental event. 

Although the researcher may be aware of some events 
that are likely causes of variability during a study, many other 
potential confounds go undetected. Repeated measurement is 
both the control for and the means to detect the presence and 
effects of such variables. The uncontrolled variables responsi- 
ble for a subject’s having a “bad day” or an unusually “good 
day” are particularly troublesome in research designs with 
few and/or widely spaced measurements of the dependent 
variable. This is one of the major weaknesses of using data 
from pretest-posttest comparisons to evaluate the effects of a 
treatment program. 

Because between-groups experiments are predicated on 
subjects’ similarity in relevant characteristics (e.g., gender, age, 
ethnicity, cultural and linguistic background, current skills and 
background knowledge), they are vulnerable to confounding by 
differences among subjects. Concern that the characteristics of 
one or more subjects may confound an experiment’s results is 
generally not an issue in the single-case experiments. First, a 
person should participate in a study because she will benefit if 
the target behavior is changed successfully. Second, a subject’s 
idiosyncratic characteristics cannot confound a study using a 
true intrasubject experimental design. With the exception of the 
multiple baseline across subjects analysis design, each partici- 
pant in a single-case study serves as her own control, a strategy 
that guarantees identically matched subjects in all experimental 
conditions because those subjects are the same person. Third, 
the external validity of the findings of a single-subject analysis 
is not dependent on the extent to which the subject(s) shares cer- 
tain characteristics with others. The extent to which a functional 
relation applies to other subjects is established by replicating the 
experiment with different subjects (replication is discussed later 
in this chapter). 


Setting Confounds 


Most applied behavior analysis studies are conducted in natural 
settings where a host of variables are beyond the investiga- 
tors’ control. Studies in natural settings are more prone to con- 
founding by uncontrolled events than are studies conducted 
in laboratories where extraneous variables can be eliminated 
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or held consistent. Even so, the applied experimenter is not 
without resources to mitigate the detrimental effects of setting 
confounds. For instance, when the applied researcher observes 
that an uncontrolled event has coincided with changes in the data, 
he should hold all possible aspects of the experiment constant 
until repeated measurement again reveals stable responding. If 
the unplanned event appears to have had a robust effect on the 
target behavior, or is otherwise of interest to the investigator, 
and is amenable to experimental manipulation, the investigator 
should treat the unplanned event as an independent variable and 
explore its possible effects experimentally. 

Applied researchers concerned about setting confounds 
must also be on the lookout for “bootleg” reinforcement within 
and outside the experimental situation. A good example of how 
a setting confound operates occurs when, unbeknownst to the 
experimenter, subjects have access to the same items or events 
to be used as putative reinforcers in the study. In such a case, 
the potential effectiveness of those consequences as reinforcers 
may be diminished considerably. 


Measurement Confounds 


Chapters 4 and 5 discussed factors that should be considered 
in designing an accurate and nonreactive measurement system. 
Still, numerous sources of confounding may exist within a well- 
planned measurement system. For instance, data might be con- 
founded by observer drift, the influence of the experimenter’ s 
behavior on observers, and/or observer bias. Although admit- 
tedly difficult to accomplish in applied settings where observers 
often see the independent variable being implemented, keeping 
observers naive to the conditions and expected outcomes of an 
experiment reduces the potential of confounding by observer 
bias. On a related note, when observers score permanent 
products, the products should not contain identifying marks that 
indicate who produced each product and under what experimen- 
tal conditions it was produced. Having observers score papers 
from baseline and treatment conditions in randomized order 
reduces the likelihood that observer drift or bias will confound 
the data within one treatment condition phase. (This procedure 
is more suitable to controlling for drift or bias by observers con- 
ducting postexperiment accuracy or interobserver agreement 
[IOA] assessments. ) 

Unless a completely unobtrusive measurement system is 
devised (e.g., a covert system using one-way mirrors, or obser- 
vations conducted at some distance from the subject), reactivity 
to the measurement procedure must always be considered as a 
possible confound. To offset this possible confound, the exper- 
imenter must maintain baseline conditions long enough for any 
reactive effects to run their course and stable responding to be 
obtained. If reactivity to measurement produces undesirable 
effects (e.g., aggressive behavior, cessation of productivity) and 
a more unobtrusive measurement procedure cannot be devised, 
intermittent probes should be considered. Practice, adaptation, 
and warm-up effects can also confound measures, especially 
during the initial stages of baseline. Again, the proper proce- 
dure is to continue baseline conditions until stable responding 
is obtained or variability is reduced to minimal levels. Inter- 
mittent probes should not be used for baseline measurement of 


behaviors for which practice effects would be expected. This is 
because, if the target behavior is susceptible to practice effects, 
those effects will occur during the intervention condition when 
more frequent measures are conducted, thereby confounding 
any effects of the independent variable. 


Independent Variable Confounds 


Most independent variables are multifaceted; that is, there is 
usually more to a treatment condition than the specific vari- 
able of interest to the investigator. For example, the effects 
of a token economy on students’ academic productivity may 
be confounded by variables such as the personal relationship 
between the students and the teacher who delivers the tokens, 
social interactions associated with delivering and exchanging 
tokens, the expectation of teacher and students that perfor- 
mance will improve when the token system is implemented, 
and so on. If the intent is to analyze the effects of token rein- 
forcement per se, these potentially confounding variables must 
be controlled. 

Schwarz and Hawkins (1970) provided a good example 
of a control procedure for ruling out an aspect associated with 
a treatment as responsible for behavior change. The researchers 
evaluated the effects of token reinforcement on three maladap- 
tive behaviors of an elementary student who was described as 
severely withdrawn. During treatment, the therapist and the girl 
met each day after school and viewed a videotape that had been 
made earlier that day of the student’s classroom behavior. The 
therapist also administered tokens contingent on the girl’s vid- 
eotaped behavior displaying progressively fewer occurrences of 
maladaptive behaviors. 

Schwarz and Hawkins recognized an independent variable 
confound potentially lurking in the design of their study. They 
reasoned that if improvement occurred as a function of the treat- 
ment, the question would remain whether the student’s behavior 
had improved because the therapist-delivered positive attention 
and rewards improved her self-concept, which in turn changed 
her maladaptive behaviors in the classroom, which were symp- 
tomatic of her poor self-concept. In that case, Schwarz and Haw- 
kins could not be certain that the contingent tokens played an 
important role in changing the behavior. Anticipating this possi- 
ble confound, Schwarz and Hawkins controlled for it in a simple 
and direct way. Following baseline, they implemented a condi- 
tion in which the therapist met with the girl each day after school 
and provided her with social attention and token reinforcement 
contingent on improvements in handwriting. During this control 
phase, the three target behaviors—face touching, slouching, and 
low voice volume—showed no change, thereby increasing their 
confidence in a conclusion that the girl’s ultimate behavioral 
improvements during the subsequent intervention phases were 
due to the token reinforcement. 

When medical researchers design experiments to test the 
effects of a drug, they use a technique called a placebo control 
to separate effects that may be produced by a subject’s perceived 
expectations of improvement because of taking the drug apart 
from the effects actually produced by the drug. In the typical 
between-groups design, the subjects in the experimental group 
receive the real drug, and subjects in the control group receive 
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placebo pills. Placebo pills contain an inert substance, but they 
look, feel, and taste exactly like the pills containing the real drug 
being tested. 

Applied behavior analysts have also employed placebo 
controls in single-subject experiments. For example, in their 
study evaluating a pharmacological treatment of the impulsivity 
by students with attention-deficit hyperactivity disorder 
(ADHD), Neef, Bicard, Endo, Coury, and Aman (2015) had a 
pharmacist prepare placebos and medications in identical gelatin 
capsules in 1-week supplies for each child. Neither the students 
nor the observers knew if a child had taken the medication or 
had taken the placebos. When neither the subject(s) nor the 
observers know whether the independent variable is present or 
absent from session to session, this type of control procedure 
is called a double-blind control. A double-blind control 
procedure eliminates confounding by subject expectations, 
parent and teacher expectations, differential treatment by others, 
and observer bias. 


Treatment Integrity/Procedural Fidelity 


Researchers must ensure that the independent variable is imple- 
mented exactly as planned and that no other unplanned variables 
are administered inadvertently along with the planned treatment. 
Treatment integrity refers to the extent to which the indepen- 
dent variable is implemented as planned. Although often used 
interchangeably with treatment integrity, procedural fidelity 
more accurately refers to the extent to which procedures in all 
conditions of an experiment, including baseline, are correctly 
implemented (Ledford & Gast, 2014). 

Low treatment integrity invites a major source of con- 
founding into an experiment, making it difficult, if not impos- 
sible, to interpret the results with confidence. Data from an 
experiment in which the independent variable was administered 
improperly, applied inconsistently, conducted piecemeal, and/or 
delivered in overdose or underdose form often lead to conclu- 
sions that—depending on the results obtained—trepresent either 
a false positive (claiming a functional relation when no such 
relation exists) or a false negative (failing to detect a functional 
relation when one actually does exist). If a functional relation 
is apparent from the analysis of the data, one cannot be sure 
whether the treatment variable as described by the experimenter 
was responsible or whether the effects were a function of extra- 
neous, uncontrolled elements of the intervention as it was actu- 
ally applied. However, it may be equally erroneous to interpret 
the failure to produce significant behavior change as evidence 
that the independent variable is ineffective. In other words, had 
the independent variable been implemented as planned, it might 
have been effective. 

Numerous threats to treatment integrity exist in applied 
settings (Billingsley, White, & Munson, 1980; Ledford & Gast, 
2014: Peterson, Homer, & Wonderlich, 1982). Experimenter 
bias can influence the researcher to administer the independent 
variable in such a way that it enjoys an unfair advantage over 
baseline or comparative conditions. Treatment drift occurs 
when the application of the independent variable differs from 
the way it was applied at the study’s outset. Treatment drift can 
occur when the complexity of the independent variable makes it 


difficult for practitioners to implement all elements consistently 
over the course of an experiment. Contingencies influencing 
the behavior of those responsible for implementing the indepen- 
dent variable can also produce treatment drift. For example, a 
practitioner might implement only those aspects of a procedure 
she favors and implement the full intervention only when the 
investigator is present. 

The following discussion and examples of methods for 
ensuring and measuring treatment integrity also apply to ensur- 
ing consistency of procedures during baseline and nontreatment 
comparison conditions (i.e., procedural fidelity). 


Precise Operational Definition. Achieving a high level of treat- 
ment integrity begins with developing a complete and precise 
operational definition of the treatment procedures. Besides pro- 
viding the basis for training the persons who will implement an 
intervention and judging the level of treatment integrity attained, 
operational definitions of treatment conditions are requisite for 
meeting the technological dimension of applied behavior ana- 
lysis (Baer, Wolf, & Risley, 1968). An investigator’s failure to 
provide explicit operational definitions of the treatment variable 
hampers the dissemination and proper use of the intervention 
by practitioners, and makes it difficult for other researchers to 
replicate and ultimately validate the findings. 

Gresham, Gansle, and Noell (1993) recommended that 
descriptions of the independent variable be judged by the 
same standards of explicitness used in assessing the quality of 
definitions of the dependent variable. That is, they should be 
clear, concise, unambiguous, and objective. More specifically, 
Gresham and colleagues also suggested that treatments be oper- 
ationally defined in each of four dimensions: verbal, physical, 
spatial, and temporal. They used Mace, Gresham, Ivancic, and 
O’Brien’s (1986) definition of a time-out procedure as an exam- 
ple of an operational definition of an independent variable. 


(a) Immediately following the occurrence of a target behay- 
ior (temporal dimension), (b) the therapist said “No, go to 
time-out” (verbal dimension), (c) led the child by the arm 
to a prepositioned time-out chair (physical dimension), and 
(d) seated the child facing the corner (spatial dimension). 
(e) If the child’s buttocks were raised from the time-out 
chair or if the child’s head was turned more than 45° (spa- 
tial dimension), the therapist used the least amount of force 
necessary to guide compliance with the time-out procedure 
(physical dimension). (f) At the end of 2 min (temporal 
dimension), the therapist turned the time-out chair 45° from 
the corner (physical and spatial dimensions) and walked 
away (physical dimension). (pp. 261-262) 


Simplify, Standardize, and Automate. Simplifying and stan- 
dardizing the independent variable and providing criterion- 
based training and practice for the people who will be responsible 
for implementing it enhance treatment integrity. Treatments 
that are simple, precise, and brief, and that require relatively 
little effort, are more likely to be delivered with consistency 
than those that are not. Simple, easy-to-implement techniques 
also have a higher probability of being accepted and used by 
practitioners than those that are not, and thus possess a certain 
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degree of self-evident social validity. Simplicity is, of course, 
a relative concern, not a mandate; effecting change in some 
socially important behaviors may require the application of 
intense, complex interventions over a long period and may 
involve many people. Baer (1987) made this point succinctly 
when he stated: 


Long problems are simply those in which the task 
analysis requires a series of many behavior changes, 
perhaps in many people, and although each of them is 
relatively easy and quick, the series of them requires 
not so much effort as time, and so it is not arduous but 
merely tedious. (pp. 336-337) 


Neither researchers nor practitioners need be thwarted 
or dismayed by complex interventions; they do, however, need 
to realize the treatment integrity implications. All things being 
equal, a simple and brief treatment will probably be applied 
more accurately and consistently than will a complex and 
extended one. 

To ensure the consistent implementation of the indepen- 
dent variable, experimenters should standardize as many of its 
aspects as cost and practicality allow. They can accomplish the 
standardization of treatment in a variety of ways. When a treat- 
ment requires a complex and/or extended sequence of behav- 
iors, a script for the person administering it may improve the 
accuracy and consistency with which the independent variable 
is applied. For example, Heron, Heward, Cooke, and Hill (1983) 
used a scripted lesson plan to ensure that a classwide peer tutor 
training program was implemented consistently across groups 
of children. 

If automating the intervention will not compromise it in 
any way, researchers might consider “canning” the independent 
variable so that an automated device can be used for its deliv- 
ery. Although a tutor training video in Heron and colleagues’ 
(1983) study would have eliminated any potential confounding 
caused by the teacher’s slightly different presentations of the 
lesson from group to group and across sets of tutoring skills, 
using a canned presentation would also have eliminated desired 
interactive and personal aspects of the training program. Some 
treatment variables are well suited to automated presentation in 
that automation neither limits the desirability of the treatment 
nor seriously reduces its social validity in terms of acceptability 
or practicability (e.g., use of video programs to model residential 
energy conservation). 


Training, Practice, and Feedback. Training and practice in 
properly implementing the independent variable provides the 
person(s) who will be responsible for conducting the treatment 
or experimental sessions with the necessary skills and knowledge 
to carry out the treatment. It would be a mistake for the researcher 
to assume that a person’s general competence and experience in 
the experimental setting (e.g., a classroom) guarantee correct and 
consistent application of an independent variable in that setting 
(e.g., implementing a peer-mediated tutoring program). 

As stated earlier, scripts detailing treatment procedures 
and cue cards or other devices that remind and prompt people 
through steps of an intervention can be helpful. Researchers 
should not, however, assume that providing a procedural manual 


or script to intervention agents will ensure a high degree of treat- 
ment integrity. Mueller and colleagues (2003) found that a com- 
bination of verbal instructions, modeling, and/or rehearsal was 
required for parents to implement pediatric feeding protocols 
with a high level of treatment integrity. 

Howard and DiGennaro Reed (2014) found that after 
written instructions produced little or no improvement in 
trainers’ implementation of a dog obedience training proce- 
dure with shelter volunteers, a single viewing of a video that 
included correct and incorrect examples of the routine “pro- 
duced marked improvement” in the correct implementation of 
the training procedure. Nevertheless, a brief one-on-one train- 
ing session consisting of modeling, descriptive praise, and 
feedback until 100% integrity was obtained in a role-play was 
required for trainers to reach satisfactory levels of treatment 
integrity (means of 88%, 97%, and 93% for three trainers over 
6 to 14 sessions) (Figure 10.5). 

Rating the integrity with which video models implement 
a treatment can also improve the fidelity with which the raters 
implement the treatment. After an initial 30-min training session 
in which trainer-reviewed written protocol of functional analysis 
(FA) procedures produced moderate levels of treatment fidel- 
ity (below 85% accuracy) by 17 of 20 graduate students, Field, 
Frieder, McGee, Peterson, and Duinkerken (2015) had the stu- 
dents rate the fidelity with which video models implemented the 
FA procedures. The treatment fidelity of 16 of the 17 students 
who participated in the observation and rating of video models 
increased further. In a final condition, 7 participants whose exe- 
cution of FA procedures was still below mastery observed and 
rated a 2-min video of themselves conducting the intervention in 
a previous session. The self-observation and rating resulted in 5 
of the 7 students achieving further increases in treatment integrity. 

Various prompting tactics might promote treatment 
integrity. In a parametric analysis of teacher praise rates on 
the on-task behavior of elementary students with autism, the 
teacher wore a MotivAider®, which vibrated at preset intervals 
of 1, 2, and 8 per minute to prompt praise statements (Kranak, 
Aiber-Morgan, & Sawyer, 2017). 

Craig (2010) reported an interesting method for assessing 
the treatment integrity of a self-monitoring and self-administering 
intervention on reduced nail biting. Tokens were exchanged for 
evening leisure time contingent on no occurrences of nail biting 
during daily, 1-hour intervals. Craig described the procedure as 
follows: 


I recorded the time of each occurrence of nail biting 

and the time of each token reinforcer delivery. Using a 
cellular camera phone, I took digital photographs (with 
timestamps) of the condition of my fingernails at the begin- 
ning of each DRO interval, after each token delivery, and 
after each occurrence of nail biting (see Figure 1). During 
token exchange periods each evening, my cohabitant 
compared the times of the occurrences of nail biting and 
of token deliveries to the timestamps of each photograph. 
If there was no noticeable difference in the condition of 
my fingernails between the photograph from the begin- 
ning of an interval and the photograph taken upon token 
delivery, token delivery was recorded as “correct.” Tokens 
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Figure 10.5 Graphic display 
illustrating treatment integrity 
data and dog compliance. 


Based on “Training shelter volunteers to 
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Figure 1. 


associated with incorrect deliveries would have been 
removed; incorrect token delivery did not, however, occur 
during the course of this study. (p. 39) 


Performance feedback and self-monitoring have also been 
shown to improve the integrity with which practitioners and 
parents implement behavior support plans and explicit teach- 
ing techniques (e.g., Codding, Feinberg, Dunn, & Pace, 2005; 
Pinkelman & Horner, 2016; Plavnick, Ferreri, & Maupin, 2010; 
Sarokoff & Strumey, 2004; Sanetti, Luiselli, & Handler, 2007). 


Assessing and Reporting Treatment Integrity. Simplification, 
standardization, training, and practice help increase the degree 
of treatment integrity, but do not guarantee it. When any reason- 
able doubt exists about the correct and consistent application of 
the independent variable, investigators should provide data on the 
accuracy and reliability of the independent variable. Procedural 
fidelity data reveal the extent to which the actual implementation 
of all experimental conditions over the course of a study matches 
their descriptions in the method section of a resear ch report.’ 
Even though effective control of the independent vari- 
able is requisite to an internally valid experiment, applied 


Trainer integrity of discrete-trial obedience training for sit (DTOT-s) and dog compliance with sit instruction. 


0 F. D. DiGennaro Reed, 2014, Journal of 
Applied Behavior Analysis, 47, p. 351. 

Copyright 2014 by the Society for the 

Experimental Analysis of Behavior, Inc. 


behavior analysts have not always made sufficient efforts to 
ensure treatment integrity. Two reviews of articles published 
in the Journal of Applied Behavior Analysis (JABA) from 
1968 to 1990 found that the majority of authors did not report 
data assessing the degree to which the independent variable 
was properly and consistently applied (Gresham et al., 1993; 
Peterson et al., 1982). Peterson and colleagues noted that a 
“curious double standard” had developed in applied behav- 
ior analysis in which data on the interobserver agreement of 
dependent variable measures were required for publication, 
but such data were seldom provided or required for the inde- 
pendent variable. 

Although more recent reviews of treatment integrity data 
in articles published in JABA and other behavioral journals 
have found some improvement, the overall results are disap- 
pointing (e.g., Andzik, Cannella-Malone, & Sigafoos, 2016; 
Ledford & Wolery, 2013; Wheeler, Baggett, Fox, & Blevins, 
2006). For example, McIntyre, Gresham, DiGennaro, and Reed 
(2007) reported that only 30% of 152 studies of school-based 
interventions published in JABA from 1991 to 2005 provided 
treatment integrity data. These authors judged 69 (45%) of 
those studies “to be at high risk for treatment inaccuracies in 
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that information on the implementation of treatments or the 
assessment of independent variables was not included but should 
have been” (p. 664) per guidelines for monitoring or measuring 
treatment (Peterson et al., 1982). 

Treatment integrity data have value beyond their crit- 
ical role in helping reviewers, practitioners, and consumers 
judge the believability of a study’s results. Increasingly, data 
documenting that the independent variable was implemented 
as described is criterion for inclusion in systematic reviews 
of literature conducted by professional and government 
organizations. Treatment integrity is one of five dimensions 
by which the National Autism Center (NAC; 2015) judges the 
scientific merit of a study (the others are research design, mea- 
surement of the dependent variable, eligibility and description 
of participants, and measure of generalization of treatment 
effects). For a study to achieve the highest score on the NAC’s 
Scientific Merit Rating Scale, treatment integrity must be 
measured during a minimum of 25% of sessions, at or above 
80% accuracy, with IOA of =80%. An expert panel tasked 
by the U.S. Department of Education to establish parameters 
for considering fidelity of intervention implementation in 
single-case design studies noted that “Because interventions 
in single-case studies are applied over time, continuous mea- 
surement of implementation is a relevant consideration” (What 
Works Clearinghouse, 2018, p. A-6). 

Methods for assessing and increasing the accuracy 
and believability of measures of the dependent variable (see 
Chapter 5) are fully applicable to the collection of proce- 
dural fidelity data. Importantly, observation and recording of 
the independent variable provide the experimenter with data 
indicating whether calibration of the treatment agent is nec- 
essary (i.e., bringing the intervention agent’s behavior into 
agreement with the true value of the independent variable). 


Video Clip # — 1-1AL 


Rater Initials: 


Observation and calibration give the researcher an ongoing 
ability to use retraining and practice to ensure a high level of 
treatment integrity over the course of an experiment. 

Figure 10.6 shows the data collection form trained observ- 
ers used to collect treatment integrity data in a study evaluating 
the effects of different qualities and durations of reinforcement 
for problem behavior, compliance, and communication within a 
treatment package for escape-maintained problem behavior (Van 
Norman, 2005). The observers viewed videotapes of randomly 
selected sessions representing approximately one third to one 
half of all sessions in each condition and phase of the study. The 
percentage of treatment integrity for each condition was calcu- 
lated by dividing the number of steps the experimenter completed 
correctly during a session by the total number of steps completed. 

Graphic displays of treatment integrity data may help 
researchers and consumers of a study judge the intervention’s 
effectiveness (e.g., Kranak et al., 2017). Detailed information 
on assessing, promoting, and reporting procedural fidelity 
can be found in a special issue of the Journal of Behavioral 
Education (DiGennaro Reed & Codding, 2014) and Ledford and 
Gast (2014). 

This overview of sources of potential confounding vari- 
ables is, of necessity, incomplete. A complete inventory of all 
possible threats to the internal validity of experimental research 
would be well beyond the scope of this text. And presenting such 
a list might suggest that a researcher need only control for the 
variables listed and not worry about anything else. In truth, 
the list of potential confounds is unique to every experiment. 
The effective researcher is the one who questions and probes the 
influence of as many relevant variables as possible. No exper- 
imental design can control all potential confounds; the chal- 
lenge is to reduce, eliminate, or identify the influence of as 
many potentially confounding variables as possible. 
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Phase 1/A (SD) 


Procedural Steps 


The instructor delivers a task prompt at the beginning of the session, e.g., “it's time 
work" or similar. 


to 


If the participant does not respond, the instructor represents the choice by replacing 
materials or restating the contingencies. 


materials. 


The break card (or similar) is presented simultaneously (within 3 seconds) with work task 


Following work choice (touching materials associated with work) 
a, Removes the task materials 

b. Presents a timer with green colored cue card 

c. Provides access to high preference items 

d. Engages in play with the participant for 1 minute 


Following break requests 
a. Removes the task materials 


b. Presents timer with yellow cue card 


c. Provides access to moderately preferred tangible items and neutral 
commenting for 30 seconds 


Following problem behavior within 10 s the instructor 

a. Removes the task/play materials 

b. Presents a timer with red cue card 

c. Provides no attention or tangible items for 10 seconds 


Which side was the break card (or similar) presented ( 
L = left) for each choice presentation. 


the participant's R = right or 


Figure 10.6 Example of a form used to record treatment 


integrity data. 


Adapted from The Effects of Functional Communication Training, Choice Making, and an Adjusting Work Schedule on Problem Behavior 
Maintained by Negative Reinforcement, by R. K. Van Norman, 2005, p. 204. Unpublished doctoral dissertation. Columbus, OH: The Ohio 


State University. Used by permission. 
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SOCIAL VALIDITY: ASSESSING THE APPLIED 
VALUE OF BEHAVIOR CHANGES AND THE 
TREATMENTS THAT ACCOMPLISH THEM 


In his landmark article “Social Validity: The Case for Subjective 
Measurement or How Applied Behavior Analysis Is Finding Its 
Heart,” Montrose Wolf (1978) proposed the then “radical con- 
cept that clients (including parents and guardians of dependent 
people, and even those whose taxes support social programs) 
must understand and admire the goals, outcomes, and methods 
of an intervention” (Risley, 2005, p. 284). Wolf recommended 
that the social validity of an applied behavior analysis study be 
assessed in three ways: the social significance of the behavior 
change goals, the appropriateness of the intervention, and the 
social importance of the results. 


Validating the Social Importance of Behavior 
Change Goals 


The social validity of behavior change goals begins with a clear 
description of those goals. 


To assess the social importance of goals, the researcher 
must be precise about the goals of the behavior change 
effort at the levels of (a) the broad social goal (e.g., 
improved parenting, enhanced social skills, improved 
cardiovascular health, increased independence), (b) the 
categories of behavior hypothesized to be related to the 
broad goal (e.g., parenting—providing instructional 
feedback, using time-out, etc.), and/or (c) the responses that 
comprise the behavioral category of interest (e.g., using 
time-out—directing the child to a location away from other 
people, instructing the child to “sit out” for a specified 
duration, etc.). Social validation may be conducted for any 
of these levels of goals. (Fawcett, 1991, pp. 235-236) 


Experts can inform the selection of socially valid target 
behaviors. Kelley and Miltenberger (2016), for example, iden- 
tified the task analyses of correct rider positions for horseback- 
riding skills “by consulting with an expert instructor as well as 
comparing many photos of expert riders considered to have good 
position” (p. 141). In their studies on the effects of behavioral 
teaching strategies on performance by high school football play- 
ers, Harrison and Pyles (2013) and Stokes, Luiselli, and Reed 
(2010) targeted tackling skills recommended by the American 
Football Coaches Association (1995). In another study teaching 
football skills to high school athletes, Stokes, Luiselli, Reed, 
and Fleming (2010) validated their 10-step task analysis for 
pass-blocking by consulting with experienced offensive linemen 
coaches (see Figure 21.5). 

Persons who use the prospective skill in natural environ- 
ments can help researchers identify socially valid target behav- 
iors. In their study teaching sequences of basketball skills to 
an adolescent with disabilities, Lambert and colleagues (2016) 
asked “groups of adults who we observed playing games of ‘pick 
up’ basketball . . . to rate the utility of each sequence during a 
typical game of ‘pick up’ basketball” (p. 206). 

Van Houten (1979) suggested two basic approaches for 
determining socially valid goals: (a) assess the performance of 


persons considered competent and (b) experimentally manip- 
ulate different levels of performance to determine empirically 
which produces optimal results. The performance of typi- 
cal performers can be used to identify and validate behavior 
change goals and target levels of performance. To arrive at a 
socially valid performance criterion for a social skills train- 
ing program for two adults with disabilities who worked in 
a restaurant, Grossi, Kimball, and Heward (1994) observed 
four restaurant employees without disabilities over a period 
of 2 weeks to determine the extent to which they acknowl- 
edged verbal initiations from coworkers. Results from these 
observations revealed that the employees without disabilities 
acknowledged an average of 90% of initiations directed toward 
them. This level of performance was selected as the goal for 
the two target employees in the study. 

Warren, Rogers-Warren, and Baer (1976) provide a good 
example of testing the effects of different levels of performance 
to determine socially valid outcomes. The researchers assessed 
the effect of different frequencies of children’s offers to share 
play materials with their peers on the peers’ reactions to those 
offers. They found that peers accepted offers to share most con- 
sistently when those offers were made at a middle frequency— 
that is, not too frequently, not too seldom. 


Validating the Social Acceptability of 
Interventions 


Several scales and questionnaires for obtaining consumers’ opin- 
ions of the acceptability of behavioral interventions have been 
developed. The /ntervention Rating Profile (IRP-15) is a 15-item 
Likert-type scale for assessing teachers’ perceptions of the accept- 
ability of classroom interventions (Martens, Witt, Elliott, & 
Darveaux, 1985). The Behavior Intervention Rating Scale, a 
revision and extension of the IRP-15, includes nine additional 
items on treatment effectiveness (Elliot & Brock Treuting, 1991). 
The Treatment Acceptability Rating Form (TARF) consists of 20 
questions with which parents rate the acceptability of behavioral 
treatments (Reimers & Wacker, 1988). For a review of these and 
six other treatment acceptability instruments, see Carter (2007). 

Figure 10.7 shows the experimenter-modified version of 
the TARE Van Norman (2005) used to obtain treatment accept- 
ability information from each participant’s parents, teachers, 
therapists, and behavior support staff. Although some of the 
people whose opinions were being sought had witnessed or 
watched a video of the intervention being used with the student, 
the following description of the intervention was read to each 
consumer before he or she was asked to answer each of the 
questions: 


First we conducted an assessment to find out what moti- 
vated Zachary to engage in challenging behavior(s) such 
as throwing materials, hitting people, and dropping to 
the floor. We found that Zachary engaged in challenging 
behavior(s), at least in part, in order to escape or avoid task 
demands. 

Next, we taught Zachary to ask for a break as a 
replacement behavior for challenging behavior by using 
physical prompting and attaching the response of asking for 
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Figure 10.7 Examples of questions adapted from the Treatment Acceptability Rating Form—Revised 
(Reimers and Wacker, 1988) to obtain consumers’ opinions of the acceptability of intervention 
procedures used to treat challenging behaviors of secondary students with severe disabilities. 


Treatment Acceptability Rating Form—Revised (TARF-R) 
. How clear is your understanding of the suggested procedures? 


Not at Neutral Very 
all clear clear 


. How acceptable do you find the strategies to be regarding your concerns about the identified learner? 


Not at all Neutral Very 
acceptable acceptable 


. How willing are you to implement the suggested procedures as you heard them described? 


Not at Neutral Very 
all willing willing 


. Given the learner's behavior issues, how reasonable do you find the suggested procedures? 


Not at all Neutral Very 
reasonable reasonable 


. How costly will it be to implement these strategies? 


Not at Neutral 
all costly 


. How disruptive will it be to your classroom to implement the suggested procedures? 


Not at all Neutral Very 
disruptive disruptive 


. How affordable are these procedures? 


Not at all Neutral Very 
affordable affordable 


. How much do you like the proposed procedures? 


Do not like Neutral Like them 
them at all . very much 


17. How much discomfort is your learner likely to experience as a result of these procedures? 


No discomfort Neutral Very much 
at all discomfort 


19. How willing would you be to change your classroom routine to implement these procedures? 


Not at Neutral Very willing 
all willing 


20. How well will carrying out these procedures fit into your classroom routine? 


Not at Neutral Very well 
all well 


From The Effects of Functional Communication Training, Choice Making, and an Adjusting Work Schedule on Problem Behavior 
Maintained by Negative Reinforcement, by R. K. Van Norman, 2005, pp. 248-256. Unpublished doctoral dissertation. Colum- 
bus, OH: The Ohio State University. Used by permission. 
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a break to access to a highly preferred item, lots of atten- 
tion, and a long duration break (3 min). 

Then we gave Zachary the choice to ask for work by 
simply touching the work materials (essentially engaging in 
the first step of the task) and getting access to highly preferred 
items, attention, and long duration break (1 min) or asking for 
a break and getting access to moderately preferred items for a 
shorter duration break (30 sec). At any time during this proce- 
dure if Zachary engaged in problem behavior he was given a 
10 s break with no attention and no activities/items. 

Finally, we continued to give Zachary the choice to 
ask for work, a break or engage in problem behavior, how- 
ever now we required Zachary to comply with a greater 
number of task-related instructions before he was given 
access to the highly preferred activities, attention, and a 
1 min break. Each session we increased the number of 
task-related instructions that were given and needed to be 
complied with before access to the highly preferred break. 

Physical prompting was only used during the initial 
phase to teach Zachary new responses, specifically how 
to ask for a break and how to ask for work. Otherwise 
Zachary was making all independent choices as they were 
presented. (p. 247) 


Some investigators present graphic displays of treatment 
acceptability data. Kurti and Dallery (2013) administered a 


treatment acceptability questionnaire to 12 adults who partic- 
ipated in Internet-based contingency management intervention 
to increase walking. Participants ranked 10 items about the 
intervention (e.g., easy to use, fun, I would recommend to oth- 
ers) using a visual analogue scale ranging from 0 (not at all) to 
100 (extremely). The researchers used a graph to present the 
treatment acceptability results, showing the mean, range, and 
individual participant ratings for each item (see Figure 10.8). 

Preference for interventions can be assessed directly by 
letting practitioners or clients choose which of multiple treat- 
ments to implement or receive. After training four parents and 
one teacher to implement each of three reinforcement-based 
interventions for problem behavior to 90% treatment integrity, 
Gabor, Fritz, Roath, Rothe, and Gourley (2016) provided a 
sequence of choice sessions in which each caregiver selected 
which intervention to implement. Jowett Hirst, Dozier, and 
Payne (2016) assessed preschool children’s preference for token 
economy procedures based on differential reinforcement or 
response cost with a “choice” phase in which only the treatment 
selected by each participant was implemented (see Figure 8.17 
and accompanying discussion). 

Hanley and colleagues have used concurrent chains exper- 
imental designs (see Chapter 8) to assess clients’ treatment pref- 
erences (e.g., Hanley, Piazza, Fisher, Contrucci, & Maglieri, 
1997; Luczynski, & Hanley, 2009; Heal, Hanley, & Layer, 2009). 
After reversal and multielement designs revealed that functional 
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Figure 2. Experiment 1 (gray bars) and Experiment 2 (dark gray bars) participants’ responses 
Lea te questionnaire, where bars represent the mean rating for each item (1 =/ would recommend the Internet-based 
walking intervention to others, 2 =| liked using the Fitbit to monitor my progress, 3 =| liked seeing my progress on the graph, 
4 =| liked earning vouchers, 5 = Earning vouchers helped me increase my activity levels). 


Figure 10.8 Example of graphic reporting treatment acceptability data. 


Based on “Internet-based Contingency Management Increases Walking in Sedentary Adults” by A. N. Kurti and 
J. Dallery, 2013, Journal of Applied Behavior Analysis, 46, p. 574. Copyright 2013 by the Society for the Experi- 


mental Analysis of Behavior, Inc. 
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communication training (FCT) with punishment was the most 
effective of three functional-based treatments in reducing two 
children’s severe problem behavior, the children’s choices during 
the concurrent chains condition showed they each preferred func- 
tional communication training (FCT) with punishment over FCT 
alone, or punishment only (Hanley, Piazza, Fisher, & Maglieri, 
2005; see Figure 8.18 and accompanying discussion). 

The social acceptability of a proposed intervention can 
be judged by the extent it meets the standards of best practice 
(Peters & Heron, 1993) and the ethical, legal, and professional 
standards of relevant learned and professional societies. In gen- 
eral, these standards require the use of evidence-based, and least 
restrictive practices, full disclosure to all participants, informed 
consent, and the right to discontinue treatments that the partici- 
pants believe are not beneficial or therapeutic. 


Validating the Social Importance of 
Behavior Changes 


Applied behavior analysts assess the social validity of out- 
comes with a variety of methods: (a) have consumers rate the 
social validity of participants’ performance, (b) ask experts to 


evaluate participants’ performance, (c) compare participants’ 
performance to that of a normative sample, (d) assess with 
standardized assessment instruments, and (e) test participants’ 
performance in the natural environment. 


Consumer Opinion 


The most frequently used method for assessing social validity is 
to ask consumers, including subjects or clients whenever possi- 
ble, whether they thought behavior changes occurred during the 
study or program, and if so, whether they thought those behav- 
ior changes were important and valuable. Figure 10.9 shows 
the questionnaire Van Norman (2005) used to obtain opinions 
of consumers (i.e., the subjects’ parents, teachers, and instruc- 
tional aides; school administrators; behavior support staff; an 
occupational therapist; a school psychologist; and a psychol- 
ogy aide) on the social validity of results of an intervention 
designed to reduce challenging behavior maintained by escape. 
Van Norman created a series of 5-minute video clips from 
randomly selected before-intervention and after-intervention 
sessions and placed the clips in random order on a CD. The 
social validity evaluators did not know whether each clip 


Figure 10.9 Form for obtaining consumer opinions of the social validity of results of an 
intervention used to treat challenging behavior of secondary students with severe disabilities. 


Social Validity of Results Questionnaire 


Directions: 


Video Clip #_ 


Please view the video and then circle one of the five choices that best describes the extent to which 
you agree or disagree with each of the three statements below. 


1. The student is engaged in academic or vocational work tasks, sitting appropriately (bottom in 


Seat), and attending to the teacher or materials. 


1 2 


Strongly Disagree Disagree 


Undecided 


4 5 


Agree Strongly Agree 


2. The student is engaged in challenging behavior and not attending to the teacher or materials. 


1 2 


Strongly Disagree Disagree 


Undecided 


4 5 


Agree Strongly Agree 


3. The student appears to have a positive affect (e.g., smiling, laughing). 


1 Ze 


Strongly Disagree Disagree 


Undecided 


Comments about the student's behavior in the video clip: 


4 5 


Agree Strongly Agree 


General comments about this student's behavior: 


Name (optional): 


Relation to the learner in video (optional): 


Based on “The Effects of Functional Communication Trainin 
on Problem Behavior Maintained by Negative Reinforcemen 
doctoral dissertation. Columbus, OH: The Ohio State University. 


g, Choice Making, and an Adjusting Work Schedule 
t" by R. K. Van Norman, 2005, p. 252. Unpublished 
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represented a before- or after-intervention session. After view- 


ing each clip, the consumer completed the questionnaire shown 
in Figure 10.9. 


Expert Evaluation 


Experts can be called on to judge the social validity of some 
behavior changes. For example, as one measure of the social 
validity of changes in the unaided notetaking skills of high 
school students with learning disabilities in social studies lec- 
tures as a result of having been exposed to teacher-prepared 
guided notes, White (1991) asked 16 secondary social studies 
teachers to rate the students’ baseline and postintervention lec- 
ture notes on three dimensions: (1) accuracy and completeness 
compared to lecture content; (2) usefulness for study for tests 
over the lecture content; and (3) how the notes compared to 
those taken by typical general education students. (The teachers 
did not know whether each set of notes they were rating was 
from baseline or the postintervention condition.) 

Howard and DiGennaro Reed (2014) recruited 186 certi- 
fied dog trainers from across the United States to evaluate the 
social validity of behavior changes in dog shelter volunteers and 
the dogs they were paired with during the study. The research- 
ers compiled brief baseline (before training) and intervention 
(after training) videos of one of the volunteer+dog dyads. The 
experts watched the videos online, and using a 9-item Survey- 
Monkey questionnaire, the experts rated the appropriateness and 
effectiveness of the dog obedience training procedures used by 
the shelter volunteer before and after training and whether the 
before- or after training dog would be more adoptable, better in 
a home with children, and a better dog for first-time pet owners. 

To assess the social validity of changes in participants’ 
horseback-riding skills, Kelley and Miltenberger (2016) showed 
two expert riders and instructors three videos from the end of 
baseline and three videos from the end of the intervention for 
each participant. The experts rated each video with a Likert-type 
scale (1 = strongly disagree, 9 = strongly agree) according to 
whether the rider was in the correct position based on the riding 
style target for the rider shown in the video. The experts who 
viewed the videos in random order were blind to whether each 
video was from baseline or intervention. 


Normative Comparison 


Van den Pol and colleagues (1981) used the performance of a 
sample of typical fast-food restaurant customers to assess the 
social validity of the posttraining performance of the young 
adults with disabilities whom they had taught to order and pay 
for a meal without assistance. The researchers observed 10 ran- 
domly selected, typical customers who ordered and ate a meal 
in fast-food restaurants and recorded the accuracy with which 
these customers performed each step of a 22-step task analysis. 
The students’ performance at the follow-up probe equaled or 
exceeded that of the customers in the normative sample in all 
but 4 of 22 specific skills. 

Using normative samples to assess the social validity of 
behavior change is not limited to posttreatment comparisons. Com- 
paring subjects’ behavior to ongoing probes of the behavior of a 
normative sample provides a formative assessment measure of how 


much improvement has been made and how much is still needed. 
Grossi and Heward (1998) used the typical rate at which employees 
without disabilities performed restaurant tasks (e.g., scrubbing pots 
and pans, loading a dishwasher, mopping and sweeping floors) 
as a socially valid performance standard for restaurant trainees 
with developmental disabilities (see Figure 29.3). 

In their study evaluating behavioral coaching strategies 
on the offensive line pass-blocking skills of high school football 
athletes, Stokes, Luiselli, Reed, and Fleming (2010) recorded 
the three highest-rated starting offensive linemen from video- 
taped games from the previous season. The mean correct per- 
formance on the 10-step task analysis by the three lineman was 
80% (range, 70% to 90%). The researchers “adopted a range of 
70% to 90% as our acceptable performance criterion” (p. 466). 

Rhode, Morgan, and Young (1983) provide an excellent 
example of formative social validity assessment. In their study, 
token reinforcement and self-evaluation procedures were used 
to improve the classroom behavior of six students with behavior 
disorders. The overall goal of the study was to help the students 
improve their appropriate classroom behavior (e.g., following 
classroom rules, completing teacher-assigned tasks, volunteering 
relevant responses) and decrease inappropriate behavior (e.g., 
talking out, noncompliance, aggression) so that they would be 
accepted and successful in regular (general education) class- 
rooms. At least once per day throughout the course of the 17-week 
study, the researchers randomly selected classmates in the regular 
classrooms for observation. The same observation codes and pro- 
cedures that were used to measure the six target students’ behavior 
were used to obtain the normative sample data. 

Figure 10.10 shows the mean and range of the six stu- 
dents’ appropriate behavior during each condition and phase of 
the study compared to the normative sample. (Individual graphs 
showing the percentage of appropriate behavior in the resource 
and regular classroom of all six subjects in each of nearly 90 
sessions were also included in Rhode and colleagues’ article.) 
During baseline, the six boys’ levels of appropriate behav- 
ior were well below those of their peers without disabilities. 
During Phase I of the study, in which the subjects learned to self- 
evaluate, their behavior in the resource room improved to a 
level matching that of their regular classroom peers. However, 
when the subjects were in the regular classroom during Phase 
I, their behavior compared poorly with that of the other stu- 
dents in the normative sample. As Phase II progressed, which 
involved various strategies for generalization and maintenance 
of the treatment gains, the mean level of appropriate behavior by 
the six students matched that of their peers without disabilities, 
and variability among the six students decreased (except for one 
subject who exhibited no appropriate behavior on one session in 
the next-to-last condition). 


Standardized Tests 


Although standardized tests provide indirect measures of the 
behaviors typically targeted by applied behavior analysts, poli- 
cymakers and the public view scores from such tests as important 
indicators of performance. Improvements in intelligence quotient 
(IQ) test scores by children who participated in the pioneering 
studies by Lovaas and colleagues (Lovaas, 1987; McEachin, 
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and K. R. Young, 1983, Journal of Applied Behavior 
Analysis, 16, p. 184. Copyright 1984 by the Society for 
the Experimental Analysis of Behavior, Inc. 


Smith, & Lovaas, 1993; Smith, Eikeseth, Klevstrand, & Lovaas, 
1997) were key to the recognition and eventual acceptance of 
early intensive behavior intervention for children with autism as 
an evidence-based treatment (e.g., Maine Administrators of Ser- 
vices for Children with Disabilities, 2000; Myers, Johnson, & 
the American Academy of Pediatrics Council on Children with 
Disabilities, 2007; New York State Department of Health Early 
Intervention Program, 1999). Subsequent research reporting 
improvements in scores on standardized tests of adaptive behav- 
ior (e.g., Vineland Adaptive Behavior Scales, Sparrow, Balla, & 


Cicchetti, 2005) and child development (e.g., Bayley Scales of 


Infant and Toddler Development, Bayley, 2005) in addition to 
IQ scores has further strengthened the awareness and widespread 
acceptance of ABA-informed education and treatment for individ- 
uals with autism (e.g., Eikeseth, 2009; Eldevik et al., 2009, 2010; 
Howard, Sparkman, Cohen, Green, & Stanislaw, 2005). 

Scores on tests of academic achievement can also attest 
to the social validity of behavior change. In their study evalu- 
ating the effects of Headsprout Comprehension® on the reading 
comprehension of six elementary students with high-incidence 
disabilities, Cullen, Alber-Morgan, Schnell, and Wheaton 
(2014) measured students’ performance on comprehension 
passages based on the Ohio Achievement Assessment, a “high- 
stakes” test, and on AIMSweb Maze probes. AIMSweb Maze 
is a norm-referenced curriculum-based measurement (CBM) of 
reading comprehension (Shinn & Shinn, 2002). 


Experimental Conditions 


Behavior analysts have created tests that can be used to 
assess the social validity of some behavior change program out- 
comes. Iwata, Pace, Kissel, Nau, and Farber (1990) developed 
the Self-Injury Trauma Scale (SITS) to enable researchers and 
therapists to measure the number, type, severity, and location of 
injuries produced by self-injurious behavior. The SITS yields a 
Number Index and Severity Index with scores of 0 to 5, and an 
estimate of current risk. Although the data collected in a treat- 
ment program may show significant decreases in the behaviors 
that produce self-injury (e.g., eye poking, face slapping, head 
banging), the social significance of the treatment must be vali- 
dated by evidence of reduced injury. Iwata and colleagues wrote: 


... the social relevance of the behavior lies in its trau- 
matic outcome. The measurement of physical injuries 
prior to treatment can establish the fact that a client or 
subject actually displays behavior warranting serious 
attention. .. . Conversely, injury measurement following 
treatment can corroborate observed changes in behavior 
because reduction of an injury-producing response below a 
certain level should be reflected in the eventual disappear- 
ance of observable trauma. In both of these instances, data 
on injuries provide a means of assessing social validity. 
(pp. 99-100) 


Twohig and Woods (2001) used the SITS to validate the 
outcomes of a habit-reversal treatment for the chronic skin 
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picking of two typically developing adult males. Both men 
reported that they had engaged in skin picking since childhood, 
digging the fingernails into the ends of a finger and pulling or 
scraping the skin, which sometimes caused bleeding, scarring, 
and infections. Two observers independently rated pretreatment, 
posttreatment, and follow-up photographs of the two men’s 
hands with the S/TS. The Number Index (NI) and Severity Index 
(SI) SITS scores on the pretreatment photographs for both men 
were | and 2, respectively, indicating one to four injuries on 
either hand and distinct but superficial breaks in the skin. NI 
and SI scores of 0 on the posttreatment photos for both men 
indicated no apparent injuries. On the follow-up photos taken 
4 months after treatment had ended, both men had SITS NI and 
SI scores of 1, indicating red or irritated skin. 


Real-World Test 


Perhaps the most socially valid way to assess a learner’s newly 
acquired behavior is to put it to an authentic test in the natural 
environment. For instance, the validity of what three adolescents 
with learning disabilities had learned about road signs and traffic 
laws was validated when they passed the Ohio Department of 
Motor Vehicles test and earned their temporary driver’s permits 
(Test & Heward, 1983). In a similar way, the social validity of 
the cooking skills being learned by three secondary students 
with developmental disabilities and visual impairments was 
tested frequently when their friends arrived at the end of probe 
sessions to share the food they had just prepared (Trask-Tyler, 
Grossi, & Heward, 1994). In addition to providing authentic 
assessments of social validity, real-world tests put the learner’s 
repertoire in contact with naturally occurring contingencies of 
reinforcement, which may promote maintenance and general- 
ization of the newly acquired behaviors. 

Observing desired behavior change in real-world settings 
may not equate with socially valid improvements in a client’s 
repertoire. This mismatch is most likely to occur when (a) the 
target behavior differs from the behavior of ultimate value in the 
natural environment, or (b) the behavior changes are observed 
during situations in the natural environment that differ from sit- 
uations when the client’s performance of the behavior is most 
crucial. Stokes, Luiselli, Reed, and Fleming’s (2010) study on 
teaching pass-blocking skills to high school football players pro- 
vides an example of the first kind of mismatch. The researchers 
found that participants’ performance of a 10-step task analysis 
for pass-blocking by offensive linemen improved from practice 
drills to games (see Figure 23.5), but they acknowledged that 
“the relation between the tackling of a ball carrier in practice 
and games and performance of the 10 skills remains unknown 
because we did not measure successful tackles during practice 
or games” (p. 512). 

Directly observing participants’ reliable performance 
of the behavior of ultimate value in real-world settings is no 
guarantee of socially valid behavior change. If children’s ability 
to use in the natural environment what they had learned in a 
classroom-based program on safe street crossing behavior was 
assessed only during days or times when traffic was very light, 
the results may give a false and overly optimistic impression 
of the children’s ability to safely cross during heavy traffic at 
rush hour. 


A tremendous strength of single-case research designs is 
transparency: The dependent and independent variables must be 
described sufficiently to allow implementation by others and to 
make the data demonstrating behavior changes directly avail- 
able for anyone to examine. This transparency enables scientific 
reviewers, practitioners, consumers, and policymakers to judge 
for themselves the social validity of behavior changes.” 


EXTERNAL VALIDITY: REPLICATING 
EXPERIMENTS TO DETERMINE THE 
GENERALITY OF RESEARCH FINDINGS 


External validity refers to the degree to which a functional 
relation found reliable and socially valid in a given experiment 
operates under different conditions. An intervention that works 
only within a circumscribed set of conditions and proves inef- 
fective when any aspect in the original experiment is altered 
makes a limited contribution to the development of a reliable 
and useful technology of behavior change. When a carefully 
controlled experiment has shown that a particular treatment pro- 
duces consistent and socially significant improvements in the 
target behavior of a given subject, a series of important ques- 
tions should then be asked: Will the treatment be as effective if 
it is applied to other behaviors? Will the procedure continue to 
work if it is changed in some way (e.g., implemented at a dif- 
ferent time of day, by another person, on a different schedule)? 
Will it work in a setting different from the original experiment? 
Will it work with participants of different ages, backgrounds, 
and repertoires? Questions about external validity are neither 
abstract nor rhetorical; they are empirical questions that if 
judged important can be addressed by empirical methods. 

A functional relation with external validity, or generality, 
will continue to operate under a variety of conditions. External 
validity is a matter of degree, not an all-or-nothing property. 
A functional relation that cannot be reproduced under any con- 
ditions other than the exact set of original variables (including 
the original subject) possesses no external validity. At the other 
end of the continuum, a procedure that is effective at any time, 
under any conditions, in any setting, with any behavior, and for 
any subject has complete generality (a most improbable situa- 
tion). Most functional relations fall somewhere between the two 
ends of this continuum, and those found to have higher degrees 
of generality make the greater contribution to applied behavior 
analysis. Investigators who use between-groups research meth- 
ods pursue external validity quite differently than do investiga- 
tors who use within-subjects research methods. 


External Validity and Between-groups 
Research Design 


As stated previously, researchers who use between-groups 
experimental designs claim two advantages for using large 
groups of subjects. In addition to the assumption that aggregat- 
ing the data from individual subjects controls for intersubject 
variability, !° group design researchers assume that including 
many subjects in an experiment increases the external validity 
of the results. On the surface this assumption is perfectly logical, 
and it is true when viewed at the proper level of extrapolation. 
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The more subjects with which a functional relation has been 
demonstrated, the greater the likelihood it will also be effective 
with untested subjects who share similar characteristics. And in 
fact, demonstrating a functional relation with various subjects 
in different settings is exactly how applied behavior analysts 
document external validity. 

The investigator who claims that the findings of a 
between-groups study possess generality to other individuals 
in the population from which the experimental subjects were 
chosen violates a fundamental premise of the between-groups 
method and ignores a defining characteristic of behavior. The 
proper inference about the results of a groups-design study is 
from the sample to the population, not from the sample to the 
individual. The random sampling methods in a well-executed 
between-groups study are followed to ensure that the study’s 
participants exemplify a heterogeneous sample of all relevant 
characteristics found in the population from which they were 
selected. The better the sample represents the population from 
which it is drawn, the less meaningful are the results for any 
individual subject. 

A second problem inherent in extending the results of 
a between-groups study to other people (and unless care is 
taken, sometimes even to a subject who participated in the 
study, as was illustrated in Figure 10.1) is that a between- 
groups experiment does not demonstrate a functional relation 
between the behavior of any subject and some aspect of his or 
her environment. From the perspective of behavior analysis, 
there is nothing in the results of a between-groups experiment 
that can have external validity; there is nothing to generalize. 
Johnston and Pennypacker (2009) made this point repeatedly 
and well. 


The researcher’s primary objective is to obtain data that 
truly represent the relationship between the independent 
variable and the dependent variable. If this is not accom- 
plished, nothing else matters. Only when the findings are 
“true” does the question of the meaningfulness under other 
circumstances become relevant. (p. 247) 

Between groups designs tend to put the cart before 
the horse. The tactics of exposing different levels of the 
independent variable to different groups, insuring that 
these groups contain a large number of participants, and 
treating their responses collectively provide comparisons 
that describe no member of any group. By failing to focus 
on individuals with careful attention to experiment control, 
these traditional methods greatly decrease the chances of 
discovering orderly relations in the first place. This makes 
the question of generality across individuals moot. (p. 346) 


Between-groups designs and statistical inference have 
long dominated research in psychology, education, and other 
social sciences. Despite its long-standing dominance, the extent 
to which this research tradition has contributed to an effective 
technology of behavior change is highly questionable (Baer, 
1977; Branch, 2014; Michael, 1974). The field of education is a 
telling example of the inability of groups-desi gn research to pro- 
vide data that lead to improved practice (Greer, 1983: Heward & 
Cooper, 1992). Classroom instruction and curricular decisions 
are often influenced more by fad, fashion, and ideology than 


by the cumulative knowledge and understanding provided by 
rigorous and sustained experimental analysis of the variables of 
which learning is a function (Heron, Tincani, Peterson, & Miller, 
2005; Heward, 2005; Kauffman, 2011; Zane, Weiss, Blanco, 
Otte, & Southwick, 2015). 

Between-groups experimentation is inappropriate 
for answering the questions of primary interest to applied 
behavior analysts—empirical questions that can be pursued 
only by analysis of repeated measures of individual behavior 
under all relevant conditions. We agree with Johnston and 
Pennypacker (1993b): 


We find the reasoning underlying all such procedures 
alien to both the subject matter and the goals of a 
natural science of behavior and regard the utility of 
group comparisons as extremely limited, no matter 

how elegant the mathematical treatment of data they 
afford. . . . [group comparison experimentation con- 
stitutes] a process of scientific inquiry that is almost 
totally inverted; instead of using questions about natural 
phenomena to guide decisions about experimental 
design, models of design are allowed to dictate both the 
form and content of the questions asked. Not only is this 
antithetical to the established role of experimentation 

in science, the types of questions allowed by groups 
comparison designs are largely inappropriate or irrele- 
vant to gaining an understanding of the determinants of 
behavior. (pp. 94-95) 


Our discussion of the limitations of between-groups 
designs for behavior analysis should not be confused with a 
position that group designs and statistical inference have no 
value as research methods for seeking scientific knowledge 
about the world. On the contrary, between-groups designs 
and statistical inference are highly effective tools for seeking 
answers to the kinds of questions for which they were devised. 
Properly designed and well-executed between-groups experi- 
ments provide answers with a specific degree of confidence (i.e., 
probability) to questions central to many large-scale evaluations. 
For example, a government body is less interested in the effects 
of a new regulation on any individual person (and perhaps even 
less interested in whether a functional relation exists between 
the regulation and that person’s behavior) than it is in the prob- 
ability that the behavior of a predictable percentage of the pop- 
ulation will be affected by the regulation (see Box 10.1). The 
former concern is behavioral and best pursued with single-case 
experimental designs that encompass the methods to address it. 
The latter concern is actuarial; random sampling and statistical 
inference of between-groups designs are appropriate tools for 
addressing it. 


External Validity and Applied Behavior Analysis 


Behavior analysts assess, establish, and specify the exter- 
nal validity, or scientific generality (Branch & Pennypacker, 
2013), of single-case research findings by replicating experi- 
ments. Replication in this context means repeating a previous 
experiment.!! Sidman (1960) described two major types of 
replication—direct and systematic. 
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BOX 10.1 


Between-Groups Research and Applied Behavior Analysis 


Between-groups research designs dominate the larger world 
of behavioral and social sciences. In studies using a between- 
groups design called the random control trial (RCT), each 
participant is randomly assigned to the treatment group or the 
no-treatment (control) group. A “double-blind” RCT elimi- 
nates potential bias by keeping membership in the two groups 
hidden from investigators and participants. Various statistical 
tools are used to determine if pre- and posttreatment measures 
of the dependent variable differ between the groups. Some 
policymaking bodies and researchers consider RCTs the gold 
standard for experimental evidence (e.g., U.S. Department of 
Education, 2003; Interactive Autism Network, 2019). 

Can RCTs and other types of between-groups designs 
help advance applied behavior analysis, a science whose 
basic principles and practices for changing behaviors have 
been discovered, refined, and demonstrated effective by 
single-case experiments? In our view, between-groups 
research designs can support applied behavior analysis in 
two principal ways. 

First, results from group studies showing positive effects 
of behavioral treatments may lead to increased awareness of 
ABA-informed interventions by policymakers, government 
officials, education and healthcare administrators who fund 
treatment initiatives, and comsumers who might advocate for 
evidence-based treatments. “A positive outcome of a group 
study may not make behavior analysts know more about con- 
trol of an individual client’s behavior, yet such an outcome 
may nonetheless make more people know about behavior 
analysis” (Iverson, 2013, p. 26). Although hundreds of single- 
subject studies had shown the positive effects of early intensive 


Direct Replication 


In a direct replication, the researcher makes every effort to 
duplicate exactly the conditions of an earlier experiment. If the 
same subject is used in a direct replication, the study 1s an intra- 
subject direct replication. Intrasubject replication within exper- 
iments is a defining characteristic of applied behavior analysis 
research and the primary tactic for establishing the existence 
and reliability of a functional relation. An intersubject direct 
replication maintains every aspect of a previous experiment 
except that different, although similar (e.g., same age, similar 
repertoires), subjects are involved. Intersubject replication is the 
primary method for determining the extent to which research 
findings have generality across subjects. 

The many uncontrolled variables in natural settings make 
the direct replication of experiments outside the laboratory 
extremely difficult. Nevertheless, intersubject replication is 
the rule rather than the exception in applied behavior analysis. 
Although numerous single-case studies involve just one sub- 
ject (e.g., Ahearn, 2003; Dixon & Falcomata, 2004; Fahmie, 
Iwata, & Mead, 2016; Tarbox, Williams, & Friman, 2004), the 
vast majority of published studies in applied behavior analysis 


behavioral intervention for children with autism, widespread 
recognition of the effectiveness of behavioral treatment for 
autism resulted, in large measure, from group comparison 
studies (see Eldevik et al., 2009, 2010; Smith, Groen, & Wynn, 
2000). In the public health sector, between-groups studies 
in the United Kingdom demonstrating that the Food Dudes 
program—a behavioral intervention package that includes peer 
modeling, video recording, rewards for eating healthy foods, 
and a parent packet—increased school children’s consumption 
of fruits and vegetables contributed to the Irish Government 
making the program available to all primary schools in Ireland 
(Horne et al., 2004, 2009). 

Second, disseminating between-groups studies evaluat- 
ing behavioral interventions in journals and at conferences that 
do not typically feature single-case design research increases 
awareness of treatments derived from behavior analysis by 
scientists in other disciplines, which may lead to collaboration 
and expansion of research agendas. When behavior analysts 
publish between-group studies in nonbehavioral journals, as 
has been the case with transportation safety (Van Houten, 
Retting, Farmer, Van Houten, & Malenfant, 2000), health care 
regimens (Raiff, Barry, Ridenour, & Jitnarin, 2016), smoking 
cessation (Dallery et al., 2017), and drug addiction (Madden, 
Petry, Badger, & Bickel, 1997), a wider audience learns how 
the principles and procedures of applied behavior analysis can 
improve the human condition. 

For additional discussion and examples of how group 
studies can contribute to the science and practice of applied 
behavior analysis, see Critchfield and Kollins (2001), Critch- 
field and Reed (2017), and Hanley (2017). 


include direct intersubject replications. This is because each 
subject can be considered an intact experiment. For example, 
a behavior analysis study in which the independent variable is 
manipulated in exactly the same way for six subjects in the same 
setting yields five intersubject replications. 


Systematic Replication 


The direct replication of experiments demonstrates the reliabil- 
ity of a functional relation, but the generality of that finding to 
other conditions can be established only by a series of experi- 
ments in which the conditions of interest are purposefully and 
systematically varied. In a systematic replication the researcher 
purposefully varies one or more aspects of a previous experi- 
ment. When a systematic replication successfully reproduces 
the results of previous research, it not only demonstrates the 
reliability of the earlier findings but also adds to the external 
validity of the earlier findings by showing that the same effect 
can be obtained under different conditions. In a systematic rep- 
lication, any aspect of a previous experiment can be altered: 
subjects, setting, administration of the independent variable, 
target behaviors. 
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Because systematic replication can provide new knowl- 
edge about the variables under investigation, it offers greater 
potential rewards than direct replication. Although systematic 
replication entails some risk, Sidman (1960) described it as a 
gamble well worth taking. 


If systematic replication fails, the original experiment will 
still have to be redone, else there is no way of determining 
whether the failure to replicate stemmed from the introduc- 
tion of new variables in the second experiment, or whether 
the control of relevant factors was inadequate in the first 
experiment. 

On the other hand, if systematic replication succeeds, 
the pay-off is handsome. Not only is the reliability of the 
original finding increased, but also its generality with 
respect to other organisms and to other experimental pro- 
cedures is greatly enhanced. Furthermore, additional data 
are now available which could not have been obtained by a 
simple repetition of the first experiment. (pp. 111-112) 


Sidman went on to explain that economic husbandry of 
limited resources must also play an important role in the scien- 
tist’s determination of how a research program should proceed. 
Direct replication of a long and costly experiment can provide 
data on only the reliability of a functional relation, whereas sys- 
tematic replication can provide information on the reliability and 
generality of the phenomena under investigation. 

The external validity of results of between-groups research 
is viewed as an inherent characteristic of a given experiment, 
as something that can be directly assessed by examining the 
methods used to conduct the study (e.g., sampling procedures, 
Statistical tests). But as Birnbrauer (1981) pointed out, exter- 
nal validity is not something a single study has, but rather the 
product of many studies. External validity can be pursued only 
through the active process of systematic replication. 


Generality is established, or more likely limited, by accu- 
mulating studies that are internally valid and by placing the 
results into a systematic context, i.e., seeking out the prin- 
ciples and parameters that particular procedures appear to 
be enunciating. The most informative studies ask how can 
an earlier positive result be repeated in the present circum- 
stances, with the present problem? (p. 122) 


Systematic replications make up much of the applied 
behavior analysis literature. Indeed, one could argue persua- 
sively that any applied behavior analysis study is a systematic 
replication of at least some aspect of an earlier experiment. 
Even when the authors have not pointed it out, virtually every 
published experiment reveals significant procedural similarity 
with previous experiments. However, as we are using the term 
here, systematic replication refers to concerted and directed 
efforts to establish and specify the generality of a functional 
relation. For example, Hamlet, Axelrod, and Kuerschner (1984) 
found a functional relation between demanded eye contact 
(e.g., “[Name], turn around”) and compliance with adult instruc- 
tions in two 11-year-old school children. Included in the same 
published report were the results of six replications conducted 
by the same researchers over a period of 1 year with nine stu- 
dents aged 2 to 21 years. Similar results were reproduced in 


eight of the nine replication subjects. Although some might con- 
sider this an example of direct intersubject replication, Hamlet 
and colleagues’ replications were conducted in various settings 
(i.e., classrooms, homes, institutions) and therefore should be 
viewed as a series of systematic replications that demonstrated 
not only the reliability of the results but also considerable gener- 
ality across subjects of different ages in different settings. 

Systematic replications across subjects sometimes reveal 
different patterns of effects, which the researcher might then 
study as a function of specific subject characteristics or con- 
textual variables. For example, Hagopian, Fisher, Sullivan, 
Acquisto, and LeBlanc (1998) reported the results of a series 
of systematic replications with 21 inpatient cases of functional 
communication training with and without extinction and pun- 
ishment.!? Lerman, Iwata, Shore, and DeLeon (1997) found 
that thinning FR | schedules of punishment (in which each 
response is punished) to intermittent punishment produced dif- 
ferent effects on the self-injurious behavior of five adults with 
profound intellectual disabilities. 

Some systematic replications are attempts to reproduce 
the results reported by another researcher in a different situ- 
ation or context. For example, Saigh and Umar (1983) suc- 
cessfully reproduced in a Sudanese classroom the positive 
results originally reported with the Good Behavior Game in 
an American classroom (Barrish, Saunders & Wolf, 1969: see 
Chapter 28). Saigh and Umar reported that a “considerable 
degree of support for the cross-cultural utility of the game was 
established” (p. 343). Subsequent systematic replications of the 
Good Behavior Game in Northern Ireland (Lynch & Keenan, 
2018) and the Netherlands (van Lier, Muthen, van der Sar, & 
Crijnen, 2004) lend additional support for the cross-cultural gen- 
erality of the intervention’s effectiveness. 

Researchers sometimes report multiple experiments, each 
experiment serving as a systematic replication investigating the 
variables influencing a given functional relation. For example, 
Fisher and colleagues (1993) conducted four studies designed to 
explore the effectiveness of functional communication training 
(FCT) with and without extinction and punishment. 

Systematic replication is evident when a research team 
pursues a consistent line of related studies over time. Exam- 
ples of this approach to replication can be found in Neef and 
colleagues’ experiments on the impulsivity of students with 
attention-deficit hyperactivity disorder (ADHD) (e.g., Bicard & 
Neef, 2002; Neef, Bicard, & Endo, 2001; Neef, Bicard, Endo, 
Coury, & Aman, 2005; Neef, Markel, et al., 2005); Miltenberger 
and colleagues’ line of research on teaching safety skills to 
children (e.g., Himle, Miltenberger, Gatheridge, & Flessner, 
2004; Johnson et al., 2006; Knudson et al., 2009; Miltenberger, 
2008; Miltenberger et al., 2004; Miltenherger et al., 2005); 
and Van Houten and colleagues’ studies investigating variables 
affecting motorists’ behavior and pedestrian safety spanning 
four decades (e.g., Huitema, Van Houten, & Manal, 2014; 
Lebbon, Austin, Van Houten, & Malenfant, 2007; Van Houten, 
Malenfant, Van Houten, & Retting, 1997; Van Houten, Nau, & 
Marini, 1980). 

The systematic replications necessary to extend a sig- 
nificant line of research often require the independent efforts 
of investigators at different sites who build upon one another’s 
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work. When independent teams of researchers at different geo- 
graphical locations report similar findings, the net result is a 
body of knowledge with significant scientific integrity and 
technological value. This collective effort speeds and enhances 
the refinement and rigorous testing of interventions that is nec- 
essary to the development and refinement of evidence-based 
practices (Hitchcock, Kratochwill, & Chezan, 2015; Horner & 
Kratochwill, 2012). 

An example of independent research teams at various 
sites reporting systematic replications across several decades is 
a body of studies analyzing the effects of response cards on stu- 
dents’ academic engagement, learning, and deportment during 
group instruction. !* Investigators have reported a similar pattern 
of results—increased participation during instruction, improved 
retention of lesson content, and/or reductions in off-task and 
disruptive behaviors—with response cards used with a wide 
range of students (general education students, special education 
students, English as a second language [ESL] learners), curric- 
ulum content (e.g., math, science, social studies, spelling), and 
instructional settings (e.g., preschool, elementary, middle, sec- 
ondary, and college classrooms) (e.g., Armendariz & Umbreit, 
1999; Bondy & Tincani, 2018; Cakiroglu, 2014; Cavanaugh, 
Heward, & Donelson, 1996; Davis & O’ Neill, 2004; Gardner, 
Heward, & Grossi, 1994; Hott & Brigham, 2018; Kellum, Carr, 
& Dozier, 2001; Lambert, Cartledge, Lo, & Heward, 2006; 
Marmolejo, Wilder, & Bradley, 2004; Wood, Mabry, Kretlow, 
Lo, & Galloway, 2009). 

Recognizing the crucial role replications play in the con- 
tinued development of applied behavior analysis, the editorial 
board of the Journal of Applied Behavior Analysis in 2017 insti- 
tuted a new category of articles labeled Replications. 


It will be expected that the more direct the replication, the 
shorter the article. More space, but not much more space, will 
be needed to describe systematic replications that duplicate 
procedures and their effects under an ever broadening set 

of conditions. A lengthy article may indeed be published in 
Replications if, for example, authors duplicate procedures, 
initially fail to replicate results, but then either demonstrate 
alternative controlling variables or isolate the conditions 
under which a functional relation occurs and the condi- 

tions under which it does not occur. (Hanley, 2017, p. 4) 


Planning, conducting, and presenting the results of an 
applied behavior analysis study requires a great deal of time, 
energy, and expertise. Increasingly, organizations representing 
scientific bodies, practitioner groups, and consumers are seeking 
systematic presentation of single-case studies. The What Works 
Clearinghouse, a program within the U.S. Department of Educa- 
tion’s Institute of Education Sciences, may have been the first to 
develop rubrics for evaluating and determining whether single- 
case studies would be included in research syntheses that they 
published. Maggin, Briesh, Chafouleas, Ferguson, and Clark 
(2014) compared five rubrics for identifying empirically sup- 
ported practices with single-case research. The consensus report 
from a group of 26 researchers offers guidance to researchers 
about what to include when submitting a research paper for 
journal publication using single-case experimental designs (Tate 
et al., 2016a, 2016b). 


EVALUATING APPLIED BEHAVIOR 
ANALYSIS RESEARCH 


Thus far we have identified a considerable number of require- 
ments for good applied behavior analysis. Our purpose now is 
to summarize those requirements in a sequence of questions one 
might ask in evaluating the quality of research in applied behav- 
ior analysis. Those questions can be organized under four major 
categories: internal validity, social validity, external validity, 
and scientific and theoretical significance. 


Internal Validity 


To determine whether an analysis of behavior has been made, 
the reader of an applied behavior analysis study must decide 
whether a functional relation has been demonstrated. This deci- 
sion requires a close examination of the measurement system, 
the experimental design, and the degree to which the researcher 
controlled potential confounds, as well as a careful visual ana- 
lysis and interpretation of the data. 


Definition and Measurement of the 

Dependent Variable 

The initial step in evaluating internal validity is to decide 
whether to accept the data as valid and accurate measures of 
the target behavior over the course of the experiment. Some of 
the important issues to be considered in this decision are cap- 
tured by the questions shown in Figure 10.11. 


Graphic Display 

If the data are accepted as a valid and accurate representation 
of the dependent variable over the course of the experiment, 
the reader should next assess the extent of stability of the target 
behavior during each phase of the study. Before evaluating the 
stability of the data paths, however, the reader should examine 
the graphic display for any sources of distortion (e.g., scaling of 
axes, distortions of time on the horizontal axis; see Chapter 6). 
The researcher or consumer who suspects that any element of 
the graph may encourage interpretations unwarranted by the data 
should replot the data using a new set of appropriately scaled 
axes. In an assessment of the stability of the dependent variable 
within the different phases of an experiment, the length of the 
phase or condition must be considered as well as the presence of 
trends in the data path. The reader should ask whether the con- 
ditions in effect during each phase were conducive to practice 
effects. If so, were these effects allowed to play themselves out 
before experimental variables were manipulated? 


Meaningfulness of Baseline Conditions 


The representativeness or fairness of the baseline conditions as 
the basis for evaluating subsequent performance in the presence 
of the independent variable should be assessed. In other words, 
were the baseline conditions meaningful in relation to the target 
behavior, setting, and research questions addressed by the exper- 
iment? For example, consider two experiments by Miller, Hall, 
and Heward (1995) that evaluated the effects of two procedures 
for conducting 1-minute time trials during a daily 10-minute 
practice session on the rate and accuracy with which students 


272 Part3 * Evaluating and Analyzing Behavior Change 


Figure 10.11 Questions that should be asked when evaluating the definition and measurement of the dependent 


variable in an applied behayios apelyas et 


Was the dependent variable precisely, completely, ae Sr iicictemt defined? 


Were examples and nonexamples of the target behavior provided, if doing so would enhance clarity? 
Were the most relevant, measurable dimensions of the target behavior specified (e.g., rate, duration)? 


Were important concomitant behaviors also measured? 


Were the observation and recording procedures appropriate for the target behavior? 
Did the measurement provide valid (i.e., meaningful) data for the problem or research question addressed? 


Was the measurement scale broad and sensitive enough to capture socially significant changes in the target behavior. 


Have the authors provided sufficient information on the training and calibration of observers? 


What procedures were used to assess and ensure the accuracy of measurement? 
Were interobserver agreement (IOA) assessments reported at the levels at which the study’s results are presented (e.g., by 


subject and experimental condition)? 


Were observation sessions scheduled during the times, activities, and places most relevant to the problem or research 


question? 


Did observations occur often enough and closely enough in time to provide a convincing estimate of behavior change over 


time? 


Were any contingencies operating in the study that may have influenced the observers’ behavior? 


Was there any expectation or indication that the dependent variable may have been reactive to the measurement system? If so, 


were procedures taken to assess and/or control reactivity? 


Were appropriate accuracy and/or reliability assessments of the data reported? 


answered math problems. Throughout all conditions and phases 
of both experiments, the students were instructed to answer as 
many problems as they could and they received feedback on 
their performance. Throughout both experiments, students’ 
worksheets were marked and scored as follows: 


Experimenters marked each student’s worksheets by putting 
an *X° next to incorrect answers. The number of correct 
answers over the total number of problems attempted was 
marked at the top of the first worksheet along with a positive 
comment to encourage the students to keep trying. If a stu- 
dent’s score was lower than his or her highest previous score, 
comments such as “Keep trying, Sally!” “Work faster!” or 
“Keep working on it!” were written. Whenever a student 
achieved his or her highest score to date, comments such as 
“Great job, Jimmy! This is your best ever!” were written on 
the packet. In the event a student equaled her highest previous 
score, “You tied your best score!” was written on the packet. 


At the beginning of each session, scored and marked 
worksheets from the prior day’s session were returned to the 
students. Each session during the 10-minute continuous work 
period condition that functioned as the baseline condition began 
with the classroom teachers saying to the students: “I want you 
to work hard and try to do your best. Answer as many problems 
as you can. Don’t worry if you do not answer all of the problems. 
There are more problems in the packet than anyone can do. Just 
try your best” (p. 326). 

The initial baseline (A) phase was followed by the two 
time-trial conditions (B and C) in an A-B-A-B-C-B-C design. 
Results for students in both classrooms showed a clear functional 


relation between both of the time-trial conditions and increased 
correct rate and accuracy over the baseline condition. However, 
if the classroom teachers had not instructed and reminded the 
students to do their best and to answer as many problems as they 
could prior to each baseline session, and if the students had not 
received feedback on their worksheets, the improved performance 
during the time-trial conditions would have been suspect. Even 
if a clear functional relation had been demonstrated against such 
baseline conditions, applied researchers and consumers could, 
and should, question the importance of such results. Maybe the 
children simply did not know they were expected to work fast. 
Perhaps the students would have solved problems in the baseline 
condition at the same high rates that they did in the time-trial 
conditions if they had been told to “go fast” and received feed- 
back on their performance, praise for their improvements, and 
encouragement to answer more problems. By including the daily 
instruction to work hard and answer as many problems as they 
could and by returning the worksheets to the students as compo- 
nents of the baseline condition, Miller and colleagues obtained 
meaningful data paths during baseline against which to test and 
compare the effects of the two time-trial conditions. 


Experimental Design 


The experimental design should be examined to determine the 
type of experimental reasoning it affords. What elements of 
the design enable prediction, verification, and replication? Is the 
design appropriate for the research questions addressed by the 
study? Does the design effectively control for confounding vari- 
ables? Does the design provide the basis for component and/or 
parametric analyses if such questions are warranted? 
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Brossart, Vannest, Davis, and Patience (2014) introduced 
the term design analysis for assessing single-case experimental 
designs for threats to internal validity and distinguished design 
analysis from visual analysis. 


A design analysis examines the structure of the experiment 
to see if it is valid for asserting a functional relationship, 
whereas a visual analysis looks at means, trends, imme- 
diacy and consistency of change in assessing type and 
amount of behaviour change. This visual analysis may 
inform a determination about a functional relationship, but 
not in isolation from evaluation of the design. (p. 468) 


Visual Analysis and Interpretation 


Although statistical methods for evaluating behavioral data and 
determining the existence of functional relations in single-subject 
experimental designs have been proposed for more than 40 years 
(e.g., Gentile, Roden, & Klein, 1972; Hartmann, 1974; Jones, 
Vaught, & Weinrott, 1977; Manolov & Solanas, 2013; Pfadt & 
Wheeler, 1995; Solomon, Howard & Stein, 2015), visual inspec- 
tion remains the most commonly used, and we believe the most 
appropriate, method for interpreting data in applied behavior 
analysis. Chapter 6 describes procedures and guidelines for visual 
analysis of level, trend, and variability behavioral data within and 
between experimental conditions. We will briefly present four 
factors that favor visual analysis in applied behavior analysis. 

First, applied behavior analysts have little interest in 
knowing that a behavior change is a statistically significant 
outcome of an intervention. Applied behavior analysts are con- 
cerned with producing socially significant behavior changes. If 
a test of statistical inference is required to determine whether an 
individual client or participant’s behavior has changed enough 
to improve his or her quality of life, the answer is no. Or, as Don 
Baer noted, “If a problem has been solved, you can see that; 
if you must test for statistical significance, you do not have a 
solution” (Baer, 1977, p. 171). 

Second, visual analysis is well suited for identifying vari- 
ables that produce strong, large, and reliable effects, which con- 
tribute to an effective, robust technology of behavior change. On 
the other hand, powerful tests of statistical analysis can detect 
the slightest possible correlation between the independent and 
dependent variables, which may lead to the inclusion of weak, 
unreliable variables in the technology. 

Two types of errors are possible when determining experi- 
mental effect (see Figure 10.12). A Type I error (also called a false 
positive) is made when the researcher concludes that the indepen- 
dent variable had an effect on the dependent variable, when in truth 
no such relation exists in nature. A Type II error (also called a false 
negative) is the opposite of a Type I error. In this case, the researcher 
concludes that an independent variable did not have an effect on 
the dependent variable, when in truth it did. '4 Tdeally, a researcher 
using well-reasoned experimental tactics coupled with a sound 
experimental design and buttressed by appropriate methods of data 
analysis will conclude correctly that a functional relation between 
the independent and dependent variables exists (or does not exist). 

Baer (1977) pointed out that the behavior analyst’s reliance 
on visual inspection to determine experimental effects results in 
a low incidence of Type I errors but increases the commission 


Functional Relation Exists in Nature 
Yes No 


Type | Error 
(false positive) 


Yes 


Correct Conclusion 


Type Il Error 


(false negative) Correct Conclusion 


Researcher Concludes 
Functional Relation Exists 
No 


Figure 10.12 Ideally, an experimental design and methods 
of data analysis help a researcher conclude correctly 

that a functional relation between the independent and 
dependent variables exists (or does not exist) when in 

fact such a relation does (or does not) exist in nature. 
Concluding that the results of an experiment reveal 

a functional relation when no such relation exists in 
nature is a Type | error. Conversely, concluding that 

that independent variable did not have an effect on the 
dependent variable when such a relation did occur is a Type 
Il error. 


of Type II errors. The researcher who relies on tests of statisti- 
cal significance to determine experimental effects makes many 
more Type I errors than the behavior analyst, but misses few, if 
any, variables that might produce some effect. 


Scientists who commit relatively many Type | errors are 
bound to memorize very long lists of variables that are 
supposed to affect diverse behaviors, some predictable 
portion of which are not variables at all. By contrast, sci- 
entists who commit very few Type | errors have relatively 
short lists of variables to remember. Furthermore, and 
much more important, it is usually only the very robust, 
uniformly effective variables that will make their list. 
Those who will risk Type | errors more often will uncover 
a host of weak variables. Unquestionably, they will know 
more, although some of that more is wrong, and much of 
it is tricky. . .. Those who keep their probability of Type 

2 errors low do not often reject an actually functional 
variable, relative to those whose Type 2 error probability 
is higher. Again, unquestionably, the practitioner with 

the lower probability of Type 2 errors will know more; 
but again, the nature of that more is seen often in its 
weakness, inconsistency of function, or its tight special- 
ization. .. . Individual-subject-design practitioners . . . nec- 
essarily fall into very low probabilities of Type | errors and 
very high probabilities of Type 2 errors, relative to their 
group-paradigm colleagues. As a result, they learn about 
fewer variables, but these variables are typically more 
powerful, general, dependable, and—very important— 
sometimes actionable. These are exactly the variables 

on which a technology of behavior might be built. 

(Baer, 1977, pp. 170-171) 
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A third problem with using statistical methods to deter- 
mine the existence of functional relations in behavioral data 
occurs with borderline data sets containing significant amounts 
of variability. Such data sets should motivate a researcher to 
engage in additional experimentation in an effort to achieve 
more consistent experimental control and to discover the factors 
causing the variability. The researcher who forgoes the addi- 
tional experimentation in favor of accepting the results of a test 
of statistical significance as evidence of a functional relation 
risks leaving important findings in the realm of the unknown. 
Statistical manipulation of data, no matter how elegant or 
sophisticated, will never replace manipulation of the variables 
of which behavior is a function. 


The situation where a significance test might seem helpful 
is typically one involving sufficient uncontrolled variabil- 
ity in the dependent variable that neither the experimenter 
nor his readers can be sure that there is an interpretable 
relationship. This is evidence that the relevant behavior is 
not under good experimental control, a situation calling for 
more effective experimentation, not a more complex judg- 
mental aid. (Michael, 1974, p. 650) 


Fourth, most statistical tests of significance involve certain 
assumptions can be applied only to data sets that conform to 
predetermined criteria (Solomon et al., 2015). If those statistical 
methods for determining experimental effects were to become 
highly valued in applied behavior analysis, researchers might 
begin to design experiments so that such tests could be com- 
puted. The resultant loss of flexibility in experimental design 
would be counterproductive to the continued development of 
behavioral science. 

Visual analysis is not without limitations. Poor inter- 
rater agreement on whether various data patterns demonstrate 
experimental control has long been a concern (e.g., Franklin, 
Gorman, Beasley, & Allison, 1996; Matyas & Greenwood, 1990; 
Ottenbacher, 1993). In one of the first studies assessing inter- 
rater agreement with visual analysis, DeProspero and Cohen 
(1979) asked 108 editorial board members and guest reviewers 
of JABA and Journal of the Experimental Analysis of Behavior 
(JEAB) to evaluate hypothetical data displayed in nine ABAB 
graphs on how convincingly (using a scale of 0 [low] to 100 
[high]) each graph demonstrated experimental control. Except 
for the most obvious graphs, those with “ideal patterns,” inter- 
rater agreement was relatively low (a 0.61 Pearson correlation 
coefficient). Diller, Barry, and Gelino (2016) asked 19 JABA and 
JEAB editorial board members and 90 Board Certified Behav- 
ior Analysts (BCBAs) to evaluate the extent to which pairs of 
hypothetical data paths displayed in 18 multielement design 
graphs demonstrated experimental control. These researchers 
also reported inconsistent agreement among reviewers. In con- 
trast, Kahng and colleagues (2010) found high interrater agree- 
ment among 45 editorial board members and associate editors 
of JABA from 2002 to 2004 on the extent to which and whether 
(yes or no) hypothetical data in 36 ABAB graphs demonstrated 
experimental control. 

A limitation of these studies examining interrater agree- 
ment with visual analysis is that participants were provided 


no context for the data. Without knowledge of crucial con- 
textual factors, an applied behavior analyst cannot interpret 
graphed data properly. Who was the subject? What behavior 
was measured? How was it measured? What were the behav- 
ior change goals? What was the treatment variable? How, 
when, where, and by whom was treatment implemented? 
What scales of behavior change and time do the vertical and 
horizontal axes represent? 

Various structured criteria, formal decision rules, and 
visual aids for interpretation of single-case data have been 
proposed (e.g. Fisch, 1998; Fisher, Kelley, & Lomas, 2003; 
Hagopian et al., 1997; Pfadt, Cohen, Sudhalter, Romanczyk, & 
Wheeler, 1992). Although multiple raters using the same set 
of rules or test of statistical aids will—if they apply the rules 
or compute the math correctly—achieve 100% agreement, that 
may not necessarily be a desirable outcome. What might be 
meaningful behavior change in one participant-treatment-setting 
context may not be in another. 

Visual analysis of graphed data has served applied behav- 
ior analysis well. Behavior analytic researchers and practitioners 
will continue to rely on it as the primary means of interpreting 
behavior change and identifying experimental control. However, 
statistical analysis of behavioral data can support and promote 
ABA research in several ways (see Box 10.2), and we encourage 
behavior analysts to follow the development of statistical tools. 


Social Validity 


The reader of a published study in applied behavior analysis 
should judge the social significance of the target behavior, the 
appropriateness of the procedures, and the social importance of 
the outcomes (Wolf, 1978). 

Chapter 3 detailed many considerations that should guide 
the applied behavior analyst’s selection of target behaviors. The 
social validity of the dependent variable should be assessed in 
light of those factors. Ultimately, all of the issues and consider- 
ations relative to target behavior selection point to one question: 
Will an increase (or decrease) in the measured dimension of this 
behavior improve the person’s life directly or indirectly? 

The independent variable should be evaluated not only in 
terms of its effects on the dependent variable, but also in terms 
of its acceptability, practicality, and cost. Regardless of their 
effectiveness, treatments perceived by practitioners, parents, 
and/or clients as unacceptable or undesirable for whatever rea- 
son are unlikely to be used. Consequently, such treatments will 
never have the chance to contribute to a technology of behavior 
change. The same can be said of independent variables that 
are extremely complex and thus difficult to learn, teach, and 
apply. Similarly, treatment procedures that require excessive 
time and/or money to implement have less social validity than 
do procedures that can be applied quickly and inexpensively. 

Even though behavior change is clearly visible ona graphic 
display, it may not represent a socially valid improvement for 
the participant and/or significant others in his environment. 
In evaluating the results of an applied behavior analysis study, 
the reader should ask questions such as these: Is the participant 
(or significant others in the participant’s life) better off now that 
the behavior has changed? Will this new levei of performance 
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BOX 10.2 


Supplementing Visual Analysis with Statistical Assessments 


When evaluating the effects of behavioral interventions, no 
amount of statistical computation can replace experimental 
control of variables and visual analysis of the data. However, 
visual analysis of data from single-case designs can be supple- 
mented with statistical assessment. Behavioral researchers are 
using statistical tools with increasing frequency to calculate 
a standard metric of effect and to calculate the overall effect 
of multiple studies investigating a common research question. 


Effect Size: Quantifying the Magnitude 
of Treatment Impact 


Effect size, a statistical measure of how much a treatment 
variable influenced a dependent variable, is widely used in 
group design studies. One type of effect size calculated with 
Cohen’s d formula ranges from —1.00 to + 1.00. Cohen (1988) 
described effect sizes of +0.2 as small, +0.5 as medium, and 
+0.8 as large. A negative effect size indicates that a treatment 
variable has a reductive or harmful effect. Cohen acknowl- 
edged the danger of using such terms without noting the 
study’s context and the resources required to obtain the effect 
(e.g., a small effect from an easy-to-implement treatment 
accepted by staff and clients may be a more valuable outcome 
than a large effect obtained with a costly treatment disfavored 
by staff and clients). 

There is no current consensus on how to derive effect 
sizes for single-case design studies. The U.S. Department of 
Education’s What Works Clearinghouse (2018) recommends 
calculating the magnitude of treatment effects in single-case 
studies with a variety of tools, suggesting that stronger con- 
clusions may be drawn if similar results are obtained from 
multiple approaches. If the magnitude of treatment effects is 
properly calculated, anyone using the same statistical tools will 
obtain the same result. The difficulty is “choosing from the 
bewildering number of statistical methods available” (Brossart, 
Vannest, Davis, & Patience, 2014, p. 467). 

To explore the question of whether different statistical 
tools can be used for determining the magnitude of effects from 
single-case studies, Shadish (2014) asked the developers of 
five different statistical approaches to analyze the nine ABAB 
data sets from Lambert, Cartledge, Heward, and Lo’s (2006) 
study on the effects of response cards on disruptive behavior 


result in increased reinforcement (or decreased punishment) for 
the subject now or in the future? In some instances it is relevant 
to ask whether the subject (or significant others) believes that 
her behavior has improved (Wolf, 1978). 


Maintenance and Generalization 
of Behavior Change 


Improvements in behavior are most beneficial when they are 
long-lasting, appear in other appropriate environments, and spill 


during math lessons by nine fourth-grade students (Figure 8.3 
shows data for four of the students). The different approaches 
found effects “not identical but reasonably consistent with each 
other and with visual inspection” of the data (Shadish, 2014, 
p. 112). In commenting on these results, Fisher and Lerman 
(2014) noted that the data sets in Lambert et al.’s study showed 
“mostly large and unambiguous treatment effects” and that 
Shadish “acknowledged that more definitive comparison of 
visual and statistical analyses will require further refinements 
in statistical methods” (p. 247). 


Meta-Analysis: Statistical Synthesis of Related Studies 


Meta-analysis entails a variety of statistical techniques for 
aggregating the effect sizes of a group of studies in a specific 
area (Borenstein, Hedges, Higgins, & Rothstein, 2009). Two 
examples are Maggin, O’ Keefe, and Johnson’s (2011) syn- 
thesis of 25 years of special education studies, and Stockard, 
Wood, Coughin, and Khoury’s (2018) quantitative review 
of 328 studies on Direct Instruction published over a half 
century. 

Meta-analyses that aggregate the individual partici- 
pant’s data across studies are sometimes called mega-analysis 
(H. Cooper & Patall, 2009). An example is Eldevik et al.’s 
(2009) mega-analysis of 16 group design studies on behavioral 
intervention for children with autism. The investigators ana- 
lyzed the results for 453 individual participants (309 children 
who received behavioral intervention, 39 children who 
received comparison interventions, and 105 children who were 
in control groups). Results showed that a far higher percent- 
age of children who received behavioral intervention achieved 
reliable change in IQ score (at least +27 points) and adaptive 
behavior (at least 21 points) than did children in the compari- 
son and control groups. 

Readers interested in learning more about statistical tools 
for single-case data can examine the What Works Clearinghouse 
(2018) Standards Handbook; explore the free web-based calcu- 
lators and manuscripts archived by Vannest, Parker, Gonen, and 
Adiguzel (2016); and study articles by single-case researchers 
with expertise in statistical methods (e.g., Fisher & Lerman, 
2014: Kratochwill & Levin, 2014; Solomon, Howard, & 
Stein, 2015). 


over to other related behaviors. Producing these kinds of effects 
is a major goal of applied behavior analysis. (Chapter 30 exam- 
ines strategies and tactics that promote generalized behavior 
change.) When evaluating applied behavior analysis research, 
consumers should consider the maintenance and generalization 
of behavior change in their evaluation of a study. An impressive 
behavior change that does not last or is restricted to a specialized 
training setting has limited value. 

Did the researchers report the results of assessing main- 
tenance and generalization through follow-up observations and 
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measurement in nontraining environments? If maintenance and/ 
or generalization were not evident in follow-up observations, did 
the experimenters modify their design and implement procedures 
in an attempt to produce and analyze the occurrence of main- 
tenance and/or generalization? Additionally, the reader should 
ask whether response generalization—changes in functionally 
similar but untreated behaviors concomitant with changes in the 
target behavior(s)—is an appropriate concern in a given study. If 
so, did the experimenters attempt to assess, analyze, or discuss 
this phenomenon? 


External Validity 


As discussed earlier, the generality of an experiment’s findings 
to other subjects, settings, and behaviors cannot be assessed by 
inherent aspects of the study itself. The generality of a behavior— 
environment relation can be established only through the active 
process of systematic replication. Therefore, the reader of an 
applied behavior analysis study should compare the study’s 
results with those of other published studies sharing relevant 
features. The authors of a published report identify in the paper’ s 
introduction the experiments that they believe are most rele- 
vant. To judge whether and the extent to which a given study 
contributes to the generality of previous findings, the reader 
must examine previous studies in the literature and compare 
the results of these studies with those of the current experiment. 

Although external validity should not be considered a 
characteristic of a study per se, various features of a single-case 
experiment suggest an expected, or likely, level of generality 
for the results. For example, an experiment that demonstrated 
a functional relation of similar form and degree in six sub- 
jects of different ages, cultural and linguistic backgrounds, 
and skills levels suggests a higher probability of generality to 
other subjects than would an identical study demonstrating the 
same results across six subjects of the same age, background, 
and skill level.'° Additional confidence in the generality of the 
results may be warranted if the experiment was conducted in 
various settings and different people administered the indepen- 
dent variable. 

A research finding without extensive external validity can 
make a meaningful contribution to scientific knowledge. Johnston 
and Pennypacker (2009) point out, “A procedure can be quite valu- 
able even though it is effective in a narrow range of conditions, 
as long as we know what those conditions are” (pp. 343-344). 


Theoretical Significance and Conceptual Sense 


A published experiment should also be evaluated in terms of its 
scientific merit. It is possible for a study to demonstrate a func- 
tional relation between the independent variable and a socially 
important target behavior—and thus be judged significant from 
an applied perspective—yet contribute little to the advancement 
of the field.'® It is possible to reliably reproduce an important 
behavior change while at the same time not fully understand 
which variables are responsible for the observed functional rela- 
tion. Sidman (1960) differentiated this kind of simple reliability 
from “knowledgeable reproducibility,” a more complete level of 
analysis in which all of the important factors have been identi- 
fied and controlled. 


After examining the majority of experimental articles pub- 
lished in the first 10 volumes of the Journal of Applied Behavior 
Analysis (1968 to 1977), Hayes, Rincover, and Solnick (1980) 
concluded that a technical drift had occurred in the field away 
from conceptual analyses and toward an emphasis on client cure. 
They warned of a likely loss of scientific understanding as a 
result of focusing purely on the technical aspects of improv- 
ing behavior in applied settings, and they recommended an 
increased effort to perform more thorough analyses of behavior. 


[T]hese more complicated analyses may increase our 
knowledge of the actual functional variables and subse- 
quently increase our ability to generate more efficient and 
general behavioral programs. Perhaps, we have gone too 
far in our attempt to be immediately applied at the expense 
of being ultimately more effective, in failing to encourage 
more analogue and analytical studies that have treatment 
implications. (Hayes et al., 1980, p. 283) 


Baer, Wolf, and Risley (1987), writing in the 20th anni- 
versary issue of the Journal of Applied Behavior Analysis, 
emphasized the need to shift from demonstrations of behavior 
changes—as convincing as they might be—to a more complete 
analysis and conceptual understanding of the principles that 
underlie the successful demonstrations. 


Twenty years ago, analytic meant a convincing experimen- 
tal design, and conceptual meant relevance to a compre- 
hensive theory about behavior. . .. Now, applied behavior 
analysis is more often considered an analytic discipline 
only when it demonstrates convincingly how to make 
specified behavior changes and when its behavior-change 
methods make systematic, conceptual sense. In the past 

20 years, we have sometimes demonstrated convincingly 
that we had changed behavior as specified, but by methods 
that did not make systematic, conceptual sense—it was not 
clear why those methods had worked. Such cases let us see 
that we were sometimes convincingly applied and behay- 
ioral, yet even.so, not sufficiently analytic. (p. 318) 


Morris (1991) expressed concern over the rise of research 
that “demonstrates the effects of behavioral interventions at the 
expense of discovering . . . actual controlling relationships” 
(p. 413). Numerous authors have discussed the importance of 
focusing on the analysis side of applied behavior analysis as 
much as the applied side (e.g., Baer, 1991; Birnbrauer, 1979, 
1981; Deitz, 1982; Iwata, 1991: Johnston, 1991; see Schlinger, 
2017 for a review). 

We agree with the need for more sophisticated, thorough 
analyses of the variables controlling socially important behavior. 
Fortunately, examination of the recent literature reveals numer- 
ous examples of studies featuring component analyses and 
parametric analyses that are necessary steps to a more complete 
understanding of behavior—an understanding that is prerequi- 
site to the development of a thoroughly effective technology 
of behavior change. Several of the studies cited earlier in this 
chapter as examples of systematic replication incorporated com- 
ponent and parametric analyses. 

The extent of a phenomenon’s generality is known only 
when all of the necessary and sufficient conditions for its 
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reproducibility have been specified. Only when all of the vari- 
ables influencing a functional relation have been identified and 
accounted for can an analysis be considered complete. Even 
then, the notion of a complete analysis is misleading: “Further 
dissection or elaboration of either variable in a functional rela- 
tion inevitably reveals fresh variability, and analysis proceeds 
anew. . . . the analysis of behavior can never be complete” 
(Pennypacker, 1981, p. 159). 

Evaluation of scientific significance takes into consider- 
ation such things as the authors’ technological description of 
the experiment as well as their interpretation and discussion of 
the results. Are the procedures described in sufficient detail so 
that at least the unique aspects of the study can be replicated?!” 

Readers should consider the level of conceptual integrity 
displayed in an experimental report. Does the literature review 
reveal a careful integration of the study with previous research? 
Does the literature review provide sufficient justification for the 
study’s research questions? Are the authors’ conclusions based 
on the data obtained in the study? Have the authors respected 


SUMMARY A eepnandigns 


Importance of the Individual Subject in Behavior 
Analysis Research 


1. Behavior analysis research methods feature 
repeated measures of the behavior of individual 
organisms within and across experimental 
conditions. 


2. By contrast, a between-groups experiment follows a 
sequence like this: 


¢ A pool of subjects is selected randomly from the 
population of interest. 


¢ The subjects are divided randomly into an experimental 
group and a control group. 


¢ A pretest measure of the dependent variable 1s 
obtained for all subjects in each group, individual 
pretest scores for the subjects in each group are 
combined, and the means and standard deviations are 
calculated. 


¢ Subjects in the experimental group are exposed to the 
independent variable. 

¢ A posttest measure of the dependent variable is obtained 
for all subjects, and means and standard deviations are 
computed for each group. 


* Various statistical tests compare any changes in 
each group’s scores from pretest to posttest, apply- 
ing to the data that enable inferences regarding the 
likelihood that any differences between the two groups’ 
performances can be attributed to the independent 
variable. 


3. Knowing that the average performance of a group of sub- 
jects changed may not reveal anything about the perfor- 
mance of individual subjects. 


the difference between basic principles of behavior and behay- 
ior change tactics? Do the authors speculate beyond the data 
without making it clear that they are doing so? Do the authors 
suggest directions for additional research to further analyze the 
problem studied? Is the study important for reasons other than 
the results actually obtained? For example, an experiment that 
demonstrates a new measurement technique, investigates a new 
dependent or independent variable, or incorporates a novel tac- 
tic for controlling a confounding variable can contribute to the 
scientific advancement of behavior analysis, even though the 
study failed to achieve experimental control or produce socially 
significant behavior change. 

Numerous criteria and considerations are involved in eval- 
uating the “goodness” of an applied behavior analysis study. 
Although each criterion is important on one level or another, it is 
unlikely that any experiment will meet all of the criteria. And, in 
fact, it is unnecessary for an experiment to do so to be considered 
worthwhile. Nevertheless, incorporating these considerations 
into a study enhances its social significance and scientific value. 


4. To be most useful, a treatment must be understood at the 
level at which people come into contact with it and are 
affected by it: the individual level. 


5. Mean performance of a group of subjects hides intersubject 
variability in the data. Attempting to cancel out variability 
through statistical manipulation neither eliminates its pres- 
ence in the data nor controls the variables responsible for it. 


6. When repeated measurement reveals significant variability, 
the researcher should seek to identify and control the 
factors responsible for it. 


7. The researcher who attributes the effects of unknown or 
uncontrolled variables to chance is unlikely to identify and 
analyze important variables. 


8. To control the effects of any variable, a researcher must 
either hold it constant throughout the experiment or 
manipulate it as an independent variable. 


9. Group data may not represent actual behavioral processes. 


10. A great strength of within-subject experimental designs 
is the convincing demonstration of a functional relation 
made possible by replication within the design itself. 


11. The overall performance of a group is socially significant 
in many situations. 


12. When group results do not represent individual perfor- 
mances, researchers should supplement group data with 
individual results. 


13. When the behavior analyst cannot control access to 
the experimental setting or identify individual subjects, the 
dependent variable must consist of the responses made by 
individuals who enter the experimental setting. 
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Importance of Flexibility in Experimental Design 


14. 


15. 


16. 


17. 


18. 


An effective experimental design is any sequence of 
independent variable manipulations that produces data 
that effectively and convincingly address the research 
question(s). 


No ready-made experimental designs await selection, nor 
is there a set of rules that must be followed. 


To investigate the research question(s) of interest, an 
experimenter must often build an experimental design that 
employs a combination of analytic tactics. 


A component analysis is any experiment designed to 
identify the active elements of a treatment condition, 

the relative contributions of different components in a 
treatment package, and/or the necessity and sufficiency 
of treatment components. Two basic methods are used for 
conducting component analyses: 


* drop-out component analysis: the investigator presents 
and assesses the treatment package as a whole, then 
systematically removes components. 


* add-in component analysis: the investigator presents and 
assesses components individually or in combination, 
then presents the complete treatment package. 


The most effective experimental designs use ongoing 
evaluation of data from individual subjects as the basis 
for employing the three elements of baseline logic— 
prediction, verification, and replication. 


Internal Validity: Controlling Potential Sources 
of Confounding in Experimental Design 


19. 


20. 


21. 


22. 


23. 


24. 


Experiments that demonstrate a clear functional relation 
between the independent variable and the target behavior 
have a high degree of internal validity. 


An experimental design’s strength is determined by the 
extent to which it (a) demonstrates a reliable effect and 
(b) eliminates or reduces the likelihood that factors other 
than the independent variable produced the behavior change. 


Although the term experimental control implies that 
the researcher controls the subject’s behavior, the 
researcher can control only some aspect of the subject’s 
environment. 


A confounding variable is an uncontrolled factor known 
or suspected to have exerted influence on the dependent 
variable. 


Confounding variables can be viewed as related primarily 
to one of four elements of an experiment: subject, setting, 
measurement of the dependent variable, and independent 
variable. 


A placebo control separates effects that may be produced 
by a subject’s expectations of improvement as a result of 
receiving treatment from the effects actually produced by 
the treatment. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


53: 


34. 


With a double-blind control procedure neither the sub- 
ject(s) nor the observers know when the independent vari- 
able is present or absent. 


Treatment integrity refers to the extent to which the inde- 
pendent variable is implemented or carried out as planned. 


Procedural fidelity refers to the extent to which procedures 
in all conditions of an experiment, including baseline, are 
correctly implemented. 


Low treatment integrity invites a major source of con- 
founding into an experiment, making it difficult, if not 
impossible, to interpret the results with confidence. 


One threat to treatment integrity, treatment drift, occurs 
when application of the independent variable during later 
phases of an experiment differs from the way the treatment 
was applied at the study’s outset. 


Achieving a high level of treatment integrity begins with a 
precise operational definition of treatment procedures. 


Simplifying and standardizing the independent variable 
and providing criterion-based training and practice for the 
people responsible for implementing it enhance treatment 
integrity. 


Researchers should not assume that a person’s general 
competence or experience in the experimental setting, or 
that providing the intervention agent with detailed written 
instructions or a script, will ensure a high degree of treat- 
ment integrity. 


Treatment integrity/procedural fidelity data describe the 
extent to which the actual implementation of experimen- 
tal procedures matches their descriptions in the method 
section of a research report. 


Treatment integrity/procedural fidelity data are required 
for a study’s inclusion in systematic reviews of literature 
conducted by professional and government organizations. 


Social Validity: Assessing the Applied Value 
of Behavior Changes and the Treatments 
That Accomplish Them 


55: 


36. 


Sis 


38. 


The social validity of an applied behavior analysis study 
should be assessed in three ways: the social significance of 
the behavior change goals, the appropriateness of the inter- 
vention, and the social importance of the results. 


Experts and persons who use the prospective skill in nat- 
ural environments can help researchers identify socially 
valid target behaviors. 


Socially valid goals can be determined empirically by 
assessing the performance of individuals judged to be highly 
competent, and experimentally manipulating different levels 
of performance to determine socially valid outcomes. 


Several scales and questionnaires for obtaining consumers’ 
opinions of the acceptability of behavioral interventions 
have been developed. 
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39. Preference for interventions can be assessed directly by 
arranging experimental conditions in which practitioners 
or clients can choose which of multiple treatments to 
implement or receive. 


40. Methods for assessing the social validity of outcomes 
include: (a) having consumers rate the social validity 
of participants’ performance, (b) asking experts to 
evaluate participants’ performance, (c) comparing 
participants’ performance to that of a normative sample, 
(d) standardized assessment instruments, and (e) real- 
world tests of participants’ performance. 


External Validity: Replicating Experiments 
to Determine the Generality of Research Findings 


41. External validity refers to the degree to which a functional 
relation found reliable and socially valid in a given experi- 
ment operates under different conditions. 


42. Group design researchers assume that including many 
subjects increases the external validity, or generality, of a 
study’s findings to other individuals. 


43. The assumption of increased generality violates a funda- 
mental premise of the between-groups method: The proper 
inferences about the results of a groups design study are 
from the sample to the population, not from the sample to 
the individual. 


44. A between-groups experiment does not demonstrate a 
functional relation between the behavior of any subject 
and some aspect of his or her environment; the external 
validity of its findings is moot. 


45. Although between-groups designs and tests of statistical 
significance are necessary and effective tools for answer- 
ing certain types of research questions, they have contrib- 
uted little to an effective technology of behavior change. 


46. The generality of research findings in applied behavior 
analysis is assessed, established, and specified by replicat- 
ing experiments. 


47. In a direct replication, the researcher makes every effort to 
duplicate exactly the conditions of an earlier experiment. 


48. In a systematic replication, the researcher purposefully 
varies one or more aspects of an earlier experiment. 


ASTRON eco SIE RRUSNONNASEITESENIDTON 


KEY TERMS 


add-in component analysis placebo control 
component analysis procedural fidelity 
direct replication replication 
double-blind control social validity 


drop-out component analysis systematic replication 


49. 


50. 


When a systematic replication successfully reproduces 
the results of previous research, it not only demonstrates 
the reliability of the earlier findings but also adds to the 
generality of the earlier findings by showing that the same 
effect can be obtained under different conditions. 


Systematic replications occur in both planned and 
unplanned ways through the work of many experimenters 
in a given area, and they result in a body of knowledge 
with significant scientific integrity and technological 
value. 


Evaluating Applied Behavior Analysis Research 


51. 


52. 


53. 


54. 


Sh 


56. 


The quality and value of an applied behavior analysis 
study may be evaluated by seeking answers to a sequence 
of questions related to internal validity, social validity, 
maintenance and generalization of behavior change, exter- 
nal validity, and scientific and theoretical significance. 


A Type I error occurs when a researcher concludes that 
the independent variable had an effect on the dependent 
variable when it did not. A Type Ii error occurs when a 
researcher concludes that the independent variable did not 
have an effect on the dependent variable when it did. 


Visual analysis effectively identifies variables that produce 
strong, large, and reliable effects, which contribute to an 
effective and robust technology of behavior change. Sta- 
tistical analysis detects the slightest possible correlations 
between the independent and dependent variables, which 
may lead to the identification and inclusion of weak and 
unreliable variables in the technology. 


A study can yield significant results from an applied 
perspective—demonstrate a functional relation between 
the independent variable and a socially important target 
behavior—yet contribute little to the advancement of the 
field. 


Only when all of the variables influencing a functional 
relation have been identified and accounted for can an 
analysis be considered complete. 


When evaluating the scientific significance of a research 
report, readers should consider the technological descrip- 
tion of the experiment, the interpretation and discussion of 
the results, and the level of conceptual sense and integrity. 


treatment drift 
treatment integrity 
treatment package 
Type I error 

Type I error 
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== 7 ccnencoenenaneEnIIe SSSR PERERA LADO ISTE 
———— a SERIE SEER ERENCE SSIDEO OO RR SS SEEN SOSSDASASSOIOI poner TERI 


MULTIPLE-CHOICE QUESTIONS 


1. The most useful level to understand the effect of a 6. An uncontrolled factor that is known or suspected to have 


treatment is the: 

a. Group level 

b. Individual level 

c. Procedural level 

d. Implementation level 

Hint: (See “Importance of the Individual Subject in 
Behavior Analysis Research’) 


. Applied behavior analysis research has identified effective 
interventions for socially significant behavior by focusing 
on the 


a. Behavior of individuals with cognitive delays 

b. Common behavior of groups of individuals 

c. Behavior of individual subjects 

d. Behavior of professionals 

Hint: (See “Importance of the Individual Subject in 
Behavior Analysis Research’’) 

. Combining analytic tactics in an experimental design is a 
good idea when 


a. It allows for a more convincing demonstration of 
experimental effect 


b. The research question is too broad for a single 
design 


c. The procedures are quite complex 

d. None of these is a justification for combining tactics 
Hint: (See “Experimental Designs Combining Analytic 
Tactics’’) 

. The research tactic for examining two or more elements of a 
treatment package is called a(n) 

a. Alternating treatment design 

b. Multiple baseline across interventions 

c. Multiple treatments 

d. Component analysis 

Hint: (See “Experimental Designs Combining Analytic 


Tactics’’) 


- Experiments that demonstrate a clear functional relationship 
between the independent variable and the observed behavior 
are said to have a high degree of 


a. Internal validity 
b. External validity 
c. Social validity 
d. Reliability 


Hint: (See “Internal Validity: Controlling Potential 
Sources of Confounding in Experimental Design’) 


10. 


exerted influence on the dependent variable is called a(n) 
a. Extraneous factor 

b. Confounding variable 

c. Behavioral control 

d. Maintaining event 


Hint: (See “Internal Validity: Controlling Potential 
Sources of Confounding in Experimental Design’) 


. All of the following are methods for assessing the social 


validity of outcomes except: 

a. Comparing participants’ performance to the performance 
of a normative sample 

b. Using standardized assessment instruments 

c. Asking consumers to rate the social validity of participant 
outcomes 

d. Asking the researchers to rate the social validity of 
participant outcomes 


Hint: (See “Social Validity: Assessing the Applied Value of 
Behavior Changes and the Treatments That Accomplish Them’’) 


. The ultimate purpose of social validity assessment is to 


a. Guide behavior change program development and 
application 


b. Improve researchers’ chances of publication 
c. Improve the relationship between researchers and clients 
d. Increase the magnitude of behavior change 


Hint: (See “Social Validity: Assessing the Applied Value of 
Behavior Changes and the Treatments That Accomplish Them’) 


. The degree to which a functional relationship can be 


demonstrated in circumstances that are different from the 
experimental conditions is called: 


a. Internal validity 

b. Treatment acceptability 

c. Replication 

d. External validity 

Hint: (See “External Validity: Replicating Experiments to 
Determine the Generality of Research Findings”) 

Which of the following research designs have contributed the 
least in terms of effective technology for behavior change? 
a. Group comparison designs 

b. Multiple baseline across settings 

c. Reversal designs 

d. Component analyses 


Hint: (See “External Validity: Replicating Experiments to 
Determine the Generality of Research Findings’) 


11. 
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Which of the following is a factor not to consider when 


determining whether or not a functional relation has been 
demonstrated in an experiment? 


a. The degree to which the researcher controlled potential 
confounds 


b. An examination of the measurement system 
¢c. Visual analysis and interpretation of the data 
d. Systematic replication of effects 


Hint: (See “Evaluating Applied Behavior Analysis 
Research’’) 


SiC SSERSCS EE SDOE VE CRE EEE BS HON AIEEE OEIC LL OE 


ESSAY-TYPE QUESTIONS 


1. Give at least two reasons why knowledge of group 


performance means is inadequate information for the 
behavior analyst to evaluate the effectiveness of a 
treatment. 


Hint: (See “Group Data May Not Represent the 
Performance of Individual Subjects”) 

Explain two benefits of building an experimental design 
using a combination of analytic tactics. 


Hint: (See “Experimental Designs Combining Analytic 
Tactics’) 


. Define treatment integrity and explain its importance in 


applied behavior analysis research. 
Hint: (See “Independent Variable Confounds’”) 


psSOREHE GH DRNGIOONO NEE NNN NONLIN TERRES 


NOTES 


ile 


Between-groups experimental designs are characterized by few measures 
of the dependent variable (often just two: pretest and posttest) across many 
subjects. In contrast, single-case studies require many measures of the 
dependent variable across a few participants (sometimes just one). Behav- 
ioral researchers occasionally employ pretest-treatment-posttest designs, 
typically reporting the results for each subject as well as group means 
(e.g., Keintz, Miguel, Kao, & Finn, 2011; Sawyer et al., 2017). 

This brief sketch of the simplest form of a between-groups design study 
omits many important details and controls. Readers interested in a thor- 
ough explication of group research methods should consult an authori- 
tative text (e.g., Adams & Lawrence, 2019; Campbell and Stanley, 1963; 
Shaddish Cook, & Campbell, 2001). Blampied (2013), Branch (2014), and 
Colquhoun (2014, 2016) describe how the decades-long reliance on sta- 
tistical significance testing, first championed by the agricultural scientist 
and geneticist R. A. Fisher (1925, 1935), has impaired robust and reliable 
scientific progress. 

A complete discussion of the problems posed by commingling data from 
multiple subjects is beyond the scope of this text. We encourage readers 
seeking a more complete understanding of these important issues to read 
Branch and Pennypacker (2013), Johnston and Pennypacker (1980, 1993b, 
2009), and Sidman (1960). 

The posttest data point in the left-hand graph of Figure 10.1 is reminiscent of 
the man whose bare feet were in a bucket of ice while his head was on fire. 
Asked how he was feeling, the man replied, “On the average, I feel fine.” 


12. 


SER ETE TD CIT IEEE EEE ISEB EAE EL REET 


Which of the following is not a reason that applied behavior 
analysts favor visual analysis of data over statistical methods? 


a. Statistical significance is not the target outcome for 
behavior analysts. 


b. Visual analysis is well suited for demonstrating strong 
effects. 


c. Highly variable data may not be statistically significant. 


d. Predetermined criteria of statistical tests offer less 
flexibility in experimental design. 


Hint: (See “Evaluating Applied Behavior Analysis Research”) 


Describe the ways that Wolf (1978) suggested that social 
validity should be assessed in applied behavior analysis. 


Hint: (See “Social Validity: Assessing the Value of 


Behavior Changes and the Treatments That Accomplish 
Them’) 


. Compare and contrast direct replication with systematic 


replication. What does each type of replication 
demonstrate? 


Hint: (See “External Validity and Applied Behavior 
Analysis”) 


. Explain the difference between a Type I and a Type I 


error in experimental research. 


Hint: (See “Evaluating Applied Behavior Analysis Research’) 


The analytic tactics presented in Chapters 8 and 9 should not be considered 
experimental designs in and of themselves for another reason: All exper- 
iments incorporate design elements in addition to the type and sequence 
of independent variable manipulations (e.g., subjects, setting, dependent 
variable, measurement system). 

Time-out and differential reinforcement of other behavior (DRO) are 
described in Chapters 15 and 25, respectively. 

Treatment integrity can be manipulated as an independent variable to ana- 
lyze the effects of full versus various levels of fidelity, or types of treatment 
“mistakes,” in implementing an intervention on participants’ behavior 
(e.g., Carroll, Kodak, & Fisher, 2013; DiGennaro Reed, Reed, Baez, & 
Maguire, 2011; Groskreutz, Groskreutz, & Higbee, 2011; Pence & 
St. Peter, 2015; St. Peter, Byrd, Pence, & Foreman, 2016 [see discussion 
of omission errors and commission errors in Chapter 15]; Wilder, Atwell, 
& Wine, 2006 [see Figure 8.13]). 

Headsprout Comprehension and Headsprout Early Reading are computer- 
based reading programs for beginning readers and were developed by 
behavior analysts (Layng, Twyman, & Stikeleather, 2004; Twyman, Layng, 
Stikeleather, & Hobbins, 2005). 

Additional discussions of social validity and procedures for assessing it 
can be found in Carroll and St. Peter (2014); Common and Lane (2017); 
Snodgrass, Chung, Meadan, and Halle (2018); and the special section on 
social validity in the Summer 1991 issue of the Journal of Applied Behavior 
Analysis. 


282 Part3 * Evaluating and Analyzing Behavior Change 


10. 


ll. 


12. 
13. 


14. 


Assuming group data describe the behavior of individual participants poses 
little problem when intersubject differences are negligible or of no par- 
ticular interest (Blampied, 2013). One kernel of corn is genetically and 
functionally identical to the millions of other kernels harvested from the 
same acre of farmland. People are quite another matter. 

Johnston and Pennypacker (1980) pointed out a distinction between repli- 
cating an experiment and reproducing its results. They stated that the quality 
of the replication should be judged only by “the extent to which equivalent 
environmental manipulations associated with [the original experiment] are 
duplicated. . . . Thus, one replicates procedures in an effort to reproduce 
effects” (pp. 303-304). However, when most researchers report a “failure to 
replicate,” they mean that the results of the replication did not match those 
obtained in the earlier research (e.g., Ecott, Foate, Taylor, & Critchfield, 
1999; Friedling & O’Leary, 1979). 

Functional communication training (FCT) is described in Chapter 26. 
Response cards are small cards, signs, or items that students hold up to 
display their answers to teacher-posed questions or problems. 

The boy who cried wolf caused both Type I and Type II errors. First every- 
one believed there was a wolf, when there wasn’t. Next everyone believed 


ils), 
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there was no wolf, when there was. Substitute “effect” for “wolf” and you'll 
never confuse Type | and II errors again (Stafford, 2015). 

The American Psychological Association (2010) recommends that authors 
indicate the sex, disability status, socio-economic status, and ethnicity of 
research participants. Li, Wallace, Ehrhardt, and Poling (2017) found that 
only 4 of 288 intervention articles published in behavior analysis journals 
for 2013 through 2015 included information on all four descriptors. 

It is important to remember that although some research in applied behavior 
analysis can rightfully be criticized as superficial because it adds little to 
our conceptual understanding of behavior, studies in which meaningful 
target behaviors are improved to a socially valid level by the application 
of a socially valid treatment variable (whether a package or not) are never 
superficial to the participants and the significant others who share their 
environment. 

Ideally, published procedural descriptions should include sufficient detail 
to allow an experienced investigator to replicate the experiment. However, 
space limitations of most journals often prohibit such detail. The common 
and recommended practice in replicating published studies is to request 
complete experimental protocols from the original investigator(s). 


PART 


Reinforcement 


The three chapters in Part 4 are devoted to reinforcement, the most important and widely applied 
principle of behavior analysis. Reinforcement, a deceptively simple behavior—consequence relation, 
is the fundamental building block of operant behavior. Chapter 11 presents the operation and defining 
effect of reinforcement, explains how reinforcers are classified and identified, describes briefly how 
antecedent stimulus conditions modulate the effects of reinforcement, discusses factors that influence 
the effectiveness of reinforcement, outlines experimental control techniques for verifying whether a 
positive reinforcement contingency is responsible for increased responding, and offers guidelines for 
using reinforcement effectively. 

In Chapter 12, Richard Smith and Brian Iwata describe one of the most consistently 
misunderstood principles of behavior. Smith and Iwata define negative reinforcement, an operant 
contingency in which responding increases as a result of the termination, reduction, or postponement 
of a stimulus as a behavioral consequence; compare and contrast it with positive reinforcement and 
punishment; distinguish between escape and avoidance contingencies; describe events that may 
serve as negative reinforcers; illustrate how the application of negative reinforcement can be used to 
strengthen desired behavior; and discuss ethical issues that arise when using negative reinforcement. 

One of Skinner’s most important discoveries was that reinforcement does not have to follow 
every response. In fact, under many intermittent schedules of reinforcement—in which reinforcement 
follows some, but not all, occurrences of the target behavior—responding occurs at higher, more 
consistent, rates than under a continuous schedule of reinforcement, in which each response is 
reinforced. Chapter 13 describes how intermittent reinforcement can be scheduled based on various 
combinations of response and/or temporal requirements and identifies characteristic patterns of 
responding associated with each schedule. Practitioners who understand how different schedules 
of reinforcement influence behavior can program reinforcement for more effective and efficient 
acquisition of new skills and for improved performance and endurance of established skills. 
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Positive Reinforcement 


LEARNING OBJECTIVES 


¢ Define and provide examples of positive reinforcement. 


¢ Define and provide examples of conditioned and unconditioned reinforcement. : 
* Describe and provide examples of the operant conditioning paradigm (i.e., the three-term and four-term contingencies). 


e Identify potential reinforcers. 


e Use appropriate parameters and schedules of reinforcement to identify reinforcers. 
e Use response-deprivation procedures (e.g., the Premack principle). 


e Identify control procedures for positive reinforcement. 
e Use positive reinforcement effectively. 


In looking back, it seems to me that the most important 
thing I learned in graduate school was from another 
student, Burrhus Frederic Skinner (I called him Burrhus, 
others called him Fred). This man had a box, within which 
was a smaller box, within which he would place a hungry 
laboratory rat. When the animal, in its explorations, would 
depress a lever that projected from one wall, a pellet of 
food would be discharged into a tray beneath the lever. 
Under such conditions, the rat would learn, in a matter of 
minutes, sometimes seconds, how to get its meal by depres- 
sion of the lever. It would even keep on pressing, sometimes 
at a rapid rate, when pellets were delivered only now and 
then; and if the food supply was cut off entirely, the animal 
would still keep working for awhile. 

—Fred Keller (1982, p. 7) 


widely applied principle of behavior. Although some 

people continue to believe that the results of laboratory 
research with animals is not applicable to human learning, 
by the mid-1960s researchers established the significant role 
of positive reinforcement in education and treatment. “It is 
safe to say that without Skinner’s detailed laboratory analy- 
ses of reinforcement (Skinner, 1938), there would be no field 
of ‘applied behavior analysis’ today, least not as we know it” 
(Vollmer & Hackenberg, 2001, p. 241). 

Fittingly, the lead article in the first issue of the Journal 
of Applied Behavior Analysis reported an experiment showing 
the effects of positive reinforcement on student behavior (Hall, 
Lund, & Jackson, 1968). Six elementary students who were 
disruptive or dawdled frequently participated in this classic study. 
The dependent variable, study behavior, was defined individually 
for each student, depending on the subject matter being taught, 
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but it generally consisted of the student being seated and oriented 
toward the appropriate object or person (e.g., looking at course 
materials or the lecturing teacher) and class participation (e.g., 
writing the assignment, answering the teacher’s question). The 
independent variable was teacher attention, cued by an observer 
who held up a small square of colored paper not likely to be 
noticed by the target student. On this signal, the teacher attended 
to the child by moving to his desk and making a verbal comment 
or giving him a pat on the shoulder. 

The effects of contingent teacher attention on the behavior 
of all six students were striking. Figure 11.1 shows the results 
for Robbie, a third-grader who was “a particularly disruptive 
student who studied very little” (p. 3). During baseline, Robbie 
engaged in study behavior for an average of 25% of the observed 
intervals. The remainder of the time he snapped rubber bands, 
played with objects in his pocket, talked and laughed with 
classmates, and played with an empty milk carton from his 
earlier-served drink. The majority of attention Robbie received 
followed nonstudy, disruptive behaviors. 

Following baseline, the experimenters showed the teacher 
a graph of Robbie’s study behavior, presented the results of pre- 
vious studies in which contingent adult attention had improved 
child behavior, and discussed the fundamentals of providing 
contingent social reinforcement. 

The results of contingent reinforcement were as follows: 
During Reinforcement 1, Robbie’s study behavior increased 
to a mean of 71%. When a reversal to baseline conditions was 
introduced, his study behavior decreased to a mean of 50%; 
but when Robbie’s teacher again provided contingent attention 
for study behavior (Reinforcement 2), his study behavior recoy- 
ered and stabilized at a level ranging between 70% and 80% of 
the observed intervals. Followup observations over a 14-week 
period showed that Robbie’s study behavior had maintained at 
79%. The teacher reported positive behavior changes associated 
with Robbie’s increased study behavior. By the final week of 
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Reinforcement 2, Robbie completed his spelling assignments 
more consistently, his disruptive behavior had diminished, and 
he continued to study while drinking his milk and did not play 
with the carton afterwards. Robbie’s intervention was based on 
the principle of positive reinforcement. 

This chapter examines the definition and nature of positive 
reinforcement, describes methods for identifying potential rein- 
forcers and assessing their effects, outlines experimental con- 
trol techniques for verifying whether a positive reinforcement 
contingency is responsible for increased responding, and offers 
guidelines for using positive reinforcement effectively. 


POSITIVE REINFORCEMENT DEFINED 


The principle of reinforcement is deceptively simple. “The basic 
operant functional relation for reinforcement is the following: 
When a type of behavior (R) is followed by reinforcement (S®) 
there will be an increased future frequency of that type of behay- 
ior” (Michael, 2004, p. 30).! However, as Michael pointed out, 
three qualifications must be considered regarding the conditions 
under which the effects of reinforcement will occur: (a) the timing 
between the end of a given response and the onset of the stimulus 
change (i.e., the presentation of the reinforcer), (b) the relation- 
ship between stimulus conditions present when the response was 
emitted, and (c) the role of motivation. In this section we examine 
these qualifications and several other concepts requisite to acquir- 
ing a full understanding of reinforcement. 


Operation and Defining Effect of Positive 
Reinforcement 


Positive reinforcement occurs when a response is followed 
immediately by the presentation of a stimulus change that 
increases the future occurrence of similar responses. Figure 11.2 
illustrates the two-term contingency—a response followed closely 
in time by the presentation of a stimulus—and the increased future 
responding that defines positive reinforcement. This two-term 
contingency is the fundamental building block for the selection 
of all operant behavior. 
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Postchecks 


Figure 11.1 Percentage of intervals 

of study behavior by a third-grade 
student during baseline and 
reinforcement conditions. The arrow 
to the first postcheck data point shows 
when cueing the teacher to provide 
attention was discontinued. 


Based on “Effects of Teacher Attention 

on Study Behavior” by R. V. Hall, D. Lund, 
and D. Jackson, 1968, Journal of Applied 
Behavior Analysis, 1, p. 3. Copyright 1968 by 
the Society for the Experimental Analysis of 
Behavior, Inc. 


The stimulus that is presented as a consequence, and 
that is responsible for the subsequent increase in responding, 
is called a positive reinforcer, or, more simply, a reinforcer. 
Teacher attention in the form of positive praise was the rein- 
forcer that increased Robbie’s study behavior. Cold water flow- 
ing into a cup and the sight of a colorful bird are the reinforcers 
for the two behaviors shown in Figure 11.2. 

It is important to remember that a reinforcer does not (and 
cannot) affect the response that it follows. When making this 
point, Skinner (1953) reminded us that reinforcement affects the 
operant response class. 


It is not correct to say the operant reinforcement “strength- 
ens the response which precedes it.” The response has 
already occurred and cannot be changed. What is changed 
is the future probability of responses in the same class. It is 
the operant as a class of behavior, rather than the response 
as a particular instance, which is conditioned. (p. 87) 


Skinner (1938, 1966) used rate of responding as the funda- 
mental datum for his research on reinforcement. To strengthen an 
operant is to make it occur more often.” However, rate is not the 
only dimension of behavior selected, shaped, and maintained by 
reinforcement. Reinforcement can also alter the duration, latency, 
interresponse time, magnitude, and/or topography of behavior. 
For example, if reinforcement follows only responses that fall 
within a range of magnitude—that is, above a minimum force but 
below a maximum force—responses within that range will occur 
more frequently. Reinforcement contingent on responses meeting 
multiple criteria will alter the response class meeting those crite- 
ria (e.g., responses by a golfer practicing 10-foot putts must fall 
within a narrow range of force and form to be successful). 


immediacy of Reinforcement 


The direct effects of reinforcement involve “temporal relations 
between behavior and its consequences that are on the order 
of a few seconds” (Michael, 2004, p. 161). Although research 
with nonhuman animals suggests that as much as 30 seconds 
can elapse without critical loss of effect (e.g., Bradley & Poling, 
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Figure 11.2 Two-term contingency 
illustrating positive reinforcement: A 
response (R) is followed closely in time by 
a stimulus change (S"*) that results in an 
increased occurrence of similar responses 
in the future. Used with the permission of 
Skinner, B. F. (1953). Science and human 
behavior. New York: Macmillan. 


2010; Byrne, LeSage, & Poling, 1997; Critchfield & Lattal, 
1993; Lattal & Gleason, 1990; Wilkenfeld, Nickel, Blakely, & 
Poling, 1992), a response-to-reinforcement delay of as little as 
I second will be less effective than a reinforcer delivered imme- 
diately. This is because behaviors other than the target behavior 
can occur during the delay; the behavior temporally closest to 
the presentation of the reinforcer will be strengthened by its pre- 
sentation. As Sidman (1960) described, “If the reinforcer does 
not immediately follow the response that was required for its 
production, then it will follow some other behavior. Its major 
effect will then be upon the behavior that bears, adventitiously 
to be sure, the closest prior temporal relationship to the rein- 
forcement” (p. 371). 

Malott and Shane (2014) discussed the importance of a 
brief response-to-reinforcement delay, as follows: 


[I]f you're trying to reinforce a response, don’t push that 
60-second limit, push the other end—the 0-second end. 

The direct effect of reinforcement drops off quickly as 

you increase the delay, even to 3 or 4 seconds. And even a 
I-second delay may reinforce the wrong behavior. If you 

ask a young child to look at you and deliver the reinforcer 

1 second after the response, you’re liable to reinforce looking 
in the wrong direction. So, one problem with delayed rein- 
forcement is that it reinforces the wrong response—the one 
that occurred just before the delivery of the reinforcer. (p. 4) 


A common misconception is that delayed consequences 
can reinforce behavior, even when the consequences occur 
days, weeks, or even years after the responses occurred. “When 
human behavior is apparently affected by long-delayed conse- 
quences, the change is accomplished by virtue of the human’s 
complex social and verbal history, and should not be thought of 
as an instance of the simple strengthening of behavior by rein- 
forcement” (Michael, 2004, p. 36). 

For example, suppose that a piano student practiced dutifully 
every day for several months in preparation for a statewide com- 
petition, at which she received a first-place award for her solo 
piano performance. Although some might believe that the award 
reinforced her persistent daily practice, they would be mistaken. 
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Delayed consequences do not reinforce behavior directly. Delayed 
consequences can, when combined with language, influence 
future behavior through instructional control and rule following. 
A rule is a verbal description of a behavioral contingency (e.g., 
“Turnip seeds planted by August 15 will yield a crop before a kill- 
ing freeze.”). Learning to follow rules is one way that a person’s 
behavior can come under the control of consequences that are 
too delayed to influence behavior directly. A statement by the 
piano teacher such as “If you practice your assignments every 
day for one hour between now and the competition, you could 
win first place” could have functioned as a rule that influenced 
the piano student’s daily practice. The student’s daily practice 
was evidence of rule-governed behavior if daily practice occurred 
because of her teacher’s rule.* In other words, rule-governed 
behavior is behavior controlled by a rule (i.e., a verbal statement 
of an antecedent-behavior-consequence contingency) that enables 
human behavior to come under the indirect control of temporarily 
remote or improbable but potentially significant consequences. 
The following conditions provide strong indicators that behavior 
is the result of instructional control or is rule-governed behav- 
ior rather than a direct effect of reinforcement (Malott, 1988; 
Michael, 2004). 


* No immediate consequence for the behavior is apparent. 

* The response—consequence delay is greater than 30 seconds. 

* Behavior changes without reinforcement. 

* A large increase in the occurrence of the behavior occurs 
following one instance of reinforcement. 

* No consequence for the behavior exists, including no 
automatic reinforcement, but the rule exists. 


Reinforcement Is Not a Circular Concept 


Another common misconception is that reinforcement is the 
product of circular reasoning and therefore contributes nothing 
to our understanding of behavior. Not so. Circular reasoning 
is a form of faulty logic in which the name used to describe 
an observed effect is mistaken as the cause for the effect. This 
confusion of cause and effect is circular because the observed 
effect is the sole basis for identifying the presumed cause. 


In circular reasoning, the suspected cause is not independent of 
its effect—they are one and the same. 

Here is an example of circular reasoning that occurs often 
in education. A student’s persistent difficulties learning to read 
(effect) leads to a formal diagnosis of a learning disability, which 
is then offered as an explanation for the student’s struggles with 
reading: “Yoshi’s reading problem is due to his learning disabil- 
ity.” How do you know Yoshi has a learning disability? Because 
he hasn’t learned to read. Why hasn’t Yoshi learned to read? 
Because his learning disability has prevented him from learning 
to read. And around and around it goes. 

Similarly, it would be circular reasoning if we said that 
teacher attention increased Robbie’s study behavior because it 
is areinforcer. Instead, it is correct to say that because Robbie’s 
study behavior increased when (and only when) it was followed 
immediately by teacher attention, teacher attention is a rein- 
forcer. The difference is more than the direction of the relation, 
or some semantic sleight-of-hand. In circular reasoning, the sus- 
pected cause is not manipulated as an independent variable to 
see whether it affects the behavior. In circular reasoning, such 
experimental manipulation is impossible because the cause and 
effect are the same. Yoshi’s learning disability cannot be manip- 
ulated as an independent variable because, as we used the con- 
cept in this example, it is nothing more than another name for 
the dependent variable (effect). 

Reinforcement is not a circular concept because the two 
components of the response—consequence relation can be sepa- 
rated, allowing the delivery of a consequence to be manipulated 
to determine whether it increases the occurrence of the behavior 
it follows. Epstein (1982) described it as follows: 


If we can show that a response increases in frequency 
because (and only because) it is followed by a particu- 
lar stimulus, we call that stimulus a reinforcer and its 
presentation, reinforcement. Note the lack of circularity. 
Reinforcement is a term we invoke when we observe 
certain relations between events in the world. .. . 
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[However,] if we say, for example, that a particular 
stimulus strengthens a response behavior because it is a 
reinforcer, we are using the term reinforcer in a circular 
fashion. It is because it strengthens behavior that we call 
the stimulus a reinforcer. (p. 4) 


Epstein (1982) went on to explain the difference between 
using an empirically demonstrated principle such as reinforcement 
in a theoretical account of behavior and using a circular argument. 


In some of his writings, Skinner speculates that certain 
behavior (for example, verbal behavior) has come about 
through reinforcement. He may suggest, for example, 

that certain behavior is strong because it was reinforced. 
This use of the concept is not circular, only speculative or 
interpretive. Using the language of reinforcement in this 
way is reasonable when you have accumulated a large data 
base. ... When Skinner attributes some everyday behavior 
to past reinforcers, he is making a plausible guess based 
on a large data base and principles of behavior established 
under controlled conditions. (p. 4) 


Used properly, reinforcement describes an empirically 
demonstrated (or speculative, in a theoretical or conceptual 
analysis) functional relation between a stimulus change (con- 
sequence) immediately following a response and an increase 
in the future occurrence of similar responses. Table 11.1 shows 
restrictions and examples of appropriate use of the terms rein- 
forcer, reinforcing, reinforcement, and to reinforce, as suggested 
by Catania (2013). Box 11.1 describes four mistakes commonly 
made when speaking and writing about reinforcement. 


Relationship Between Reinforcement and 
Antecedent Stimulus Conditions 


Reinforcement does more than increase the future occurrence 
of behavior; it also changes the function of stimuli that imme- 
diately precede the reinforced behavior. By virtue of being 
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TABLE 11.1. The Vocabulary of Reinforcement* 
Term Restrictions 
reinforcer (noun) A stimulus 


reinforcing (adjective) A property of a stimulus 


As an operation, the delivery of conse- 
quences when a response occurs 


reinforcement (noun) 


As. a process, the increase in responding 
that results from the reinforcement 


to reinforce (verb) 
when a response occurs; responses are 


reinforced and not organisms 


As a process, to increase responding 
through the reinforcement operation 


As an operation, to deliver consequences 


Examples 
Food pellets were used as reinforcers for the rat’s lever presses. 


The reinforcing stimulus was produced more often than the 
other, nonreinforcing stimull. 


The fixed-ratio schedule of reinforcement delivered food after 
every 10th key peck. 


The experiment with monkeys demonstrated reinforcement 
produced by social consequences. 


When a period of free play was used to reinforce the child’s 
completion of school work, the child’s grades improved. 


The experiment was designed to find out whether gold stars 
would reinforce cooperative play among first-graders. 


*This vocabulary is appropriate if and only if three conditions exist: (1) A response produces consequences; (2) that type of response occurs more often when 
it produces those consequences than when it does not produce them; and (3) the increased responding occurs because the response has those consequences. 
A parallel vocabulary is appropriate to punishment (including punisher as a stimulus and punish as a verb), with the difference being that a punishing conse- 


quence reduces the likelihood of future responding. 


Based on Learning, Interim (Sth ed.) by A. C. Catania, 2013, p. 66. Cornwall-on-Hudson, NY: Sloan Publishing. 
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Common Mistakes in Talking and Writing About Reinforcement 


A standard set of technical terms is prerequisite to the mean- 
ingful description of any scientific activity. Effectively 
communicating the design, implementation, and outcome of 
an applied behavior analysis depends on the accurate use of the 
discipline’s technical language. The language of reinforcement 
includes some of the most important elements of the behavior 
analyst’s vocabulary. 

In this box we identify four mistakes made frequently 
by students of applied behavior analysis when describing 
reinforcement-based interventions. Perhaps the most common 
mistake—confusing negative reinforcement with punishment— 
is not discussed here. That terminology error was introduced in 
Chapter 2 and receives additional attention in Chapter 12. 


Reinforcing the Person 


Although it is proper to speak of presenting a reinforcer to a 
learner (e.g., “The teacher delivered a token to Bobby each 
time he asked a question”), statements such as “The teacher 
reinforced Bobby when he asked a question” and “Chloe was 
reinforced with praise each time she spelled a word correctly” 
are incorrect. Behaviors are reinforced, not people. Bobby’s 
teacher reinforced question asking, not Bobby. Of course, 
reinforcement acts on and affects the overall person, in that it 
strengthens behaviors within the person’s repertoire. However, 
the procedural focus and the primary effect of reinforcement 
are on the behaviors that it follows. 


Practice as Reinforcement for a Skill 


Educators will sometimes say that students should practice a 
skill because “practicing reinforces the skill.” The phrase poses 
no problem if the speaker is describing a common outcome of 
practice with the everyday language connotation of reinforce, 
as in “to make something stronger” (e.g., to reinforce concrete 
by embedding steel rods in it). Well-designed drill and practice 
on a skill usually yields stronger performance in the form of 
better retention, reduced latency, higher response rates, and/or 
increased endurance (e.g., Johnson & Layng, 1994; Swanson 
& Sachse-Lee, 2000). Unfortunately, a phrase such as “prac- 
ticing reinforces the skill” is often misused and misinterpreted 
as technical usage of the language of operant conditioning. 
Although a skill that has been practiced is often stronger 
as a result of the practice, the practice itself could not be a 
reinforcer tor the behavior practiced. Practice refers to the form 
and manner in which the target skill is emitted (e. g., answering 
as many math problems as you can in | minute). Practicing is 
a behavior that could be reinforced with various consequences, 
such as a preferred activity (e.g., “Practice solving these 
math problems; then you can have 10 minutes of free time”). 
Depending on a learner’s history and preferences, the oppor- 
tunity to practice a certain skill may function as a reinforcer 
for practicing another skill (e.g., “Finish your math problems; 
then you'll get to do 10 minutes of repeated reading practice’). 


Artificial Reinforcement 


A distinction between natural and artificial reinforcers is made 
sometimes, as in this statement: “As the students’ success rates 
improved, we gradually stopped using artificial reinforcers, 
such as stickers and trinkets, and increased the use of natural 
reinforcers.” Some authors have suggested that applications 
of the principles of behavior result in “artificial control” 
(e.g., Smith, 1992). A behavior—consequence contingency 
may be effective or ineffective as reinforcement, but none of 
its elements (the behavior, the consequence, or the resultant 
behavior change) is, or can be, artificial. 

The reinforcement contingencies and stimuli used as 
reinforcers in any behavior change program are always con- 
trived—otherwise, there would be no need for the program— 
but they are never artificial (Skinner, 1982). The meaningful 
distinction when talking about reinforcement contingencies 
is not between the natural and the artificial, but between 
contingencies that already exist in a given setting prior to 
a behavior change program and contingencies that are con- 
trived as part of the program (Kimball & Heward, 1993). 
Although the ultimate effectiveness of a behavior change 
program may depend on shifting control from contrived to 
naturally occurring contingencies, there is no such thing as 
artificial reinforcement. 


Reinforcement and Feedback as Synonyms 


Some speakers and writers mistakenly use reinforcement and 
feedback interchangeably. The two terms refer to different 
operations and outcomes, though some of each term encom- 
passes parts of the other term’s meaning. Feedback is infor- 
mation a person receives about a particular aspect of his or her 
behavior following its completion (e.g., “Very good, Kathy. 
Two quarters equal’ 50 cents.”). Feedback is most often pro- 
vided in the form of verbal descriptions of performance, but it 
can also be provided by other means, such as vibration or lights 
(e.g., Greene, Bailey, & Barber, 1981). Because feedback is a 
consequence that often results in the increased future occur- 
rence of behavior, it sometimes leads to the faulty assumption 
that reinforcement must involve feedback or that reinforcement 
is just a behaviorist’s term for feedback. 

Reinforcement always increases the future frequency of 
responding. Feedback may result in (a) an increase in the future 
occurrence of the student’s performance as a reinforcement 
effect and/or as a prompt or instruction on how to respond next 
time (e.g. “Your handwriting is improving, Jason, but don’t 
forget to cross your T’s”) and/or (b) a reduction in the occur- 
rence of some aspect of the learner’s performance as a function 
of punishment or instruction (e.g., “You dropped your elbow 
on that pitch. Don’t do that.”). Feedback may have multiple 
effects, increasing one aspect of performance and decreasing 
another. Feedback may also have no effect on future respond- 
ing whatsoever. 


Reinforcement is defined functionally by its effect on 
future responding; feedback is defined by its formal char- 
acteristics (information about some aspect of performance). 
The operation of either concept is neither necessary nor 
sufficient for the other. That is, reinforcement may occur in 


the absence of feedback, and feedback may occur without a 
reinforcement effect. 


Sometimes Commonsense Language Is Better 


The technical language of behavior analysis is complex, and 
mastering it is no simple matter. Beginning students of behav- 
ior analysis are not the only ones who commit terminology 
errors. Well-trained practitioners, established researchers, and 
experienced authors also make mistakes now and then when 
speaking and writing about behavior analysis. Using behav- 
ioral concepts and principles—such as positive reinforce- 
ment—to confidently explain complex situations involving 
multiple processes and uncontrolled and unknown variables is 
a mistake that catches the most attentive and conscientious of 
behavior analysts at times. 

Instead of invoking the terminology and concepts of 
reinforcement to explain the influence of temporally distant 


temporally paired with the response-reinforcer contingency, 
antecedent events acquire the ability to evoke (make more 
likely) instances of the reinforced response class. As introduced 
in Chapter 2, a discriminative stimulus (So pronounced “ess- 
dee’) is an antecedent stimulus correlated with the availability 
of reinforcement for a particular response class. Responding 
in the presence of the SP produces reinforcement; responding 
in the absence of the SP (a condition called stimulus delta (S - 
pronounced “ess-delta’”’) does not. As a result of this history of 
reinforcement, a person learns to make more responses in the 
presence of the SP than in its absence. The behavior is then 
considered to be under stimulus control (see Chapter 17). 

With the addition of the SP, the two-term contingency for 
reinforcement becomes the three-term contingency of the dis- 
criminated operant. Figure 11.3 shows examples of three-term 
contingencies for positive reinforcement. Assuming that cold 
water is currently reinforcing and the person has a history of 
obtaining cold water only under blue taps, he is more likely to 
hold his cup under the blue tap on the cooler (than, say, a red 
tap). Similarly, assuming that seeing a colorful bird is currently 
reinforcing and a person has a history of seeing birds more often 
when looking toward chirping sounds (than, say, other sounds 
or silence), turning one’s head and looking to the left will occur 
more often when chirping is heard to the left. 


The Role of Motivation 


The phrase assuming that cold water is currently reinforcing 
in the previous paragraph holds another key to understanding 
reinforcement. Although reinforcement is commonly thought 
of as a way to motivate people—and it can be—the momentary 
effectiveness of any stimulus change as reinforcement depends 
on an existing level of motivation with respect to the stimulus 
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consequences on behavior, it is probably wiser to follow Jack 
Michael’s (2004) advice and simply use everyday descriptive 
language and commonsense relations. 


Incorrectly used technical language is worse than common 
sense language because it suggests that the situation is well 
understood, and it may displace serious attempts at further 
analysis. Until we are able to provide an accurate analysis 
of the various processes relevant to indirect effects [of 
reinforcement], we are better off using ordinary descrip- 
tive language. Thus, say “the successful grant application 
is likely to encourage future efforts in the same direction,” 
but don’t say it as though you had the science of behavior 
behind you. Stop referring to successful settlements of a 
labor dispute as reinforcement for striking, and success- 
ful election of a political candidate as reinforcement for 
political activity. ... Don’t talk about good grades as rein- 
forcement for effective study behavior, although they are 
no doubt responsible for maintaining it in some cases. Just 
say that they’re responsible for maintaining it. Restraint 

of this sort will deprive some of us of an opportunity to 
(incorrectly) display our technical knowledge, but so much 
the better. (p. 165, emphasis in original) 


change in question. As introduced in Chapter 2, motivating 
operations alter the current effectiveness of stimulus changes 
as reinforcement. 


Motivating operations (MOs) are environmental variables 
that have two effects on behavior: (1) They alter the oper- 
ant reinforcing effectiveness of some specific stimuli, 
objects, or events (the value-altering effect); and (2) They 
alter the momentary frequency of all behavior that has 
been reinforced by those stimuli, objects, or events (the 
behavior-altering effect). The value-altering effect, like 
response-reinforcement delay, is relevant to the effec- 
tiveness of the reinforcer at the time of conditioning, and 
stating that the consequence is a form of reinforcement 
implies that a relevant MO is in effect and at sufficient 
strength. (Michael 2004, p. 31) 


In other words, for a stimulus change to “work” as rein- 
forcement at any given time, the learner must already want it. 
This is a critical qualification in terms of the environmental 
conditions under which the effects of reinforcement will be seen. 
Michael (2004) explained this qualification as follows: 


The behavior-altering effect is relevant to the increased 
future frequency of the reinforced behavior, and 

must be added as a third qualification to the operant 
reinforcement relation: In a given stimulus situation 
(S) when a type of behavior (R) is followed immedi- 
ately by reinforcement (S8) there will be an increase 

in the future frequency of that type of behavior in the 
same or similar stimulus conditions, but the increased 
frequency will only be seen when the MO relevant to the 
reinforcement that was used is again in effect. (p. 31, 
emphasis in original) 
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Effect on Future Occurrence 
of Similar Responses 
in Presence of S2 


sR+ 


Hold cup under tap and 
push lever 


Blue tap on water cooler 


Cold water flows into 
cup 


Chirping sound to left Turn head and look left 


See colorful bird 


Figure 11.3 Three-term contingency illustrating positive reinforcement of a discriminated operant: A response (R) emitted 
in the presence of a discriminative stimulus (S°) is followed closely in time by a stimulus change (S®*) and results in an 
increased occurrence of similar responses in the future when the S° is present. A discriminated operant is the product of a 
conditioning history in which responses in the presence of the S° produced reinforcement, and responses in the absence 
of the S° (a condition called stimulus delta [S“]) were not reinforced (or resulted in a reduced amount or quality of 


reinforcement than in the S° condition). 


Motivating operations take two forms. An MO that increases 
the current effectiveness of a reinforcer is called an establishing 
operation (EO) (e.g., food deprivation makes food more effective 
as a reinforcer); an MO that decreases the current effectiveness of 
a reinforcer is an abolishing operation (AO) (e.g., food ingestion 
reduces the effectiveness of food as a reinforcer).* 

Adding the establishing operation (EO) to a discriminated 
operant results in a four-term contingency, as shown in Figure 11.4. 
Spending several hours in a hot and stuffy room without water 
is an EO that (a) makes water more effective as a reinforcer and 
(b) increases the momentary frequency of all behaviors that have 


produced water in the past. Similarly, a park ranger stating prior 
to a hike that any hiker who describes the coloring of the bird 
that makes a certain chirping sound will receive a $5 token for 
the gift shop is an EO that will (a) make seeing a bird that makes 
the chirping sound effective as reinforcement and (b) increase the 
frequency of all behaviors (e.g., turning one’s head to look around, 
stepping quietly) that have produced similar consequences (in this 
case, seeing the source of sounds) in the past. 

In simpler terms, establishing operations (EOs) deter- 
mine what an individual wants at any particular moment. 
EOs are dynamic, always changing. Reinforcer effectiveness 


r 


Effect on Future Occurrence 


D R+ of Similar Responses 
EO S R Ss under Similar Conditions 
in the Future 
Water deprived for 2 Blue tap on Hold cup under tap Cold water flows 
; ——— 
hours in hot, stuffy room water cooler and push lever into cup 
| ee L 
——_—_—— 
Park ranger says, “After our 
hike, anyone who describes Chiroi d jornhend 
the bird that makes this ae Bee Ree eee See colorful bird 
chirping sound will receive a alae Medel 
$5 token for the gift shop.” 
S| 


Figure 11.4 Four-term contingency illustrating positive reinforcement of a discriminated Operant made current by a motivating 
operation: An Spealoiice operation (EO) increases the momentary effectiveness of a stimulus change as a reinforcer, which in 
turn makes the S” more likely to evoke behavior that has been reinforced by that stimulus change in the past. 


(the want) goes up with increasing levels of deprivation and 
goes down with levels of satiation. Vollmer and Iwata (1991) 
demonstrated how the reinforcing effectiveness of three classes 
of stimuli—food, music, and social attention—varied under 
conditions of deprivation and satiation. Participants were five 
adults with developmental disabilities, and the dependent vari- 
able was the number of responses per minute on two motor 
tasks—pressing a switch or picking up small blocks from 
a container and putting them through the hole in the top of 
another container. All sessions lasted 10 minutes and began 
with the experimenter saying, “Do this, [participant’s name],” 
and modeling the response. During baseline, participants’ 
responses received no programmed consequences. During the 
deprivation and satiation conditions, responses were followed 
by presentation of food, music, or social attention. Initially 
each response was followed by the programmed consequence; 
this gradually shifted to every third, fifth, or tenth response 
being followed by the consequence. 

Different procedures were used to create deprivation 
and satiation conditions for each stimulus class. With food, for 
example, baseline and deprivation condition sessions were con- 
ducted 30 minutes prior to a participant’s scheduled lunchtime; 
sessions during the satiation condition were conducted within 15 
minutes after the participant had eaten lunch. For social atten- 
tion, baseline and deprivation condition sessions were conducted 
immediately following a 15-minute period in which the partic- 
ipant had either been alone or been observed to have had no 
social interaction with another person. Immediately prior to each 
session in the satiation condition, the experimenter provided 
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continuous social interaction (e.g., played a simple game, con- 
versation) with the participant for 15 minutes. 

All five participants responded at higher rates under the 
deprivation conditions than during the satiation conditions. 
Figure 11.5 shows the effects of deprivation and satiation of social 
attention on the effectiveness of social attention as a reinforcer for 
two of the study’s participants, Donny and Sam. Other researchers 
have reported similar findings concerning the effects of depri- 
vation and satiation of various stimuli and events as motivating 
operations that affect the relative effectiveness of reinforcement 
(e.g., Hanley, Iwata, & Roscoe, 2006; Klatt, Sherman, & Sheldon, 
2000; North & Iwata, 2005; Zhou, Iwata, & Shore, 2002). 


Automaticity of Reinforcement 


A reinforcing connection need not be obvious to the indi- 
vidual reinforced. 
—B. F. Skinner (1953, p. 75) 


The fact that a person does not have to understand or verbalize 
the relation between his or her actions and a reinforcing con- 
sequence, or for that matter even be aware that a consequence 
has occurred, for reinforcement to occur is known as the auto- 
maticity of reinforcement. Skinner (1983) provided an interest- 
ing example of automaticity in the third and final volume of 
his autobiography, A Matter of Consequences. He described an 
incident that took place at a meeting of distinguished scholars 
who had been invited to discuss the role of intention in political 
activity. At one point during the meeting, the psychologist Erich 
Fromm began to argue that “people were not pigeons,” perhaps 
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Figure 11.5 Responses per minute 
by two students during baseline and 
when social attention was used as 
reinforcement under deprivation 
and satiation conditions for social 
attention. 
From “Establishing Operations and Reinforce- 
ment Effects,” by T. R. Vollmer and B. A. lwata, 
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implying that an operant analysis based on positive reinforce- 
ment could not explain human behavior, which Fromm believed 
is the product of thought and free will. Skinner recounted what 
happened next: 


I decided that something had to be done. On a scrap of paper 
I wrote, “Watch Fromm’s left hand. I am going to shape 
[reinforce by successive approximations] a chopping motion” 
and passed it down the table to Halleck [a member of the 
group]. Fromm was sitting directly across the table and 
speaking mainly to me. I turned my chair slightly so that 

I could see him out of the corner of my eye. He gesticulated a 
great deal as he talked, and whenever his left hand came up, 

I looked straight at him. If he brought the hand down, I nod- 
ded and smiled. Within five minutes he was chopping the air 
so vigorously that his wristwatch kept slipping out over his 
hand. (pp. 150-151, words in brackets added) 


Arbitrariness of the Behavior Selected 


So far as the organism is concerned, the only important 
property of the contingency is temporal. 
—B. F. Skinner (1953, p. 85) 


Reinforcement occurs independent of a logical or adaptive 
connection between behavior and a reinforcing consequence. 
In other words, reinforcement strengthens any behavior that 
immediately precedes it. This arbitrary nature of the behavior 
selected is critical to understanding reinforcement. All other 
relations (e.g., what’s logical, desirable, useful, appropriate) 
must compete with the temporal relation between behavior and 
consequence. “To say that reinforcement is contingent upon 
a response may mean nothing more than that it followed the 
response . . . conditioning takes place presumably because of the 
temporal relation only, expressed in terms of the order and prox- 
imity of response and reinforcement” (Skinner, 1948, p. 168). 
Skinner (1948) demonstrated the arbitrary nature of the 
behaviors selected by reinforcement in one of his most famous 
experimental papers, “Superstition in the Pigeon.” He gave pigeons 
a small amount of food every 15 seconds, “with no reference what- 
soever to the bird’s behavior” (p. 168). The fact that reinforcement 
will strengthen whatever behavior it immediately follows was soon 
evident. Six of the eight birds developed idiosyncratic behaviors 
“so clearly defined, that two observers could agree perfectly in 
counting instances” (p. 168). One bird walked counterclockwise 
around the cage; another repeatedly thrust its head into one of the 
upper corners of the cage. Two birds acquired a “pendulum motion 
of the head and body, in which the head was extended forward and 
swung from right to left with a sharp movement followed by a 
somewhat slower return” (p. 169). The pigeons had exhibited none 
of those behaviors at “any noticeable strength” during adaptation 
to the cage or before the food was periodically presented. 
Whatever behavior the pigeons happened to be doing 
when the food hopper appeared tended to be repeated, which 
made it more likely to be occurring when food appeared the 
next time. Reinforcement was not contingent (in the sense of, 
dependent) on the behavior; it was only a coincidence that rein- 
forcement sometimes followed the behavior. Such accidentally 
reinforced behavior is called “superstitious” because it has no 


influence on whether reinforcement follows. Humans engage in 
many superstitious behaviors. Sports provide countless exam- 
ples: A basketball player tugs on his shorts before shooting a 
foul shot, a golfer carries his lucky ball marker, a batter goes 
through the same sequence of adjusting his wristbands before 
each pitch, a college football fan wears a goofy-looking neck- 
lace made of inedible nuts to bring good luck to his team.” 

The importance of understanding the arbitrariness of rein- 
forcement goes far beyond providing a possible explanation for 
the development of harmless superstitious and idiosyncratic 
behaviors. The arbitrary nature of selection by reinforcement may 
explain the acquisition and maintenance of many maladaptive and 
challenging behaviors. For example, a caregiver’s well-meaning 
social attention provided in an attempt to console or divert 
a person who is hurting himself may help shape and maintain 
the very behavior the caregiver is trying to prevent or eliminate. 
Kahng, Iwata, Thompson, and Hanley (2000) documented with 
a functional analysis that social reinforcement maintained the 
self-injurious behavior (SIB) and aggression of three adults with 
developmental disabilities. Kahng and colleagues’ data support the 
hypothesis that aberrant behaviors may be selected and maintained 
by social attention because of the arbitrariness of reinforcement. 


Automatic Reinforcement 


Kennedy (1994) noted that applied behavior analysts use two 
meanings to define the term automatic reinforcement. In the first 
instance, automatic reinforcement is determined by the absence of 
social mediation (Voilmer, 1994, 2006). In this context, automatic 
reinforcement refers to the behavior—stimulus change relation that 
occurs without the presentation of consequences by other people 
(Vaughan & Michael, 1982; Vollmer, 1994, 2006). In short, auto- 
matic reinforcement occurs independent of social mediation of 
others. Response products that function as automatic reinforce- 
ment are often in the form of a naturally produced sensory conse- 
quence that “sounds good, looks good, tastes good, smells good, 
feels good to the touch, or the movement itself is good” (Rincover, 
1981, p. 1). Scratching an insect bite to relieve or attenuate the itch 
is acommon example of automatic reinforcement. 

In the second instance, automatic reinforcement is assumed 
when a behavior persists in the absence of any known reinforcer 
(Hagopian, Rooker, & Yenokyan, 2018). In these “default” situ- 
ations, automatic reinforcement is assumed to be the controlling 
variable (Fisher, Lindauer, Alterson, & Thompson, 1998; Ring- 
dahl, Vollmer, Marcus, & Roane, 1997: Roscoe, Iwata, & Goh, 
1998). Persistent, seemingly nonpurposeful, repetitive, self-stim- 
ulatory behaviors (e.g., flipping fingers, head rolling, body rock- 
ing, toe walking, hair pulling, fondling body parts) may produce 
sensory stimuli that function as automatic reinforcement. Such 
“self-stimulation” is thought to be a factor maintaining self-in- 
jurious behavior (Iwata, Dorsey, Slifer, Bauman, & Richman, 
1994), stereotypic repetitive movements, hair pulling (Rapp, 
Miltenberger, Galensky, Ellingson, & Long, 1999), nail biting, 
chewing on the mouth or lips, and twirling objects or fondling 
jewelry (Miltenberger, Fuqua, & Woods, 1998). 

Determining that a behavior may be maintained by auto- 
matic reinforcement, and when possible isolating or substitut- 
ing the source of that reinforcement (see Figure 8.5; Shore, 


Iwata, DeLeon, Kahng, & Smith, 1997), has important impli- 
cations for designing interventions to either capitalize on the 
automatic reinforcing nature of the behavior or counteract it. 
As Vollmer (1994) stated “. . . one clear goal of research should 
be to develop procedures designed to identify specific sources 
of automatic reinforcement. Effective assessment might allow 
for the development of treatments that involve reversing 
or eliminating the contingencies maintaining a behavior” 
(p. 191). Assuming that such assessment provides important clues 
for intervention, Vollmer further proposed five interventions 


to reduce or eliminate behaviors maintained by automatic 
reinforcement. 


1. Eliminate or minimize medically based establishing oper- 
ations. For example, a child whacking his ear or scratching 
his arm repeatedly might be treated by an antibiotic or 
medicated lotion to reduce or eliminate aversive stimu- 
lation produced by an earache or allergy-based itch that 
established whacking or scratching responses as a nega- 
tive reinforcement. 


2. Try a sensory extinction procedure, thus eliminating, or 
reducing, sources of visual, auditory, or proprioceptive 
stimulation. Wearing gloves, shielded clothes, helmets, 
and so forth serve as examples. 


3. Implement differential reinforcement of alternative behav- 
ior (DRA) interventions, thus providing a “choice” with 
an alternative source of reinforcement contingent on 
appropriate behavior. A course of action would provide 
a different source of visual, auditory, or proprioceptive 
stimulation that would compete with the automatically 
reinforced behavior. Teaching the person to turn on an 
electric vibrator, television, or musical carousel to obtain 
alternative stimulation serve as examples. 


4. Consider punishment to override current maintaining con- 
tingencies, and thus capitalize on the fact that (a) pun- 
ishment does not require the knowledge of the specific 
controlling variables, and (b) exigent circumstances do 
not allow for prolonged detective work to determine the 
existing reinforcement contingencies. 


5. Implement a “package” program. “It is unlikely that any 
given intervention derived from establishing operations, sen- 
sory extinction, differential reinforcement, or punishment 
would be used in isolation from other procedures. Package 
treatment approaches are common both in research and in 
clinical settings. In the case of automatic reinforcement, it is 
possible that a package approach would be highly suitable”. 
(Vollmer, 1994, p. 203) 


Each of these recommendations has advantages and disad- 
vantages, and none is a “silver bullet” that will instantly address 
automatic reinforcement’s effect on behavior. As Vollmer (1994) 
notes, “When reinforcers maintaining a behavior are not within 
the control of the therapist or experimenter (i.e., the social environ- 
ment), behavior is especially difficult to assess and treat” (p. 188). 
Still, these five recommendations provide a direction for analysts 
to determine their efficacy in individual instances. 

Responses may produce unconditioned or conditioned 
automatic reinforcement. Sundberg, Michael, Partington, and 
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Sundberg (1996) described a two-stage conditioning history that 
may account for conditioned automatic reinforcement. 


For example, a person may persist in singing or hum- 
ming a song while coming home from a movie despite no 
obvious direct reinforcement for singing. In order for this 
behavior to occur as automatically reinforced behavior, a 
special two-stage conditioning history is necessary. In stage 
one, some stimulus (e.g., a song) must be paired with an 
existing form of conditioned or unconditioned reinforce- 
ment (e.g., an enjoyable movie, popcorn, relaxation). As a 
result, the new stimulus can become a form of conditioned 
reinforcement (e.g., hearing the song may now be a new 
form of conditioned reinforcement). In stage two, the 
emission of a response (for whatever reason) produces a 
response product (i.e., the auditory stimuli produced by 
singing the song) that has topographical similarity to that 
previously neutral stimulus (e.g., the song), and may now 
have self-strengthening properties. (pp. 22—23) 


Several theorists have suggested that automatic reinforce- 
ment may help to explain the extensive babbling of infants and 
how babbling shifts naturally, without apparent intervention 
from others, from undifferentiated vocalizations to the speech 
sounds of their native language (e.g., Bijou & Baer, 1965; 
Mowrer, 1950; Skinner, 1957; Staats & Staats, 1963; Vaughan 
& Michael, 1982). Caregivers frequently talk and sing while 
holding, feeding, and bathing a baby. As a result of repeated 
pairing with various reinforcers (e.g., food, warmth), the care- 
giver’s voice may become a conditioned reinforcer for the 
baby. The baby’s babbling is automatically reinforced when it 
produces sounds that match or closely approximate the care- 
giver’s. At that point, “The young child alone in the nursery 
may automatically reinforce his own exploratory vocal behavior 
when he produces sounds that he has heard in the speech of 
others” (Skinner, 1957, p. 58). 

Although the idea that automatic reinforcement is a fac- 
tor in early language acquisition has been proposed repeatedly, 
few experimental analyses of the phenomenon have appeared in 
the literature (e.g., Miguel, Carr, & Michael, 2002; Sundberg, 
Michael, Partington, & Sundberg, 1996; Yoon & Bennett, 2000). 
Sundberg and colleagues (1996) reported the first study show- 
ing the effects of a stimulus—stimulus pairing procedure on 
the rate with which children emitted new vocal sounds with- 
out direct reinforcement or prompts to respond. Five children, 
ages 2 to 4 and representing a broad range of language abilities, 
served as subjects. During the prepairing (baseline) condition, 
the parents and in-home trainers sat a few feet away from the 
child and recorded each word or vocal sound emitted by the 
child as he played with a train set and several toys. Data were 
collected in consecutive 1-minute intervals. The adults did not 
interact with the subject during the prepairing baseline. The 
stimulus—stimulus pairing procedure consisted of a familiar 
adult approaching the child, emitting a target vocal sound, word, 
or phrase, and then immediately delivering a stimulus that had 
been established previously as a form of reinforcement for the 
child (e.g., tickles, praise, bouncing in a parachute held by 
adults). This stimulus—stimulus pairing procedure was repeated 
15 times per minute for | or 2 minutes. The adult used a variety 
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of pitches and intonations when voicing the target sound, word, 
or phrase. During the postpairing condition, which began imme- 
diately after the stimulus—stimulus pairings, the adult moved 
away from the child and conditions were the same as during the 
prepairing condition. 

The stimulus—stimulus pairing of a vocal sound, word, or 
phrase with an established reinforcer was followed by an increased 
frequency of the targeted word during the postpairing condition 
for all five children. Figure 11.6 shows the results of a represen- 
tative sample of one of three pairings conducted with Subject 2, 
a 4-year-old boy with autism. Subject 2 had a verbal repertoire of 
more than 200 mands, tacts, and intraverbals, but rarely emitted 
spontaneous vocalizations or engaged in vocal play.° During the 
prepairing condition, the child did not say the target word and 
emitted four other vocalizations at a mean rate of 0.5 per minute. 
The stimulus— stimulus pairing procedure consisted of pairing the 
word apple with tickles approximately 15 times in 60 seconds. 
Immediately after the pairing, the subject said “apple” 17 times 
in 4 minutes, a rate of 4.25 responses per minute. In addition, 
the child said “tickle” four times within the first minute of the 
postpairing condition. Sundberg and colleagues’ results provide 
evidence that the children’s vocal response products may have 
functioned as automatic conditioned reinforcement after being 
paired with other forms of reinforcement. 

In summing up the uses and limitations of automatic rein- 
forcement as a concept, Vollmer (2006) suggested that: 


* Practitioners should recognize that not all reinforcement 
is planned or socially mediated. 

* Some behaviors maintained by automatic reinforce- 
ment (e.g., self-stimulation, stereotypy) may not be 


reduced or eliminated with timeout, planned ignoring, or 
extinction. 

¢ Affixing the label automatic reinforcement to an observed 
phenomenon too quickly may limit our analysis and effec- 
tiveness by precluding further efforts to identify the actual 
reinforcer-maintaining behavior. 

¢ When socially mediated contingencies are difficult to 
arrange or simply not available, practitioners might con- 
sider automatic reinforcement as a potential aim. 


CLASSIFYING REINFORCERS 


In this section we review the technical classification of rein- 
forcers by their origin as well as several practical categories by 
which practitioners and researchers often describe and classify 
reinforcers by their formal characteristics. Regardless of type 
or classification, all reinforcers share their most important and 
defining characteristic: All reinforcers increase the future occur- 
rence of behavior that immediately precedes them. 


Classification of Reinforcers by Origin 


As introduced in Chapter 2, there are two basic types of reinforcers— 
a reinforcer that is the product of the evolution of the species (an 
unconditioned reinforcer) or a reinforcer that is the result of the 
learning history of the individual (a conditioned reinforcer). 


Unconditioned Reinforcers 


An unconditioned reinforcer is a stimulus change that func- 
tions as reinforcement even though the learner has had no partic- 
ular learning history with it. (The terms primary reinforcer and 
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Figure 11.6 Cumulative number of times a 4-year-old child with autism vocalized “apple” before and after “apple” had 
been paired repeatedly with an established form of reinforcement. Automatic reinforcement may explain the increased 


frequency of the child’s vocalizing “apple” after pairing. 


From “The Role of Automatic Reinforcement in Early Language Acquisition,” by M. L. Sundberg, J. Michael, J. W. Partington, and C. A. Sundberg, 1996 


The Analysis of Verbal Behavior, 13, p. 27. Copyright 1996 by the Association for Behavior Analysis, 


Inc. Used by permission. 


unlearned reinforcer are synonyms for unconditioned reinforcer.) 
Because unconditioned reinforcers are the product of the evolu- 
tionary history of a species (phylogeny), all biologically intact 
members of a species are more or less susceptible to reinforce- 
ment by the same unconditioned reinforcers. For example, food, 
water, oxygen, warmth, and sexual stimulation are examples of 
stimuli that do not have to undergo a learning history to function as 
reinforcers. Food will function as an unconditioned reinforcer for 
a human deprived of sustenance; water will function as an uncon- 
ditioned reinforcer for a person deprived of liquid, and so forth. 

Human touch may also be an unconditioned reinforcer 
(Gewirtz & Pelaez-Nogueras, 2000). Pelaez-Nogueras and 
colleagues (1996) found that infants preferred face-to-face 
interactions that included touch stimulation. Two condition- 
ing treatments were implemented in alternated counterbal- 
anced order. Under the touch condition, infants’ eye-contact 
responses were followed immediately by adult attention (eye 
contact), smiling, cooing, and rubbing the infants’ legs and feet. 
Eye-contact responses during the no-touch condition were fol- 
lowed by eye contact, smiles, and coos from the adult, but no 
touching. All of the babies in the study emitted eye contact for 
longer durations, smiled and vocalized at higher rates, and spent 
less time crying and protesting in the contingent condition that 
included touch. From these results and several related studies, 
Pelaez-Nogueras and colleagues concluded that “these results 
suggest that . . . touch stimulation can function as a primary 
reinforcer for infant behavior” (p. 199). 


Conditioned Reinforcers 


A conditioned reinforcer (sometimes called a secondary 
reinforcer or learned reinforcer) is a previously neutral stim- 
ulus change that has acquired the capability to function as a 
reinforcer through stimulus—stimulus pairing with one or more 
unconditioned reinforcers or conditioned reinforcers. Through 
repeated pairings, the previously neutral stimulus acquires the 
reinforcement capability of the reinforcer(s) with which it has 
been paired.’ For example, after a tone has been paired repeat- 
edly with food, when food is delivered as a reinforcer, the tone 
will function as a reinforcer when an EO has made food a cur- 
rently effective reinforcer. 

Neutral stimuli can also become conditioned reinforcers 
for humans without direct physical pairing with another rein- 
forcer through a pairing process Alessi (1992) called verbal 
analog conditioning. 


For example, a class of preschool children who have been 
receiving M&M candies for good school work might be 
shown pieces of cut up yellow construction paper and told, 
“These pieces of yellow paper are what big kids work for” 
(Engelmann, 1975, pp. 98-100). Many children in the 
group immediately refuse M&Ms, and work extra hard, but 
accept only pieces of yellow paper as their rewards. 

We might say that the pieces of yellow paper act 
as “learned reinforcers.” Laboratory research tells us that 
neutral stimuli become reinforcers only through direct 
pairing with primary reinforcers (or other “learned reinfore- 
ers”). Yellow paper was not paired with any reinforcer and 
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certainly not with the primary (M&Ms) reinforcers. Yellow 
paper acquired reinforcing properties even more powerful 
than the primary M&Ms reinforcers, as demonstrated by 
the children’s refusal to accept M&Ms, demanding instead 
pieces of yellow paper. (For the sake of this example, 
assume that the children had not been satiated with M&Ms 
Just before the session.) (p. 1368) 


It is sometimes thought that the “power” of a conditioned 
reinforcer is determined by the number of times it has been 
paired with other reinforcers. However, a statement such as, 
“The more often the tone is paired with food, the more reinforc- 
ing the tone will become” is not completely accurate. Although 
numerous pairings will increase the likelihood that the tone will 
function as a conditioned reinforcer in the first place (though a 
single pairing is sometimes sufficient), the momentary effective- 
ness of the tone as a reinforcer will be a function of the relevant 
EO for the reinforcer(s) with which the conditioned reinforcer 
has been paired. A tone that has been paired only with food will 
function as an effective reinforcer for a food-deprived learner, 
but the tone will have little effect as a reinforcer if the learner has 
just consumed a lot of food, regardless of the number of times 
it has been paired with food. 

A generalized conditioned reinforcer is a conditioned 
reinforcer that as a result of having been paired with many 
unconditioned and conditioned reinforcers does not depend on a 
current EO for any particular form of reinforcement for its effec- 
tiveness. For example, social attention (proximity, eye contact, 
praise) is a generalized conditioned reinforcer for many people 
because it has occurred simultaneously with many reinforcers. 
The more reinforcers with which a generalized conditioned rein- 
forcer has been paired, the greater the likelihood that it will be 
effective at any given time. Because it can be exchanged for a 
nearly limitless variety of backup reinforcers, money is a gen- 
eralized conditioned reinforcer whose effectiveness is usually 
independent of current establishing operations. 

It is sometimes thought that a conditioned reinforcer 1s 
called a generalized conditioned reinforcer because it can func- 
tion as reinforcement across a wide range of behaviors. This, 
however, is not so. Any reinforcer is capable of strengthening 
the future occurrence of any similar behavior that immediately 
precedes its occurrence. A conditioned reinforcer is called a gen- 
eralized conditioned reinforcer because it is effective as rein- 
forcement across a wide range of EO conditions. Because of 
their versatility across EO conditions, generalized conditioned 
reinforcers offer great advantages for practitioners, who often 
have limited control of the EOs for particular reinforcers. 

Generalized conditioned reinforcers provide the basis for 
implementing a token economy, a reinforcement-based system 
capable of improving multiple behaviors of multiple participants 
(Alstot, 2015; Campbell & Anderson, 2011; Hirst, Dozier, & 
Payne, 2016). In a token economy, participants receive tokens 
(e.g., points, checkmarks, poker chips) contingent on a vari- 
ety of target behaviors. Participants accumulate the tokens 
and exchange them at specific times for their choices from a 
menu of backup reinforcers (e.g., free time, computer time, 
snacks). Token economy systems and guidelines for designing 
and implementing them are described in Chapter 28. 
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Classification of Reinforcers by Formal 
Properties 


When applied behavior analysts describe reinforcers by their 
physical properties—a practice that can enhance communication 
among researchers, practitioners, and the agencies and people 
they serve—reinforcers are typically classified as edible, sen- 
sory, tangible, activity, or social. 


Edible Reinforcers 


Researchers and practitioners have used bites of preferred foods, 
snacks, candy, and sips of drinks as reinforcers. One interesting 
and important use of edibles as reinforcers is in the treatment of 
chronic food refusal in children. For example, Riordan, Iwata, 
Finney, Wohl, and Stanley (1984) used “highly preferred food 
items” as reinforcers to increase the food intake of four children 
at a hospital treatment facility. The treatment program consisted 
of dispensing the high-preference food items (e.g., cereal, 
yogurt, canned fruit, ice cream) contingent on the consumption 
of a target food item (e.g., vegetables, bread, eggs). 

Kelley, Piazza, Fisher, and Oberdorff (2003) used edible 
reinforcers to increase cup drinking by Al, a 3-year-old boy who 
had been admitted to a day treatment program for food refusal and 
bottle dependency. The researchers measured the percentage of 
trials in which Al consumed 7.5 ml of three different liquids from 
the cup. During baseline, when Al was praised if he consumed the 
drink, his consumption averaged 0%, 44.6%, and 12.5% of trials 
for orange juice, water, and a chocolate drink, respectively. During 
the positive reinforcement component of the cup-drinking inter- 
vention, each time Al consumed the drink, the therapist praised him 
(as was done in baseline) and delivered a level spoon of peaches 
(a preferred food) to his mouth. Al consumed all three beverages 
on 100% of the trials during the positive reinforcement condition. 

Fahmie, Iwata, and Jann (2015) found that 12 individuals 
diagnosed with intellectual disabilities or autism and ranging 
in age from 5 to 22 years preferred 15-sec access to a single 
edible reinforcer (e.g., M&M, raisin, pretzel) to 15-sec access 
to a leisure item (e.g., music box, plush bear, maraca) and that 
edible reinforcers maintained responding somewhat longer than 
leisure-item reinforcers. Regarding the implications of these 
results for effective practice, Fahmie and colleagues noted: 


These advantages provide evidence in favor of including 
edible items in preliminary preference assessments. One 
additional advantage of edible reinforcers is that they are 
delivered more easily and efficiently, which may have an 
overall beneficial effect on training in the long run. Never- 
theless, researchers (e.g., Ferrari & Harris, 1981; Rincover 
& Newsom, 1985) have emphasized the relative benefits of 
leisure stimuli during training, stating that leisure stimuli 
are more natural, are less deleterious to health (see also 
Behavior Analysis Certification Board, 2014, Guideline 
4.10), and promote more active engagement. Of course, 
behavior analysts also are responsible for providing the 
most effective treatment (Behavior Analyst Certification 
Board, 2014, Guideline 2.09). This study provides some 
additional evidence to help balance these goals. (p. 342, 
BACB citations updated to most recent version). 


Sensory Reinforcers 


Various forms of sensory stimulation such as vibration (e.g., 
massager), tactile stimulation (e.g., tickles, strokes with a 
feather boa), flashing or sparkling lights, and music have been 
used effectively as reinforcers (e.g., Bailey & Meyerson, 1969; 
Ferrari & Harris, 1981; Gast et al., 2000; Hume & Crossman, 
1992; Rincover & Newsom, 1985; Vollmer & Iwata, 1991). 


Tangible Reinforcers 


Items such as stickers, trinkets, school materials, trading cards, 
and small toys often serve as tangible reinforcers. An object’s 
intrinsic worth is irrelevant to its ultimate effectiveness as a 
positive reinforcer. Virtually any tangible item can serve as a 
reinforcer. Remember Engelmann’s (1975) kindergarten stu- 
dents who worked for yellow slips of paper! 


Activity Reinforcers 


When the opportunity to engage in a certain behavior functions 
as reinforcement, that behavior is an activity reinforcer. Activity 
reinforcers may be everyday activities (e.g., playing a board 
game, leisure reading, listening to music), privileges (e.g., lunch 
with the teacher, shooting baskets, first in line), or special events 
(e.g., a trip to the zoo). 

McEvoy and Brady (1988) evaluated the effects of contin- 
gent access to play materials on the completion of math work- 
sheets by three students with autism and behavior disorders. 
During baseline, the teacher told the students to complete the 
problems as best that they could and to either complete other 
unfinished assignments or “find something else to do” if they 
finished the worksheets before a 6-minute timing elapsed. 
No other prompts or instructions were given for completing 
the worksheets. The teacher praised the completion of the 
worksheets. 

On the first day of intervention for each student, he was 
taken to another room and shown a variety of toys and play mate- 
rials. The teacher told the student he would have approximately 
6 minutes to play with the materials if he met a daily criterion 
for completing math problems. Figure 11.7 shows the results. 
During baseline, the rate at which all three students correctly 
completed problems was either low (Dicky) or highly variable 
(Ken and Jimmy). When contingent access to the play activities 
was introduced, each student’s completion rate increased and 
eventually exceeded criterion levels. 

Premack (1959) hypothesized that activity reinforcers can 
be identified by looking at the relative distribution of behav- 
iors in a free operant situation. Premack believed that behaviors 
themselves could be used as reinforcers and that the relative 
frequency of behavior was an important factor in determining 
how effective a given behavior might be as a reinforcer if the 
opportunity to engage in the behavior is contingent on another 
behavior. The Premack principle states that making the oppor- 
tunity to engage in a behavior that occurs at a relatively high 
free operant (or baseline) rate contingent on the occurrence of 
low-frequency behavior will function as reinforcement for the 
low-occurrence behavior. For a student who typically spends 
much more time watching TV than doing homework, a con- 
tingency based on the Premack principle (informally known as 
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Sessions 


“Grandma’s Law”) might be, “When you have finished your 
homework, you can watch TV.” 

Building on Premack’s concept, Timberlake and Allison 
(1974) proposed the response-deprivation hypothesis as a 
model for predicting whether access to one behavior (the con- 
tingent behavior) will function as reinforcement for another 
behavior (the instrumental response) based on the relative base- 
line rates at which each behavior occurs and whether access to 
the contingent behavior represents a restriction compared to the 
baseline level of engagement. Restricting access to a behavior 
presumably acts as a form of deprivation that serves as an EO, 
thus making the opportunity to engage in the restricted behav- 
ior an effective form of reinforcement (Allison, 1993; Iwata & 
Michael, 1994). 

Iwata and Michael (1994) cited a series of three studies 
by Konarski and colleagues as demonstrating the veracity and 
applied implications of the response-deprivation hypothesis. 
In the first study, when students were given access to coloring 
(a high-probability behavior contingent on completing math 
problems (a low-probability behavior), they spent more time 
doing math, but only if the reinforcement schedule represented 
a restriction of the amount of time spent coloring compared to 


Figure 11.7 Number of math problems completed 
correctly and incorrectly per minute by three special 
education students during baseline and contingent 
access to play materials. Dashed horizontal lines 
indicate criteria. 


From “Contingent Access to Play Materials as an Academic Motiva- 
tor for Autistic and Behavior Disordered Children,” by M. A. McEvoy 
and M. P. Brady, 1988, Education and Treatment of Children, 11, 

p. 15. Copyright 1998 by the Editorial Review Board of Education 
and Treatment of Children. Used by permission. 


baseline (Konarski, Johnson, Crowell, & Whitman, 1980). The 
researchers found that a contingency in which students could 
earn more time coloring than they did in baseline for complet- 
ing math problems was ineffective. These basic findings were 
reproduced in a subsequent study in which access to reading 
(or math, depending on the subject) was contingent on math 
(or reading) (Konarski, Crowell, Johnson, & Whitman, 1982). 
In the third study, Konarski, Crowell, and Duggan (1985) took 
the response-deprivation hypothesis a step further by examin- 
ing the “reversibility of reinforcement” within subjects; that is, 
engaging in either of two activities—reading or math—could 
serve as reinforcement for increased performance in the other 
activity, in a response-deprivation condition for the contingent 
activity. Response deprivation for reading as the contingent 
response resulted in increases in math (instrumental response); 
conversely, response deprivation for math as the contingent 
response produced increases in reading. In all three studies, 
response restriction was the key factor in determining whether 
access to the contingent response would be reinforcing. 
Iwata and Michael (1994) concluded that the collective 
results of Konarski and colleagues’ studies illustrate each of 
three predictions based on the response-deprivation hypothesis 
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(assume the ratio of baseline rates of doing homework to watch- 
ing TV is 1:2 in the following examples): 


* Reinforcement of a low-rate target behavior when access 
to a high-rate contingent behavior is restricted below base- 
line levels (e.g., 30 minutes of homework gets access to 
30 minutes of TV). 

¢ Nonreinforcement of a low-rate behavior when access 
to a high-rate contingent behavior is not restricted below 
baseline levels (e.g., 30 minutes of homework gets access 
to 90 minutes of TV). 

¢ Reinforcement of a high-rate target behavior when access 
to the low-rate behavior is restricted below baseline levels 
(e.g., 30 minutes of TV yields 5 minutes of homework). 


Although recognizing that practitioners seldom design 
reinforcement programs to increase the rate of behaviors such as 
TV watching that already occur at high rates, Iwata and Michael 
(1994) noted that: 


There are a number of instances in which one may wish to 
produce highly accelerated performance (e.g., as in superla- 
tive academic or athletic performance that is good to begin 
with). In such cases, one need not find another activity 

that occurs at a higher rate to serve as reinforcement if one 
could arrange a suitable deprivation schedule with an activ- 
ity that occurs at a relatively low rate. (p. 186) 


As with all other descriptive categories of reinforcers, 
there is no a priori list that reveals what activities will or will not 
function as reinforcers. An activity that serves as effective rein- 
forcement for one learner might have quite another effect on the 
behavior of another learner. For example, in Konarski, Crowell, 
and colleagues’ (1982) study, access to math functioned as rein- 
forcement for doing more reading for three students, whereas get- 
ting to read was the reinforcer for completing math problems for 
a fourth student. Many years ago, a classic cartoon brought home 
this crucial point very well. The cartoon showed two students duti- 
fully cleaning the chalkboard and erasers after school. One student 
said to the other, ““You’re cleaning erasers for punishment!? I get 
to clean erasers as a reward for completing my homework.” 


Social Reinforcers 


Physical contact (e.g., hugs, pats on the back), proximity gg 
approaching, standing, or sitting near a person), attention, 
and praise are examples of events that often serve as social 
reinforcers. Adult attention is one of the most powerful and 
generally effective forms of reinforcement for children. The 
nearly universal effects of contingent social attention as rein- 
forcement has led some behavior analysts to speculate that 
some aspects of social attention may entail unconditioned rein- 
forcement (e.g., Gewirtz & Pelaez-Nogueras, 2000; Vollmer 
& Hackenberg, 2001). 

The original experimental demonstrations and discovery 
of the power of adults’ social attention as reinforcement for chil- 
dren’s behavior took place in a series of four studies designed 
by Montrose Wolf and carried out by the preschool teachers at 
the Institute of Child Development at the University of Wash- 
ington in the early 1960s (Allen, Hart, Buell, Harris, & Wolf, 


1964: Harris, Johnston, Kelly, & Wolf, 1964; Hart, Allen, Buell, 
Harris, & Wolf, 1964; Johnston, Kelly, Harris, & Wolf, 1966). 
Describing those early studies, Risley (2005) wrote: 


We had never seen such power! The speed and magnitude 
of the effects on children’s behavior in the real world 

of simple adjustments of something so ubiquitous as 
adult attention was astounding. Forty years later, social 
reinforcement (positive attention, praise, “catching them 
being good’) has become the core of most American 
advice and training for parents and teachers—making 

this arguably the most influential discovery of modern 
psychology. (p. 280) 


Because of the profound importance of this long-known 
but underused phenomenon, we describe a second study show- 
ing the effects of contingent attention as reinforcement for 
children’s behavior. The first volume of the Journal of Applied 
Behavior Analysis included no fewer than seven studies building 
on and extending Wolf and colleagues’ pioneering research on 
social reinforcement.® R. Vance Hall and colleagues conducted 
two of those studies. Like the Hall, Lund, and Jackson (1968) 
study, which introduced this chapter and from which we selected 
the example of a teacher’s use of positive reinforcement with 
Robbie, the three experiments reported by Hall, Panyan, Rabon, 
and Broden (1968) continue to serve as powerful demonstrations 
of the effects of teacher attention as social reinforcement. 

Participating in one of the experiments was a first-year 
teacher whose class of 30 sixth-graders exhibited such high rates 
of disruptive and off-task behaviors that the school principal 
described the class as “completely out of control.” Throughout 
the study, Hall, Panyan, Rabon, and Broden (1968) measured 
teacher attention and student behavior during a continuous 
30-minute observation period in the first hour of the school day. 
The researchers used a 10-second partial-interval observation 
and recording procedure to measure study behavior (e.g., writing 
the assignment, looking in the book, answering the teacher’s 
question) and nonstudy behavior (e.g., talking out, being out of 
seat, looking out the window, fighting or poking a classmate). 
The observers also recorded the occurrence of teacher atten- 
tion in each interval. Each instance of teacher verbal attention, 
defined as a comment directed to a student or group of students, 
was recorded with a “+” if it followed appropriate study behav- 
ior and with a “—” if it followed nonstudy behavior. 

During baseline the class had a mean percentage of inter- 
vals of study behavior of 44%, and the teacher made an aver- 
age of 1.4 comments following study behavior per session (see 
Figure 11.8). “Almost without exception those [comments] that 
followed study behavior were approving and those that followed 
nonstudy behavior were in the form of a verbal reprimand” 
(Hall, Panyan, Rabon, & Broden, 1968, p. 316). The level of 
study behavior by the class was 90% on one day when the help- 
ing teacher presented a demonstration lesson (see data points 
marked by solid arrows). On three occasions during baseline 
(data points marked by open arrows), the principal met with 
the teacher to discuss his organizational procedures in an effort 
to improve the students’ behavior. These counseling sessions 


Chapter 11 * Positive Reinforcement 299 
rl Baseline — Reversal Reinforcement, Post 
: J : 
4S 
S 30 
3 a 
a 
5 
& 60 \ - 
ovo 
£ 
= 
‘5 40- 
wo 
f=) 
ic) 
i 
S 20-4 
2 
Ve, I I iia = I 1 
5 10 15 20 25 30 35 40 45 50 55 
~ ~ 40 
ae | , 
Be " 
25 shad om bd 
oe @ 4 
Se ) 
Et 0+eees_*o—_o ee fet = 
5 10 iis 20 25 
Sessions 


Figure 11.8 A record of class study behavior and teacher attention for study behavior during reading period in a sixth- 


grade classroom. Baseline = before experimental procedures; Reinforcement 1 


increased teacher attention for study; 


Reversal = removal of teacher attention for study; Reinforcement 2 = return to increased teacher attention for study. 
Post-follow up checks occurred up to 20 weeks after termination of the experimental procedures. 


From “Instructing Beginning Teachers in Reinforcement Procedures Which Improve Classroom Control,” by R. V. Hall, M. Panyan, D. Rabon, and M. Broden, 
1968, Journal of Applied Behavior Analysis, 1, p. 317. Copyright by the Society for the Experimental Analysis of Behavior, Inc. Reprinted by permission. 


resulted in the teacher writing all assignments on the board (after 
the first meeting) and changing the seating chart (after the third 
meeting). Those changes had no apparent effect on the students’ 
behavior. 

Prior to the first day of the reinforcement condition, the 
teacher was shown baseline data on the class study behavior and 
the frequency of teacher attention following study behavior. The 
teacher was instructed to increase the frequency of positive com- 
ments to students when they were engaged in study behavior. 
After each session during this condition, the teacher was shown 
data on the level of class study behavior and the frequency of 
his comments that followed study behavior. During the first rein- 
forcement phase, teacher comments following study behavior 
increased to a mean frequency of 14.6, and the mean level of 
study behavior was 72%. The teacher, principal, and data col- 
lectors reported that the class was under better control and that 
noise had decreased significantly. 

During a brief return of baseline conditions, the teacher 
provided “almost no reinforcement for study behavior,” and a 
sharp downward trend in class study behavior was observed. 
The teacher, principal, and data collectors all reported that dis- 
ruptive behavior and high noise levels had returned. The rein- 
forcement conditions were then reinstated, which resulted in a 
mean frequency of 14 teacher comments following study behav- 
ior, and a mean level of 716% of intervals of study behavior. 


IDENTIFYING POTENTIAL REINFORCERS 


In the laboratory, we had learned to use a simple test: 
Place a candy in the palm of our hand, show it to the child, 
close our fist fairly tightly around the candy, and see if the 
child will try to pull away our fingers to get at the candy. 
If he or she will do that, even against increasingly tightly 
held fingers, the candy is obviously a reinforcer. 

—Murray Sidman (2000, p. 18) 


The ultimate success of many behavior change programs 
requires an effective reinforcer that the practitioner or researcher 
can deliver to increase future occurrences of the desired behav- 
ior targeted for improving. Fortunately, identifying effective 
and accessible reinforcers for most learners is relatively easy. 
Sidman (2000) described a quick and simple method for deter- 
mining whether candy would likely function as a reinforcer. 
However, every stimulus, event, or activity that might function 
as a reinforcer cannot be held in the palm of the hand. 
Identifying robust and reliable reinforcers for learners 
with severe and/or multiple disabilities may pose a major chal- 
lenge. Although many common events (¢.g., praise, music, free 
time, tokens) serve as effective reinforcers for most people, these 
stimuli do not function as reinforcers for all learners. Time, 
energy, and resources would be lost if planned interventions 
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were to fail because a practitioner used a presumed, instead of 
an actual, reinforcer. 

Also, reinforcer preferences shift, and the transitory and 
idiosyncratic nature of preference has been reported repeatedly 
in the literature (Bishop & Kenzer, 2012; Carr, Nicholson, & 
Higbee, 2000; DeLeon et al., 2001; Ortiz & Carr, 2000; Shapiro, 
Kazemi, Pgosjana, Rios, & Mendoza, 2016). Preferences may 
change with the person’s age, interest level, time of day, social 
interactions, deprivation and satiation levels, and the presence 
of establishing operations (EOs) (Gottschalk, Libby, & Graff, 
2000; Whitehouse, Vollmer, & Colbert, 2014). What a teacher 
asks in September to determine preferences may have to be 
repeated a month later or a day later. Likewise, a therapist who 
asks a client what is reinforcing during a morning session may 
find that this stimulus is not preferred in an afternoon session 
(Bishop & Kenzer, 2012). 

After reviewing 13 published studies that evaluated prefer- 
ences and reinforcers for people with profound multiple disabil- 
ities, Logan and Gast (2001) concluded that preferred stimuli do 
not always function as reinforcers, and preferred stimuli at one 
point in time changed later. Additionally, individuals with behav- 
ioral or severe to profound developmental disabilities may engage 
in activities for such a limited time, or shift preferences within 
teaching sessions, that it is difficult to clearly determine whether 
a stimulus change is a reinforcer (Bishop & Kenzer, 2012). 

To meet the challenge of identifying effective reinforc- 
ers, researchers and practitioners have developed a variety of 
procedures that fall under the twin headings of stimulus pref- 
erence assessment and reinforcer assessment. Stimulus prefer- 
ence assessment and reinforcer assessment are often conducted 
in tandem to determine if a stimulus identified as a reinforcer 
actually functions as one (Kang et al., 2013; Lee, Yu, Martin 
& Martin, 2010; Piazza, Fisher, Hagopian, Bowman, & Toole, 
1996; Whitehouse et al., 2014). Piazza et al. (1996) state: 


During preference assessments, a relatively large num- 
ber of stimuli are evaluated to identify preferred stimuli. 
The reinforcing effects of a small subset of stimuli (i.e., 
the highly preferred stimuli) are then evaluated during 


reinforcer assessment. Although the preference assessment 
is an efficient procedure identifying potential reinforcers 
from a large number of stimuli, it does not evaluate the 
reinforcing effects of the stimuli. (pp. 1-2) 


Stimulus preference assessment identifies stimuli that are 
likely highly preferred and thus likely to serve as reinforcers. 
As Kelley, Shillingsburg, and Bowen (2016) state, “The value 
of preference assessments is grounded in the extent to which the 
identified stimuli ultimately function as reinforcers for socially 
important behaviors” (p. 394). 

Reinforcer assessment, in contrast, puts the highly pre- 
ferred, potential reinforcers to a direct test by presenting them 
contingent on occurrences of a behavior and measuring perfor- 
mance effects (Fisher, Piazza, Bowman, & Amari, 1996). In this 
section we describe a variety of techniques developed by research- 
ers and practitioners for conducting stimulus preference assess- 
ments and reinforcer assessment (see Figures 11.9 and 11.10). 
These methods form a continuum of approaches ranging from 
simple and quick to more complex and time-consuming. 


Stimulus Preference Assessment 


Stimulus preference assessment (SPA) refers to a variety of 
procedures used to determine (a) the stimuli that the person dif- 
ferentially selects, (b) the relative hierarchical preference value 
of those stimuli (high preference to low preference), (c) the 
conditions under which those preference values change when 
task demands, deprivation states, or schedules of reinforcement 
are modified, and (d) whether highly preferred items ultimately 
serve as effective reinforcers. Generally speaking, stimulus 
preference assessment is conducted using a three-step process: 
(1) A large pool of stimuli that might serve as reinforcers is 
gathered, (2) those stimuli are presented to the target person sys- 
tematically to identify preference, and (3) high- and sometimes 
low-preference items are “tested” experimentally to determine 
the conditions under which they serve as reinforcers (Livingston 
& Graff, 2018). It is essential for practitioners to narrow the 
initial field of possible preferred stimuli to those that have good 
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Figure 11.9 Stimulus preference assessment and reinforcer assessment methods for identifying potential reinforcers. 
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Figure 11.10 Commonly used stimulus preference assessments. 


Description of Assessment 


Name of Assessment (Citation) 
Single-stimulus (SS) (Pace et al., 1985) 


Across a series of trials, stimuli are presented one at a time. Approach responses 
(e.g., moving hand or body toward the item) are recorded. Preference hierarchies 
are established by calculating the percentage of approach responses per stimulus. 


Across a series of trials, stimuli are presented two at a time; individuals can 
approach (i.e., select) only one item on a trial. Approach responses are recorded. 
Preference hierarchies are established by calculating the percentage of approach 
responses per stimulus. 


Paired-stimulus (PS) (Fisher et al., 1992) 


Multiple-stimulus-without-replacement 


At the start of each session, multiple stimuli are placed in front of the individ- 
(MSWO) (DeLeon & Iwata, 1996) 


ual, who can select one. Approach responses are recorded. The selected item 

is not replaced, and the positions of the remaining stimuli are changed. Then, 
the individual selects from the remaining items. Continue in this manner until all 
items have been selected or the individual stops selecting items. Typically, several 
sessions are conducted. Preference hierarchies are established by calculating the 
percentage of approach responses per stimulus across all sessions. 


Brief Free Operant (FO) (Roane et al., 1998) | Multiple stimuli are placed on a tabletop, and participants are free to engage 


with any of the items for 5 min. Duration of engagement with each object 
(e.g., manipulating objects) is measured. Preference hierarchies are established by 
ranking items according to the duration of object manipulation for each stimulus. 


From “Assessing Preferences of Individuals with Developmental Disabilities: A Survey of Current Practices,” by R. B. Graff and A. M. Karsten, 
2012a, Behavior Analysis in Practice 5(2), p. 38. Used with permission. 


odds of functioning as reinforcers; otherwise, the assessment 
process will be protracted and intervention delayed. 
Methodologically, SPA can be conducted using at least five 
forms: single-stimulus (SS), paired-stimulus (PS), free operant 
(FO), multiple-stimulus with item replacement (MSW), and mul- 
tiple-stimulus without item replacement (MSWO). Each SPA 
method has advantages or disadvantages with respect to bias, 
assessment time required, likelihood of identifying reinforcers, 
and the chance that the assessment activity itself may evoke prob- 
lem behavior (see Figure 11.11) (Karsten, Carr, & Lepper, 2011). 
As illustrated in Figure 11.9 these five methodological vari- 
ations can be clustered into three categories: (1) ask the person 
(or significant others) to identify preferred stimuli; (2) employ 


Method 


a free operant procedure by observing the person approaching, 
interacting, or engaging with various stimuli; and (3) measure 
responses to trial-based pairs or multiply presented stimuli by 
replacing or not replacing the stimulus after preference selection. 

In choosing which method to use, practitioners must bal- 
ance three competing perspectives: (a) gaining the maximum 
amount of preference assessment data within the least amount 
of time, but with false positives (i.e., believing a stimulus is 
preferred when it is not); (b) conducting a more extensive and 
time- and labor-intensive assessment that will delay interven- 
tion, but may yield more conclusive results; and (c) assessing 
the skill level of colleagues conducting the stimulus preference 
assessment and any initial or recurrent training needs they have 


Figure 11.11 Assets and barriers associated with stimulus preference assessments. 


Potential barriers 


activities 


Assets 
MSWO Likely to identify multiple reinforcers in minimal time Positional bias, limited to smaller tabletop items and 
fewer items; requires more time than FO assessment 
BS Likely to identify multiple reinforcers, accommodates Positional bias, requires more time than MSWO and 
larger tabletop items and a greater number of items FO assessments 
SS Likely to identify multiple reinforcers, accommodates __ False positive results, less likely to identify relative 
larger items and activities preferences than MSWO and PS methods except 
when duration of engagement is also measured 
FO Less likely to evoke problem behavior, requires Less likely to identify multiple reinforcers than other 


minimal time, accommodates larger items and 


methods except when assessment is repeated without 
most preferred item 


Note: MSWO = multipe stimulus without replacement; PS = paired stimulus, SS = single stimulus; FO = free operant. 


From “Description of a Practitioner Model for identifying Preferred Stimuli with Individuals with Autism Spectrum Disorders, " by A. M. Karsten, 
J. E. Carr, and T. L. Lepper, 2011, Behavior Modification, 35(4), p. 350. Copyright by Sage Publication. Reprinted by permission. 
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(Leaf et al., 2015). To assist analysts and practitioners with 
deciding which of the five methodological variations to employ, 
Karsten et al. (2011) proposed a three-step model that progresses 
from (a) choosing the SPA that is likely to yield an accurate 
hierarchy of high-preference stimuli, (b) modifying that SPA 
approach based on actual data collected during the SPA, and 
(c) verifying that the high-preference stimuli function as 
reinforcers. 

Stil, a practical roadblock may affect the implementation 
of the best of intended SPA models. Graff and Karsten (2012a) 
stated that upwards of one half of the respondents to their SPA 
questionnaire reported that they had little to no experience with 
conducting SPAs. Given that conducting some level of a SPA 
has become the gold standard of best practice for identifying 
highly preferred stimuli and ultimately reinforcers (Leaf et al., 
2018), it seems important that training programs at the pre- and 


in-service levels address this shortcoming. Deliperi, Vladescu, 
Reeve, Reeve, and DeBar (2015) state the issue succinctly: 


Due to the reliance on reinforcement in interventions for 
consumers with disabilities, assessing the preferences of 
these individuals is an important part of effective pro- 
gramming. The failure to identify stimuli that function as 
reinforcers may compromise skill acquisition and behavior 
reduction programs. Therefore, identifying and implement- 
ing effective training procedures are essential to increasing 
the likelihood that staff will conduct preference assessments 
and use data from these assessments to guide their selection 
of consequences during subsequent programming. (p. 324) 


Box 11.2 discusses a number of questions associated 
with training practitioners to conduct stimulus preference 
assessments. 


Training Practitioners to Conduct Stimulus Preference Assessments (SPA) 


Identifying reinforcers is key to a successful behavior change 
program. But busy, inexperienced, and inadequately trained 
preservice or in-service practitioners, who now also lack time 
and resources, are not likely to engage in conducting stimulus 
preference assessments without training from an experienced 
analyst (Graff & Karsten, 2012a). Although not intended to 
be exhaustive, the following frequently asked questions may 
provide a starting point to address stimulus preference assess- 
ment training issues for inexperienced, novice, or poorly 
trained preservice practitioners within either an individual or 
group-training context. 


For SPAs to Be Conducted, What Skills Must Be Taught? 


There appears to be consensus in the literature that practi- 
tioners need three main skills: (1) identifying stimulus items 
for the assessment; (2) conducting the SPA with a confederate 
or actual consumer; and (3) scoring and interpreting the results 
of the assessment to determine high- and low-preference stim- 
uli. (Source: Deliperi et al., 2015) 


Can Inexperienced Practitioners Learn the Skills 
Necessary to Conduct Reliable, Accurate, and 
Worthwhile Stimulus Preference Assessments? 


Yes. Although the methods might vary and depend on local 
available resources, broadband Internet connectivity, and skill 
level of the trainer, it is quite possible—even desirable—to 
train (or retrain) inexperienced practitioners to use SPA. Doing 
so will give the teacher, parent, or caregiver direction for using 
those assessment data to further determine actual reinforcers 
that will affect future occurrences of target behaviors for the 
persons in their care. (Source: Ringdahl et al., 1997) 


How Long Does It Take to Train an Inexperienced 
Person to Conduct a SPA? 


The good news is that within several training sessions trainees 
can gain sufficient skill to conduct an SPA, especially if the 
training provides combinations of video models, scripts, voice- 
overs, practice, feedback, and supervision. (Sources: Bovi, 
Vladescu, DeBar, Carroll, & Sarokoff, 2017; Deliperi et al., 
2015; Weldy, Rapp, & Capocosa, 2014) 


Will Warranty Work Be Required? Does the 
Training Stick? 


Whether future “booster” or “refresher” sessions will be 
required to maintain skills depends on a number of variables. 
For example, if novice practitioners only use SPA a few times 
per year, you can expect loss of initial skill acquisition. If, 
however, they use SPA regularly, it is more likely that initial 
training will be maintained, especially if they receive follow-up 
feedback and have access to review materials (prior training 
videos, embedded directions, etc.). In these cases, as the behav- 
ior analyst trainer, you would have more confidence that initial 
skills would be maintained. (Source: Deliperi et al., 2015) 


Does SPA Training Only Work for “Ask” or “Free 
Operant” Categories of SPA? 


No. A number of researchers have shown that analysts can train 
inexperienced practitioners to use ask, free operant, and trial-based 
variations of SPA. The complexity of the SPA methodology is not 
a determining or self-limiting factor in training. Inexperienced par- 
ticipants can learn a variety of SPA methods. (Sources: DeLeon 
& Iwata, 1996; Fisher et al., 1992: Hanley, Iwata, Roscoe, 2006; 
Pace, Ivancic, Edwards, Iwata, & Page, 1985) 


Will Brief Written Instructions Alone—Like Those 
Found in a Method Section of a Journal Article—Be 
Sufficient for Training? 


Probably not. Simply providing an opportunity for inexpe- 
rienced practitioners to read the method section of a journal 
article on SPA is not likely to be successful. A number of 
researchers have demonstrated that written instructions 
alone are not robust enough to bring inexperienced practi- 
tioners up to a standard whereby they can conduct an SPA 
without supervision. Additional and supplementary mate- 
rials are needed to “enhance” written instructions. These 
enhancements might include providing pictures, video mod- 
els, role playing, voiceovers, feedback, and so on. (Sources: 
Graff & Karsten, 2012b; Lavie and Sturmey, 2002; Ramon, 
Yu, Martin, & Martin, 2015; Roscoe & Fisher, 2008; 
Shapiro et al., 2016). 


If | Develop a Self-instruction Manual, Will 
That Work? 


It depends. Although self-instruction manuals can be an 
improvement over brief, written methodological synopses 
excerpted from journals, the scope and sequence of the man- 
ual would be of primary importance. Otherwise, your trainee 
might be left with the equivalent of “written instructions 
alone,” which has not been shown to be effective. Using a 
self-instruction manual alone in this manner would be tan- 
tamount to a “train and hope” process. However, if the man- 
ual provides introductory content, step-by-step directions, a 
procedural checklist, exercises, an answer key, and methods 
for recording, summarizing, and interpreting SPA data, the 
manual’s utility is enhanced considerably. If the trainer uses 
feedback from the trainees as a way to improve the accuracy 
and completeness of the manual, the manual would evolve 
over time into a more effective document. (Source: Ramon 
et al., 2015) 


Asking About Stimulus.Preferences 


A person’s preference for various stimuli might be determined 
by merely asking what she likes. Asking can greatly reduce the 
time needed for more intrusive stimulus preference assessment, 
and it often yields information that can be integrated within an 
intervention program. Several variations of asking exist: asking 
the target person, asking significant others in the person’s life, 
or offering a pretask choice assessment. 


Asking the Target Person. A straightforward method for deter- 
mining stimulus preference is to ask the target person what she 
likes. Typical variations include asking open-ended questions, 
providing the person with a list of choices or asking her to 
rank-order a list of choices. 
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Can | Use Video, Multimedia, or a Smartphone App to 
Teach SPA? 


Yes. Video methods have been shown to be effective, espe- 
cially when combined as a “package approach,” including 
feedback, embedded text, rehearsal, and supervision. Also, 
trainers can use smartphone apps (e.g., Plickers.com) to con- 
duct basic SPA formative assessments with groups of students. 
(Sources: Hansard & Kazemi, 2018; Lavie & Sturmey, 2002; 
Radley, Dart, Battaglia, & Ford, 2018) 


| Have a Number of Practitioners to Teach. Can | Teach 
Within a Group Context? 


Yes. Teaching practitioners within a small-group context can 
increase training efficiency, without loss of effect. (Source: 
Weldy et al., 2014) 


| Live in a Rural/Remote Area. Can | Use Distance 
Learning with My Staff? 


Yes. One of the remarkable benefits of high-speed, broadband 
Internet connections is that telecommunication links can be 
established and operated remotely. Training staff in distant loca- 
tions appears to be at least as comparable to in-person training 
with respect to initial skill acquisition and maintenance, even 
without in-person contact or on-site trainers. (Source: Higgins, 
Luczynski, Carroll, Fisher, & Mudford, 2017) 

Choosing a method to teach inexperienced practitioners rests 
with the trainer’s assessment of current staff resources (e.g., time, 
physical facilities, exigent circumstances, location) and consumer 
needs. Although a number of methods can be effective, depending 
on individual or group instruction, trainers would be advised to 
follow a “‘model-lead-test” sequence to ensure initial training to 
mastery. If necessary, be prepared to provide follow-up “booster” 
sessions to ensure maintenance and retention. Do not assume that 
“once taught is forever learned,” especially if the practitioner has 
few opportunities to implement SPA after initial training. 


* Open-ended questions. Depending on the learner's language 
abilities, an open-ended assessment of stimulus preference 
can be done orally or in writing. The person may be asked to 
name preferences among general categories of reinforcers— 
for example, “What do you like to do in your free time?” 
“What are your favorite foods and drinks?” “Are there any 
types of music or performers whose music you like?” An 
open-ended assessment can be accomplished simply by ask- 
ing the learner to list as many favorite activities or items as 
possible. She should list not only everyday favorite things 
and activities but also special items and activities. Figure 
28.15 is simply a sheet with numbered lines on which family 
members identify potential rewards they would like to earn 
by completing tasks on contingency contracts. 


304 Part4 ¢ Reinforcement 


* Choice format. This format could include asking questions 
such as the following: “Which would you do a lot of hard 
work to get?” “Would you rather get things to eat, like 
chips, cookies, or popcorn, or get to do things like art proj- 
ects, play computer games, or go to the library?” (Northup, 
George, Jones, Broussard, & Vollmer, 1996, p. 204) 

¢ Rank-ordering. The learner can be given a list of items 
or stimuli and instructed to rank-order them from most to 
least preferred. 


For learners with limited language skills, pictures of 
items, icons, or, preferably, the actual stimuli can be presented 
(Clevenger & Graff, 2005). For example, a teacher, while point- 
ing to an icon, might ask a student, “Do you like to drink juice, 
use the computer, ride the bus, or watch TV?” Students simply 
nod yes or no. 

Surveys have been developed to assess students’ prefer- 
ences. For example, elementary school teachers might use the 
Child Reinforcement Survey, which includes 36 rewards in four 
categories: edible items (e.g., fruit, popcorn), tangible items 
(e.g., stickers), activities (e.g., art projects, computer games), 
and social attention (e.g., a teacher or friend saying, “I like that’) 
(Fantuzzo, Rohrbeck, Hightower, & Work, 1991). Other surveys 
are the School Reinforcement Survey Schedule for students in 
grades 4 through 12 (Holmes, Cautela, Simpson, Motes, & Gold, 
1998) and the Reinforcement Assessment for Individuals with 
Severe Disabilities (Fisher et al., 1996). 

Although asking for personal preferences is relatively 
uncomplicated, the procedure is not entirely foolproof with 
respect to confirming that a preferred choice will later serve as 
a reinforcer (Whitehouse et al., 2014). “Poor correspondence 
between verbal self-reports and subsequent behavior has been 
long noted and often demonstrated” (Northup, 2000, p. 335). 
Although a child might identify watching cartoons as a preferred 
event, watching cartoons may function as a reinforcer only when 
the child is at home on Saturday mornings, but not at Grandma’s 
house on Sunday night. 

Further, surveys may not differentiate accurately between 
what children claim to be high-preference and low-preference 
items for reinforcers. Northup (2000) found that preferences of 
children with attention-deficit hyperactivity disorder (ADHD) 
did not rise beyond chance levels when survey results were later 
compared to reinforcer functions. “The relatively high number 
of false positives and low number of false negatives again sug- 
gest that surveys may more accurately identify stimuli that are 
not reinforcers than those that are” (p. 337). Merely asking chil- 
dren their preferences once might lead to false positives (i.e., 
children may choose an event or stimulus as a reinforcer, but it 
may not be reinforcing). 


Asking Significant Others. A pool of potential reinforcers can 
be obtained by asking parents, siblings, friends, or caregivers 
to identify the activities, items, foods, hobbies, or toys that 
they believe the learner prefers. For example, the Reinforcer 
Assessment for Individuals with Severe Disabilities (RAISD) is 
an interview protocol that asks caregivers to identify preferred 
stimuli across visual, auditory, olfactory, edible, tactile, and 
social domains (Fisher et al., 1996). Significant others then 


rank-order the selected preferences based on likely high-versus 
low-preference items. Finally, significant others are asked to 
identify the conditions under which they predict that specific 
items might function as reinforcers (e.g., cookies with milk ver- 
sus just cookies alone). As a cautionary note, however, analysts 
should be aware that stimuli identified as highly preferred by a 
significant other’s opinion alone may not always function as an 
effective reinforcer. Fisher et al. (1996) remind us: “Caregiver 
opinion alone is insufficient for accurate reinforcer identifi- 
cation” (p.14). However, when caregiver opinion is combined 
with a structured interview that provides prompts, cues, and a 
broader array of potential reinforcers for the significant other 
to consider, the caregiver report can be a helpful supplement to 
other methods. 


Offering a Pretask Choice. In this method, the practitioner asks 
the participant to choose what he wants to earn for doing a task. 
The participant then chooses one item from two or three options 
presented (Piazza et al., 1996). All of the stimuli presented as 
pretask choices will have been identified as preferred stimuli by 
other assessment procedures. For instance, a teacher might make 
the following statement: “Robyn, when you finish your math 
problems, you may have 10 minutes to play Battleship with 
Martin, read quietly, or help Ms. Obutu prepare the social stud- 
ies poster. Which activity do you want to work for?” A learner’s 
choice of a consequence will not necessarily be a more effective 
reinforcer than one selected by the researcher or practitioner 
(Smith, Iwata, & Shore, 1995). 


Free Operant Observation 


The activities that a person engages in most often when able to 
choose freely from among behaviors will often serve as effec- 
tive reinforcers when made contingent on engaging in low- 
probability behaviors. Observing and recording what activities 
the target person engages in when she can choose during a 
period of unrestricted access to numerous activities is called free 
operant observation. A total duration measure of the time the 
person engages with each stimulus item or activity is recorded. 
The longer the person engages with an item, the stronger the 
inference that the item is preferred. 

Procedurally, the person has unconstrained and simultane- 
Ous access to a predetermined set of items or activities or to the 
materials and activities that are naturally available in the envi- 
ronment. There are no response requirements, and all stimulus 
items are available and within the person’s sight and reach. An 
item is never removed after engagement or selection. According 
to Ortiz and Carr (2000) and Karsten et al. (2011), free operant 
responding is less likely to produce problem behavior during 
the assessment that might otherwise be observed if a stimulus 
is removed. Free operant observations can be contrived or con- 
ducted in naturalistic settings. 


Contrived Free Operant Observation. Practitioners use con- 
trived observation to determine whether, when, how, and the 
extent to which the person engages with each of a predetermined 
set of activities and materials. The observation is contrived 
because the researcher or practitioner “salts” the environment 
with a variety of items that may be of interest to the learner. 
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Figure 11.12 Number of minutes Mike spent engaged in activities during 2 hours of 


free time after school. 


Activity 


Leisure reading 
Watch TV 
Phone with friends 


Play video games 


Play with constuction toy 


Minutes observed 


Free operant assessment presupposes that the person has 
had sufficient time to move about and explore the environment 
and has had the chance to experience each of the stimuli, mate- 
rials, or activities. Just prior to the free operant observation 
period, the learner is provided brief noncontingent exposure to 
each item. All of the items are then placed within view and easy 
access to the learner, who then has the opportunity to sample 
and choose among them freely. Observers record the total dura- 
tion of time that the learner engages with each stimulus item or 
activity. 


Naturalistic Free Operant Observation. Naturalistic observa- 
tions of free operant responding are conducted in the learner’s 
everyday environment (e.g., playground, classroom, home). As 
unobtrusively as possible, the observer notes how the learner 
allocates his time and records the number of minutes the learner 
devotes to each activity. For instance, Figure 11.12 shows how 
a teenager, Mike, distributed his time during 2 hours of free 
time each day after school. Mike’s parents collected these data 
by keeping a chart of the total number of minutes their son 
was engaged in each activity. The summary chart for the week 
shows that Mike played computer video games, watched televi- 
sion, and talked on the phone to his friends every day. On two 
different days Mike spent 10 minutes reading a library book, 
and he played with a new construction toy for a brief time on 
Wednesday. Two activities—watching television and playing 
video games—occurred the most often and for the longest dura- 
tion. If Mike’s parents wanted to apply the Premack principle 
introduced earlier in this chapter to increase the amount of time 
he spends reading for pleasure or playing with the construction 
toy (i.e., low-probability behaviors), they might make watching 
television or playing video games (1.e., high-probability behav- 
iors) contingent on a certain amount of time spent leisure read- 
ing or playing with the construction toy. 


Trial-based Methods 


In trial-based methods of stimulus preference assessment, 
stimuli are presented to the learner in a series of tests, and the 
learner’s responses to the stimuli are measured as an index or 
hierarchy of preference. One or more of three measures of the 
learner’s behavior are recorded in trial-based stimulus prefer- 
ence assessment: approach, contact (DeLeon & Iwata, 1996), 
and engagement with the stimulus (DeLeon, Iwata, Conners, & 
Wallace, 1999; Hagopian, Rush, Lewin, & Long, 2001; Roane, 


et al., 1998). Approach responses typically include any detect- 
able movement by the person toward the stimulus (e.g., eye 
gaze, head turn, body lean, hand reach), a contact is tallied each 
time the person touches or holds the stimulus, and engagement 
is a measure of the total time or percentage of observed intervals 
in which the person interacts with the stimulus (e.g., in which 
the person held a massager against her leg). An assumption is 
made that the more frequently the person approaches, touches or 
holds, or engages with a stimulus, the more likely the stimulus 
is preferred. As DeLeon and colleagues (1999) stated, “duration 
of item contact is a valid index of reinforcer value” (p. 114). 

Preferred stimuli are typically labeled along a hierarchy 
from high-preference (HP) to low-preference (LP) stimuli based 
on predetermined criteria (e.g., stimuli chosen 80% or more of 
the time are HP) (Carr et al., 2000; Northup, 2000; Pace et al., 
1985; Piazza et al., 1996). Implicit, but testable, assumptions are 
that a highly preferred stimulus will serve as a reinforcer and 
that preference stability is necessary before treatment can begin. 
Although these assumptions do not always hold (Higbee, Carr, 
& Harrison, 2000; Verriden & Roscoe, 2016), they have proven 
to be efficient and workable assumptions with which to begin. 

The many variations of trial-based stimulus preference 
assessment can be grouped by presentation method as single 
stimulus (successive choice), paired stimuli (forced choice), and 
multiple stimuli with or without replacement.” 


Single Stimulus. A single-stimulus presentation method, also 
called a successive choice or single-stimulus engagement prefer- 
ence method, represents the most basic assessment available for 
determining preference. Simply stated, a stimulus is presented 
by a trained clinician or teacher, and the person’s reaction to, or 
engagement with, the stimulus is noted. Engagement behaviors 
include touching, playing with, looking at, or holding the stim- 
ulus for a predetermined time. Presenting one stimulus at a time 
“may be well suited for individuals who have difficulty selecting 
among two or more stimuli” (Hagopian et al., 2001, p. 477). 
To conduct a single-stimulus assessment, target stimuli 
across all sensory systems (i.e., visual, auditory, vestibular, tac- 
tile, olfactory, gustatory, and multisensory) are made available 
to the participant for a brief period. The participant is allowed 
to sample the stimuli one at a time, engage with it briefly, and, 
if a toy, see how it works, before the formal assessment protocol 
begins. Next, the stimuli are presented one at a time in ran- 
dom order, and the person’s reaction to and/or engagement with 
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each stimulus is recorded (Logan et al., 2001; Pace et al., 1985). 
Approach, avoidance, or rejection responses are recorded in 
terms of occurrence (yes or no), count (e.g., number of touches 
per minute), or duration (i.e., time spent engaged with an item). 
Each item in the array of proposed stimuli is presented sequen- 
tially. For example, a mirror might be presented to determine 
the duration of time the participant gazes into it, touches it, or 
rejects the mirror altogether (i.e., pushes it away). Each item is 
presented several times, and the order of presentation should be 
varied. Once all stimuli have been presented, a graphical hier- 
archy is constructed with the stimuli listed on the x-axis and 
the percentage of engagement on the y-axis. By analyzing the 
data with a bar graph, stimuli that are ranked the highest are 
presumed to be the preferred reinforcers. !° 

Boutot and DiGangi (2018) used a single-stimulus assess- 
ment procedure to identify preferred and nonpreferred toys by 
Dominic, a 5-month-old infant with Down syndrome. If Dominic 
looked at the adult-activated toy for longer than 5 seconds, 
changed facial expressions while looking at the object (e.g., 
smiled), or produced a marked bodily movement (kicked his 
legs or waved his arms), the toy was deemed preferred. Using 
this method, the researchers ultimately identified three toys and 
later employed them as reinforcers to increase the percentage of 
time Dominic sustained his head in an elevated position during 
tummy time over a “No Toy” condition (See Figure 11.13). 
Boutet and DiGangi (2018) concluded that the single-stimulus 


assessment procedure could be used in an efficient, probe-like 
manner to identify preferred stimuli and that those stimuli could 
be used by researchers and parents to improve important mus- 
cular skills for an infant with Down syndrome. 


Paired Stimuli. Each trial in the paired-stimuli presentation 
method, also sometimes called the “forced choice” method, 
consists of the simultaneous presentation of two stimuli. The 
observer records which of the two stimuli the learner chooses. 
To conduct a paired-stimuli assessment, each stimulus is 
matched randomly with all other stimuli in the proposed group 
of stimuli to be compared (Fisher et al., 1992). The number in 
the group is at the discretion of the analyst. For example, Piazza 
and colleagues (1996) used 66 to 120 paired-stimuli trials to 
determine reinforcer preference. Data from a paired-stimuli 
assessment show how many times each stimulus is chosen. The 
stimuli are then rank-ordered from high to low preference. Items 
that received engagement, play, manipulation, or selection over 
80% of the pairings are deemed high-preference items. Low- 
preference items were selected at the 50% level. Pace and col- 
leagues (1985) found that paired-stimuli presentations yielded 
more accurate distinctions between high- and low-preference 
items than did single-stimulus presentations. Paired-stimulus 
preference assessment has the advantage of arranging all stimuli 
sequentially, thereby providing choices that otherwise would not 
be possible during a single-stimulus presentation format. Thus, 
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the likelihood of ultimately identifying reinforcers is higher 
(Paclawskyj & Vollmer, 1995). 

Because every possible pair of stimuli must be pre- 
sented, paired-stimuli assessing is likely to take more time than 
the simultaneous presentation of a series of multiple stimuli. 
However, DeLeon and Iwata (1996) argued that ultimately the 
paired-stimuli method may be more time efficient because “the 
more consistent results produced by the PS method may indi- 


cate that stable preferences can be determined in fewer, or even 
single, sessions” (p. 5204 


Multiple Stimuli. The multiple-stimuli presentation method is 
an extension of the paired-stimuli procedure developed by Fisher 
and colleagues (1992). The participant to be assessed chooses 
a preferred stimulus from an array of three or more stimuli 
(Windsor, Piche, & Locke, 1994). By presenting multiple stim- 
uli together, assessment time is reduced. For example, instead 
of presenting a series of trials consisting of all possible pairs of 
stimuli from a group of six stimuli and continuing until all pairs 
have been presented, all six stimuli are presented simultaneously. 

The two major variations of the multiple-stimuli preference 
assessment are multiple stimuli with replacement (MSWI) and 
multiple stimuli without replacement (MSWO). The difference 
between the two lies in which stimuli are removed or replaced in 
preparation for the next trial after the participant indicates a pref- 
erence among the displayed items. In the multiple stimuli with 
replacement procedure, the item chosen by the learner remains in 
the array and items that were not selected are replaced with new 
items. In the multiple stimuli without replacement procedure, the 
chosen item is removed from the array, the order or placement 
of the remaining items is structurally rearranged, and the next 
trial begins with a reduced number of items in the array (Brod- 
head, Al-Dubayan, Mates, Abel, & Brouwers, 2016).!? 

In any case, each trial begins by asking the person, “Which 
one do you want the most?” (Higbee et al., 2000), or instructing 
the person to “Choose one” (Ciccone, Graff, & Ahearn, 2005) 
and then continues until all items from the original array, or the 
gradually reducing array, have been selected. The entire sequence 
is usually repeated several times, although a single round of trials 
may identify stimuli that function as reinforcers (Carr et al., 2000). 

The stimuli presented in each trial might be tangible 
objects themselves, pictures of the items, or verbal descriptions. 
Comparisons between conducting stimulus preference assess- 
ments with tangible objects themselves versus pictures of those 
objects have been mixed. Whereas Brodhead et al. (2016) have 
shown that video MSWO may be a suitable complement to tan- 
gible MSWI, Higbee, Carr, and Harrison (1999) found that the 
tangible objects produced greater variation and distribution of 
preferences than the picture objects did. Cohen-Almeida, Graff, 
and Ahearn (2000) found that the tangible object assessment 
was about as effective as a verbal preference assessment, but the 
clients completed the verbal preference assessment in less time. 

In sum, MSWO assessment may identify stimuli that func- 
tion as reinforcers, may require fewer sessions or less time to 
complete than other variations of preference assessments, yields 
a hierarchy of student preference, and appears to be suitable for 
a wide range of students due to fast administration and accuracy 
in identifying potential reinforcers (Brodhead et al., 2016). 
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DeLeon and Iwata (1996) used an adaptation of the mul- 
tiple-stimuli and paired-stimuli presentations they described as a 
brief stimulus assessment to reduce the time needed to determine 
stimulus preference. Basically, in the brief stimulus assessment, 
once a particular stimulus item is chosen, that item is nor returned 
to the array. Subsequent trials present a reduced number of items 
from which to choose (Carr et al., 2000; DeLeon et al., 2001; Roane 
et al., 1998). DeLeon and Iwata (1996) found that multiple stimuli 
without replacement identified preferred items in approximately 
half the time that a paired-stimulus comparison procedure did. 
According to Higbee and colleagues (2000), “With a brief stimu- 
lus preference procedure, practitioners have a method for reinforcer 
identification that is both efficient and accurate” (pp. 72-73). 


Guidelines for Selecting and Using Stimulus 
Preference Assessments 


Practitioners can combine assessment procedures to compare 
single versus paired, paired versus multiple, or free operant 
versus trial-based methods (Ortiz & Carr, 2000).'* In day-to- 
day practice, brief stimulus presentations using comparative 
approaches might facilitate reinforcer identification, thereby 
speeding up interventions using those reinforcers. For teaching 
purposes and to launch instruction as soon as possible, Karsten 
et al. (2011) recommend using FO and/or MSWO options, as 
they are efficient and likely to provide sufficient information 
on a reinforcer hierarchy. Tung, Donaldson, and Kang (2017) 
also state that the FO procedure may be the method of choice 
when problem behavior is anticipated when tangible stimuli are 
removed during the assessment process. 

In summary, an important goal of stimulus preference 
assessments is to identify stimuli that are most highly preferred 
and thus likely to function as reinforcers (Kelly, Roscoe, Hanley, 
& Schlichenmeyer, 2014). Each method for assessing prefer- 
ence has advantages and limitations with respect to identifying 
preferences (Karsten et al., 2011; Roane et al., 1998). Practi- 
tioners may find the following guidelines helpful when conduct- 
ing stimulus preference assessments (DeLeon & Iwata, L296; 
Gottschalk et al., 2000; Higbee et al., 2000; Ortiz & Carr, 2000; 
Roane et al.. 1998: Roscoe, Iwata, & Kahng, 1999): 


* Monitor the learner’s activities during the time before the 
stimulus preference assessment session to be aware of 
EOs that may affect the results. 

¢ Use stimulus preference assessment options that bal- 
ance the cost-benefit of brief assessments (but the pos- 
sible occurrence of false positives) with more prolonged 
assessments, which, while they may ultimately identify 
more highly preferred stimuli (and reinforcers), delay 
intervention. 

¢ Balance using a stimulus preference method that may 
yield a ranking of preferred stimuli against an assessment 
method that occurs without rankings, but occurs more fre- 
quently, to counteract shifts in preference. 

* Conduct a brief stimulus preference assessment with 
fewer items in an array (i.e., 3 instead of 5 or more) when 
assessment time is limited. 

¢ Combine data from multiple assessment methods and 
sources of stimulus preference (e.g., asking the learner 
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and significant others, free operant observation, and trial- 
based methods). 
* Verify that stimuli identified by any efficient SPA method 
ultimately serve that function as a reinforcer. 
* Recognize that whereas stimulus preference may be stable 
for individuals over time (Kelley et al., 2016), the possi- 
bility exists that stimulus preference will shift, and further 
SPAs may be necessary. 
To facilitate SPA assessment with respect to time, con- 
sider using categories of stimuli. Edible categories might 
include chocolate, gummy treats, crunchy snacks, and so 
forth. Toy categories might include, for example, trucks, 
cars, and fire engines. It’s likely that if SPA shows a pref- 
erence for one item within a category, other unassessed 
items within that category might also serve as reinforcers, 
even if they are not initially formally assessed. 


Reinforcer Assessment 


The only way to tell whether or not a given event is rein- 
forcing to a given organism under given conditions is to 
make a direct test. 

—B. F. Skinner (1953, pp. 72-73) 


Highly preferred stimuli may not always function as reinforcers 
(Higbee et al., 2000); even the candy that a child pried out of 
Sidman’s hand may not have functioned as reinforcer under cer- 
tain conditions. Conversely, least preferred stimuli might serve 
as reinforcers under some conditions (Gottschalk et al., 2000). 
The only way to know for sure whether a given stimulus serves 
as a reinforcer is to present it immediately following the occur- 
rence of a behavior and note its effects on responding. 

Reinforcer assessment refers to a variety of direct, data- 
based methods used to present one or more stimuli contingent 
on a target response and then measuring the future effects on the 
rate of responding. Researchers and practitioners have developed 
reinforcer assessment methods to determine the relative effects of 
a given stimulus as reinforcement under different and changing 
conditions and to assess the comparative effectiveness of multi- 
ple stimuli as reinforcers for a given behavior under specific con- 
ditions. Reinforcer assessment or analysis is often accomplished 
by presenting stimuli suspected of being reinforcers either “in the 
moment” or contingent on responding within concurrent, multi- 
ple, or progressive-ratio reinforcement schedules. !4 


In-the-Moment Reinforcer Analysis 


In-the-moment reinforcer analysis is conducted by practitioners 
when a learner’s response is followed immediately by the pre- 
sentation of a stimulus change—in this case, a stimulus pre- 
sumed to be a reinforcer—and the effect is noted on increases 
in the future occurrence of similar responses (Leaf et al., 2018). 
In practical applications, the analyst, with immediate access to 
numerous presumed reinforcers, makes an instantaneous judg- 
ment about which of those “reinforcers” is likely to function as 
such, and delivers that reinforcer to the learner. When deciding 
which reinforcer to deliver, the analyst considers the learner’s 
current affect (smiling, frowning), previous interactions with 
that stimulus item (approaching, engaging, contacting), past 


data showing the effectiveness of that stimulus with respect to 
performance (improved or decreased), interactions with similar 
items in the past (furry dog, fleece-covered cat), and other past 
observational preferences (Leaf et al., 2012). 

Leaf et al. (2018) compared formal paired-stimulus pref- 
erence assessments with “in-the-moment reinforcer analysis” 
used to determine rates of learning a simple picture-labeling 
task by two preschool boys with autism. Procedurally, the 
teachers conducted a paired-stimulus preference assessment 
to determine the top three preferred items. During teaching, if 
the boys answered a question correctly, they received access 
to one of the top three items for 10 seconds. If they answered 
incorrectly, they received feedback that their response was incor- 
rect. During the in-the-moment condition, the identical pool of 
paired-stimulus condition items was available, but the teachers 
had complete discretion about which item to deliver contingent 
on a correct response. They could deliver the same reinforcer 
during all learning trials or change the reinforcer after each trial. 
The determining factor was their moment-to-moment judgment 
based on the factors cited above. 

The results of Leaf et al. showed no clear difference in 
skill acquisition for the boys—they learned equally well in both 
conditions, but the in-the-moment procedure was far more effi- 
cient with respect to the division of time spent in assessment 
versus teaching conditions. Leaf and colleagues concluded by 
stating: “When teaching individuals with learning difficulties, 
any incremental gain in speed of acquisition through formal 
preference assessments must be weighed against the reduced 
teaching time that remains after completing the assessments, 
which is a point that seems to be too often overlooked” (p. 23). 


Concurrent Schedule Reinforcer Assessment 


When two or more contingencies of reinforcement operate 
independently and simultaneously for two or more behaviors, 
a concurrent schedule of reinforcement is in effect. When used 
as a vehicle for reinforcer assessment, a concurrent schedule 
arrangement essentially pits two stimuli against each other to 
see which will produce the larger increase in responding when 
presented as a consequence for responding. If a learner allo- 
cates a greater proportion of responses to one component of 
the concurrent schedule over the other, the stimulus used as a 
contingent consequence for that component is the more effec- 
tive reinforcer. Using a concurrent schedule in this way shows 
the relative effectiveness of high-preference (HP) and low- 
preference (LP) stimuli as reinforcers (Koehler, Iwata, Roscoe, 
Rolider, & O’Steen, 2005; Piazza et al., 1996). 

Concurrent schedules may also be used to determine dif- 
ferences between relative and absolute reinforcement effects of 
stimuli. That is, will an LP stimulus now presented contingently 
in the absence of the HP stimulus serve as a reinforcer? Roscoe 
and colleagues (1999) used concurrent schedules to compare the 
effects of HP and LP stimuli as reinforcers for eight adults with 
developmental disabilities. Following the preference assess- 
ments, a concurrent schedule of reinforcement was established 
using the high-preference and low-preference stimuli. The target 
response was pressing either of two micro switch panels. Each 
panel was a different color. Pressing a panel would illuminate 


a small light in the center of the panel. A training condition 
took place prior to baseline to establish panel pressing in the 
subjects’ repertoires and to expose them to the consequences of 
responding. During baseline, pressing either panel resulted in no 
programmed consequences. During the reinforcement phase, an 
HP stimulus was placed on a plate behind one of the panels, and 
an LP stimulus was placed on a plate behind another panel. All 
responses to either panel resulted in the participant immediately 
receiving the item on the plate behind the respective panel (i.e., 
an FR | schedule of reinforcement). 

Under the concurrent schedule of reinforcement that 
enabled participants to choose reinforcers on the same FR 1 
schedule, the majority of participants allocated most of their 
responding to the panel that produced the HP stimulus as rein- 
forcement (e.g., see results for Sean, Peter, Matt, and Mike on 
Figure 11.14). However, these same participants, when later pre- 
sented with the opportunity to obtain LP stimuli as reinforcers 
on a single-schedule contingency (i.e., only one panel to push), 
showed increased levels of responding over baseline, similar to 
those obtained with the HP stimuli in the concurrent schedule. 
The study by Roscoe and colleagues (1999) demonstrated how 
concurrent schedules may be used to identify the relative effects 
of stimuli as reinforcers. The study also showed that the poten- 
tial effects of a stimulus as a reinforcer may be masked or over- 
shadowed when that stimulus is pitted against another stimulus 
on a concurrent schedule. In such cases, a potentially reinforcing 
stimulus might be abandoned prematurely. 


Multiple Schedule Reinforcer Assessment 


A multiple schedule of reinforcement consists of two or more 
component schedules of reinforcement for a single response 
with only one component schedule in effect at any given time. 
A discriminative stimulus (S) signals the presence of each 
component schedule, and that stimulus is present as long as the 
schedule is in effect. One way that a multiple schedule could 
be used for reinforcer assessment would be to present the same 
stimulus event contingent (i.e., response dependent) on each 
occurrence of the target behavior in one component of the 
multiple schedule and on a fixed-time schedule (i.e., response 
independent) in the other component. For example, if a prac- 
titioner wanted to use a multiple schedule to assess whether 
social attention functioned as a reinforcer, she would provide 
social attention contingent on occurrences of cooperative play 
when one component of the multiple schedule is in effect, and 
during the other component the practitioner would present the 
same amount and kind of social attention except on a fixed-time 
schedule, independent of cooperative play (1.e., noncontingent 
reinforcement). The teacher could apply the response-depen- 
dent schedule during the morning play period, and the response- 
independent schedule during the afternoon play period. If social 
attention functioned as reinforcement, cooperative play would 
likely increase over its baseline rate in the morning periods, 
and because of no relationship with cooperative play, attention 
would likely have no effect in the afternoon period. This sit- 
uation follows a multiple schedule because there is one class 
of behavior (i.e., cooperative play), a discriminative stimulus 
for each contingency in effect (i.e., morning and afternoon 
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Figure 11.14 Responses per minute during concurrent- 
schedule and single-schedule baseline and reinforcement 
conditions for four adults with intellectual disabilities. 


“Relative Versus Absolute Reinforcement Effects: Implications for 
Preference Assessments” by E. M. Roscoe, B. A. Iwata, and S. Kahng, 
1999, Journal of Applied Behavior Analysis, 32, p. 489. Copyright 1999 
by the Society for the Experimental Analysis of Behavior, Inc. 


play periods), and different conditions for reinforcement 
(i.e., response dependent and response independent). 


Progressive-Ratio Schedule Reinforcer Assessment 


Stimulus preference assessments with low response require- 
ments (e.g., FR 1) may not predict the effectiveness of the 
stimulus as a reinforcer when presented with higher response 
requirements (e.g., on an FR 10 schedule, a student must com- 
plete 10 problems to obtain reinforcement). As DeLeon, Iwata, 
Goh, and Worsdell (1997) stated: 


Current assessment methods may make inaccurate predic- 
tions about reinforcer efficacy when the task used in train- 
ing regimens requires either more responses or more effort 
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before the delivery of reinforcement . . . for some classes 
of reinforcers, simultaneous increases in schedule require- 
ments may magnify small differences in preferences that 
are undetected when requirements are low. In such cases, 

a stimulus preference assessment involving low response 
requirements (FR1) schedules does not accurately predict 
the relative potency of reinforcers under increased response 
requirements. (pp. 440, 446) 


Progressive-ratio schedules provide a framework for 
assessing the relative effectiveness of a stimulus as a reinforcer 
as response requirements increase. In a progressive-ratio sched- 
ule of reinforcement, the response requirements for reinforce- 
ment are increased systematically over time independent of the 
participant’s behavior. In a progressive-ratio schedule, the prac- 
titioner gradually requires more responses per presentation of 
the preferred stimulus until a breaking point is reached and the 
response rate declines (Roane, Lerman, & Vorndran, 2001). For 
example, initially each response produces reinforcement (FR 1), 
and then reinforcement is delivered after every 2nd response 
(FR 2), then perhaps after every Sth, 10th, and 20th response 
(FR 5, FR 10, and FR 20). At some point, a preferred stimulus 
may no longer function as a reinforcer (Tustin, 1994). 


DeLeon and colleagues (1997) used a progressive ratio 
within a concurrent schedule to test the relative effectiveness 
of two similarly preferred stimuli (e.g., cookie and cracker) and 
two dissimilar stimuli (e.g., drink and balloon) as reinforcers for 
micro switch panel pressing for Elaine and Rick, two adults with 
intellectual disabilities. One panel was blue and one was yellow. 
The experimenters placed two reinforcers on separate plates, and 
put one plate behind each of the panels. Each trial (24 per ses- 
sion for Rick; 14 per session for Elaine) consisted of the subject 
pushing either one of the panels and immediately receiving the 
item on the plate behind that panel. During the first phase, an 
FR 1 schedule was used (i.e., each response produced the item 
on the plate). Later, the response requirement for obtaining the 
items was gradually increased (FR 2, FR 5, FR 10, and FR 20). 

Elaine and Rick made responses that produced the two 
dissimilar items at roughly the same rates during the FR | phase 
(see the top two tiers in Figure 11.15). As response requirements 
for receiving the dissimilar stimuli increased, Elaine and Rick 
continued to evenly allocate responding between the two panels. 
However, when initially equivalent and similar reinforcers (food 
in FR 1) were compared under increasing schedule requirements, 
the differences in responses rates on the two panels revealed clear 
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Figure 11.15 Responses per minute during 
¥: concurrent-schedule and singie-schedule baseline 
: and reinforcement conditions for two adults with 
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and consistent preferences (see the bottom two tiers in Figure 
11.15). For example, when Elaine needed to work more to receive 
food, she allocated the majority of her responses to the panel that 
produced chips rather than the one that produced pretzels. In the 
same way, as the number of responses required to receive rein- 
forcement increased, Rick showed a clear preference for cookies 
over crackers. These results suggest that “for some classes of 
reinforcers, simultaneous increases in schedule requirements may 
magnify small differences in preference that are undetected when 
requirements are low” (DeLeon et al., 1997, p. 446). 

Increasing response requirements within a concurrent 
schedule may reflect the effects of increasing response require- 
ments on the choice between reinforcers and also may reveal 
whether and under what conditions two reinforcers are substi- 
tutable for each other. If two reinforcers serve the same function 
(i.e., are made effective by the same establishing operation), an 
increase in the price (i.e., response requirement) for one of the 
reinforcers will lead to a decreased consumption of that item if 
a substitutable reinforcer is available (Green & Freed, 1993). 
DeLeon and colleagues (1997) used a hypothetical person with 
a slight preference for Coke over Pepsi as an analogy to explain 
the results shown in Figure 11.15. 


Assuming that Coke and Pepsi are both available for $1.00 
per serving and that a person has only a slight preference 
for Coke, the individual may allocate choices rather evenly, 
perhaps as a function of periodic satiation for the preferred 
item, but with slightly more overall selections of Coke. 
Now assume that the cost of each is increased to $5.00 per 
serving. At this price, the preference for Coke is likely to 
be expressed. By contrast, a similar arrangement involving 
Coke and bus tokens may produce different results. Again, 
at $1.00 per item, roughly equal selection between the two 
options would not be surprising, assuming that the estab- 
lishing operation for each dictates that both are momentarily 
equally valuable. However, these items serve distinctly 
different functions and are not substitutable; that is, the per- 
son is not free to trade one for the other and to continue to 
receive functionally similar reinforcement at the same rate. 
The person is more likely to continue choosing equally, even 
when the price for both reinforcers increases substantially. 
The same might be said for the results obtained in 
the present study. When choices involved two substitutable 
items, such as a cookie and a cracker, concurrent increases 
in the cost of each may have “forced” the expression of 
slight preference for one of the items. However, when 
reinforcers that were unlikely to be substitutes, such as a 
cookie and a massager, were concurrently available and 
equally preferred, increases in cost had little effect on pref- 
erence. (pp. 446-447) 


Although Stimulus X and Stimulus Y may each function 
4s reinforcers when task demands are low, or when the rein- 
forcement schedule is dense, when the task demands increase or 
when the schedule becomes leaner (i.e., more responses required 
per reinforcement), participants may choose only Stimulus Y. 
DeLeon and colleagues (1997) pointed out that practitioners 
who are alert to these relationships might be more skeptical 
in believing that original preferences will be sustained under 
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changing environmental conditions, and judicious in how they 
plan reinforcement delivery relative to task assignment once 
intervention is under way. That is, it might be better to save 
some types of preferred stimuli for when task demands are high 
rather than substituting them for other equally preferred stimuli 
when task demands are low. 


CONTROL PROCEDURES FOR POSITIVE 
REINFORCEMENT 


Positive reinforcement control procedures are used to manip- 
ulate the contingent presentation of a potential reinforcer and 
observe any effects on the future frequency of behavior. Control, 
as the term is used here, requires an experimental demonstration 
that the presentation of a stimulus contingent on the occurrence 
of a target response functions as positive reinforcement. Con- 
trol is demonstrated by comparing response rates in the absence 
and presence of a contingency, and then showing that with the 
absence and presence of the contingency the behavior can be 
turned on and off, or up and down (Baer, Wolf, & Risley, 1968). 
Historically, researchers and practitioners have used the reversal 
technique as the major control technique for positive reinforce- 
ment. Briefly, the reversal technique includes two conditions 
and a minimum of four phases (i.e., ABAB). In the A condition, 
the behavior is measured over time until it achieves stability 
in the absence of the reinforcement contingency. The absence 
of the contingency is the control condition. In the B condition, 
the reinforcement contingency is presented; the same target 
behavior continues to be measured to assess the effects of the 
stimulus change. The presence of the reinforcement contingency 
is the experimental condition. If the rate of responding increases 
in the presence of the contingency, the analyst then withdraws 
the reinforcement contingency and returns to the A and B con- 
ditions to learn whether the absence and presence of the contin- 
gency will turn the target behavior down and up. 

However, using extinction as the control condition during 
the reversal phase presents practical and conceptual problems. 
First, withdrawing reinforcement may result in extinction- 
produced side effects (e.g., an initial increase in response rate, 
emotional responses, aggression—see Chapter 24) that affect 
the demonstration of control. Second, in some situations it may 
be impossible to withdraw the reinforcement contingency com- 
pletely. For example, it is unlikely that a teacher could completely 
remove teacher attention during the A condition. In addition to 
these problems, Thompson and Iwata (2005) noted that, although 


extinction has often been successful in reversing the behay- 
ioral effects of positive reinforcement, its use as a control 
procedure presents interpretive difficulties. Essentially, 
extinction does not adequately isolate the reinforcement 
contingency as the variable controlling the target response, 
because mere stimulus presentation cannot be ruled out as 
an equally viable explanation. (p. 261, emphasis added) 


According to Thompson and Iwata (2005), “the ideal control 
procedure for positive reinforcement eliminates the contingent rela- 
tion between the occurrence of the target response and the presen- 
tation of the stimulus while controlling for the effects of the mere 
stimulus presentation” (p. 259). They reviewed the effectiveness of 
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three variations of the reversal technique as control procedures for 
determining reinforcement: noncontingent reinforcement (NCR), 
differential reinforcement of other behavior (DRO), and differen- 
tial reinforcement of alternative behavior (DRA).!> 


Noncontingent Reinforcement 


Noncontingent reinforcement (NCR) is the presentation of a 
potential reinforcer on a fixed-time (FT) or variable-time (VT) 
schedule independent of the occurrence of the target behavior. 
The response-independent presentation of the potential rein- 
forcer eliminates the contingent relation between the target 
behavior and the stimulus presentation, while allowing any 
effects of the stimulus presentation alone to be detected. Thus, 
NCR meets Thompson and Iwata’s (2005) criteria for an ideal 
control procedure for positive reinforcement. 

The NCR reversal technique entails a minimum of five 
phases (ABCBC): A 1s a baseline condition; B is an NCR con- 
dition, where the potential reinforcer is presented on a fixed- or 
variable-interval schedule independent of the target behavior; 
and C is a condition in which the potential reinforcer is pre- 
sented contingent on the occurrence of the target behavior. The B 
and C conditions are then repeated to learn whether the level of 
responding decreases and increases as a function of the absence 
and presence of the response-consequence contingency. The 
quality, amount, and rate of reinforcement should be approxi- 
mately the same during the contingent and noncontingent B and 
C conditions of the analysis. 

NCR often produces persistent responding, perhaps 
because of accidental reinforcement that sometimes occurs with 
a response-independent schedule, or because similar EOs and 
antecedent stimulus conditions evoke the persistent responding. 
Whatever the cause, persistent responding is a limitation of the 
NCR control procedure because it makes achieving a reversal 
effect (reduced responding) more time-consuming than the 
reversal technique with extinction. Achieving the effect may 
require lengthy contact with the NCR schedule. 


Differential Reinforcement of Other Behavior 


A practitioner using differential reinforcement of other behav- 
tor (DRO) delivers a potential reinforcer whenever the target 
behavior has not occurred during a set time interval or at a 
specific point in time. The DRO reversal technique includes a 
minimum of five phases (ABCBC): A is a baseline condition; 
B is areinforcement condition, in which the potential reinforcer 
is presented contingent on the occurrence of the target behav- 
ior; and C is the DRO control condition in which the poten- 
tial reinforcer is presented contingent on the absence of the 
target behavior. The analyst then repeats the B and C condi- 
tions to determine whether the level of responding decreases 
and increases as a function of the absence and presence of the 
response-consequence contingency. 

The DRO schedule allows for the continued presentation 
of the reinforcement contingency during the reversal phases of 
the control procedure. In one condition, the contingency is active 
with occurrences of the target behavior. In another condition, the 
contingency is active for the omission of the target behavior. The 
DRO control procedure may produce the reversal effect in less 


time than the NCR schedule, perhaps because of the elimination 
of accidental reinforcement of the target behaviors. 


Differential Reinforcement of 
Alternative Behavior 


When differential reinforcement of an alternative behavior (DRA) 
is used as a control condition, the potential reinforcer is presented 
contingent on occurrences of a desirable alternative to the target 
behavior.'° The DRA reversal technique includes a minimum of 
five phases (ABCBC): A is a baseline condition; B is a reinforce- 
ment condition, in which the potential reinforcer is presented 
contingent on the occurrence of the target behavior; and C is a 
condition in which the potential reinforcer is presented contin- 
gent on the occurrence of an alternative behavior (i.e., DRA). The 
analyst will then repeat phases B and C to ascertain whether the 
level of responding decreases and increases as a function of the 
absence and presence of the response—consequence contingency. 

Thompson and Iwata (2005) summarized the limitations 
of using DRO and DRA as control conditions procedures to test 
for positive reinforcement: 


[DRO and DRA] introduce a new contingency that was 
not present in the original experimental arrangement. As a 
result, reductions in the target response under a contingency 
reversal might be attributed to either (a) termination of the 
contingency between the target response and the reinforcer 
or (b) introduction of reinforcement for the absence of 

the target response or for the occurrence of a competing 
response. In addition, given that reinforcement is provided 
contingent on some characteristic of responding during 

the contingency reversal, it may be difficult to control for 
the rate of stimulus presentation across experimental and 
control conditions. If responding is not quickly reduced 
(DRO) or reallocated toward responses that produce rein- 
forcement (DRA), the rate of reinforcement in the control 
condition may be low relative to the rate of reinforcement 
in the experimental conditions. When this occurs, the 
contingency-reversal strategy is functionally similar to the 
conventional extinction procedure. (p. 267) 


Given the considerations for the reversal technique with 
extinction, and its three variations, Thompson and Iwata (2005) 
concluded that NCR offers the most thorough and unconfounded 
demonstration of the effects of positive reinforcement. 


USING REINFORCEMENT EFFECTIVELY 


We offer practitioners nine guidelines for applying positive rein- 
forcement effectively. These guidelines come from three main 
sources: the research literatures of the experimental analysis of 
behavior, applied behavior analysis, and our personal experiences, 


Set an Easily Achieved Initial Criterion 
for Reinforcement 


A common mistake in applications of reinforcement is setting 
the initial criterion for reinforcement too high, which prohib- 
its the learner’s behavior from contacting the contingency. To 


use reinforcement effectively, practitioners should establish an 
initial criterion so that the participant’s first responses produce 
reinforcement, and then increase the criterion for reinforcement 
gradually as performance improves. Heward (1980) suggested 
the following method for establishing initial criteria for rein- 
forcement based on the learner’s level of responding during 
baseline (see Figure 11.16). 


For a behavior you wish to increase, set the initial criterion 
higher than the child’s average baseline performance and 
lower than or equal to his best performance during base- 
line. For a behavior you want to decrease in frequency, the 
initial criterion for reinforcement should be set below the 
child’s average performance during baseline and greater 
than or equal to his lowest baseline performance. (p. 7) 


Use High-Quality Reinforcers of Sufficient 
Magnitude 


Reinforcers that maintain responding on simple tasks may not 
have the potency to produce similar levels of responding on 
more difficult or longer tasks. Practitioners will likely need to 
use a reinforcer of higher quality for behaviors that require more 
effort or endurance. A highly preferred stimulus chosen during 
preference assessments sometimes functions as a high-quality 
reinforcer. Neef, Mace, Shea, and Shade (1992), for example, 
found that behaviors that received a lower reinforcer rate but 
a higher quality reinforcer increased in occurrence, whereas 
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behaviors that received a higher reinforcer rate with a lower 
quality reinforcer decreased in occurrence. Reinforcer quality 
is relative also to other consequences for responding currently 
available to the learner. 

Applied behavior analysts define the magnitude (or 
amount) of a reinforcer as (a) the duration of time for access to 
the reinforcer, (b) the number of reinforcers per unit of time (i.e., 
reinforcer rate), or (c) the intensity of the reinforcer. Increases 
in reinforcer magnitude may correlate with an increased effec- 
tiveness of the behavior-reinforcer relation. However, the effects 
of reinforcer magnitude are not well understood because “few 
applied studies have examined the effects of magnitude on 
responding in a single-operant arrangement” (Lerman, Kelly, 
Vorndran, Kuhn, & LaRue, 2002, p. 30). Consideration of how 
much reinforcement to use should follow the maxim “Reinforce 
abundantly, but don’t give away the store.” We suggest that the 
amount of reinforcement be proportional to the quality of the 
reinforcer and the effort required to emit the target response. 


Use Varied Reinforcers to Maintain Potent 
Establishing Operations 


Reinforcers often decrease in effectiveness with frequent use. 
Presenting an overabundance of a specific reinforcer is likely to 
diminish the momentary effectiveness of the reinforcer due to 
satiation. Practitioners can minimize satiation effects by using a 
variety of reinforcers. If reading a specific book on sports func- 
tions as a reinforcer and the teacher relies solely on this reinforcer, 


Figure 11.16 Using data from learners’ baseline performance to set initial criteria for 


reinforcement. 


The criterion-setting formulas are 


For increasing behavior: 


baseline average < initial criterion = highest performance during baseline 


For decreasing behavior: 


baseline average > initial criterion = lowest performance during baseline 


Performance 


Examples 


Baseline Range for 


Lowest Highest 
14 min. 


9 letters 


Target Behavior Goal 
2 min. 
4 letters 


Playing alone Increase 


Identifying letters of Increase 


the alphabet 


Number of leg 
exercises completed 


Increase D2 


Percentage of math Increase 
problems correctly 


solved 


Number of typing Decrease 28 


errors in one letter 


Number of calories Decrease 3,980 


consumed per day 


Average 
6 min. 
5 letters 


22 


2,950 


Initial Criterion 
I-14 mine 
6-9 letters 


16-21 


2,260-2,900 


From: “A Formula for Individualizing Initial Criteria for Reinforcement,” by W. L. Heward, 1980, Exceptional 
Teacher, 1(9), p. 8. Used by permission. 
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ultimately reading that book may no longer produce reinforce- 
ment. Conversely, known reinforcers that are not always available 
may have increased effectiveness when they are reintroduced. If a 
teacher has demonstrated that “being first in line” is a reinforcer, 
but uses this reinforcer only once per week, the reinforcement 
effect will be greater than if “being first in line” is used frequently. 

Varying reinforcers may enable less preferred stim- 
uli to function as reinforcers. For example, Bowman, Piazza, 
Fisher, Hagopian, and Kogan (1997) found that some learners 
responded better to a variety of less preferred stimuli as com- 
pared to a continuous access to a single, more highly preferred 
stimulus. Also, using a variety of reinforcers may keep the 
potency of any one particular reinforcer higher. For example, 
Egel (1981) found that students’ correct responding and on-task 
behavior were higher when the students had access to one of 
three randomly selected reinforcers across trials versus a con- 
stant reinforcement condition in which one of the stimuli was 
presented following each successful trial. Even within a session, 
teachers could let students select a variety of consequences from 
a menu. Similarly, varying a property of a reinforcer may keep 
its reinforcing potency for a longer time. If comic books are used 
as reinforcers, having several different genres of comic books 
available is likely to maintain their potency. 


Use Direct Rather Than Indirect Reinforcement 
Contingencies When Possible 


With a direct reinforcement contingency, emitting the target 
response produces direct access to the reinforcer; the contin- 
gency does not require any intervening steps. With an indirect 
reinforcement contingency, the response does not produce rein- 
forcement directly. The practitioner presents the reinforcer. 
Research suggests that direct reinforcement contingencies may 
enhance performance (Koegel & Williams, 1980; Williams, 
Koegel, & Egel, 1981). Thompson and Iwata (2000), for exam- 
ple, linked the definitions of direct and indirect contingencies to 
the difference between automatic reinforcement (i.e., direct) and 
socially mediated contingencies (i.c., indirect) and summarized 
their research on response acquisition under direct and indirect 
contingencies of reinforcement this way: 


Under both contingencies, completion of identical tasks 
(opening one of several types of containers) produced 
access to identical reinforcers. Under the direct contin- 
gency, the reinforcer was placed inside the container to be 
opened; under the indirect contingency, the therapist held 
the reinforcer and delivered it to the participant upon task 
completion. One participant immediately performed the 
task at 100% accuracy under both contingencies. Three 
participants showed either more immediate or larger 
improvements in performance under the direct contingency. 
The remaining two participants showed improved perfor- 
mance only under the direct reinforcement contingency. 
Data taken on the occurrence of “irrelevant” behaviors 
under the indirect contingency (e.g., reaching for the 
reinforcer instead of performing the task) provided some 
evidence that these behaviors may have interfered with task 
performance and that their occurrence was a function of 
differential stimulus control. (p. 1) 


Whenever possible, practitioners should use direct rein- 
forcement contingency, especially with learners having limited 
behavioral repertoires. 


Combine Response Prompts 
and Reinforcement 


Response prompts are supplementary antecedent stimuli used 
to occasion a correct response in the presence of an S? that 
will eventually control the behavior. Applied behavior analysts 
give response prompts before or during the performance of a 
target behavior. The three major forms of response prompts are 
instructions, modeling, and physical guidance. 

Instructions describing the contingency may function as 
a motivating operation for learners with verbal skills, making 
it more likely they will contact the reinforcer more quickly. For 
example, Mayfield and Chase (2002) told college students who 
were learning five basic algebra rules that they would earn money 
for correct answers and not be penalized for incorrect answers. 

Bourret, Vollmer, and Rapp (2004) used verbal response 
prompts during an assessment of the vocal verbal mand reper- 
toires of three participants with autism. 


Each vocalization assessment session consisted of 10 trials, 
each | min in duration. A nonspecific prompt [describing 
the contingency] was delivered (e.g., “If you want this, ask 
me for it”) 10 s after the onset of the trial. A prompt includ- 
ing a model of the complete targeted utterance (e.g., “If 
you want this, say ‘chip’ ”) was delivered 20 s into the trial. 
The participant was prompted to say just the first phoneme 
of the targeted response (e.g., “If you want this, say ‘ch’ ”) 
30 s after the initiation of the trial. (pp. 131-132) 


Chapter 17 provides further discussion of response prompts, 
including specific procedures for combining response prompts with 
reinforcement, and additional examples of using verbal instruction, 
modeling, and physical guidance response prompts. 


Initially Reinforce Each Occurrence of 
the Behavior Initially 


Provide reinforcement for each occurrence of the target behavior 
(i.e., continuous reinforcement) to strengthen behavior, primar- 
ily during the initial stages of learning a new behavior. After the 
behavior is established, gradually thin the rate of reinforcement 
so that some, but not all, occurrences of the behavior are rein- 
forced (i.e., intermittent reinforcement). For example, a teacher 
might initially reinforce each correct response to sight words 
printed on flash cards and then use a ratio schedule to thin rein- 
forcement. To firm the responses after the initial learning, pro- 
vide reinforcement following two correct responses for a few 
trials, then following each set of four correct responses, and so 
on. Hanley and colleagues (2001) gradually shifted from a very 
dense fixed interval (FI) 1-sec schedule of reinforcement (on an 
FI schedule, the first target response following the end of the 
interval produces reinforcement) to thinner schedules with the 
following increments of intervals: 2 sec, 4 sec, 8 sec, 16 sec, 
25 sec, 35 sec, 46 sec, and finally to an FI 58-sec sched- 
ule. Chapter 13 provides more information on using continuous 
and intermittent reinforcement. 


Provide Contingent Attention and 
Descriptive Praise 


As discussed earlier in this chapter, social attention and praise 
are powerful reinforcers for most people. However, behavioral 
improvements following praise often involve something more, 
or altogether different from, the direct effects of reinforcement. 
Michael (2004) discussed the common conceptual mistake of 
assuming that increased responding following praise and atten- 
tion is a function of reinforcement. 


Consider the common use of descriptive praise, providing 
some general sign of social approval (a smile plus some 
comment such as “Good work!”’) and, in addition, a brief 
description of the behavior that is responsible for the 
approval (“I like the way you’re. .. !’’). When such praise 
is provided to a normally verbal person over 5 or 6 years 
of age, it probably functions as a form of instruction or as a 
rule, much as if the praiser had said, “If you want my con- 
tinued approval you have to...” For example, a factory 
supervisor walks up to an employee who is cleaning up 

an oil spill on the factory floor, smiles broadly, and says, 
“George, I really like the way you’re cleaning up that spill 
before anyone steps in it. That’s very considerate of you.” 
Now suppose that George cleans up spills from that time 
forward—a rather large change in behavior considering that 
it was followed by only a single instance of reinforcement. 
We might suspect that the praise functioned not simply as 
reinforcement but rather as a form of rule or instruction, 
and that George, for various reasons, provided himself with 
similar instruction every time another spill occurred. 

(pp. 164-165, emphasis in original) 


A study by Goetz and Baer (1973) investigating the 
effects of teacher praise on preschool children’s creative play 
with building blocks used descriptive praise in one condition of 
the study. “The teacher remarked with interest, enthusiasm, and 
delight every time that the child placed and/or rearranged the 
blocks so as to create a form that had not appeared previously in 
that session’s construction(s). .. . ‘Oh, that’s very nice—that’s 
different’!)” (p. 212). The three 4-year-old girls increased the 
construction of block form diversity during each phase of contin- 
gent descriptive praise. Goetz and Baer did not conduct a com- 
ponent analysis to determine how much of the girls’ improved 
performance could be attributed to reinforcement in the form of 
positive attention (“That’s very nice!”) or to the feedback they 
received (“That’s different!”), which enabled them to create a 
rule to follow (“Building different things with the blocks gets the 
teacher’s attention.”). The authors surmised that 


for some children, either [reinforcing attention or descrip- 
tive praise] will be sufficient without the other, but that for 
other children, the mix of the two will be more effective 
than either alone. If so, then for applied purposes a pack- 
age of positive attention and descriptive praise is probably 
the best technique to apply to children in general. (p. 216, 
words in brackets added) 


We recommend that, in the absence of data showing that 
attention and praise have produced counter-therapeutic effects for 
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a given learner, practitioners incorporate contingent praise and 
attention into any intervention entailing positive reinforcement. 


Gradually Increase the Response-to- 
Reinforcement Delay 


We recommended in a previous guideline that practitioners rein- 
force each occurrence of the target behavior during the initial 
stages of learning, and then thin the delivery of reinforcers by 
switching to an intermittent schedule of reinforcement. Because 
the consequences that maintain responding in natural environ- 
ments are often delayed, Stromer, McComas, and Rehfeldt 
(2000) reminded us that using continuous and intermittent 
schedules of reinforcement might be just the first steps of pro- 
gramming consequences for everyday situations. “Establishing 
the initial instances of a behavioral repertoire typically requires 
the use of programmed consequences that occur immediately 
after the target response occurs. However, the job of the applied 
behavior analyst also involves the strategic use of delayed rein- 
forcement. Behaviors that yield delayed reinforcement are 
highly adaptive in everyday life, but they may be difficult to 
establish and maintain” (p. 359).!7 

Examples of tactics that applied behavior analysts have 
used to help people learn to respond effectively for delayed con- 
sequences include (a) a delay-to-reinforcement time interval that 
begins with a short delay that is gradually increased (Dixon, 
Rehfeldt, & Randich, 2003; Schweitzer & Sulzer-Azaroff, 
1988); (b) a gradual increase in work requirements during the 
delay (Dixon & Holcomb, 2000); (c) an activity during the 
delay to “bridge the gap” between the behavior and reinforcer 
(Mischel, Ebbesen, & Zeiss, 1972); and, importantly, (d) verbal 
instruction in the form of an assurance that the reinforcer will 
be available following a delay (e.g., “The calculator will show 
the amount of money to be placed in a savings account for you. 
You will be given all the nickels in your savings account on 
[day]’”’) (Neef, Mace, & Shade, 1993, p. 39). We present more 
on using delayed consequences to promote generalization and 
maintenance of behavior changes in Chapter 30. 


Gradually Shift from Contrived to Naturally 
Occurring Reinforcers 


We end this chapter with an extract from Murray Sidman’s 
(2000) insightful and thought-provoking account of what he 
learned in the “early days” of applying behavioral principles to 
human behavior. In describing a project from 1965 to 1975 that 
emphasized the use of positive reinforcement with boys between 
the ages of 6 and 20 years who were diagnosed with intellectual 
disabilities and living in a state institution, Sidman recollected 
how introducing tokens as generalized conditioned reinforcers 
eventually led to praise from the project staff, and later to learn- 
ing itself, becoming powerful reinforcers for the boys. 


We began with tokens, which had the advantage of being 
visible and easily handled. Later, after the boys had learned 
to save tokens and to understand numbers, we were able to 
introduce points. For some, points led eventually to money. 
As the boys saw how pleased we were when they earned 
the tokens and points that brought them other reinforcers, 
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Success in manipulating the environment may be the 
ultimate naturally occurring reinforcer. As Skinner (1989) 
pointed out, this powerful reinforcer “does not need to be 
contrived for instructional purposes; it 1s unrelated to any par- 
ticular kind of behavior and hence always available. We call 
it success.” (p. 91) 


our pleasure also became important to them, and we 
became able to use praise as a reinforcer. As they learned 
more and more, many of the boys found that what they 
learned permitted them to deal more effectively with their 
gradually enlarging world. For them, learning itself became 
reinforcing. (p. 19) - 


sce CATES SNES SCOSNOSDOLODI HON IU HESECEDD SHA SEIEEE HS SECOCOSHNOU ONT E SoLSASTEONODUORES OTE 


SUMMARY 


Positive Reinforcement Defined 


between his or her actions and a reinforcing consequence, 


1. Positive reinforcement occurs when a response is followed 
immediately by the presentation of a stimulus change that 


or for that matter even be aware that a consequence has 
occurred, for reinforcement to happen. 


increases the future occurrence of similar responses. 11. Reinforcement strengthens any behavior that immediately 
precedes it; no logical or adaptive connection between 
2. The stimulus that is presented as a consequence, and that behavior and the reinforcing consequence is necessary. 
is responsible for the subsequent increase in responding, is 
called a positive reinforcer, or, more simply, a reinforcer. 12. The importance of understanding the arbitrariness of 
reinforcement goes far beyond providing a possible expla- 
3. The importance of the immediacy of reinforcement must nation for the development of harmless superstitious and 
be emphasized; a response-to-reinforcement delay of as idiosyncratic behaviors. The arbitrary nature of selection 
little as 1 second will be less effective than a reinforcer by reinforcement may explain the acquisition and mainte- 
are, yoeciatcly, nance of many maladaptive and challenging behaviors. 
4. The effects of long-delayed consequences on human 13. Automatic reinforcement refers to the behavior—stimulus 


behavior should not be attributed to reinforcement, as 
these effects drop off quickly as the delay is increased. 


5. A misconception held by some is that reinforcement is 
a circular concept. Circular reasoning is a form of faulty 
logic in which cause and effect are confused and not 
independent of each other. Reinforcement is not a circular 


10. Automaticity of reinforcement refers to the fact that a per- 


son does not have to understand or verbalize the relation 


change relation that occurs without the presentation of 
consequences by other people, or is assumed when a 
behavior persists in the absence of any known reinforcer. 


Classifying Reinforcers 


concept because the two components of the response— 14. An unconditioned reinforcer is a stimulus change that 
consequence relation can be separated and the conse- functions as reinforcement even though the learner has had 
quence manipulated to determine whether it increases the no particular learning history with it. (The terms primary 
frequency of the behavior it follows. reinforcer and unlearned reinforcer are synonyms for 

6. Reinforcement does more than increase the future occur- pet oe eee ecient SS, 
rence of similar behavior; it also changes the function of 15. A conditioned reinforcer (sometimes called a secondary 
stimuli that immediately precede the reinforced behavior. reinforcer or learned reinforcer) is a previously neutral stim- 
By virtue of being temporally paired with the response— ulus change that has acquired the capability to function as 
reinforcer contingency, antecedent events acquire the abil- a reinforcer through stimulus-stimulus pairing with one or 
ity to evoke (make more likely) instances of the reinforced more unconditioned reinforcers or conditioned reinforcers. 
response class. An antecedent stimulus that evokes behay- : ss F ' 3 
ior because it has been correlated with the availability ii aise Saeicaa eae aaa sell \ aia 
of reinforcement is called a discriminative stimulus (SP). fase bast ae LR pes oe Palvcue te 

many unconditioned and conditioned reinforcers, does not 

7. A discriminated operant is defined by a three-term contin- depend on a current EO for any particular form of rein- 
gency of SP >R— SRr. forcement for its effectiveness. 

8. The momentary effectiveness of any stimulus change as 17. When reinforcers are described by their physical proper- 
reinforcement depends on an existing level of motivation ties, they are typically classified as edible, sensory, tangi- 
with respect to that stimulus change. An establishing oper- ble, activity, or social reinforcers. 
ation (EO) (e.g., deprivation) increases the current effec- 
tiveness of a reinforcer; an abolishing operation (AO) (e. g., 18. The Premack principle states that making the opportunity 
satiation) decreases the current effectiveness of a reinforcer. to engage in a behavior that occurs at a relatively high free 

9. A complete description of reinforcement of a dis- eperans (onbaseling : a contingen 104 the, OCU TeRCe OF 
criminated operant entails a four-term contingency: jos fegiuen oa bebayien wa function as reinforcement for 
EO — SP > R —> sk+. the low-occurrence behavior. 

19. 


The response-deprivation hypothesis is a model for pre- 
dicting whether access to one behavior (the contingent 
behavior) wili function as reinforcement for another 


behavior (the instrumental response) based on the relative 
baseline rates at which each behavior occurs and whether 
access to the contingent behavior represents a restriction 
compared to the baseline level of engagement. 


Identifying Potential Reinforcers 


20. Stimulus preference assessment refers to a variety of pro- 
cedures used to determine (a) the stimuli that the person 
differentially selects; (b) the relative hierarchical prefer- 
ence value of those stimuli (high preference to low pref- 
erence); (c) the conditions under which those preference 
values change when task demands, deprivation states, or 
schedules of reinforcement are modified; and (d) whether 
highly preferred items ultimately serve as effective 
reinforcers. 


21. Stimulus preference assessment can be conducted using 


at least five forms: the single-stimulus (SS), paired-stim- 
ulus (PS), free operant (FO), multiple-stimulus with item 
replacement (MSWI), multiple-stimulus without item 
replacement (MSWO). These five forms can be clustered 
into three categories: ask the target person and/or signif- 
icant others, conduct free operant observations, and con- 
duct trial-based assessments. 


22. Preferred stimuli do not always function as reinforcers, 


and stimulus preferences often change over time. 


23. Reinforcer assessment refers to a variety of direct, data- 
based methods for determining the relative effects of a 
given stimulus as reinforcement under different and chang- 
ing conditions or the comparative effectiveness of multiple 
stimuli as reinforcers for a given behavior under specific 
conditions. Reinforcer assessment is often conducted 

with in-the-moment procedures, concurrent schedules of 
reinforcement, multiple schedules of reinforcement, and 


progressive reinforcement schedules. 
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KEY TERMS 


automatic reinforcement positive reinforcer 


conditioned reinforcer Premack principle 
generalized conditioned reinforcer reinforcer assessment 


positive reinforcement 
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MULTIPLE-CHOICE QUESTIONS 


1. A teacher wants to deliver a sticker to Johnny every time 
he says “Please.” This would be considered positive rein- 


forcement if: 

a. The behavior of saying “Please” increases over time. 

b. The behavior of saying “Please” decreases over time. 

c. The behavior of saying “Please” does not change over time. 
d. The behavior of sticker giving decreases over time. 


Hint: (See “Positive Reinforcement Defined”) 


response-deprivation hypothesis 


a aes nocSRRNRERRGSUES SSSA 


Chapter 11 * Positive Reinforcement 317 


Control Procedures for Positive Reinforcement 


24. Positive reinforcement control procedures are used to manip- 
ulate the contingent presentation of a potential reinforcer and 
observe any effects on the future frequency of behavior. Con- 
trol, as the term is used here, requires an experimental demon- 
stration that the presentation of a stimulus contingent on the 
occurrence of a target response functions as positive reinforce- 
ment. Control is demonstrated by comparing response rates in 
the absence and presence of a contingency, and then showing 
that with the absence and presence of the contingency the 
behavior can be turned on and off, or up and down. 


25. In addition to a reversal design using the withdrawal of the 


reinforcement contingency (i.e., extinction) as the control 
condition, noncontingent reinforcement (NCR), differen- 
tial reinforcement of other behavior (DRO), and differen- 
tial reinforcement of alternative behavior (DRA) can be 
used as control conditions for reinforcement. 


Using Reinforcement Effectively 


26. Guidelines for increasing the effectiveness of positive 


reinforcement interventions include: 
¢ Set an easily achieved initial criterion for reinforcement. 
¢ Use high-quality reinforcers of sufficient magnitude. 


¢ Use varied reinforcers to maintain potent establishing 
operations. 


¢ Use direct rather than indirect reinforcement contingen- 
cies when possible. 


¢ Combine response prompts and reinforcement. 

¢ Initially reinforce each occurrence of the behavior. 

¢ Provide contingent attention and descriptive praise. 

* Gradually increase the response-to-reinforcement delay. 


* Gradually shift from contrived to naturally occurring reinforcers. 


rule-governed behavior 
socially mediated contingencies 
stimulus preference assessment 


unconditioned reinforcer 


2. The qualifications to be considered when utilizing positive 
reinforcement are: 


a. Delay between response and consequence 

b. Stimulus conditions when response was emitted 
c. Strength of motivation 

d. All of these 

Hint: (See “Using Reinforcement Effectively”) 
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3. An example of unconditioned reinforcer(s) is: 


a. Food 

b. Water 

c. Oxygen 

d. Warmth 

e. Sexual stimulation 

f. All of these 

Hint: (See “Classifying Reinforcers”’) 

. A neutral stimulus can be made into a reinforcer by: 
a. Pairing it with an unconditioned stimulus 

b. Pairing it with an unconditioned response 

c. Presenting it individually after a response multiple times 
d. A neutral stimulus cannot be utilized as a reinforcer 
Hint: (See “Classifying Reinforcers’’) 

. Heather hears a phone ring, then picks up the phone. 


Her husband talks with her over the phone. What is the 
behavior in this three-term contingency? 


a. Husband talking 

b. Heather picks up the phone 

c. Phone ringing 

d. None of these 

Hint: (See “Positive Reinforcement Defined”) 
. An example of a motivating operation is: 

a. Hunger 

b. Hay fever 

c. Argument with significant other 

d. All of these 

e. None of these 

Hint: (See “Positive Reinforcement Defined’) 


. Which of the following is not an appropriate way to iden- 
tify potential reinforcers? 


a. Pick a stimulus that is reinforcing for someone else 

b. Ask the person or significant other what is reinforcing 
c. Observe the person 

d. Measure the person’s responses to trial-based tests 
Hint: (See “Identifying Potential Reinforcers”) 


- Stimulus preference assessment refers to a variety of pro- 
cedures to determine: 


a. The stimulus a person prefers 

b. The relative preference values of those stimuli 

c. The conditions under which those preference values hold 
d. All of these 


Hint: (See “Identifying Potential Reinforcers”) 


9, What information does a concurrent schedule reinforcer 


10. 


11. 


12. 


assessment provide? 
a. A rank order of preference for a variety of stimuli 


b. How often children will select one stimulus versus 
another to indicate preference 


c. Whether or not a stimulus functions as a reinforcer and 
how effective that stimulus is as a reinforcer compared 
to other stimuli 


d. All of these 

e. None of these 

Hint: (See “Reinforcer Assessment’) 

When two or more contingencies of reinforcement operate 


independently and simultaneously for two or more behav- 
iors, what schedule of reinforcement is in effect? 


a. Multiple 

b. Mixed 

c. Progressive-Ratio 

d. Concurrent 

e. All of these 

f. None of these 

Hint: (See “Reinforcer Assessment’) 

Which schedule reinforcer assessment consists of two or 
more component schedules of reinforcement for a single 


response with only one component schedule in effect at 
any given time? 

a. Progressive-Ratio 

b. Multiple 

c. Mixed 

d. Concurrent 

e. All of these 

f. None of these 

Hint: (See “Reinforcer Assessment’’) 

The model for predicting whether (a) access to one behav- 
ior will function as reinforcement for another behavior 
based on the relative baseline rates at which each behavior 
occurs and (b) whether contingent access to one of the 


behaviors represents a restriction compared to the baseline 
level of engagement is known as the: 


a. Response-satiation hypothesis 

b. Response-contingency hypothesis 
c. The Premack Principle 

d. Response-deprivation hypothesis 
e. None of these 


Hint: (See “Classification of Reinforcers by Formal 
Properties”’) 


13. 


14. 


15. 
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The response-deprivation hypothesis was built on whose 
concept? 


a. Premack 

b. Skinner 

c. Watson 

d. Pavlov 

e. None of these 

Hint: (See “Classification of Reinforcers by Formal 
Properties’’) 

Experimental control when evaluating positive reinforce- 
ment is demonstrated by comparing: 

a. Response rates in the presence of a contingency 

b. Response rates in the absence of a contingency 

c. Fixed time reinforcement to differential reinforcement 


d. Differential reinforcement of other behavior to another 
reinforcement schedule, such as differential reinforce- 
ment of alternate behavior 


e. All of these 

f. None of these 

Hint: (See “Control Procedures for Positive Reinforcement’) 
The presentation of a potential reinforcer on a fixed or 


variable schedule independent of the occurrence of the tar- 
get behavior is also referred to as: 


a. Contingent reinforcement 

b. Noncontingent reinforcement 
c. Contingent punishment 

d. Noncontingent punishment 
e. None of these 

f. All of these 


Hint: (See “Control Procedures for Positive Reinforcement’) 


ESSAY-TYPE QUESTIONS 


1. 


4. 


Briefly explain the terms positive reinforcement and posi- 
tive reinforcer. 


Hint: (See “Positive Reinforcement Defined”) 


. Briefly explain the terms conditioned and unconditioned 


reinforcers and give an example of each. 


Hint: (See “Classifying Reinforcers”) 


. Bob has just completed his morning workout routine. On 


the way to taking a shower, Bob walks past the refrigera- 
tor. Bob takes an apple from the refrigerator. Bob eats the 
apple on the way to taking his morning shower. Briefly 
describe the above example using the operant condition- 
ing paradigm. (Be sure to include motivation in your 
answer.) 


Hint: (See “Positive Reinforcement Defined’) 


Briefly describe both the pros and cons of the three basic 
methods for stimulus preference assessments. (The three 


16. 


17. 


18. 
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Which of the following could be used as a control condi- 


tion when evaluating the effects of a reinforcement-based 
procedure? 


a. Noncontingent reinforcement 
. Differential reinforcement of other behavior 


. Differential reinforcement of alternative behavior 
. All of these 


. None of these 
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Hint: (See “Control Procedures for Positive Reinforcement’) 
Which of the following are guidelines for effective 


reinforcement? 


a. Choosing reinforcers that are not relevant to the target 
behavior 


b. Using reinforcers of low magnitude 

c. Delaying the delivery of a reinforcer 

d. Using the same reinforcers over and over again 

e. None of these 

Hint: (See “Using Reinforcement Effectively”) 
Practitioners increase the effectiveness of positive rein- 


forcement when they attend to the of stimulus 
presentation. 


a. Arrangement 
b. Management 
c. Administration 
d. All of these 

e. None of these 


Hint: (See “Using Reinforcement Effectively”) 


methods are asking the person or significant others, 
observing the individual, and measuring the person’s 
responses to trial-based tests.) 


Hint: (See “Identifying Potential Reinforcers”) 


. After identifying possible reinforcers how could it 


be determined if the stimulus is a reinforcer? Briefly 
describe what assessment procedure you would use 
and why. 


Hint: (See “Identifying Potential Reinforcers’’) 


. Describe and give an example of the response-deprivation 


procedure. Explain your answer. 


Hint: (See “Classifying Reinforcers’”’) 


. Briefly explain whether or not “control” has been demon- 


strated in the graph. Support your explanation with 
evidence from the graph. (Control requires an experi- 
mental demonstration that the presentation of a stimulus 
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Rate of Target Behavior Per 


contingent on the occurrence of a target response functions 
as a positive reinforcement.) 
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Hint: (See “Control Procedures for Positive 
Reinforcement’) 


NOTES 


NO 


. Phrases such as strengthening the behavior and increasing the likelihood 


of future responding are sometimes used to describe the basic effect of 
reinforcement. Although such terms appear occasionally in this book, rec- 
ognizing Michael’s (1995) concern that use of such terms “encourages a 
language of intervening variables, or an implied reference to something 
other than an observable aspect of behavior” (p. 274), we most often use 
increased future occurrence (or rate) to refer to the primary effect of 
reinforcement. 

When the consequence that produced the increase in responding is best 
described as the termination or withdrawal of an already present stimulus, 
negative reinforcement has occurred. The fundamental nature and qual- 
ifying conditions for positive reinforcement and negative reinforcement 
are the same: Behavior increases in the future. Negative reinforcement is 
examined in detail in Chapter 12. 


. Excellent discussions of rule-governed behavior can be found in Baum 


(2017); Chase and Danforth (1991); Hayes (1989); Hayes, Zettle, and 
Rosenfarb (1989); Malott and Shane (2014); Reitman and Gross (1996); 
Schlinger and Blakely (1987); and Vaughan (1989). 


4. Motivating operations are described in detail in Chapter 16. 


It is a mistake to assume that all superstitious behavior is the direct result 
of adventitious reinforcement. Many superstitious behaviors are probably 
the result of following cultural practices. For example, high school baseball 
players may wear their caps inside out and backwards when a late-inning 
rally is needed because they have seen major leaguers don such “rally caps” 
in the same situation. 

Mands, tacts, and intraverbals—three elementary verbal operants first 
described by Skinner (1957)—are explained in Chapter 18. 

Remember, the environment does the pairing, not the learner. The learner 
does not have to “associate” the two stimuli. 

The first volume of the Journal of Applied Behavior Analysis (1968) is a 
treasure trove of classic studies in which simple and elegant experimental 
designs revealed the powerful effects of operant conditioning and contin- 
gency management. We strongly encourage any serious student of applied 
behavior analysis to read it from cover to cover, and to return to it now and 
then for inspiration. 

See https://www.kennedykrieger.org/sites/default/files/patient-care-files/ 
paired_stimulus_preference_assessment.pdf for detailed procedural steps 
and associated sample forms to complete a variety of SPAs. 


8. Shawn is overweight and does not currently do any form 


10. 


12.4For additional details on conducting multiple stimulus without replace- 


“ment (MSWO) preference assessments, including a sample data sheet, refer 


ily, 


16. 


ie 


of daily exercise. His doctor has recommended that he 
walk 3 miles daily. Briefly explain how positive reinforce- 
ment could be utilized to help Shawn increase the behav- 
ior of walking. (Include a proposed reinforcer, timing of 
proposed reinforcer delivery, contingency, motivation, 
and a statement about the proposed way of evaluating the 
effectiveness of the proposed reinforcer). 


Hint: (See “Using Reinforcement Effectively”) 


For additional details on conducting single-stimulus assessments, including 
a sample data sheet, refer to https://www.kennedykrieger.org/sites/default/ 
files/patient-care-files/single_stimulus_preference_assessment.pdf and 
Hagopian et al. (2001). 


. For additional details on conducting paired-stimulus preference assess- 


ments, including a sample data sheet, refer to https://www.kennedykrieger. 
org/sites/default/files/patient-care-files/paired_stimulus_preference_ 
assessment.pdf and Fisher et al. (1992). 


to https://www.kennedykrieger.org/sites/default/files/patient-care-files/ 
mswo_preference_assessment.pdf and DeLeon and Iwata (1996). 


. For additional information on comparative effects of SPA, see Verriden and 


Roscoe (2016) and Karsten et al. (2011). 


. These schedules of reinforcement and their effects on behavior are 


described in Chapter 13. 

Chapter 8 presents the ABAB, NCR, DRO, and DRA control techniques in 
the context of single-case experimental designs, and Chapter 25 discusses 
NCR, DRO, and DRA in detail. 

Chapter 25 describes DRO and DRA as behavior change tactics for decreas- 
ing the frequency of undesirable behavior. 

Moving from continuous reinforcement to an intermittent schedule of 
reinforcement is sometimes described as a means of increasing reinforcer 
delay (e.g., Alberto & Troutman, 2013; Kazdin, 2013). However, an inter- 
mittent schedule of reinforcement does not entail “delayed reinforcement” 
unless specified. Although only some occurrences of the target behavior are 
reinforced on an intermittent schedule of reinforcement (see Chapter 13), 
reinforcement is delivered immediately following the response that meets 
the contingency. For example, on a fixed ratio 10 schedule of reinforce- 
ment, every 10th response produces immediate reinforcement. Delay to 
reinforcement or reinforcement delay describes the time lapse between the 
response and delivery of the reinforcer after the contingency has been met 
(e.g., the reinforcer was delivered 45 seconds after every 10th response). 
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Negative Reinforcement 


Richard G. Smith and Brian A. lwata 


LEARNING OBJECTIVES 


¢ Define and provide examples of negative reinforcement. 


° Identify and use negative reinforcers. 


¢ Differentiate between escape and avoidance contingencies. 


e Identify the characteristics of negative reinforcement. 


e Use appropriate parameters and schedules of negative reinforcement. 
e State and plan for the possible unwanted effects of and ethical issues in the use of negative reinforcement. 


ave you ever been “that person” whose cell phone rings 

in an otherwise quiet classroom? If so, you probably 

felt the burning stares of the professor and the other 
students in the class as you quickly scrambled to turn off your 
ringer and sank deeply into your chair. You probably also make 
sure to turn your ringer off before you attend classes now. These 
are both instances of negative reinforcement contingencies, in 
which behavior increases when it results in escape from or 
avoidance of aversive events. Turning off the ringer in class 
produced escape from the judgmental stares of your classmates, 
and turning off the ringer before class now allows you to avoid 
any such future embarrassment. 

As noted in Chapter 11, positive reinforcement involves 
an increase in responding as a function of contingent presen- 
tation of a stimulus. In a complementary way, responding can 
lead to the termination or avoidance of a stimulus, and when 
responding increases as a result of this arrangement, learning has 
occurred through negative reinforcement. This chapter expands 
the discussion of operant contingencies to include negative 
reinforcement. We define negative reinforcement, distinguish 
between escape and avoidance contingencies, describe events 
that may serve as the basis for negative reinforcement, illustrate 


EO s? 


ways negative reinforcement may be used to strengthen behav- 
ior, and discuss ethical issues that arise when using negative 
reinforcement. Readers interested in more in-depth discussions 
of basic and applied research on negative reinforcement are 
referred to reviews by Hineline and Rosales-Ruiz (2013) and 
Iwata (1987). 


DEFINITION OF NEGATIVE REINFORCEMENT 


A negative reinforcement contingency is one in which the 
occurrence of a response produces the termination, reduc- 
tion, postponement, or avoidance of a stimulus, which leads 
to an increase in the future occurrence of that response. A full 
description of negative reinforcement requires specification of 
its four-term contingency (see Figure 12.1): (a) The establishing 
operation (EO) for behavior maintained by negative reinforce- 
ment is an antecedent event in whose presence escape (termina- 
tion of the event) is reinforcing, (b) the discriminative stimulus 
(SP) is another antecedent event in whose presence a response 
is more likely to be reinforced, (c) the response is the act that 
produces reinforcement, and (d) the reinforcer is the termination 
of the event that served as the EO. 


Effect on Frequency 
of Similar Occurrence 
under Similar Conditions 
in the Future 


Loud noise from 


Ask roommate 


3 - Roommate nearb - : 
construction outside y to close window 


Noise subsides 


Figure 12.1 Four-term contingency illustrating negative reinforcement. 
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Positive Versus Negative Reinforcement 


Positive and negative reinforcement have a similar effect on 
behavior in that both produce an increase in responding. They 
differ, however, with respect to the type of stimulus change that 
follows behavior, as illustrated in Figure 12.2. In both examples, 
a stimulus change (consequence) strengthens the behavior that 
preceded it: Asking the sibling to make a sandwich is strength- 
ened by obtaining food; carrying rain protection is strengthened 
by blocking the rain. However, behavior maintained by posi- 
tive reinforcement produces a stimulus that was absent prior to 
responding, whereas behavior maintained by negative reinforce- 
ment terminates a stimulus that was present prior to responding: 
Food was unavailable prior to asking for it but available after 
(positive reinforcement); rain was landing on one’s clothing 
before raising the umbrella but not after (negative reinforcement). 

Thus, the key distinction between positive and negative 
reinforcement is based on the type of stimulus change that occurs 
following a response. Many stimulus changes have discrete onsets 
and offsets and involve an “all-or-none” operation, as in turning 
on a television (positive reinforcement) or turning off the light in 
a bedroom (negative reinforcement). Other stimulus changes exist 
on a continuum from less to more, such as turning up the volume 
of a stereo to hear it better (positive reinforcement) or turning 
it down when it is too loud (negative reinforcement). Some- 
times, however, it is difficult to determine whether an increase 
in responding resulted from positive or negative reinforcement 
because the stimulus change is ambiguous. For example, although 
a change in temperature can be measured quantitatively so we 
know that it either increased or decreased following behavior, 
it is unclear whether turning on a heater when the temperature 
is 40° F (5° C) is an example of positive reinforcement because 
the response “produced heat” or negative reinforcement because 
the response “removed cold.” Another example can be found 
in a Classic study by Osborne (1969) on the use of free time as 


Positive Reinforcement 


EO sd 


a reinforcer in the classroom. During baseline, students were 
observed to get out of their seats frequently during long work 
periods. During treatment, the students were given 5 minutes of 
free time if they remained in their seats during 10-minute work 
periods, and in-seat behavior increased. At first glance, the free- 
time contingency appears to have involved negative reinforcement 
(termination of the in-seat requirement contingent on appropri- 
ate behavior). As Osborne noted, however, access to activities 
(games, social interaction, etc.) during free time may have func- 
tioned as positive reinforcement. 

Given the ambiguous nature of some stimulus changes, 
Michael (1975) suggested that the distinction between positive 
and negative reinforcement, based on whether a stimulus is pre- 
sented or removed, may be unnecessary. Instead, he emphasized 
the importance of specifying the type of environmental change 
produced by a response in terms of key stimulus features that 
constituted both the “prechange” and “postchange” conditions. 
This practice, he proposed, would eliminate the necessity of 
describing the transition between prechange and postchange 
conditions as one involving the presentation or removal of stim- 
uli and would facilitate a more complete understanding of func- 
tional relations between environment and behavior. 

Little has changed since the publication of Michael’s 
(1975) article; the distinction between positive and negative rein- 
forcement continues to be emphasized in every text on learning 
principles, and citations to the term negative reinforcement have 
increased in applied research (Iwata, 2006). In an attempt to renew 
the discussion, Baron and Galizio (2005) reiterated Michael’s 
position and included some additional points of emphasis. This 
terminological issue is a complex one that can be considered from 
several perspectives—conceptual, procedural, and historical— 
and debate on the issue continues at this time. Readers inter- 
ested in the topic are referred to a series of reactions to Baron 
and Galizio (Chase, 2006; Iwata, 2006; Lattal & Lattal, 2006; 
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Figure 12.2 Four-term contingency illustrating similarities and the difference between positive and 


negative reinforcement. 


Marr, 2006; Michael, 2006; Sidman, 2006) and their rejoinder 
(Baron & Galizio, 2006). One difficulty that arises when attempting 
to examine the issue empirically is equating the two types of con- 
tingencies. Magoon and colleagues did so in two human operant 
studies (Magoon & Critchfield, 2008; Magoon, Critchfield, 
Merrill, Newland, & Schneider, 2017) by pitting money-gain 
positive reinforcement against money-loss avoidance negative 
reinforcement. Their results showed differential outcomes under 
the two arrangements, supporting the proposition that the two 
types of contingencies are functionally unique. 


Negative Reinforcement Versus Punishment 


Negative reinforcement is sometimes confused with punish- 
ment for two reasons. First, because the lay term for positive 
reinforcement is reward, people mistakenly consider negative 
reinforcement as the technical term for the opposite of reinforce- 
ment (punishment). The terms positive and negative, however, do 
not refer to “good” and “bad” but to the type of stimulus change 
(presentation versus termination) that follows behavior (Catania, 
2013). A second source of confusion stems from the fact that the 
stimuli involved in both negative reinforcement and punishment 
are, by definition, “aversive.”! Although it is true that the same 
stimulus may serve as a negative reinforcer in one context and as 
a punisher in a different context, both the nature of the stimulus 
change and its effect on behavior differ. In a negative reinforce- 
ment contingency, a stimulus that was present is terminated by a 
response, which leads to an increase in responding; in a punish- 
ment contingency, a stimulus that was absent is presented follow- 
ing a response, which leads to a decrease in responding. Thus, a 
response that terminates loud noise would increase as a function 
of negative reinforcement, but one that produces loud noise would 
decrease as a function of punishment (see Chapter 14 for a more 
extensive discussion of punishment). 


ESCAPE AND AVOIDANCE CONTINGENCIES 


In its simplest form, negative reinforcement involves an escape 
contingency, in which a response terminates (produces escape 
from) an ongoing stimulus. An early study by Keller (1941) 
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illustrates typical laboratory research on escape. When a rat was 
placed in an experimental chamber and a bright light was turned 
on, the rat quickly learned to press a lever, which turned off the 
light. Osborne’s (1969) study on free-time contingencies may 
also serve as an example of escape learning in an applied con- 
text. To the extent that the important feature of the contingency 
was the termination of work requirements, in-seat behavior 
during 10-minute work periods produced 5 minutes of escape. 

Although situations involving escape are commonly encoun- 
tered in everyday life (e.g., we turn off loud noises, shield our eyes 
from the sun, flee from an aggressor), much behavior maintained 
by negative reinforcement is characterized by an avoidance 
contingency, in which a response prevents or postpones the presen- 
tation of a stimulus. Returning to the previous laboratory example, 
an experimenter can add to the escape contingency an arrangement 
in which another stimulus such as a tone precedes the presentation 
of the bright light, and a response in the presence of the tone 
eliminates the presentation of the light or postpones it until the 
tone is next presented. This type of arrangement has been called 
discriminated avoidance, in which responding in the presence of a 
signal prevents the onset of a stimulus from which escape is a rein- 
forcer. Because responses in the presence of the tone are reinforced, 
whereas those in the absence of the tone have no effect, the tone is a 
discriminative stimulus (SP) in whose presence there 1s an increased 
likelihood of reinforcement for responding. (See Chapter 17 
for more on stimulus control.) 

Avoidance behavior also can be acquired in the absence 
of a signal. Suppose the experimenter arranges a schedule in 
which the bright light turns on for 5 seconds every 30 seconds, 
and a response (or some number of responses) at any time during 
the interval resets the clock to zero. This type of arrangement 
is known as free-operant avoidance because the avoidance 
behavior is “free to occur” at any time and will delay the pre- 
sentation of the bright light. 

Each of the three types of contingencies described earlier 
was illustrated in an ingenious study by Azrin, Rubin, O’Brien, 
Ayllon, and Roll (1968) on postural slouching (see Figure 12.3). 
Participants wore an apparatus that closed an electrical circuit 
when slouching occurred. Closure of the switch produced an 
audible click, which was followed 3 seconds later by a 55-db 


Figure 12.3 Three types of negative reinforcement contingencies used by Azrin 
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tone. Postural correction in the presence of the tone turned off 
the tone (escape) but prevented the tone if correction occurred 
during the 3 seconds following the click (discriminated avoid- 
ance). Furthermore, maintenance of correct posture prevented 
the click (free-operant avoidance). A hypothetical example 
involving homework management also illustrates these contin- 
gencies. A parent who sends a child to his or her room imme- 
diately following school and allows the child to leave the room 
only after completing homework has arranged an escape con- 
tingency: Homework completion produces escape from the bed- 
room. A parent who first delivers a warning (e.g., “If you don’t 
start your homework in 10 minutes, you’ll have to do it in your 
bedroom”) has arranged a discriminated avoidance contingency: 
Starting homework following the warning avoids having to do 
it in the bedroom. Finally, the parent who waits until later in 
the evening to impose the in-room requirement has arranged a 
free-operant avoidance contingency: Homework completion at 
any time after school avoids having to do it in the bedroom later. 


As can be seen above, escape and avoidance contingen- 
cies illustrate very different arrangements, and one particular 
feature of some avoidance contingencies has raised questions 
about fundamental learning processes. Whereas the environment— 
behavior relationship in escape contingencies is clearly 
identifiable—that is, responding results in termination or reduc- 
tion of a stimulus that was present initially—the evocative and 
maintaining influences involved in avoidance are less clear. 
Except in the case of discriminated avoidance, avoidance behav- 
ior often occurs in the absence of any apparent evocative con- 
dition and does not produce obvious consequential changes in 
the environment. This seems inconsistent with our A-B-C notion 
of operant behavior: We see the “B,” but where are the “A” and 
“C”? How do we account for behavior that does not appear to 
fit a conceptual framework that emphasizes the importance of 
immediate antecedent and consequent events as determinants 
of behavior? (See Box 12.1 for further discussion of avoidance 
responding.) 


Is Avoidance a Different Type of Learning? 


Two-Factor Theory 


In the 1940s, Mowrer conducted research showing that, after several 
trials in which a tone was presented just before a shock, rats 
began to emit escape responses (moving over a barrier to another 
location) in the presence of only the tone. Mowrer proposed that 
this behavior resulted from two independent processes. First, the 
shock was considered to be an unconditioned stimulus, eliciting 
reflexive “fear” responses. After being paired with the shock, 
the tone acquired fear-eliciting properties through respondent 
conditioning. Thus, Mowrer asserted that responding in the 
presence of the tone produced escape from the fear and anxiety 
elicited by the tone, rather than escape from the shock. He proposed 
a “two-factor” theory, incorporating both respondent and operant 
mechanisms, to account for avoidance behavior. Some subsequent 
research provided further evidence in support of this theory, such 
as a study by Kamin (1957) showing that avoidance behavior was 
disrupted when it canceled upcoming shocks but did not terminate 
the warning tone. Results of other research, however, have been 
inconsistent with Mowrer’s theory. For example, Solomon, 
Kamin, and Wynne (1953) showed that dogs continued to emit 
avoidance responses in the presence of a warning signal long after 
shocks were no longer programmed to follow the signal. And 
work by Sidman (1953, 1962) and others (Herrnstein & Hineline, 
1966; Perone & Galizio, 1987) showed that when shocks were 
presented according to time-based schedules, animals learned to 
postpone shocks in the absence of a programmed signal, further 
calling Mower’s two-factor theory into question. 


Anxiety as derived relational responding 


Behavior analysts also have extended findings of research on 
derived relational responding to investigate clinical problems 


involving avoidance (Dymond, 2009). It has been suggested 
that anxiety and phobic disorders develop when aversive prop- 
erties spread across classes of stimuli whose function is estab- 
lished through associative, or indirect, conditioning processes. 


‘For example, Dougher, Hamilton, Fink, and Harrington (2007) 


showed that fear responses spread in an orderly, hierarchical 
way across three otherwise arbitrary stimuli to which college 
students had been trained to respond according to a small, 
medium, and large relation. After the “medium” stimulus was 
paired with a mild shock, a smaller fear response was recorded 
for the “small” stimulus, whereas a larger fear response was 
recorded in the presence of the “large” stimulus—even though 
neither of these stimuli had ever been paired with the shock! 
The authors suggested that these results were relevant to clin- 
ical problems involving excessive fear responses to stimuli 
that had not been previously paired with aversive experiences 
(e.g., excessive fear of photos of anything resembling a spider) 
and offered an alternative account to cognitive interpretations 
based on hypothetical constructs such as schema, beliefs, and 
expectancies. 

As behavior analysts continue to study the nature of 
avoidance, scientists in other fields are producing better ways 
to predict what sorts of challenges—environmental as well 
as social—we will face in the future. Solutions to problems 
such as disease, poverty, and pollution depend on being able 
not only to predict their occurrence but also to change human 
behavior so as to avoid, to the extent possible, their disastrous 
outcomes. Thus, behavior analysts’ contribution to an under- 
standing of the fundamental determinants of avoidance and of 
ways to promote effective avoidance repertoires can directly 


influence our ability to prosper in an increasingly challenging 
world. 


CHARACTERISTICS OF NEGATIVE 
REINFORCEMENT 


Responses Acquired and Maintained by 
Negative Reinforcement 


It is a well-known fact that aversive stimulation produces a variety 
of responses (Hutchinson, 1977). Some of these may be respon- 
dent behaviors (as in reflexive actions to intense stimuli), but the 
focus in this chapter is on operant behaviors. Recall that the pre- 
sentation of an aversive stimulus serves as an EO for escape and 
occasions behavior that has produced escape from similar stimula- 
tion in the past. Any response that successfully terminates the stim- 
ulation will be strengthened; as a result, a wide range of behaviors 
may be acquired and maintained by negative reinforcement. All 
of these behaviors are adaptive because they enable one to interact 
effectively with the environment; some behaviors, however, are 
more socially appropriate than others. As will be seen later in the 
chapter, negative reinforcement may play an important role in the 
development of academic skills, but it also can account for the 
development of disruptive or dangerous behavior. 


Events That Serve as Negative Reinforcers 


In discussing the types of stimuli that can strengthen behavior 
through negative reinforcement, a problem arises when attempt- 
ing to use the same terminology as that applied to the description 
of positive reinforcers. It is quite common to refer to positive 
reinforcers by listing stimuli such as food, money, praise, and so 
on. It is, however, the presentation of the stimulus that strength- 
ens behavior: Food presentation, and not food per se, is a posi- 
tive reinforcer. Nevertheless, we often simply list the stimulus 
and assume that “presentation” is understood. In a similar way, 
to say that negative reinforcers include shock, noise, parental 
nagging, and so on, is an incomplete description. It is important 
to remember that a stimulus described as a negative reinforcer 
refers to its removal because, as noted previously, the same 
stimulus serves as an EO when presented prior to behavior and 
as punishment when presented following behavior. 


Learning History 


As is the case with positive reinforcers, negative reinforcers 
influence behavior because (a) we have the inherited capacity to 
respond to them or (b) their effects have been established through 
a history of learning. Stimuli whose removal strengthens behav- 
ior in the absence of prior learning are unconditioned negative 
reinforcers. These stimuli typically are noxious events such as 
shock, loud noise, intense light, extremely high or low tempera- 
ture, or strong pressure against the body. In fact, any source of 
pain or discomfort (e.g., a headache) will occasion behavior, and 
any response that successfully eliminates the discomfort will be 
reinforced. Other stimuli are conditioned negative reinforcers, 
which are previously neutral events that acquire their effects 
through pairing with an existing (unconditioned or conditioned) 
negative reinforcer. A bicyclist, for example, usually heads for 
home when seeing a heavily overcast sky because dark clouds 
have been highly correlated with impending bad weather. Various 
forms of social coercion, such as parental nagging, are perhaps 
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the most commonly encountered conditioned negative reinfore- 
ers. For example, reminding a child to clean his or her bedroom 
may have little effect on the child’s behavior unless failure to 
respond is followed by another consequence, such as having to 
stay in the room until it is clean. To the extent that the nagging is 
reliably “backed up” by sending the child to the room, the child 
will eventually respond simply to stop or prevent the nagging. It 
is interesting that, in the case of negative reinforcement, neutral 
events (dark sky, nagging) function as both (a) discriminative 
stimuli because responding in their presence avoids another con- 
sequence and (b) conditioned aversive stimuli (i.e., conditioned 
negative reinforcers) because, due to their pairing with another 
consequence, they become stimuli to avoid or escape. 


The Source of Negative Reinforcement 


Another way to classify negative reinforcers is based on how 
they are removed (1.e., the source of the stimulus change). In 
Chapter 11, a distinction was made between socially mediated 
reinforcement, in which the consequence results from the action 
of another person, and automatic reinforcement, in which the 
consequence is produced directly by a response independent of 
the actions of another. This distinction also applies to negative 
reinforcement. Returning to the example in Figure 12.1, we can 
see that termination of the construction noise was an instance 
of social negative reinforcement (the roommate’s action closed 
the window). The person “being bothered” by the noise, how- 
ever, could simply have walked across the room and closed 
the window (automatic negative reinforcement). This example 
illustrates the fact that many reinforcers can be removed or ter- 
minated either way: You can consult a physician when experi- 
encing a headache (social) or take a pain medication (automatic), 
ask your roommate to turn off the stereo when you're studying 
for a test (social) or turn it off yourself (automatic), and so on. 
Considering the source of negative reinforcement may 
facilitate the design of behavior-change procedures by determin- 
ing the focus of intervention. For example, when faced with a 
perplexing work task, an employee may finish it incorrectly just 
to get it out of the way (automatic reinforcement) or ask for help 
(social reinforcement). Aside from reassigning the employee, 
the quickest solution would be to reinforce the employee’s seek- 


‘ing help by offering assistance. Ultimately, however, the super- 


visor would want to teach the employee the necessary skills to 
complete the work tasks independently. 


Identifying the Context of Negative 
Reinforcement 


Chapter 11 outlined several ways to identify positive reinforcers; 
the difference with negative reinforcers is that equal emphasis must 
be placed on the antecedent event (EO) as well as on the reinforc- 
ing consequence because, once the behavior occurs, the negative 
reinforcer may be gone and cannot be observed. The identification 
of EOs may be difficult with individuals who have limited verbal 
ability and cannot tell someone they are experiencing aversive 
stimulation. These people may engage in other behaviors, such 
as tantrums, attempts to leave the situation, destructive behavior, 
aggression, or even self-injury. Weeks and Gaylord-Ross (1981), 
for example, observed students with severe disabilities when no 
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task, an easy task, and a difficult task were presented. Little or no 
problem behavior occurred during the no-task condition, and prob- 
lem behavior occurred somewhat more often in the difficult-task 
condition than in the easy-task condition. These results suggested 
that the students’ problem behavior was maintained by escape 
from task demands and that difficult tasks were more “aversive” 
than were easy tasks. However, because the consequences that 
followed problem behavior were unknown, it is possible that the 
behaviors were maintained by some other consequence, such as 
attention, which would be a positive reinforcer. 

Iwata, Dorsey, Slifer, Bauman, and Richman (1994) devel- 
oped a method for identifying the types of contingencies that 
maintain problem behavior by observing people under a series 
of conditions that differed with respect to both antecedent and 
consequent events. One condition involved the presentation 
of task demands (EO) and removal of the demands (escape) if 
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problem behavior occurred; higher rates of problem behavior 
under this condition relative to others indicated that problem 
behavior was maintained by negative reinforcement (see 
Chapter 24 for further discussion of this approach to assessment). 

Smith, Iwata, Goh, and Shore (1995) extended the findings 
of Weeks and Gaylord-Ross (1981) and Iwata et al. (1994) by 
identifying some characteristics of task demands that make them 
aversive. After first determining that their participants’ (people 
with severe disabilities) problem behavior was maintained by 
escape from task demands, Smith and colleagues examined sev- 
eral dimensions along which tasks might differ: task novelty, 
duration of the work session, and rate of demand presentation. 
Results of one of these analyses are shown in Figure 12.4, which 
depicts frequency distributions and cumulative records of prob- 
lem behavior from the beginning to the end of sessions. These 
data illustrate the importance of individualized assessments in 
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identifying the basis for negative reinforcement because two par- 
ticipants (Evelyn and Landon) showed increasing rates of prob- 
lem behavior as work sessions progressed, whereas two other 
participants (Milt and Stan) showed decreasing rates of problem 
behavior as work sessions progressed, as illustrated by the declin- 
ing height of the bars in their histograms and of the slopes of 
their cumulative curves. In fact, subsequent studies (e.g., Roscoe, 
Rooker, Pence, & Longworth, 2009) have shown that failure to 
consider the idiosyncratic nature of aversive stimuli may lead to 
erroneous conclusions during assessment of problem behavior. 


Determinants of Negative 
Reinforcement Effects 


The factors that determine whether a negative reinforcement 
contingency will be effective in changing behavior are similar 
to those that influence positive reinforcement (see Chapter 11) 
and are related to (a) the strength of the contingency and (b) 
the presence of competing contingencies. In general, negative 
reinforcement for a given response will be more effective under 
the following conditions: 


1. The stimulus change immediately follows the occurrence 
of the target response. 


2. The magnitude of reinforcement is large, referring to 
the difference in stimulation present before and after the 
response occurs. 


3. Occurrence of the target response consistently produces 
escape from or postponement of the EO. 


4. Reinforcement is unavailable for competing (nontarget) 
responses. 


APPLICATIONS OF NEGATIVE 
REINFORCEMENT 


Negative reinforcement is a fundamental principle of learning 
that has been studied extensively in basic research (Hineline & 
Rosales-Ruiz, 2013). Although many examples of escape and 
avoidance learning can be found in everyday life, research in 
applied behavior analysis has heavily emphasized the use of 
positive reinforcement over negative reinforcement, mostly 
for ethical reasons, which are noted in the final section of this 
chapter. Still, negative reinforcement has been used as one means 
of establishing a variety of behaviors. This section illustrates 
several therapeutic uses of negative reinforcement, as well as the 
unintended role it may play in strengthening problem behavior. 


Acquisition and Maintenance of Appropriate 
Behavior 


Interventions for Chronic Food Refusal 


Pediatric feeding problems are common and are especially prev- 
alent among children with developmental disabilities. These dis- 
orders may take a variety of forms, including selective eating, 
failure to consume solid foods, and complete food refusal, and 
may be serious enough to require tube feeding or other artificial 
means to ensure adequate nutritional intake. A large proportion 
of feeding problems cannot be attributed to a medical cause but, 
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instead, appear to be learned responses most likely maintained 
by escape or avoidance. 

Results from a number of studies have shown that operant 
learning—based interventions can be highly effective in treat- 
ing many childhood feeding disorders, and a study by Ahearn, 
Kerwin, Eicher, Shantz, and Swearingin (1996) illustrated the 
use of negative reinforcement as a form of intervention. Three 
children admitted to a hospital who had histories of chronic food 
refusal were first observed under a baseline (positive reinforce- 
ment) condition in which food was presented and access to toys 
was available contingent on accepting food. Food refusal, how- 
ever, produced escape in that it terminated a trial. Subsequently, 
the experimenters compared the effects of two interventions. 
One treatment condition (nonremoval of the spoon) involved 
presenting food and keeping the spoon positioned at the child’s 
lower lip until the bite was accepted. The other treatment (phys- 
ical guidance) involved presenting food and, if the child did not 
accept, opening the child’s mouth so that the food could be deliv- 
ered. Both treatments involved a negative reinforcement contin- 
gency because food acceptance terminated the trial by producing 
removal of the spoon or avoidance of the physical guidance. 

Figure 12.5 shows the results obtained for the three chil- 
dren. All children exhibited low rates of acceptance during 
baseline in spite of the availability of positive reinforcement. 
The two interventions were implemented in a multiple base- 
line across subjects design and were compared in a multiele- 
ment design. As can be seen in the second phase of the study, 
both interventions produced immediate and large increases in 
food acceptance. Numerous subsequent studies have replicated 
these findings. Of particular interest is a study by Piazza, Patel, 
Gulotta, Sevin, and Layer (2003), who compared the effects of 
positive reinforcement (access to preferred toys and attention), 
negative reinforcement (physical guidance or nonremoval of the 
spoon, both labeled as “escape extinction” because food refusal 
no longer terminated feeding trials), and positive and negative 
reinforcement combined on chronic food refusal exhibited by 
four children. Results showed that positive reinforcement was 
not effective in increasing food consumption for any of the chil- 
dren. By contrast, negative reinforcement produced increases in 
consumption regardless of whether it was implemented alone or 
in combination with positive reinforcement. Collectively, results 
of these studies showed that positive reinforcement for appro- 
priate behavior may have limited effects if competing behavior 
(food refusal) produces negative reinforcement, and that neg- 
ative reinforcement, which maintained the children’s problem 
behavior, could be used to establish alternative behavior. 


Error-Correction Strategies 


As noted in Chapter 11, positive reinforcement is a basic moti- 
vational component of effective instruction. Teachers commonly 
deliver praise, privileges, and other forms of reward contingent 
on correct performance. Another common procedure, but one that 
has received less attention than positive reinforcement, involves 
the correction of student errors by repeating a learning trial, hav- 
ing the student practice correct performance, or giving the student 
additional work. To the extent that correct performance avoids 
these remedial procedures, improvements may be just as much 
a function of negative reinforcement as positive reinforcement. 
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Worsdell and colleagues (2005) examined the relative 
contributions of these contingencies during behavioral acqui- 
sition. The learning task involved reading words presented on 
flash cards, and the intervention of interest involved the correct 
repetition of misread words during remedial trials. The proce- 
dure provided additional practice of correct responses but also 
represented an avoidance contingency. To separate these effects 
(in Study 3), the authors implemented two error correction con- 
ditions. In the “relevant” condition, which combined the effects 
of practice and negative reinforcement, students were prompted 
to pronounce the misread word correctly 5 times contingent on 
an error. In the “irrelevant” condition, students were prompted 
to repeat an unrelated, nontarget word 5 times contingent on an 
error. The irrelevant condition contained only the negative rein- 
forcement contingency because repetition of irrelevant words 
provided no practice in correctly reading misread words. 

Figure 12.6 shows the results of Study 3, expressed as the 
cumulative number of words mastered by the 9 participants. All 
participants’ performance improved during both error-correction 
conditions relative to baseline, when no error-correction proce- 
dure was in effect. Performance by 3 participants (Tess, Ariel, 
and Ernie) was better during relevant error correction. However, 
Mark’s performance was clearly superior during irrelevant error 
correction, and performance of the remaining 5 participants (Hay- 
ley, Becky, Kara, Maisy, and Seth) was similar in both conditions. 
Thus, all participants showed improvement in reading perfor- 
mance even when they practiced irrelevant words, and most partic- 
ipants (6 of 9) did just as well or better practicing irrelevant rather 
than relevant words. These results suggest that the success of many 
remedial (error-correction) procedures may be due at least in part 
to negative reinforcement. Subsequent studies (Carroll, Joachim, 
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St. Peter, & Robinson, 2015; Kodak et al., 2016; McGhan & 
Lerman, 2013) have examined specific features of error-correction 
procedures that may enhance their effectiveness, and a consistent 
finding has been the contribution of avoidance contingencies to 
learning in instructional contexts. That is, the inclusion of a reme- 
dial-trial contingency produces faster learning than that resulting 
from positive reinforcement of correct responses alone. 


Acquisition and Maintenance of Problem 
Behavior 


Well-designed instructional procedures maintain a high degree 
of task engagement and lead to improved learning. Occasion- 
ally, however, the presentation of task demands may function as 
an EO for escape behavior due to some aversive feature of the 
work requirements. Initial forms of escape may include lack of 
engagement or mild forms of disruption. To the extent that pos- 
itive reinforcement for compliance is less than optimal, attempts 
to escape may persist and may escalate to severe forms of prob- 
lem behavior. In fact, research on the assessment and treatment 
of problem behavior has shown that escape from task demands 
is a common source of negative reinforcement for property 
destruction, aggression, and even self-injury. This topic is coy- 
ered more extensively in Chapter 27 and is included here as 
well because of its special relevance to negative reinforcement. 

O’Reilly (1995) conducted an assessment of a person’s 
episodic aggressive behavior. The participant was an adult 
with severe intellectual disability who attended a vocational 
day program. To determine whether aggressive behavior was 
maintained by positive versus negative reinforcement, O’Reilly 
observed the participant under two conditions, which were alter- 
nated in a multielement design. In one condition (attention), a 
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therapist ignored the participant (EO) except to deliver repri- 
mands following aggression (positive reinforcement). In the 
second condition (demand), a therapist presented difficult tasks 
to the participant (EO) and briefly terminated the trial following 
aggression (negative reinforcement). 

As Figure 12.7 shows, aggressive behavior occurred more 
often in the demand condition, indicating that it was maintained 
by negative reinforcement. Because anecdotal reports suggested 
that the participant also was more likely to be aggressive fol- 
lowing nights when he had not slept well, the data for both 
conditions were further divided based on whether the partici- 
pant slept for more or less than 5 hours the previous night. The 
highest rates of aggression occurred following sleep deprivation. 
These data are particularly interesting in that they illustrate the 
combined influence of two antecedent events—EOs on behav- 
ior maintained by negative reinforcement: Work tasks (EO #1) 


Analysis of Behavior, Inc. 


generally occasioned escape but even more so in the absence of 
sleep (EO #2). 

Other studies have had similar outcomes, suggesting that 
a number of “contextual” factors can mediate the “aversiveness” 
of demand situations. For example, task demands may be more 
likely to evoke escape behavior when a medical condition such 
as otitis media (ear infection) is present (O’Reilly, 1997) or 
when demands are presented in the context of low-quality atten- 
tion (e.g., negative verbal statements) from caregivers (Gardner, 
Wacker, & Boelter, 2009). By contrast, contextual manipulations 
such as presenting the opportunity to choose among tasks (Dyer, 
Dunlap, & Winterling, 1990) or to choose the order in which 
tasks are completed (Kern, Mantegna, Vorndran, Bailin, & Hilt, 
2001), or presenting demands in the context of pleasant stories 
(Carr, Newsom, & Binkoff, 1976), have been shown to decrease 
problem behaviors maintained by escape. 
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were exacerbated by sleep deprivation. 
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Extinction of Behavior Maintained by Negative 
Reinforcement 


When operant behavior no longer produces reinforcement, its 
frequency decreases through a process known as extinction. As 
is the case with reinforcement, extinction involves both an oper- 
ation (termination of reinforcement) and an effect (a reduction 
in responding). Although simple in concept, the application of 
extinction requires consideration of numerous factors: (a) its 
functional as well as procedural variations; (b) its direct and 
indirect effects, some of which may be undesirable; and (c) his- 
torical influences that may facilitate or impede its outcome. As a 
result, a chapter of this text (Chapter 24) is devoted to the topic 
of extinction. We introduce it here just to note that extinction 
represents one way (the most direct way) to decrease behavior 
maintained by negative reinforcement and to emphasize the fact 
that termination of a negative reinforcement contingency by “no 
longer reinforcing behavior” means presenting the consequence 
that responding previously escaped or avoided (i.e., withholding 
negative reinforcement). For example, if a child’s loud whining 
and complaining allow the child to delay preforming an unde- 
sirable task or perhaps to avoid it altogether, extinction would 
involve reinstating the requirement to complete the task. 


Behavioral Replacement Strategies 


Differential Negative Reinforcement 


Problem behaviors maintained by negative reinforcement can be 
treated in a number of ways. One strategy is to strengthen a more 
socially appropriate replacement behavior using negative rein- 
forcement, as illustrated in a study by Durand and Carr (1987). 
After determining that the “stereotypic’” behaviors of four spe- 
cial education students were maintained by escape from task 
demands, the authors taught the students an alternative response 
(“Help me”), which was followed by assistance with the task 
at hand. As can be seen in Figure 12.8, all students engaged 
in moderate-to-high levels of stereotypy during baseline. After 
being taught to use the phrase “Help me,” the students began to 
exhibit that behavior, and their stereotypy decreased. 


Demand (Home) Attention (Home) Demand (Home) 
Attention (Voc Facility) Demand (Voc Facility) Attention (Voc Facility) 
a Me ae ee | reg ae ee ee 


Demand < 5 hrs sleep ——— @ 
Demand > 5 hrs sleep ——— O 
Attention < 5 hrs sleep @ 
Attention > 5 hrs sleep fe) 


Sessions 


Results of the Durand and Carr (1987) study showed that 
a desirable behavior could replace an undesirable one; however, 
the replacement behavior might be considered less than ideal 
because it did not necessarily facilitate improved task engage- 
ment. This limitation was addressed in a subsequent study by 
Marcus and Vollmer (1995). After collecting baseline data on 
a young girl’s compliant and disruptive behavior, the authors 
compared the effects of two treatments in a reversal design. Ina 
differential negative reinforcement (DNR) communication con- 
dition, the girl was given a brief break from the task when she 
said “Finished.” In a second condition, called DNR compliance, 
the girl was given a break after complying with an instruction 
(the criterion for a break was later increased to compliance with 
three instructions). Results of this comparison (see Figure 12.9) 
showed that both treatments produced marked reductions in dis- 
ruptive behavior. However, only the DNR compliance condition 
produced an improvement in task performance. 


Differential Positive Reinforcement 


Research on the treatment of problem behavior has emphasized 
the importance of extinction as a critical component of interven- 
tion. Results of a small but growing number of studies, however, 
indicate that reinforcement-based procedures occasionally can 
be effective even though problem behavior might continue to 
be reinforced. Furthermore, some of these studies have shown 
that applications of positive reinforcement can be successful in 
strengthening appropriate replacement behavior when problem 
behavior has been maintained by negative reinforcement (see 
Payne & Dozier, 2013, for a recent review). 

A study by Slocum and Vollmer (2015) illustrated this 
approach with five children who engaged in high rates of prob- 
lem behavior maintained by escape from task demands and cor- 
respondingly low levels of compliance. The authors compared 
two interventions in a multielement design, both of which were 
implemented without extinction (i.e., problem behavior contin- 
ued to produce escape). Positive reinforcement involved deliv- 
ery of a small edible item contingent on compliance, whereas 
negative reinforcement involved delivering a 30-sec break 
from the task contingent on compliance. The authors reported 
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of intervals of stereotypic 
behaviors maintained by escape 
from task demands of four 
special education students 
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that positive reinforcement produced more consistent effects, 
as illustrated in Figure 12.10, which shows data on problem 
behavior for one of the participants. During the first treatment 
phase, both interventions were effective, although lower rates of 
problem behavior were observed under the positive reinforce- 
ment condition. During the second treatment phase, positive 
reinforcement was effective, whereas negative reinforcement 


was not. 


CHANGES IN TEACHER AND CAREGIVER 
RESPONDING AS A FUNCTION OF NEGATIVE 
REINFORCEMENT 


A great deal of research has focused on the use of procedures 
such as those described above to improve the behavior of stu- 
dents and clients. By contrast, very little work has examined the 
complementary effect of client behavior change on the behavior 
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Figure 12.10 The rate of problem 
behavior during baseline and 
treatment conditions (left) and the 
percentage of compliance during 
baseline and treatment conditions 
(right) across sessions. (Note: This 
graph shows results for one of the 
study’ five subjects). 

From “A comparison of positive and 
negative reinforcement for compliance 

to treat problem behavior maintained by 
escape.” by S. K. Slocum and T. R. Vollmer, 
2015, Journal of Applied Behavior Analysis. 
Reproduced with permission of John Wiley 
& Sons, |nc. 
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of the agent (teacher, therapist, etc.), yet this relation may reflect 
an important negative reinforcement contingency. When a 
teacher implements a behavior change procedure that results in a 
reduction of problem behavior, teacher behavior has terminated 
an aversive event. This type of relation may strengthen any 
behavior—inappropriate as well as appropriate—that has the 
same effect, a possibility suggested by results from descriptive 
studies of teacher behavior (Addison & Lerman, 2009; Sloman 
et al., 2005). 

Negative reinforcement effects have been examined 
more directly in two experimental simulations. Miller, 
Lerman, and Fritz (2010) observed seven teachers in training 
who were instructed to teach a variety of skills to a “student” 
(role played by a research assistant). The student engaged in 
problem behavior at prearranged times throughout sessions 
and either stopped when the teacher delivered a reprimand 
(negative reinforcement condition) or continued to engage 
in problem behavior regardless of the teacher’s behavior 
(extinction). Results showed that student behavior had a 
clear effect on teacher behavior in that the rate of reprimands 
was higher during the negative reinforcement condition. 
The authors also reported that three of the teachers termi- 
nated their participation in the study during an extinction 
condition, which provided further evidence that “problem 
behavior” was an aversive event. Using a similar method- 
ology, Thompson, Bruzek, and Cotnoir-Bichelman (2011) 
observed 11 undergraduate students during simulated care- 
taking sessions in which the student interacted with a “baby” 
(a doll). An experimenter presented a recorded infant cry at 
the beginning of sessions and terminated the cry contingent 
on different caregiver responses (horizontal rocking, vertical 
rocking, simulated feeding, playing) in different conditions. 
Results generally showed persistence of responses that were 
“effective” in terminating crying and extinction of responses 
that no longer were effective. 
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Data from this line of research have several implications. 
First, the occurrence of problem behavior appears to be an EO 
for caregiver escape and avoidance and is likely to occasion any 
caregiver response that is effective in terminating it, ranging 
from active (delivering a reprimand) to passive (limiting social 
interaction) responding. Second, this fact underscores the impor- 
tance of instructing parents and teachers in appropriate behavior 
management strategies to prevent the emergence of inappropri- 
ate ones. Finally, the maintenance of newly taught therapeutic 
skills may be more than a function of the quality of training; 
an equally important determinant may be how successful those 
skills are in quickly solving problems. 


ETHICAL CONSIDERATIONS IN USING 
NEGATIVE REINFORCEMENT 


Ethical concerns about the use of positive and negative rein- 
forcement arise primarily from the severity of the antecedent 
event (EO) that occasions the target behavior. Many EOs for 
behavior maintained by positive reinforcement can be character- 
ized as deprivation states, which, if severe, can constitute undue 
restriction of rights. By contrast, EOs for behavior maintained 
by negative reinforcement are, by definition, aversive events. 
Extremely noxious events, when presented as antecedent stimuli, 
cannot be justified as part of a typical behavior change program. 

Another concern with negative reinforcement is that the 
presence of aversive stimuli can itself generate behaviors that 
compete with the acquisition of desired behavior (Hutchinson, 
1977; Myer, 1971). For example, a socially withdrawn child, 
when placed in the midst of others, may simply scream and run 
away instead of playing with the peers, and running away is 
incompatible with social interaction. Finally, undesirable side 
effects typically associated with punishment (see Chapter 14) 
might also be observed when implementing behavior change 
programs based on negative reinforcement. 
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Definition of Negative Reinforcement 


I 


Negative reinforcement involves the termination, 
reduction, or postponement of a stimulus contingent on the 
occurrence of a response, which leads to an increase in the 
future occurrence of that response. 


. A negative reinforcement contingency involves (a) an 


establishing operation (EO) in whose presence escape is 
reinforcing, (b) a discriminative stimulus (SP) in whose 
presence a response is more likely to be reinforced, (c) the 
response that produces reinforcement, and (d) termination 
of the event that served as the EO. 


. Positive and negative reinforcement are similar in that 


both lead to an increase in responding; they differ in 
that positive reinforcement involves contingent stimulus 
presentation, whereas negative reinforcement involves 
contingent stimulus termination. 


. Negative reinforcement and punishment differ in that 


(a) negative reinforcement involves contingent stimulus 
termination, whereas punishment involves contingent 
stimulation, and (b) negative reinforcement leads to an 
increase in responding, whereas punishment leads to a 
decrease in responding. 


Escape and Avoidance Contingencies 


5, An escape contingency is one in which responding termi- 


nates an ongoing stimulus. An avoidance contingency is 
one in which responding delays or prevents the presenta- 
tion of a stimulus. 


. In discriminated avoidance, responding in the presence 


of a signal prevents stimulus presentation; in free-operant 
avoidance, responding at any time prevents stimulus 
presentation. 


Characteristics of Negative Reinforcement 


7. Any response that successfully terminates aversive stim- 


10. 


ulation will be strengthened; as a result, a wide range of 
behaviors may be acquired and maintained by negative 
reinforcement. 


. Negative reinforcement may play an important role in the 


development of academic skills, but 1t can also account for 
the development of disruptive or dangerous behavior. 


. Unconditioned negative reinforcers are stimuli whose 


removal strengthens behavior in the absence of prior 
learning. Conditioned negative reinforcers are stimuli 
whose removal strengthens behavior as a result of previous 
pairing with other negative reinforcers. 


Social negative reinforcement involves stimulus termi- 
nation through the action of another person. Automatic 
negative reinforcement involves stimulus termination as a 
direct result of a response. 


11. 


12. 
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Identification of negative reinforcers requires the speci- 
fication of the stimulus conditions in effect prior to and 
following responding. 


In general, negative reinforcement for a given response 
will be more effective when (a) the stimulus change 
immediately follows the occurrence of the target response, 
(b) the magnitude of reinforcement is large, (c) the target 
response consistently produces escape from or postpone- 
ment of the EO, and (d) reinforcement is unavailable for 
competing responses. 


Applications of Negative Reinforcement 


13. 


14. 


15. 


16. 
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19. 


Although negative reinforcement is a fundamental princi- 
ple of learning that has been studied extensively in basic 
research, applied behavior analysis has heavily empha- 
sized the use of positive reinforcement over negative 
reinforcement. 


Applied researchers have explored the therapeutic uses 
of negative reinforcement in treating pediatric feeding 
problems. 


Improvements in student performance as a result of error 
correction that involves repeating a learning trial, having 
the student practice correct performance, or giving the 
student additional work may be a function of negative 
reinforcement. 


The presentation of task demands during instruction may 
function as an EO for escape; initial forms of escape may 
include lack of attention or mild forms of disruption. To 
the extent that positive reinforcement for compliance is 
less than optimal, escape behaviors may persist and may 
even escalate. 


Extinction of behavior maintained by negative reinforce- 
ment involves not allowing a response to avoid or termi- 
nate its programmed consequence. 


One strategy for treating problem behaviors maintained 
by negative reinforcement is to strengthen a more socially 
appropriate replacement behavior through negative 
reinforcement. 


Results of some research have shown that interventions 
based on positive reinforcement also can be effective as 
treatment for problem behavior maintained by negative 
reinforcement. 


Changes in Teacher and Caregiver Responding as a 
Function of Negative Reinforcement 


20. 


Another dimension of negative reinforcement involves 
effects on caregiver behavior. Student problem behavior 
serves as an EO teacher escape and avoidance, and teacher 
responses to problem behavior are maintained by negative 
reinforcement. 
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Ethical Considerations in Using Negative 
Reinforcement 


21. Ethical concerns about the use of positive and negative 


reinforcement are similar and arise from the severity of the 
antecedent event (EO) that occasions behavior. Most EOs 


for behavior maintained by negative reinforcement can 
be viewed as aversive events. Extremely noxious events, 


‘smn SS REE CSSD DE SOT EO SOI TES SESE ELROD 


KEY TERMS 


avoidance contingency 
conditioned negative reinforcer 


discriminated avoidance 


seems iene 


MULTIPLE-CHOICE QUESTIONS 


escape contingency 


free-operant avoidance 


1. Negative reinforcement can be defined as: Stimulus 


, contingent upon a response, which 
the future probability of that response. 


a. Presented, increases 
b. Presented, decreases 
c. Removed, increases 
d. Removed, decreases 


Hint: (See “Definition of Negative Reinforcement”) 


. Which of the following is an example of a negative rein- 
forcement contingency? 


a. Jo is sitting near a window at Starbucks and having a 
cup of coffee. The sun is streaming in the window, and 
it is too warm for Jo—she is beginning to perspire. Jo 
moves to another chair away from the window, where 
it is shady. The next time Jo goes to Starbucks, she 
sees the sun shining in the window again and sits in the 
chair in the shade instead. 


b. Frank often goes to Starbucks for coffee as well. After 
receiving his coffee, he sits down in his seat. It is 9:00 
am, and Starbucks is very busy. Most of the tables are 
full. As a result, a smart woman approaches Frank and 
asks if she can share his table. Frank agrees, and the 
woman sits down and has coffee with Frank. Frank 
decides he will come to Starbucks at 9:00 am for coffee 
more often in the future. 


c. Cheryl loves coffee. She orders a coffee at Starbucks 
but burns her mouth on it because it is too hot. After a 
couple of occurrences of this, Cheryl stops coming to 
Starbucks for coffee. 


= 


Sam was walking to Starbucks for coffee. On his way, 
he lost his wallet to a pickpocket. Sam no longer walks 
to that Starbucks for coffee. 


Hint: (See “Definition of Negative Reinforcement’) 


22. 


3h 


4. 


Be 


when presented as antecedent stimuli, cannot be justified 
as part of a typical behavior change program. 


Another concern with negative reinforcement is that 
the presence of aversive stimuli can itself generate 
behaviors that compete with the acquisition of desired 
behavior. 


SCREENS 


negative reinforcement 


unconditioned negative reinforcer 


ee a 


Mary Jo decides to implement a negative reinforcement 
intervention with one of her students, Anjali, to increase 
the amount of work Anjali completes. Mary Jo tells Anjali, 
“If you complete 15 math problems today (Monday), you 
don’t have to do your math worksheet on Friday.”’ What is 
one problem that might arise with this intervention? 


a. Reinforcement is available for a competing behavior. 


b. There is little difference in the stimulus conditions 
before the response (completing work) occurs and after 
the response occurs. 


c. The stimulus change following the occurrence of target 
behavior is not immediate. 


d. Mary Jo is being inconsistent in her delivery of 
reinforcement. 


Hint: (See “Determinants of Negative Reinforcement 
Effects”) 


Unlike assessments for identifying positive reinforcers, 
assessments for negative reinforcers must place equal 
emphasis on as well as the 
consequence events for target behavior. 


a. The EO or antecedent event 

b. The future occurrence of the target behavior 
c. The stimulus 

d. Punishment 


Hint: (See “Identifying the Context of Negative 
Reinforcement’’) 


The key difference between an escape contingency and an 
avoidance contingency is: 


a. In an escape contingency, the future probability of the 
target behavior increases, while the future probability 
of the target behavior decreases with an avoidance 
contingency. 


b. In an escape contingency, the EO is present prior to 
the occurrence of the target behavior, while in an 


avoidance contingency, the EO is not present prior to 
the occurrence of the target behavior. 


c. In an escape contingency, the EO is not present prior 
to the occurrence of the target behavior, while in an 
avoidance contingency, the EO is present prior to the 
occurrence of the target behavior. 

d. In an escape contingency, there is a warning stimulus 
that signals an aversive stimulus is imminent, while 
in an avoidance contingency, there is no such warning 
stimulus. 


Hint: (See “Escape and Avoidance Contingencies’) 


. Which of the following is an example of free-operant 

avoidance? 

a. Jackie crosses the street when she sees Donna come 
around the corner on the next block so that she doesn’t 
have to talk to her. 


b. Lavonda puts on a bicycle helmet when she rides her 
bike so that she doesn’t hurt her head if she falls off 
her bike. 


c. Nathaniel puts down the hurricane shutters on his 
Florida home when he hears that a tropical storm is 
approaching so that his windows do not break. 


d. All of these 

Hint: (See “Escape and Avoidance Contingencies”) 

. Which of the following behaviors could be maintained by 
negative reinforcement? 

a. Completing school work. 

b. Cleaning a bedroom 

c. Tantrums 

d. All of these 

Hint: (See “Characteristics of Negative Reinforcement’) 


. An unconditioned negative reinforcer: 
a. Is one that began as a neutral stimulus 


b. Acquired its aversive qualities by being paired with 
another aversive stimulus 


c. Is one that strengthens behavior in the absence of prior 
learning 
d. Can be thought of as an inherited negative reinforcer 


e. Third and fourth choices 
Hint: (See “Characteristics of Negative Reinforcement”) 


_ The textbook describes a study by Ahearn and colleagues 
(1996), in which negative reinforcement was used to 
increase food acceptance in children. In this example, 
during baseline, bite acceptances produced access to toys 
and bite refusals produced removal of the spoon (negative 
reinforcement). During the intervention, bite refusals no 
longer produced removal of the spoon. Instead, the spoon 
was only removed if a bite was accepted. As soon as a 
bite was accepted and every time a bite was accepted, the 
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spoon was briefly removed. Which factors that are import- 
ant to consider for effectively changing behavior with neg- 
ative reinforcement are illustrated in this example? 


a. The stimulus change following the occurrence of the 


target behavior was immediate. 


b. The difference in stimulation prior to and after the 
response occurred was large. 


c. The occurrence of the target response consistently 


produced escape. 


d. Reinforcement was unavailable for competing 
responses. 


e. All of these 
f. None of these 


Hint: (See “Factors That Influence the Effectiveness of 
Negative Reinforcement’) 


. The study by Rodgers and Iwata (1991) that analyzed the 


effects of positive reinforcement, error correction proce- 
dures, and an avoidance procedure demonstrated that: 


a. Error correction procedures may produce learning, at 
least in part, due to an avoidance contingency. 


b. Positive reinforcement was the most effective proce- 
dure for producing correct performance. 


c. Error correction procedures and avoidance procedures 
produced the most correct responding. 


d. All of these 
e. None of these 


Hint: (See “Applications of Negative Reinforcement’) 


. Ethical concerns about the use of negative reinforcement 


stem from: 
a. Having to deprive the individual of positive reinforce- 
ment for an extended period of time 


b. The presence of antecedent aversive stimuli in the indi- 
vidual’s environment 


ie) 
° 


The potential for creating a context that generates unde- 
sired behaviors 


d. The presentation of aversive stimuli contingent upon a 
target behavior displayed by an individual 


e. Second and third choices 


Hint: (See “Ethical Considerations in Using Negative 
Reinforcement”) 


. The potential negative side effects (e.g., crying, running 


away) of negative reinforcement are similar to the side 
effects associated with: 


a. Positive Reinforcement 
b. Punishment 

c. Stimulus Control 

d. All of these 

e. None of these 


Hint: (See “Ethical Considerations in Using Negative 
Reinforcement”) 


336 Part4 ¢ Reinforcement 


ESSAY-TYPE QUESTIONS 


IIe 


Write the definition of negative reinforcement and provide 
an example of negative reinforcement that has not been 
discussed in class or presented in the textbook. When 
giving your example, write it in the form of the diagrams 
found in the textbook, and be sure to include all four com- 
ponents of the diagram. 


Hint: (See “Definition of Negative Reinforcement”) 


. What are some stimuli that might serve as negative 


reinforcers? Identify at least one negative reinforcer 
that is unconditioned and one negative reinforcer that is 
conditioned. Then, select one and show how that neg- 
ative reinforcer could be used to teach a behavior (the 
behavior can either be socially appropriate or socially 
inappropriate). Use the diagramming notation pre- 
sented in class to illustrate the negative reinforcement 
contingency. 


Hint: (See “Characteristics of Negative Reinforcement’) 


. What is the key difference between an escape contin- 


gency and an avoidance contingency? 


Hint: (See “Escape and Avoidance Contingencies’’) 


4. Explain the difference between social negative reinforce- 
ment and automatic negative reinforcement. 


Hint: (See “Characteristics of Negative Reinforcement’) 


. What are four things you can do to increase the effectiveness 


of negative reinforcement? 
Hint: (See “Determinants of Negative Reinforcement Effects”’) 


. You are working with a child with autism, who engages in 


problem behavior to escape completing daily living tasks, 
such as washing his hands before lunch. You would like 

to teach this student to politely say, “Don’t want to wash 
hands,” rather than to throw tantrums when told to wash 
his hands. You set up an instructional program where you 
will make several hand-washing requests throughout the 
day in order to provide the child with opportunities to 
practice saying “Don’t want to wash hands.” Each time the 
child makes this request, you allow him to escape washing 
his hands. What are the ethical issues that might arise from 
such a plan and how can you plan for and/or decrease 
these negative effects? 


Hint: (See “Ethical Issues in Using Negative Reinforcement’’) 


NOTES 


tl) 


The term aversive is not meant to describe an inherent characteristic of a 
stimulus but, rather, a stimulus whose presentation functions as punishment 
or whose removal functions as negative reinforcement. 
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Schedules of Reinforcement 


LEARNING OBJECTIVES 


Define schedule of reinforcement. 

Define continuous reinforcement. 

Define intermittent reinforcement. 

Be able to identify the importance of naturally occurring reinforcement. 
Identify the two main intermittent schedules of reinforcement. 
Compare and contrast fixed and variable ratio schedules. 

Explain the fixed ratio schedule consistency of performance. 

Explain the phenomenon of a post-reinforcement pause. 


e Explain and define a variable ratio. 


e Explain and describe fixed interval schedules of reinforcement. 

e Explain and define variable interval schedules of reinforcement. 

e Be able to define and explain the variables associated with “ratio strain.” 
e Identify four variations of basic intermittent schedules of reinforcement. 


schedule of reinforcement is a rule that describes a con- 

tingency of reinforcement, the environmental arrange- 

ments that determine conditions by which behaviors 
will produce reinforcement. Continuous reinforcement and 
extinction provide the boundaries for all other schedules of 
reinforcement. A schedule of continuous reinforcement (CRF) 
provides reinforcement for each occurrence of behavior. For 
example, a teacher using a continuous schedule of reinforce- 
ment would praise each occurrence of a correctly identified sight 
word emitted by a student. Examples of behaviors that tend to 
produce continuous reinforcement include turning on a water 
faucet (water comes out), turning on a light switch in a dark 
room (a light comes on), and putting money into a vending 
machine (a product is obtained). During extinction (EXT), 
no occurrence of the behavior produces reinforcement. (For a 
detailed description of extinction, see Chapter 24.) 


INTERMITTENT REINFORCEMENT 


Between continuous reinforcement and extinction, many 
intermittent schedules of reinforcement (INT) are possible in 
which some, but not all, occurrences of the behavior are reinforced. 
Only selected occurrences of behavior produce reinforcement 
with an intermittent schedule of reinforcement. CRF is used to 
strengthen behavior, primarily during the initial stages of learning 
new behaviors. INT is used to maintain an established behavior, 
especially during maintenance stages of learning. 


Maintenance of Behavior 


Maintenance of behavior refers to a lasting change in behavior. 
Regardless of the type of behavior change technique employed 
or the degree of success during treatment, applied behavior ana- 
lysts must be concerned with sustaining gains after terminating 
a treatment program. For example, Zofia is in the seventh grade 
and taking French, her first foreign language class. After a few 
weeks, the teacher informs Zofia’s parents that she is failing the 
course. The teacher believes that Zofia’s problems in French 
have resulted from lack of daily language practice and study. 
The parents and teacher decide that Zofia will record a tally on 
a chart kept on the family bulletin board each evening that she 
studies French for 30 minutes. Zofia’s parents praise her practice 
and study accomplishments and offer encouragement. During 
a follow-up meeting 3 weeks later, the parents and teacher 
decide that Zofia has done so well that the tally procedure can 
be stopped. Unfortunately, a few days later Zofia is once again 
falling behind in French. 

A successful program was developed to establish daily 
French language practice. However, gains were not main- 
tained after removing the tally procedure. The parents and the 
teacher did not establish intermittent reinforcement procedures. 
Let us review what happened and what could have happened. 
Continuous reinforcement was used correctly to develop daily 
study behavior. However, after the study behavior was estab- 
lished and the tally procedure removed, the parents should 
have continued to praise and encourage daily practice and 
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gradually offer fewer encouragements. The parents could have 
praised Zofia’s accomplishments after every second day of daily 
practice, then every fourth day, then once per week, and so on. 
With the intermittent praise, Zofia might have continued daily 
practice after removing the tally procedure. 


Progression to Naturally Occurring 
Reinforcement 


A major goal of most behavior change programs is the devel- 
opment of naturally occurring activities, stimuli, or events to 
function as reinforcement. It is more desirable for people to 
read because they like to read, rather than to obtain contrived 
reinforcement from a teacher or parent; to engage in athletics 
for the enjoyment of the activity, rather than for a grade or 
because of a physician’s directive; to help around the house for 
the personal satisfaction it brings, rather than to earn an allow- 
ance. Intermittent reinforcement is usually necessary for the 
progression to naturally occurring reinforcement. Even though 
some individuals spend hours each day practicing a musical 
instrument because they enjoy the activity, chances are good 
that this persistent behavior developed gradually. At first, the 
beginning music student needs a great deal of reinforcement 
to continue the activity: “You really practiced well today,” 
“IT can’t believe how well you played,” “Your mother told me 
you received a first place in the contest—that’s super!” These 
social consequences are paired with other consequences from 
teachers, family members, and peers. As the student develops 
more proficiency in music, the outside consequences occur less 
frequently, intermittently. Eventually, the student spends long 
periods making music without receiving reinforcement from 
others because making music has itself become a reinforcer for 
doing that activity. 

Some might explain the transition of our music student 
from an “externally reinforced person” to a “self-reinforced 
musician” as the development of intrinsic motivation, which 
seems to imply that something inside the person is responsi- 
ble for maintaining the behavior. This view is incorrect from a 
behavioral standpoint. Applied behavior analysts describe intrin- 
sic motivation as reinforcement that is received by manipulating 
the physical environment. Some individuals ride bicycles, go 
backpacking, read, write, or help others because manipulations 
of the environment provide reinforcement for engaging in those 
activities. 


DEFINING BASIC INTERMITTENT SCHEDULES 
OF REINFORCEMENT 


Ratio and Interval Schedules 


Applied behavior analysts directly or indirectly embed ratio 
and interval intermittent schedules of reinforcement in most 
treatment programs, especially ratio schedules (Lattal & Neef, 
1996). Ratio schedules require a number of responses before one 
response produces reinforcement. If the ratio requirement for a 
behavior is 10 correct responses, only the 10th correct response 
produces reinforcement. Interval schedules require an elapse of 


time before a response produces reinforcement. If the interval 
requirement is 5 minutes, reinforcement is contingent on the first 
correct response that occurs after 5 minutes has elapsed. There- 
after, reinforcement is contingent on the first correct response 
after 5 minutes has elapsed since the last reinforced response. 

Ratio schedules require a number of responses to be emit- 
ted for reinforcement; an elapse of time does not change the 
number contingency. The participant’s response rate determines 
the rate of reinforcement. The more quickly the person com- 
pletes the ratio requirement, the sooner reinforcement will occur. 
Conversely, interval schedules require an elapse of time before 
a single response produces reinforcement. The total number of 
responses emitted on an interval schedule is irrelevant to when 
and how often the reinforcer will be delivered. Emitting a high 
rate of response during an interval schedule does not increase the 
rate of reinforcement. Reinforcement is contingent only on the 
occurrence of one response after the required time has elapsed. 
The availability of reinforcement is time-controlled with interval 
schedules, and rate of reinforcement is “self-controlled” with 
ratio schedules, meaning that the more quickly the individual 
completes the ratio requirement, the sooner reinforcement will 
occur. 


Fixed and Variable Schedules 


Applied behavior analysts can arrange ratio and interval sched- 
ules to deliver reinforcement as a fixed or a variable contingency. 
With a fixed schedule, the response ratio or the time requirement 
remains constant. With a variable schedule, the response ratio or 
the time requirement can change from one reinforced response 
to another. The combinations of ratio or interval and fixed or 
variable contingencies define the four basic schedules of inter- 
mittent reinforcement: fixed ratio, variable ratio, fixed interval, 
and variable interval. 


SCHEDULE EFFECTS AND CONSISTENCY 
OF PERFORMANCE 


The following sections define the four basic schedules of inter- 
mittent reinforcement, provide examples of each schedule, and 
present some well-established schedule effects derived from 
basic research. 

Early experimental and applied behavior analysts (e.g., 
1950-1970) reported similar consistencies of schedule effects 
among nonhuman animals and humans. Many behavior ana- 
lysts today question the generality of the schedule effects 
presented in the next sections. This issue developed as an 
outcome of advancing research with human subjects, and 
the acceptance that humans—not nonhuman animals—have 
language. Verbal behavior allows humans to make and use 
rules that can influence their responses to schedules of rein- 
forcement (i.e., rule-governed behavior). Jack Michael (1987) 
provides an example of a verbal rule controlling a schedule 
contingency. 


Humans differ behaviorally from nonhumans in one over- 
whelmingly significant way; humans have an extensive 
language repertoire which influences most aspects of their 


behavior. With the exception of very young children and 
the profoundly retarded the human subjects used in studies 
of the effects of complex contingencies have all had this 
language repertoire, and one would certainly expect it to 
play a role in determining the experimental results. One of 
my first attempts to study human behavior in a situation 
somewhat analogous to a nonhuman operant conditioning 
experiment involved a simple discrimination schedule. The 
human subject pressed a telegraph key and received—on 

a variable interval schedule—nickels as reinforcement. 
The VI schedule, however, was alternated with a period 
of extinction, and a light on the response panel was on 
when the VI schedule was in effect, and off when it was 
not. Only minimal instructions were given, and they did 
not mention the role of the light. My first subject pressed 
the telegraph key at a moderately high rate throughout the 
30-minute session, and his rate during the light-off condi- 
tion was no different than during the light-on condition. 
When he finished his session I asked him to tell me as 
much as he could about what he was doing. He stated that 
he received nickels for pressing the telegraph key—not for 
every press, but once in a while, and never when the light 
was off. I asked him why he continued to press when the 
light was off. He said he thought that maybe I was trying 
to see if he was a persistent type of person, one who would 
continue trying even though success seemed unlikely, and 
he wanted to show me that he was exactly that type of 
person. 

His explanation was quite reasonable, at least in its 
suggestion of the possible role of his social history and lan- 
guage repertoire, but I certainly did not interpret this event 
as a challenge to the generality of the effect of discrimina- 
tion training on behavior. (pp. 38-39) 


As you study the schedule effects sections that follow, recog- 
nize that participants’ performance on schedules of reinforcement 
arranged in applied settings may differ considerably from that 
predicted by basic laboratory research as rule-governed behavior. 
Always remember, however, Michael’s concluding comment 
on generality in the above quote. (See also “Perspectives on 
Using Schedules of Reinforcement in Applied Settings: Applied 
Research with Intermittent Schedules” in this chapter.) 


Fixed Ratio Defined 


A fixed ratio (FR) schedule of reinforcement requires the 
completion of a fixed number of responses for a reinforcer. For 
example, every fourth correct (or target) response produces rein- 
forcement on an FR 4 schedule. An FR 15 schedule requires 
15 responses to produce reinforcement. Skinner (1938) concep- 
tualized each ratio requirement as a response unit. Accordingly, 
the response unit produces the reinforcer, not just the last 
response of the ratio. 

Some business and industrial tasks are paid on an FR 
schedule (e.g., piecework). A worker might receive a pay credit 
after completing a specified number of tasks (e.g., assembling 
15 pieces of equipment or picking a box of oranges). A stu- 
dent might receive either a happy face after learning 5 new sight 
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words or a certain number of points after completing 10 math 
problems. 

Jeffries, Crosland, and Miltenberger (2016) assessed 
the effectiveness of the tablet application Look in My Eyes 
Steam Train and differential reinforcement. During the 
tablet condition, a child looked into eyes on the iPad to see 
a number, and then picked a matching number on a grid. If 
the numbers matched, ‘“‘a piece of coal appeared on the screen 
and a praise statement was provided by the [iPad] narrator.” 
The interactive Steam Train game appeared on the tablet screen 
for | minute after the child received four pieces of coal. This 
example combined CRF and FR 4 schedules of reinforcement. 
Each correct number match produced a piece of coal and praise. 
Following four correct matches, the child received | minute of 
screen time to play the Steam Train game. (see also “Tandem 
Schedules” in this chapter under “Compound Schedules of 
Reinforcement’). 


Fixed Ratio Schedule Effects 


Consistency of Performance 


FR schedules produce a typical pattern of responding: (a) After 
the first response of the ratio requirement, the participant com- 
pletes the required responses with little hesitation between 
responses; and (b) a postreinforcement pause follows reinforce- 
ment (i.e., the participant does not respond for a period following 
reinforcement). The size of the ratio and the reinforcement mag- 
nitude can influence the duration of the postreinforcement pause 
(Powell, 1969; Schlinger, Derenne, & Baron, 2008): Large ratio 
requirements may produce long pauses; small ratios produce 
short pauses. Lower reinforcement magnitudes—an inverse 
relation to the large ratio—may produce long pauses “indicating 
that magnitude and ratio size interact to determine the extent of 
pausing” (Schlinger et al., 2008, p. 43). Following the pause, the 
participant again completes the fixed ratio with little hesitation 
between responses. Analysts often identify this characteristic of 
FR responding as a break-and-run pattern. 


Rate of Response 


FR schedules often produce high rates of response. Quick 
responding on FR schedules maximizes the delivery of reinforce- 
ment because the higher the rate of response, the greater the rate of 
reinforcement. People work rapidly with a fixed ratio because they 
receive reinforcement upon completion of the ratio requirements. 

The size of the ratio can influence the rate of response on 
FR schedules. To a degree, the larger the ratio requirement, the 
higher the rate of response. A teacher could reinforce every third 
correct answer to arithmetic facts. With this ratio requirement, 
the student might complete 12 problems within the specified 
time, producing reinforcement four times. The student might 
complete more problems in less time if the teacher arranged 
reinforcement contingent on 12 correct answers rather than 3. 
The higher ratio is likely to produce a higher rate of response. 
The rate of response decreases, however, if the ratio require- 
ments are too large. The maximum ratio is determined in part 
by the participant’s past FR history of reinforcement, motivating 
operations, the quality and magnitude of the reinforcer, and the 
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procedures that change the ratio requirements. For example, if 
ratio requirements are raised gradually over an extended period, 
extremely high ratio requirements can be reached. 

Figure 13.1 depicts and summarizes the schedule effects 
typically produced by FR schedules of reinforcement. 


Variable Ratio Defined 


A variable ratio (VR) schedule of reinforcement requires 
the completion of a variable number of responses to produce 
a reinforcer. A number representing the average (e.g., mean) 
number of responses required for reinforcement identifies 
the VR schedule. For example, with a VR 10 schedule every 
10th correct response on the average produces reinforcement. 
Reinforcement can come after 1 response, 14 responses, 
5 responses, 19 responses, or n responses, but the average 
number of responses required for reinforcement is 10 (e.g., 
T+ 1445+ 19 + 11 = 50; 50/5 = 10). 

The operation of a slot machine, the one-armed bandit, 
provides a good example of a VR schedule. These machines are 
programmed to pay off only a certain proportion of the times 
they are played. A player cannot predict when the next operation 
of the machine will pay off. The player might win 2 or 3 times in 
succession and then not win again for 20 or more plays. 

De Luca and Holborn (1992) examined the effects of 
a VR schedule on three obese and three nonobese children’s 
rate of pedaling an exercise bicycle. The children could use 
the exercise bicycle Monday to Friday during each week of 
the analysis, but received no encouragement to do so. The par- 
ticipants received the instruction to “exercise as long as you 
like” to initiate the baseline condition. De Luca and Holborn 
introduced the VR schedule of reinforcement after establishing 
a stable baseline rate of pedaling. They calculated the baseline 
mean number of pedal revolutions per minute and programmed 
the first VR contingency at approximately 15% faster pedaling 
than the baseline mean. The children received points on the VR 
schedule to exchange for backup reinforcers. De Luca and Hol- 
born increased the VR schedule in two additional increments 
by approximately 15% per increment. All participants had sys- 
tematic increases in their rate of pedaling with each VR value, 
meaning that the larger the variable ratio, the higher the rate of 


Figure 13.1 Summary of FR schedule effects during 
ongoing reinforcement. 


Definition: Reinforcement delivered 
contingent on emission of a specified 
number of responses. 


Schedule Effects: 

After reinforcement a 
postreinforcement pause 
occurs. After the pause the 
ratio requirement is completed 
with a high rate of response 
and very little hesitation 
between responses. The size 
of the ratio influences 

both the pause and the rate. 


response. De Luca and Holborn reported that the VR schedule 
produced higher rates of response than did the FR schedule in 
their previous study (De Luca & Holborn, 1990). Figure 13.2 
presents the participants’ performances under baseline and VR 
(i.e., VR ranges of 70 to 85, 90 to 115, 100 to 130) conditions. 

Student behaviors usually produce reinforcement follow- 
ing the completion of variable ratios. Normally a student cannot 
predict when the teacher will call on him to give an answer 
and thereby receive reinforcement. Good grades, awards, and 
promotions all may come after an unpredictable number of 
responses. And in checking seatwork, the teacher might rein- 
force a student’s work after the completion of 10 tasks, another 
student’s work after 3 tasks, and so on. 


Variable Ratio Schedule Effects 


Consistency of Performance 


VR schedules produce consistent, steady rates of response. They 
typically do not produce a postreinforcement pause, as do FR 
schedules. Perhaps the absence of pauses in responding is due 
to the absence of information about when the next response will 
produce reinforcement. Responding remains steady because the 
next response may produce reinforcement. 


Rate of Response 


Like the FR schedule, the VR schedule tends to produce a quick 
rate of response. Also similar to the FR schedule, the size of the 
ratio influences the rate of response. To a degree, the larger the 
ratio requirement, the higher the rate of response. Again like FR 
schedules, when variable ratio requirements are thinned gradually 
over an extended period, participants will respond to extremely 
high ratio requirements. Figure 13.3 summarizes the schedule 
effects typically produced by VR schedules of reinforcement. 


Variable Ratio Schedules in Applied Settings 


Basic researchers use computers to select and program VR sched- 
ules of reinforcement. In applied settings, VR schedules are 
seldom implemented with a planned and systematic approach. 
In other words, the reinforcer is delivered by chance, hit or miss 
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in most interventions. This nonsystematic delivery of reinforce- 
ment is not an effective use of VR schedules. Teachers can select 
and preplan VR schedules that approximate the VR schedules 
used in basic research. For example, teachers can plan variable 
ratios by (a) selecting a maximum ratio for a given activity (e.g., 
15 responses) and (b) using a table of random numbers to pro- 
duce the specific variable ratios for the schedule of reinforcement. 
A table of random numbers might generate the following sequence 
of ratios: 8, 1, 1, 14, 3, 10, 14, 15, and 6, producing a VR 8 schedule 
of reinforcement (on the average each eighth response produces 
the reinforcer) with the ratios ranging from | to 15 responses. 
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Figure 13.2 Mean revolutions per minute during baseline, 
VR 1 (VR range, 70 to 85), VR 2 (VR range, 90 to iS)), 

and VR 3 (VR range, 100 to 130), return to baseline, and 
return to VR 3 phases for obese and nonobese subjects. 
“Effects of a Variable-Ratio Reinforcement Schedule with Changing 
Criteria on Exercise in Obese and Nonobese Boys”, R. V. De Luca and 
S. W. Holborn, 1992, Reproduced with permission of John Wiley and 
Sons Inc. 


Teachers can apply the following VR procedures as indi- 


vidual or group contingencies of reinforcement for academic or 
social behavior: 


Tic-Tac-Toe VR Procedure 


1. The teacher establishes a maximum number for the indi- 
vidual student or group. The larger the maximum number 
selected, the greater the odds against meeting the contin- 
gency. For example, | chance out of 100 has less chance 
of being selected than | chance out of 20. 


2. The teacher gives the individual or group a tic-tac-toe grid. 


342 Part 4 ¢ Reinforcement 


Definition: Reinforcer is 
delivered after the emission 
of a variable number of 
responses. 
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the size of the ratio 
requirements. 


Figure 13.3 Summary of VR schedule effects during 
ongoing reinforcement. 


3. Students fill in each square of the grid with a number no 
greater than the maximum number. For example, if the max- 
imum number is 30, the score sheet might look like this: 


4. The teacher fills a box or some other type of container with 
numbered slips of paper (with numbers no higher than the 
maximum number). Each number should be included sev- 
eral times—for example, five Is, five 2s, five 3s. 


5. Contingent on the occurrence of the target behavior, 
students withdraw one slip of paper from the box. If the 
number on the paper corresponds to a number on the 
tic-tac-toe sheet, the students mark out that number on 
the grid. 

6. The reinforcer is delivered when students have marked 
out three numbers in a row—horizontally, vertically, or 
diagonally. 


For example, a student might withdraw one slip of paper 
for each homework assignment completed. Selecting an activity 
from the class job board (e.g., teacher’s helper, collecting milk 
money, running the projector) could serve as the consequence 
for marking out three numbers in a row. 


Classroom Lottery VR Procedure 


1. Students write their names on index cards after success- 
fully completing assigned tasks. 


2. Students put signature cards into a box located on the 
teacher’s desk. 


3. After an established interval of time (e.g., | week), the 
teacher draws a signature card from the box and declares 
that student the winner. The lottery can have first, second, 
and third place, or any number of winners. The more cards 
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students earn, the greater is the chance that one of their 
cards will be picked. 


Teachers have used classroom lotteries with a variety of 
student accomplishments, such as reading non-assigned books. 
For example, for each book read, students write their names and 
the title of the book they have read on a card. Every 2 weeks the 
teacher picks one card from the box and gives the winning stu- 
dent a new book. To make the book an especially desirable con- 
sequence, the teacher lets students earn the privilege of returning 
the book to the school, inscribed with the student’s name, class, 
and date (e.g., Brian Lee, fifth grade, donated this book to the 
High Street Elementary School Library on May 8, 2019). 


Desk Calendar VR Procedure 


1. Students receive desk calendars with loose-leaf date pages 
secured to the calendar base. 


2. The teacher removes the loose-leaf date pages from the 
calendar base. 


3. The teacher establishes a maximum ratio for the students. 


4. The teacher numbers index cards consecutively from 1 to 
the maximum ratio. Multiple cards are included for each 
number (e.g., five 1s, five 2s). If a large average ratio is 
desired, the teacher includes more large numbers; for 
small average ratios, the teacher uses smaller numbers. 


5. The teacher uses a paper punch to punch holes in the index 
cards for attaching the cards to the calendar base. 


6. The teacher or student shuffles the index cards to qua- 
si-randomize the order and attaches the index cards to a 
calendar base face down. 


7. Students produce their own VR schedules by turning over 
one index card at a time. After meeting that ratio require- 
ment, students flip the second card to produce the next 
ratio, and so on. 


Students can use the desk calendar base to program VR 
schedules for most curriculum areas (e.g., arithmetic facts). For 
example, after receiving an arithmetic worksheet, the student 
flips the first card. It has a 5 written on it. After completing 


five problems, she holds up her hand to signal her teacher that 
she has completed the ratio requirement. The teacher checks the 
student’s answers, provides feedback, and presents the conse- 
quence for correct problems. The student flips the second card; 
the ratio requirement is 1. After completing that single problem, 
she receives another consequence and flips the third card. This 
time the ratio is 14. The cycle continues until all of the cards in 
the stack are used. New cards can then be added or old cards 
reshuffled to create a new sequence of numbers. The average of 
the numbers does not change in the reshuff ling. (See Kauffman, 
Cullinan, Scranton, & Wallace, 1972, for a description of using 
a deck of playing cards, mounted on a refillable calendar base, 
to create FR and VR schedules of reinforcement that function 
in most environments.) 


Fixed Interval Schedules 


A fixed interval (FI) schedule of reinforcement provides rein- 
forcement for the first response following a fixed duration of 
time. With an FI 3-min schedule, the first response following 
the elapse of 3 minutes produces the reinforcer. A common 
procedural misunderstanding with the FI schedule is to assume 
that the elapse of time alone is sufficient for the delivery of a 
reinforcer, assuming that the reinforcer is delivered at the end 
of each fixed interval of time. However, more time than the 
fixed interval can elapse between reinforced responses. The 
reinforcer is available after the fixed time interval has elapsed, 
and it remains available until the first response. When the first 
response occurs sometime after the elapse of a fixed inter- 
val, that response is immediately reinforced, and the timing 
of another fixed interval is usually started with the delivery 
of the reinforcer. This FI cycle is repeated until the end of the 
session. 

Examples of FI schedules in everyday life are difficult to 
find. However, some situations do approximate and in reality 
function as FI schedules. For example, mail is often delivered 
close to a fixed time each day. An individual can make many 
trips to the mailbox to look for mail, but only the first trip to 
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the mailbox following the mail delivery will produce reinforce- 
ment. Many textbook examples of FI schedules, such as the 
mail example, do not meet the definition of an FI schedule; but 
the examples do appear similar to an FI schedule. For exam- 
ple, receiving a paycheck as wages for work by the hour, day, 
week, or month is contingent on the first response on payday 
that produces the paycheck. Of course, earning the paycheck 
requires many responses during the interval that eventually 
lead to receiving the paycheck. In a true FI schedule, responses 
during the interval do not influence reinforcement. 

FI schedules are relatively easy to use in applied settings. 
A teacher could make reinforcement available on an FI 2-min 
schedule for correct answers on an arithmetic worksheet. The 
teacher or student could use an electronic timer with a count- 
down function to signal the elapse of the 2-minute interval. The 
student’s first correct answer following the interval produces 
reinforcement, and then the teacher resets the timer for another 
2-minute interval. Similarly, the teacher could use a timing app 
or MotivAider® (www.habitchange.com) that vibrates to signal 
the elapse of an interval. 


Fixed Interval Schedule Effects 


Consistency of Performance 


FI schedules typically produce a postreinforcement pause in 
responding during the early part of the interval. An initially slow 
but accelerating rate of response is evident toward the end of the 
interval, usually reaching a maximum rate just before delivery of 
the reinforcer. This gradually accelerating rate of response toward 
the end of the interval is called an FI scallop because of the rounded 
curves that are shown on a cumulative graph (see Figure 13.4). 
FI postreinforcement pause and scallop effects can be 
seen in many everyday situations. When college students are 
assigned a term paper, they typically do not rush to the library 
and start to work on the paper immediately. More often they wait 
a few days or weeks before starting to work. However, as the 
due date approaches, their work on the assignment increases in 
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Figure 13.4 Summary of Fl schedule effects during 
ongoing reinforcement. 
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an accelerating fashion, and many are typing the final draft just 
before class. Cramming for a midterm or final examination is 
another example of the FI scallop effect. 

These examples with the reinforcement pause and scallop 
effects appear to be produced by FI schedules of reinforcement. 
They are not, however, because like the paycheck example, col- 
lege students must complete many responses during the interval 
to produce the term paper or a good grade on the examinations, 
and the term paper and examinations have deadlines. With FI 
schedules, responses during the interval are irrelevant, and FI 
schedules have no deadlines for the response. 

Why does an FI schedule produce a characteristic pause 
and scallop effect? After adjustment to an FI schedule, partic- 
ipants learn (a) to discriminate the elapse of time and (b) that 
responses emitted right after a reinforced response are never 
reinforced. Therefore, extinction during the early part of the 
interval might account for the postreinforcement pause. The 
effects of FI and FR schedules of reinforcement are similar in 
that both schedules produce postreinforcement pauses. How- 
ever, it is important to recognize the different characteristics of 
behavior that emerge under each schedule. Responses under an 
FR schedule are emitted at a consistent rate until completing the 
ratio requirement, whereas responses under an FI schedule begin 
at a slow rate and accelerate toward the end of each interval. 


Rate of Responding 


Overall, FI schedules tend to produce a slow to moderate rate 
of response. The duration of the time interval influences the 
postreinforcement pause and the rate of response; to a degree, 
the larger the fixed interval requirement, the longer the postre- 
inforcement pause and the lower the overall rate of response. 


Variable Interval Schedules 


A variable interval (VI) schedule of reinforcement provides 
reinforcement for the first correct response following the elapse 
of variable durations of time. The distinguishing feature of VI 
schedules is that “the intervals between reinforcement vary in 
a random or nearly random order” (Ferster & Skinner, 1957, 
p. 326). Behavior analysts use the average (i.e., mean) inter- 
val of time before the opportunity for reinforcement to describe 


Definition: The first 
response following varying 


VI schedules. For example, in a VI 5-min schedule the aver- 
age duration of the time intervals between reinforcement and 
the opportunity for subsequent reinforcement is 5 minutes. The 
actual time intervals in a VI 5-min schedule might be 2 minutes, 
5 minutes, 3 minutes, 10 minutes, or n minutes (or seconds). 


Variable Interval Schedule Effects 


Consistency of Performance 


A VI schedule of reinforcement tends to produce a constant, 
stable rate of response. The slope of the VI schedule on a cumu- 
lative graph appears uniform with few pauses in responding (see 
Figure 13.5). A VI schedule typically produces few hesitations 
between responses. For example, pop quizzes at unpredictable 
times tend to occasion more consistent study behavior from 
students than do quizzes scheduled at fixed intervals of time. 
Furthermore, students are less apt to engage in competing off- 
task behaviors during instructional and study periods when a 
pop quiz is likely. The pop quiz is used often as an example 
of a VI schedule because the performance effect is similar to 
a VI performance. The pop quiz does not represent a true VI 
schedule, however, because of the required responses during the 
interval, and the deadline for receiving reinforcement. 


Rate of Responding 


VI schedules of reinforcement tend to produce low to moder- 
ate rates of response. Like the FI schedule, the average dura- 
tion of the time intervals on VI schedules influences the rate of 
response; to a degree, the larger the average interval, the lower 
the overall rate of response. Figure 13.5 summarizes the sched- 
ule effects typically produced by VI schedules during ongoing 
reinforcement. 


Variable Interval Schedules in Applied Settings 


Basic researchers use computers to select and program VI sched- 
ules of reinforcement, as they do with VR schedules. Teachers 
seldom apply VI schedules in a planned and systematic way. For 
example, a teacher might set an electronic countdown timer with 
varied intervals of time ranging from | minute to 10 minutes 
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Figure 13.5 Summary of VI schedule effects during 
ongoing reinforcement. 
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without any prior plan as to which intervals or which order will 
be used. This set-them-as-you-go selection of intervals approx- 
imates the basic requirements for a VI schedule; however, it is 
not the most effective way of delivering reinforcement on a VI 
schedule. A planned, systematic application of varied intervals 
of time should increase the effectiveness of a VI schedule. 

For example, applied behavior analysts can select the max- 
imum time interval, whether in seconds or minutes, that will 
maintain performance and still be appropriate for the situation. 
Preferably, applied behavior analysts will use data from a direct 
assessment to guide the selection of the maximum VI interval, or 
at the least clinical judgment based on direct observation. Analysts 
can use a table of random numbers to select the varied intervals 
between | and the maximum interval, and then identify the VI 
schedule by calculating an average value for the VI schedule. The 
VI schedule may need adjustments following the selection of time 
intervals. For example, if a larger average interval of time appears 
reasonable, the teacher can replace some of the smaller intervals 
with larger ones. Conversely, if the average appears too large, the 
teachers can replace some of the higher intervals with smaller ones. 


Interval Schedules with a Limited Hold 


When a limited hold is added to an interval schedule, reinforce- 
ment remains available for a finite time following the elapse of 
the FI or VI interval. The participant will miss the opportunity 
to receive reinforcement if a targeted response does not occur 
within the time limit. For example, on an FI 5-min schedule with 
a limited hold of 30 seconds, the first correct response following 
the elapse of 5 minutes is reinforced, but only if the response 
occurs within 30 seconds after the end of the 5-minute inter- 
val. If no response occurs within 30 seconds, the opportunity 
for reinforcement has been lost and a new interval begins. The 
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abbreviation LH identifies interval schedules using a limited 
hold (e.g., FI S-min LH 30-sec, VI 3-min LH 1-min). Limited 
holds with interval schedules typically do not change the overall 
response characteristics of FI and VI schedules beyond a possi- 
ble increase in rate of response. 

Martens, Lochner, and Kelly (1992) used a VI schedule 
of social reinforcement to increase academic engagement 
of two 8-year-old boys in a third-grade classroom. The 
classroom teacher reported that the boys had serious off-task 
behaviors. The experimenter wore an earphone connected 
to a microcassette recorder that played a fixed-time cueing 
tape. During baseline, the cueing tape was programmed for 
a VI 20-sec schedule of reinforcement with a 0-sec limited 
hold for the availability of reinforcement. If the boys were 
not academically engaged when the 20-second interval timed 
out, they lost that opportunity for reinforcement until the next 
cue. Following baseline, the experimenter delivered contingent 
praise on VI 2-min or VI 5-min schedules of reinforcement that 
alternated daily on a quasi-random basis. Both boys’ academic 
engagement on the VI 5-min schedule resembled their baseline 
engagement (see Figure 13.6). When the VI 2-min schedule 
was in effect, both students were academically engaged at 
consistently higher levels than they were during baseline or the 
VI 5-min condition. 

Figure 13.7 summarizes and compares the four basic 
intermittent schedules of reinforcement. 


THINNING INTERMITTENT REINFORCEMENT 


Applied behavior analysts often use one of two procedures for 
schedule thinning. First, they thin an existing schedule by grad- 
ually increasing the response ratio or the duration of the time 
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Figure 13.7 Four basic intermittent schedules of reinforcement compared. 
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interval. If a student has answered addition facts effectively and 
responded well to a CRF schedule for two or three sessions, the 
teacher might thin the reinforcement contingency slowly from 
one correct addition fact (CRF) to a VR 2 or VR 3 schedule. 
The student’s performance should guide the progression from 
a dense schedule (i.e., responses produce frequent reinforce- 
ment) to a thin schedule (i.e., responses produce less frequent 
reinforcement). Applied behavior analysts should use small 
increments of schedule changes during thinning and ongoing 
evaluation of the learner’s performance to adjust the thinning 
process and avoid the loss of previous improvements. 

Second, teachers often use instructions to clearly com- 
municate the schedule of reinforcement, facilitating a smooth 
transition during the thinning process. Instructions include 
rules, directions, and signs. Participants do not require an aware- 
ness of environmental contingencies for effective intermittent 


reinforcement, but instructions may enhance the effectiveness 
of interventions when participants are told what performances 
produce reinforcement. 

Ratio strain can result from abrupt increases in ratio 
requirements when moving from denser to thinner reinforce- 
ment schedules. Also, ratio strain will occur when the ratio 
becomes so large that the reinforcement cannot maintain the 
response level, or the response requirement exceeds the partici- 
pant’s physiological capabilities. Common behavioral character- 
istics associated with ratio strain include avoidance, aggression, 
and unpredictable pauses in responding. Applied behavior ana- 
lysts should reduce the ratio requirement when ratio strain is 
evident. The analyst can again gradually thin ratio requirements 
after recovering the behavior. Preemptively, small and gradual 


increases in ratio requirements help to avoid the development 
of ratio strain. 


VARIATIONS ON BASIC INTERMITTENT 
SCHEDULES OF REINFORCEMENT 


Schedules of Differential Reinforcement of 
Rates of Responding 


Applied behavior analysts frequently encounter behavior 
problems that result from the rate that people perform certain 
behaviors. Responding too infrequently, or too often, may be det- 
rimental to social interactions or academic learning. Differential 
reinforcement provides an intervention for behavior problems 
associated with rate of response. Differential reinforcement of 
particular rates of behavior is a variation of ratio schedules. 
Delivery of the reinforcer is contingent on responses occurring 
at a rate either higher than or lower than some predetermined 
criterion. The reinforcement of responses higher than a prede- 
termined criterion is called differential reinforcement of high 
rates (DRH). When responses are reinforced only when they 
are lower than the criterion, the schedule provides differential 
reinforcement of low rates (DRL). DRH schedules produce a 
higher rate of responding. DRL schedules produce a lower rate 
of responding. 

Applied behavior analysts use three definitions of DRH 
and DRL schedules. The first definition states that reinforce- 
ment is available only for responses that are separated by a given 
duration of time. This first definition is sometimes called spaced- 
responding DRH or spaced-responding DRL. An interresponse 
time (IRT) identifies the duration of time that occurs between 
two responses. IRT and rate of response are functionally related. 
Long IRTs produce low rates of responding; short IRTs pro- 
duce high rates of responding. Responding on a DRH schedule 
produces reinforcement whenever a response occurs before a 
time criterion has elapsed. If the time criterion is 30 seconds, 
the participant’s response produces reinforcement only when the 
IRT is 30 seconds or less. 

Under the DRL schedule, a response produces reinforce- 
ment when it occurs after a time criterion has elapsed. If the 
stated DRL time criterion is again 30 seconds, a response pro- 
duces reinforcement only when the IRT is 30 seconds or greater. 

This first definition of DRH and DRL as IRT schedules 
of reinforcement has been used almost exclusively in laboratory 
settings. There are two apparent reasons for its lack of applica- 
tion in applied settings: (a) Most applied settings do not have 
sufficient automated equipment to measure IRT and to deliver 
reinforcement using an IRT criterion; and (b) reinforcement is 
delivered usually, but not necessarily, following each response 
that meets the IRT criterion. Such frequent reinforcement would 
disrupt student activity in most instructional settings. However, 
with greater use of computers for tutorial and academic response 
practice, opportunities should increase for using IRT-based 
schedules of reinforcement to accelerate or decelerate academic 
responding. Computers can monitor the pauses between aca- 
demic responses and provide consequences for each response 
meeting the IRT criterion, with little disruption in instructional 
activity. 

Based on the laboratory procedures for programming 
DRL schedules presented previously, Deitz (1977) labeled 
and described two additional procedures for using differential 
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reinforcement of rates of responding in applied settings: 
full-session DRH or DRL and interval DRH or DRL. Deitz 
initially used the full-session and interval procedures as a DRL 
intervention for problem behaviors. The full-session and interval 
procedures, however, apply also for DRH. 

A DRH full-session schedule provides reinforcement if the 
total number of responses during the session meets or exceeds 
a number criterion. If the participant emits fewer than the speci- 
fied number of responses during the session, the behavior is not 
reinforced. The DRL full-session schedule is procedurally the 
same as the DRH schedule, except reinforcement is provided for 
responding at or below the criterion limit. If the participant emits 
more than the specified number of responses during the session, 
reinforcement is not delivered. 

The interval definition for DRH and DRL schedules states 
that reinforcement is available only for responses that occur at 
a minimum or better rate of response over short durations of 
time during the session. To apply an interval DRH schedule, the 
analyst organizes the instructional session into equal intervals of 
time and dispenses a reinforcer at the end of each interval when 
the student emits a number of responses equal to, or greater than, 
a number criterion. The interval DRL schedule is procedurally 
like the DRH interval schedule, except that reinforcement is pro- 
vided for responding at or below the criterion limit. 

The differential reinforcement of diminishing rates 
(DRD) schedule provides reinforcement at the end of a prede- 
termined time interval when the number of responses is less than 
acriterion that is gradually decreased across time intervals based 
on the individual’s performance (e.g., fewer than five responses 
per 5 minutes, fewer than four responses per 5 minutes, fewer 
than three responses per 5 minutes). Deitz and Repp (1973) used 
a group DRD contingency to reduce off-task talking of 15 high 
school senior girls. They set the first DRD criterion limit at five 
or fewer occurrences of off-task talking during each 50-minute 
class session. The DRL criterion limits were then gradually 
reduced to three or fewer, one or fewer, and finally no responses. 
The students earned a free Friday class when they kept off-task 
talking at or below the DRD limit Monday through Thursday. 

The previous example of a DRD schedule used a proce- 
dure identical to that described for the full-session DRL. DRD is 
also a procedural variation on interval DRL schedules described 
by Deitz (1977) and Deitz and Repp (1983). The typical proce- 
dure for using an interval DRL as an intervention for problem 
behavior provided reinforcement contingent on emitting one or 
no responses per brief interval. After the problem behavior sta- 
bilizes at the initial criterion, the analyst maintains the maximum 
criterion of one or no responses per interval, but increases the 
duration of the session intervals to further diminish the behay- 
ior. Increasing the duration of session intervals continues grad- 
ually until the problem behavior achieves a terminal low rate of 
responding. 

Deitz and Repp (1983) programmed the interval DRL 
with a criterion greater than one response per interval, then grad- 
ually diminished the maximum number of responses per interval 
while the duration of the interval remained constant (e.g., fewer 
than five responses per 5 minutes, fewer than four responses 
per 5 minutes, fewer than three responses per 5 minutes, etc.). 
The DRD schedule and the interval DRL schedule that use a 
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maximum number criterion greater than one per interval are dif- 
ferent terms for the same procedure. Full-session and interval 
DRL have a long history of application in applied behavior anal- 
ysis. DRD offers applied behavior analysts a new, and perhaps 
improved, label for the interval DRL procedure. (For a detailed 
description of differential reinforcement of rates of response, 
see Chapter 25.) 


Increasing Response Variability with Lag 
Schedules of Reinforcement 


Response variability often sets the occasion for expanding 
important behavior repertoires such as developing prob- 
lem-solving skills, improving language acquisition and com- 
munication behaviors, and facilitating enhanced academic and 
social behaviors. Behavior analysts consider response variability 
an operant and therefore sensitive to reinforcement contingen- 
cies (Neuringer, 2004; Page & Neuringer, 1985). A lag schedule 
of reinforcement is one method to increase response variability. 
Reinforcement on a lag schedule is contingent on a response 
differing in some predetermined way (e.g., different topography, 
different sequence) from one or more responses that preceded 
it. With a Lag | schedule, each response that differs from the 
prior response produces reinforcement. Reinforcement on a Lag 
2 schedule is contingent on a response being different from the 
previous 2 responses, a Lag 3 requires a response to differ from 
the previous 3 responses, and so on. To produce reinforcement 
on a lag infinity schedule, a response must differ from all prior 
responses. 

Wiskow and Donaldson (2016) used a lag schedule to 
promote varied naming of items in the categories of vegetable 
and animal. Three children demonstrating low levels of vari- 
ability during initial testing served as participants. The children 
included two typically developing girls 3 and 4 years old, and a 
6-year-old boy diagnosed with high-functioning autism. 


Lag 0. The therapist gave the initial instruction, “We are 
going to play the category game and see how many things 
you can think of. Wait quietly until it is your turn. When 
you tell me a correct answer, you will earn a token. Let’s 
start with [category].” The therapist provided general 
feedback following an incorrect answer (e.g., “Nice try, 
but a banana is not a vegetable.”). The therapist delivered 
specific praise (e.g., “Great! Lettuce is a vegetable!”) and 
a token following correct responses. If the child did not 
respond, the therapist said nothing and moved on to the 
next child. 

Lag |. We arranged a Lag | schedule only in the 
animal category. The therapist gave the following initial 
instruction in Lag | sessions: “We are going to play the 
category game and see how many things you can think 
of. Wait quietly until it is your turn. When you tell me 
an answer that is different than what someone else said, 
you will earn a token. Let’s start with [category].” We did 
not include the first correct response in the lag schedule 
because this schedule required that a child’s response be 
different than the immediately prior correct response. That 
is, in the Lag | condition, the first participant could say 


any correct response and receive praise and a token. Incor- 
rect responses did not reset the schedule or factor into the 
schedule requirement and were not included when we cal- 
culated the percentage of variability (pp. 476-477). 


Behavior analysts have increasingly addressed the impor- 
tance of response variability in the development of verbal 
behavior (e.g., intraverbals, Contreras & Betz, 2016; tacts, 
Heldt & Schlinger, Jr., 2012; phonemic variability, Koehler- 
Platten, Grow, Schulze, & Bertone, 2013; echoics, Esch—J.W., 
Esch-B.E., & Love, 2009; [see also Chapter 18]). These 
researchers (and others) have demonstrated the effectiveness of 
Lag schedules on increasing variability with different behaviors, 
including verbal behavior and with other different categories of 
participants (e.g., age, gender, disability). 

Heldt and Schlinger (2012) investigated the effects of a 
Lag 3 schedule on increasing variability and, importantly, on 
maintaining the increased variability. A 13-year-old boy with 
autism and fragile X syndrome and a 4-year-old boy with mild 
intellectual disability served as participants. During the Lag 3 
schedule, the instructor began the 10 trials per session by (a) say- 
ing the participant’s name, (b) presenting a picture with an array 
of images, and (c) saying “What do you see?” Reinforcement 
was provided when the fourth response differed from the pre- 
vious three responses. “Therefore, for the first three responses, 
instructors utilized a reinforcement of different forms procedure, 
such as the one described by Goetz and Baer (1973). By the 
fourth response, the instructor implemented the Lag 3 schedule 
regardless of the prior four responses (i.e., whether they were 
all novel, all rote, all irrelevant responses, or some combina- 
tion of those). On this schedule, the instructor ignored incorrect 
responses, no response within 5 s, or responses that did not differ 
from the previous three responses, and proceeded to the next 
trial (p. 133). The follow-up occurred 3 weeks after termination 
of the Lag 3 sessions. The results showed that both participants 
increased variability in tacting during the Lag 3 sessions, and 
that variability in responding was maintained during a 3-week 
follow-up. (See also Lag schedule, Chapter 30.) 


Progressive Schedules of Reinforcement 


A progressive schedule of reinforcement systematically thins 
each successive reinforcement opportunity within the session 
independent of the participant’s behavior. Progressive-ratio and 
delay to reinforcement (sometimes called progressive interval, 
time-based progressive delay, delay fading) schedules of rein- 
forcement are used for reinforcer assessment and behavioral 
intervention. 


Progressive-ratio Schedule of Reinforcement 


In 1961, Hodos introduced a variation of the fixed ratio (FR) 
schedule of reinforcement that increased the ratio requirements 
in incremental steps within the session. For example, the par- 
ticipant begins the session on an FR 5 schedule of reinforce- 
ment, and receives reinforcement after emitting five responses. 
Following FR 5 responses and reinforcement, the schedule 
of reinforcement is now increased by some specified amount 
(e.g., from FR 5 to FR 10). Following the FR 10 responses 


and reinforcement, the ratio would again increase (e.g., from 
FR 10 to FR 15). This pattern of increasing ratio requirements 
continues to the session end. Analysts change PR schedule 
requirements using (a) arithmetic progressions to add a constant 
number to each successive ratio or (b) geometric progressions 
to add successively a constant proportion of the preceding ratio 
(Lattal & Neef, 1996). These within-session changes define the 
progressive-ratio (PR) schedule of reinforcement (Poling, 
2010; Roane, 2008; Schlinger et al., 2008). 

During the PR sessions, the ratios are typically increased 
to the “breaking point,” when the participant stops responding. 
Comparing the breaking points and corresponding number of 
responses associated with each PR condition offers an assess- 
ment for the potency of a reinforcer. For example, Jerome and 
Sturmey (2008) had three adults with developmental disabili- 
ties complete verbal and pictorial assessments to identify staff 
preferences. Following the verbal and pictorial assessments, 
Jerome and Sturmey used a PR schedule to evaluate the rein- 
forcer potency of interactions with preferred staff. The breaking 
points with the PR schedules were higher for all three adults 
under responses for positive social interactions with their pre- 
ferred staff than with their nonpreferred staff. 


In addition to the use of break points as a measure of rein- 
forcer efficacy, PR schedules are also characterized by the 
omission of a terminal schedule value. That is, PR schedule 
requirements typically increase throughout the course of 

a session until responding stops for a period of time (e.g., 

5 min) or until a predetermined duration of the observation 
has been reached (a session cap). Thus, total response out- 
put (as opposed to response rate) and break-point values are 
the primary measures of interest when evaluating behavior 
with PR schedules. (Roane, 2008, p. 155) 


In an article on the applied use of PR schedules, Roane 
(2008) recommended several areas for future research, 
including 


¢ The initial ratio size 

¢ The progression of the step-size ratios 

* The influence of response topographies on PR schedule 
performance 

¢ The likely interaction of increasing ratios and the values 
of reinforcer delivery 

¢ Using PR schedules to evaluate the therapeutic effects of 
psychopharmacological treatments 

© The use of translational research with PR schedules, link- 
ing applied and basic research 


Words of Caution When Using PR Schedules 


Poling (2010) identified the following concerns regarding PR 
schedules with protected populations and people with special 
needs. 


In applied research—at least, when applied is defined 

as proposed in Baer, Wolf, and Risley’s (1968) seminal 
article—the potency of a scheduled reinforcer is important 
primarily with respect to whether or not that reinforcer 
can be arranged to improve a socially significant target 
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behavior. How it affects behavior in other circumstances, 
even in the population of concern (e.g., children with 
autism), is of value only if (a) this information is easily 
obtained, so there is little cost for people with special 
needs; and (b) this information allows applied behavior 
analysts to predict the clinical utility of the scheduled 
reinforcer. PR schedules are not an especially quick 

way to scale reinforcer potency. In addition, exposure to 
long ratios (as when the breaking point is approached) is 
known to be aversive (e.g., Dardano, 1973). Therefore, 
members of protected populations should not be exposed 
to them unless there are clear offsetting benefits. To 
date, no such benefits have been demonstrated, and no 
compelling arguments for their existence have been 
provided. (p. 349). 


Delay to Reinforcement Schedules 


Applied behavior analysts have used delay to reinforcement 
schedules to develop self-control (e.g., Binder, Dixon, & Ghezzi, 
2000; Dixon & Cummins, 2001). For example, Dixon and Hol- 
comb (2000) used a delay schedule to develop cooperative work 
behaviors and self-control of six adults dually diagnosed with 
intellectual disabilities and psychiatric disorders. The adults par- 
ticipated in two groups composed of three men in Group | and 
three women in Group 2. During a natural baseline condition, 
the groups received instruction to exchange or share cards to 
complete a cooperative task of sorting playing cards into piles 
by categories (e.g., hearts with hearts). Dixon and Holcomb ter- 
minated a natural baseline session for the group when one of the 
adults stopped sorting cards. 

The groups received points for working on the card-sorting 
task during the choice baseline condition and the self-control 
training condition. Groups exchanged their points earned for 
items such as soda pop or cassette players, ranging in values 
from 3 points to 100 points. 

During the choice baseline conditions, the group’s partici- 
pants could choose an immediate 3 points before doing the card 
sorting or a delayed 6 points after sorting the cards. Both groups 
chose the immediate smaller number of points rather than the 
larger amount following a delay to reinforcement. 

During self-control training, the participants were asked 
while working on a cooperative task, “Do you want 3 points 
now, or would you like 6 points after sorting the cards for Z 
minutes and seconds?” (pp. 612-613). The delay was initially 
0 seconds for both groups. The progressive delay to reinforce- 
ment ranged from an increase of 60 seconds to 90 seconds fol- 
lowing each session that the group performance met the exact 
criterion for number of seconds of task engagement. The termi- 
nal goals for the delay to reinforcement were 490 seconds for 
Group | and 772 seconds for Group 2. Both groups achieved 
these delay-to-reinforcement goals. Following the introduction 
of the progressive delay procedure, both groups improved their 
cooperative work engagement and the self-control necessary to 
select progressively larger delays to reinforcement that resulted 
in more points earned. Figure 13.8 shows the performance of 
both groups of adults during natural baselines, choice baselines, 
and self-control training conditions. 
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Applied behavior analysts often use functional commu- 
nication training (FCT)—the most prescribed function-based 
intervention to reduce problem behavior (e.g., aggression, ste- 
reotypy, self-injurious behavior [SIB], making threats, elope- 
ment) (Greer, Fisher, Saini, Owen, & Jones, 2016). Resurgence 
may occur when FCT is discontinued (see Chapter 24), or 
with a rapid thinning of the schedule of reinforcement. If 
resurgence of a previously reduced problem behavior occurs, 
applied behavior analysts might apply a delay to reinforcement 
schedule. 

For example, Stevenson, Ghezzi, & Valenton (2016) used 
FCT (Carr & Durand, 1985) and a delay to reinforcement sched- 
ule (called delay fading) to treat the elopement behavior of a 
9-year-old boy with autism. The FCT began with a functional 
behavior assessment (see Chapter 27)—(a) to identify reinforc- 
ers and (b) to identify processes for accessing reinforcement 
with appropriate behavior. During FCT, the boy learned to 
ask appropriately for the identified reinforcers and his eloping 
decreased. Stevenson et al. applied the delay fading condition 
following the boy’s behavioral improvements during the FCT 
condition. The delay fading method used a conjunctive FR 1/VI 
schedule of reinforcement. 


Throughout these sessions, when Damon [the child] asked 
for an item or activity (i.e., the FR 1 schedule), the tutor said, 
“You have to wait for now, so let’s keep walking.” The VI 
schedule began timing at this moment, and the first appro- 
priate request that Damon made after the interval elapsed 
produced the relevant item or activity. The value of the VI 
component progressed through various phases, when no 
elopement occurred for three consecutive sessions, from VI 
I5 s to VI30s, VI 45 s, VI.90 s, and finally, VI 300 s. 
(Stevenson et al., 2016, p. 171) 


The results from the FCT and delay fading conditions indi- 
cated that the child continued with enduring appropriate com- 
munication and low elopements. He learned to wait for desired 
activity. (See Conjunctive Schedules later in this chapter). 


COMPOUND SCHEDULES OF REINFORCEMENT 


Applied behavior analysts combine two or more basic sched- 
ules of reinforcement—continuous reinforcement (CRF), the 
four intermittent schedules of reinforcement (FR, VR, FI, VI), 
differential reinforcement of various rates of responding (DRH, 
DRL), and extinction (EXT)—to form compound schedules of 
reinforcement. The basic component schedules: 


* can occur simultaneously or successively; 

* can occur with or without discriminative stimuli; and 

* make reinforcement contingent on meeting the response 
requirements of each component schedule independently, 
or contingent on meeting the response requirements of 
the combination of the compound schedules (Ferster & 
Skinner, 1957). 


Concurrent Schedules 


A concurrent schedule (conc) of reinforcement occurs when 
(a) two or more contingencies of reinforcement (b) operate 
independently and simultaneously (c) for two or more behaviors. 
Each schedule is correlated with a discriminative stimulus. 
People in the natural environment have opportunities for making 
choices among concurrently available events. For example, 
Sharon receives a weekly allowance from her parents, contingent 
on doing daily homework and cello practice. After school she can 
choose when to do homework and when to practice the cello, and 


she can distribute her responses between these two simultaneously 
available schedules of reinforcement. Applied behavior analysts 


use concurrent schedules for reinforcer assessment and for 
behavioral interventions. 


Using Concurrent Schedules for Reinforcer 
Assessment 


Applied behavior analysts have used concurrent schedules exten- 
sively to provide choices during the assessment of consequence 
preferences and the assessment of response quantities (e.g., rate, 
latency, magnitude) and reinforcer quantities (e.g., rate, duration, 
immediacy, amount). Responding to concurrent schedules pro- 
vides a desirable assessment procedure because (a) the participant 
makes choices, (b) making choices during assessment approx- 
imates the natural environment, (c) the schedule is effective in 
producing hypotheses about potential reinforcers operating in the 
participant’s environment, and (d) these assessments require the 
participant to choose between stimuli rather than indicating a pref- 
erence for a given stimulus (Adelinis, Piazza, & Goh, 2001; Neef, 
Bicard, & Endo, 2001; Piazza et al., 1999). 

Roane, Vollmer, Ringdahl, and Marcus (1998) presented 
10 items to a participant, 2 items at a time. The participant had 
5 seconds to select | item by using a reaching response to touch 
the selected item. As a consequence for the selection, the par- 
ticipant received the item for 20 seconds. The analyst verbally 
prompted a response if the participant did not respond within 
5 seconds, waiting another 5 seconds for the occurrence of a 
prompted response. Items were eliminated from the assessment 
(a) if they were not chosen during the first five presentations or 
(b) if they were chosen two or fewer times during the first seven 
presentations. The participant made a total of 10 choices among 
the remaining items. The number of selections out of the 10 
opportunities served as a preference index. (See also “Concur- 
rent Schedule Reinforcer Assessment” in Chapter 1 1.) 


Using Concurrent Schedules for Intervention 


Applied behavior analysts have used concurrent schedules exten- 
sively for improving vocational, academic, and social skills in 
applied settings (e.g., Cuvo, Lerch, Leurquin, Gaffaney, & Pop- 
pen, 1998; Reid, Parsons, Green, & Browning, 2001; Romaniuk 
et al., 2002). For example, Hoch, McComas, Johnson, Faranda, 
and Guenther (2002) arranged two concurrent response alter- 
natives for three boys with autism. The boys could play in one 
setting with a peer or sibling, or play alone in another area. Hoch 
and colleagues manipulated the duration of access to toys (i 
reinforcer magnitude) and preference (i.e., reinforcer quality). In 
one condition, the magnitude and quality of the reinforcer were 
equal in both settings. In the other condition, the magnitude 
and quality of the reinforcer were greater for play in the setting 
with a peer or sibling than in the play-alone setting. With the 
introduction of the condition with greater magnitude and qual- 
ity of the reinforcer, the boys allocated more play responses to 
the setting with the peer or sibling, rather than playing alone. 
The magnitude and quality of the reinforcer influenced choices 
made by the three boys. Figure 13.9 reports the percentage of 
responses allocated to the concurrent play areas. 
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Concurrent Schedule Performances: Formalizing the 
Matching Law 


Cuvo et al. (1998) reported that concurrent schedules typically 
produce two response patterns. With concurrent interval sched- 
ules (conc VI/VI, conc FI/FI), participants “typically do not 
allocate all of their responses exclusively to the richer schedule 
[i.e., the schedule producing the higher rate of reinforcement]; 
rather, they distribute their responding between the two sched- 
ules to match or approximate the proportion of reinforcement 
that is actually obtained on each independent schedule” (p. 43). 
Conversely, with concurrent ratio schedules (conc VR/VR, conc 
FR/FR), participants are sensitive to the ratio schedules and tend 
to maximize reinforcement by responding primarily to the ratio 
that produces the higher rate of reinforcement. 

Williams (1973) identified three types of interactions 
found with concurrent schedules. First, when similar reinforce- 
ment is scheduled for each of the concurrent responses, the 
response receiving the higher frequency of reinforcement will 
increase in rate, whereas a corresponding decrease will occur 
in the response rate of the other behavior. Second, when one 
response produces reinforcement and the other produces pun- 
ishment, responses associated with punishment will decrease 
in occurrence. That decrease may produce a higher rate of 
response for the behavior producing reinforcement. Third, with 
a concurrent schedule programmed for one response to pro- 
duce reinforcement and the other response to produce avoidance 
of an aversive stimulus, the rate of avoidance responding will 
accelerate with an increase in the intensity or the frequency 
of the aversive stimulus. As avoidance responding accelerates, 
typically responding on the reinforcement schedule will then 
decrease. 

The matching law addresses response allocation to 
choices available with concurrent schedules of reinforcement. 
Typically, the rate of responding is proportional to the rate of 
reinforcement received from each choice alternative. The char- 
acteristics of performance on concurrent schedules are con- 
sistent with the relationships formalized by Herrnstein (1961, 
1970) and as detailed by Cuvo et al. (1998) and Williams (1973). 


Discriminated Compound Schedules of 
Reinforcement 


A discriminative stimulus (SP) signals the presence of each 
component schedule in a discriminated compound schedule of 
reinforcement. 


Multiple Schedules 


A multiple schedule (mult) presents two or more basic schedules 
of reinforcement in an alternating, usually random, sequence. 
The basic schedules within the multiple schedule occur succes- 
sively and independently. A discriminative stimulus is correlated 
with each basic schedule, and that stimulus is present as long as 
the schedule is in effect. Applied behavior analysts have used 
multiple schedules of reinforcement for diverse treatment con- 
ditions—for example, thinning dense schedules of reinforcement 
(Greer, Fisher, Saini, Owen, & Jones, 2016), facilitating the ver- 
bal skill of children with autism (e.g., Sidener, Shabani, Carr, & 
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Figure 13.9 Percentage of responses allocated to 
the play area with the peer across experimental 
sessions (top panel) and in natural-setting probes 
with different peers in the classroom (middle panel) 
for analyzing the magnitude of reinforcement with 
Robbie, and the percentage of responses allocated 
to the play area with the sibling across experimental 
sessions for analyzing the quality of reinforcement 
with Abe (bottom panel). 

“The Effects of Magnitude and Quality of Reinforcement on Choice 
Responding During Play Activities” by H. Hoch, J. J. McComas, 
L. Johnson, N. Faranda, and S. L. Guenther, 2002, Journal of 
Applied Behavior Analysis, Reproduced with permission of John 
Wiley and Sons Inc. 


Roland, 2006), and reducing severe problem behavior (Neidert, 
Iwata, & Dozier, 2005). Academic behaviors also can become 
sensitive to the control of multiple schedules of reinforcement. 
For example, a student might respond to basic arithmetic facts 
with her teacher, and also with her tutor. With the teacher, the 
student responds to arithmetic facts during small-group instruc- 
tion. The tutor then provides individual instruction and practice 
on the facts. This situation follows a multiple schedule because 
there is one class of behavior (i.e., math facts), a discriminative 
stimulus for each contingency in effect (i.e., teacher/tutor, small 
group/individual), and different conditions for reinforcement 
(i.e., reinforcement is less frequent in group instruction). 

Tiger and Hanley (2005) used a three-component mul- 
tiple schedule of reinforcement (SR+/EXT,/EXT)) to teach 
two preschool children how and when to recruit teacher atten- 
tion at appropriate times. The children sat at individual tables 
with access to materials such as blocks and string beads. The 
analyst looked down except for giving contingent attention to 
one child’s approach responses (SR+), while the other child’s 
approach experienced extinction (EXT)), and both children’s 
approaching the analyst produced extinction (EXT). The ana- 
lyst wore a red lei (discriminative stimulus) when attention was 
available for one child, a blue lei when attention was avail- 
able to the other child, and a white lei when attention was not 
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available to either child. “The purpose of the MULT condition 
was to determine if correlating stimuli with the availability 
(SR+) and unavailability (EXT, and EXT;) of attention was 
sufficient to bring children’s social approaches under stimu- 
lus control.” (p. 502) The children demonstrated discriminated 
performances when—prior to the session—they received verbal 
reminders of the multiple schedule arrangements (cf. Tiger & 
Hanley, 2004; Tiger, Hanley, & Heal, 2006). 

Torelli, Lloyd, Diekman, and Wehby (2016) extended 
Tiger and colleagues’ (2004, 2005, 2006) line of research from 
preschool to elementary school classrooms. In the words of 
Torelli and colleagues, “The purpose of the current study 
was to evaluate the effects of a class-wide multiple sched- 
ule on differentiated rates of student recruitment of teacher 
attention in two public elementary classrooms. General educa- 
tion teachers implemented the multiple schedule intervention 
in the context of a common instructional routine (i.e., small 
group rotations among reading centers). Results indicated that 
the multiple schedule intervention was effective at decreasing 
disruptive bids for attention when teacher attention was not 
available” (p. 1). 

To learn more about the use of multiple schedules in clinical 
assessment and treatment, see Saini, Miller, and Fisher (2016) and 
“Multiple Schedule Reinforcer Assessment” in Chapter 11. 


Reynolds (1961) introduced the term behavioral contrast 
to describe the effects of (a) a schedule change that increases or 
decreases the rate of responding in one component of a multi- 
ple schedule of reinforcement, which is accompanied by (b) a 
change in the response rate in the opposite direction on another, 
unaltered component. That is, changing schedule requirements 
in one component may affect responding in other unaltered com- 
ponents (e.g., changing a mult FR 3/FI 30-sec to a mult FR 3/ 
EXT may result in an increased rate of response in the FR com- 
ponent, the unaltered component). 

For example, an analyst used the following intervention 
to decrease a student’s disruptive behavior in the classroom 
and on the playground. She established an interval of time 
and delivered reinforcement at the end of the interval if the 
student had emitted no disruptive classroom behavior during 
the interval. With the occurrence of a disruptive behavior, the 
analyst immediately reset a timer, starting a new interval. The 
analyst introduced no intervention on playground behavior. 
This procedure effectively decreased the student’s disruptive 
classroom behavior, and the number of playground disruptions 
remained low as well. With the effectiveness of differential 
reinforcement of other behavior (DRO; see Chapter 25), the 
analyst began to thin the classroom reinforcement schedule by 
extending the interval of time (e.g., 25-, 35-, and 45-min inter- 
vals). Schedule thinning effectively maintained the appropriate 
classroom behavior. However, the behavioral contrast of the 
increased occurrences of disruptive behavior on the playground 
surprised the analyst. 

Although most research on behavioral contrast has been 
conducted with nonhuman subjects, Pierce and Epling (1995) 
concluded the phenomenon likely occurs in humans. We concur 
and advise applied behavior analysts to be aware of behavioral 
contrast effects in their practice. 


Chained Schedules 


A chained schedule (chain) is similar to a multiple schedule. 
The multiple and chained schedules have two or more basic 
schedule requirements that occur successively, and have a dis- 
criminative stimulus correlated with each independent schedule. 
A chained schedule differs from a multiple schedule in three 
ways. First, the basic schedules in a chain schedule always 
occur in a specific order, never in the random or unpredictable 
order of multiple schedules. Second, the behavior may be the 
same for all elements of the chain, or different behaviors may be 
required for different elements in the chain. Third, conditioned 
reinforcement for responding in the first element in a chain is the 
presentation of the second element; conditioned reinforcement 
for responding in the second element is presentation of the third 
element, and so on until all elements in the chain have been 
completed in a specific sequence. The last element normally 
produces unconditioned reinforcement in a laboratory setting, or 
unconditioned or conditioned reinforcement in applied settings. 

Tarbox, Madried, Aguilar, Jacobo, and Schiff (2009) used 
two-component chain schedules (€.g., “mun” & “day” for “Mon- 
day”; “b” and “all” for “ball’”) to develop the echoic behavior of 
two children with autism and one child with developmental delay. 
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During each chaining session, the therapist presented 
one echoic in three sequential trials as rapidly as feasible 
(i.e., the therapist presented a trial immediately after the 
completion of reinforcement for the previous trial). On 
the first trial, the therapist modeled the first component 
(e.g., “say ‘mun’”). If the participant correctly imitated 
this component within 5 s, the therapist delivered 
reinforcement and immediately modeled the second 
component (e.g., “say “day’”). If the participant imitated 
the second component correctly, the therapist delivered 
reinforcement, modeled the entire target echoic (e.g., 
“say “Monday’”), and provided reinforcement for correct 
imitation. An incorrect response on any of the three trials 
resulted in a single repetition of that trial, after which the 
therapist resumed the sequence. (p. 903) 


The following example shows an elaborate sequence of 
different behaviors that must occur in a specific order. To ser- 
vice a bicycle headset, the mechanic will complete a chain 
with 13 components: (1) Disconnect the front brake cable; 
(2) remove handlebar and stem; (3) remove front wheel; 
(4) remove locknut; (5) unscrew adjusting race; (6) take fork 
out of frame; (7) inspect races; (8) grease and replace bearing 
balls for lower stack; (9) grease and replace bearing balls for 
upper race; (10) grease threads of steering column; (11) put 
fork into frame and thread the screwed race; (12) return lock 
washer; (13) adjust and lock the headset. The final outcome 
(i.e., a clean, greased, and adjusted bicycle headset) is contin- 
gent on the completion of all 13 components. (For a detailed 
description of behavior chains and how applied behavior ana- 
lysts help individuals learn new and more complex chains of 
behavior, see Chapter 23.) 


Nondiscriminated Schedules of Reinforcement 


Unlike with discriminated compound schedules of reinforcement, 
the presence of each component schedule in a nondiscriminated 
compound schedule of reinforcement is not associated with an a) 


Mixed Schedules 


A mixed schedule of reinforcement (mix) uses a proce- 
dure identical to that of the multiple schedules, except no 
discriminative stimuli signal the presence of the indepen- 
dent component schedules. For example, with a mix FR 10/ 
FI 1 schedule, reinforcement sometimes occurs after the 
completion of 10 responses and sometimes occurs with the 
first response after a 1-minute interval from the preceding 
reinforcement. 

The research of Tiger and Hanley (2005), presented in 
the section on multiple schedules, used a three-component mult 
schedule of reinforcement (SR+, EXT; and EXT,) to teach 
preschool children how to recruit teacher attention at appro- 
priate times. They also used a time-based rotation of those 
components without discriminative stimuli. This mix schedule 
(SR+, EXT;, and EXT) “served as a baseline from which the 
influence of schedule-correlated stimuli on children’s social 
approaches would be assessed.” (p. 501) 
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Tandem Schedules 


The tandem schedule (tand) operates like the chained schedule, 
except that discriminative stimuli are not used with the elements 
in the chain. In a tand FR 15/FI 2 schedule, after a participant 
makes 15 responses (FR 15), the first correct response following 
an elapse of 2 minutes (FI 2) produces reinforcement. 

Jeffries et al. (2016) assessed the effectiveness of a CRF/ 
FR 4 tandem schedule of reinforcement with children using an 
iPad platform. Essentially, a child examined a number (e.g., 7, 2) 
embedded within simulated eyes on the iPad with the direction 
to pick the matching number on a corresponding grid. If the 
number matched, a piece of coal appeared on the screen and 
a praise statement was emitted by the [iPad] narrator (CFR). 
After the child received four pieces of coal (FR 4), an interac- 
tive Steam Train game appeared, which the child was allowed 
to play for | min. 

Although basic researchers have produced considerable 
data on the effects of mixed and tandem schedules, there is little 
applied research on these schedules. As applied behavior analysts 
continue to improve the science’s knowledge base, how nondis- 
criminated compound schedules of reinforcement might contrib- 
ute to assessment and intervention may become more apparent. 


Schedules Combining Number of Responses 
and Time 


Alternative Schedules 


An alternative schedule (alt) provides an opportunity for rein- 
forcement on two or more simultaneously available basic sched- 
ules of reinforcement. The first schedule completed provides 
reinforcement, regardless of which schedule component is met 
first. All schedule requirements start over following reinforce- 
ment. With an alt FR 50/FI 5-min schedule, reinforcement is 
delivered whenever either of these two conditions has been met: 
(a) 50 correct responses, provided the 5-minute interval of time 
has not elapsed; or (b) the first response after the elapse of 5 min- 
utes, provided that fewer than 50 responses have been emitted. 

For instance, a teacher using an alt FR 25/FI 3-min 
schedule of reinforcement assigns 25 math problems and 
assesses the student’s correct and incorrect answers following 
the elapse of 3 minutes. If the student completes the 25 prob- 
lems before the elapse of 3 minutes, the teacher checks the 
student’s answers and provides a consequence consistent with 
the FR 25 schedule. However, if the ratio requirement of 25 
math problems has not been completed after an elapse of 3 min- 
utes, the first correct answer following the 3 minutes produces 
reinforcement. The alternative schedule offers the advantage 
of a second chance for reinforcement if the student has not met 
the FR requirement in a reasonable amount of time. The FI pro- 
vides reinforcement for one response, and that one reinforced 
response might encourage continued responding with the new 
start of the FR requirement. 


Conjunctive Schedules 


A conjunctive schedule (conj) of reinforcement is in effect 
whenever reinforcement follows the completion of response 
requirements for two or more schedules of reinforcement. For 
example, a student behavior produces reinforcement when at 
least 2 minutes have elapsed and 50 responses have been made. 
This arrangement is a conj FI 2/FR 50 schedule of reinforce- 
ment. With the conjunctive schedule of reinforcement, the first 
response following the conclusion of the time interval produces 
reinforcement if the criterion number of responses has been 
completed. 

A 14-year-old boy with autism had higher rates of 
aggression with two of his four therapists during instruction. 
The higher rates of aggression were directed toward the two 
therapists who previously worked with the boy at a different 
treatment facility. Progar et al. (2001) intervened to reduce the 
levels of aggression with the therapists from the different facility 
to the levels that occurred with the other two therapists in the 
current setting. The boy’s aggression occurred in demand situ- 
ations (e.g., making his bed) and was escape maintained. The 
initial intervention used three consequences: (1) a 10-minute 
chair time-out for attempts to choke, (2) escape extinction (see 
Chapter 24), and (3) differential reinforcement other behavior 
(DRO; see Chapter 25) for the omission of aggression during the 
10-minute sessions. This intervention was identical to the treat- 
ment used with the boy at the other facility. It was ineffective in 
reducing the boy’s aggression in the current setting. 

Because of the ineffectiveness of the initial intervention, 
Progar and colleagues added a conj FR/VI-DRO schedule of 
reinforcement to their initial intervention. They delivered edible 
reinforcers contingent on completing a three-component task, 
such as dusting or straightening objects (i.e., an FR 3 sched- 
ule), and the omission of aggression for an average of every 2.5 
minutes (i.e., a VI-DRO 150-sec). An occurrence of aggression 
reset the conj schedule. (Note: Resetting this conj schedule used 
a standard procedure because any occurrence of the problem 
behavior during a DRO interval immediately reset the time to the 
beginning of the interval.) Progar and colleagues demonstrated 
that the conj FR VI-DRO schedule produced a substantial reduc- 
tion in aggression directed toward the two therapists previously 
from the other treatment facility. 

Duvinsky and Poppen (1982) found that human perfor- 
mance on a conjunctive schedule is influenced by the ratio 
and interval requirements. When task requirements are high in 
relation to the interval requirements, people are likely to work 
steadily on the task throughout the time available. However, peo- 
ple are likely to engage in behaviors other than the task require- 
ments when a large time interval and a low ratio requirement are 
present. 

Table 13.1 describes and provides examples of the com- 
pound schedules of reinforcement presented above. Table 13.2 
compares the defining dimensions of these schedules. 


TABLE 13.1 


SCHEDULE 
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Descriptions and Examples of Compound Schedules of Reinforcement 
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Discriminated Compound Schedules of Reinforcement 


DESCRIPTION 


EXAMPLE 


a aa eee 


Concurrent 
(conc) 


Multiple (mult) 


Chained (chain) 


Two or more schedules of reinforcement, each with a 


correlated S°, operate independently and simultaneously 
for two or more behaviors. 


Two or more basic schedules of reinforcement for the 
same behavior operate successively, usually in randorn 
sequence. An S? is correlated with each component 
schedule. Reinforcement is delivered each time the 
response requirements are met for the schedule in 
effect. 


Two or more basic schedules, each correlated with an 
S> operate in a specified sequence. The component 
schedules may require the same behavior or different 
behaviors. Meeting the response requirement in the 
first component presentation of the S° for the second 
component must be accomplished, and so on until all 
elements in the chain have been completed. Completing 
the last element normally produces unconditioned or 
conditioned reinforcement in applied settings. 


Nondiscriminated Compound Schedules of Reinforcement 


SCHEDULE 
Mixed (mix) 


Tandem (tand) 


DESCRIPTION 


Two or more basic schedules are presented successively, 
usually in random sequence. Unlike with a multiple 
schedule, however, each component schedule is not 
signaled by an S°. 


Two or more basic schedules operate in a specified 
sequence like a chained schedule. Unlike with a chained 
schedule, however, an SP is not associated with each 
component schedule. 


A student is given two math worksheets, one blue 

with addition problems, one white with subtraction 
problems. The student can work on either or both types 
of problems, and earns a reinforcer for every 10 correct 
answers to addition problems and every 5 correct 
answers to subtraction problems. [conc FR 10/FR 5] 


During daily math practice activity, a student is given 

a blue worksheet or a white worksheet. Which color 
worksheet is assigned each day varies randomly. Regard- 
less of color, each worksheet contains a mix of 100 
addition problems. On blue worksheet days, the student 
earns a reinforcer for each 10 correct answers. On white 
worksheet days, the student receives a reinforcer each 
time he correctly answers a variable number (average, 5) 
of problems. [mult FR 10/VR 5] 


A student receives a worksheet containing 20 addition 
problems. When the student has correctly answered 
all of the problems, he receives a second worksheet, 
this one with 10 subtraction problems. After correctly 
answering the 10 subtraction problems, the student 
receives a reinforcer. [chain FR 20/FR 10] 


EXAMPLE 


A student is assigned daily math worksheets always 
of the same color. Sometimes the student receives 
a reinforcer after 10 correct answers; sometimes 
reinforcement follows the first correct answer after 
a 1-min interval has elapsed from the preceding 
reinforcement. [mix FR 10/Fl 1] 


A student receives a worksheet with math problems. 
She earns a reinforcer for completing 15 problems, and 
then, the first problem completed after an elapse of 

2 min also produces a reinforcer. [tand FR 15/Fl 2] 


Compound Schedules of Reinforcement Combining Number of Responses and Time 


SCHEDULE 
Alternative (alt) 


Conjunctive 
(con) 


DESCRIPTION 


Reinforcement is obtained by meeting the response 
requirements of any of two or more simultaneously 
available component schedules. Reinforcement is 
delivered regardless of which schedule component Is 
met. All component schedule requirements are reset 
following reinforcement. 


Reinforcement follows the completion of response 
requirements for two or more simultaneously operating 
schedules of reinforcement. 


EXAMPLE 


A student receives a worksheet with math problems and 
is instructed to begin. If 15 correct answers are emitted 
prior to the elapse of 3 min, a reinforcer Is delivered. 

If the FR requirement is not met after 3 min, the first 
correct answer from that time forward is reinforced. 

[Fl 3-min/FR 15] 


A student receives a worksheet with math problems. 

After 15 problems have been answered correctly, the 
first correct answer after an elapse of 2 min produces 
reinforcement. [conj FR 15/Fl 2] 
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TABLE 13.2 Defining Dimensions of Compound Schedules of Reinforcement 
ra er ee ear ne ee ee 


Dimension Concurrent Multiple Chained Mixed Tandem Alternative Conjunctive 
Number of component 2 or more 2 or more 2 or more 2 or more 2 or more 2 Or more 2 or more 
schedules 

Number of response 2 or more 1 1 or more 1 1 or more 1 1 

classes involved 

Discriminative stimuli or Possible Yes Yes No No Possible Possible 
cues for each component 

schedule 

Successive presentation of No Yes Yes Yes Yes No No 
component schedules 

Simultaneous presentation Yes No No No No Yes Yes 

of component schedules 

Reinforcement limited to No No Yes No Yes No Yes 

final component schedule 

Must meet response Yes Yes No Yes No Yes No 


requirements of each 
component schedule for 
reinforcement 


PERSPECTIVES ON USING SCHEDULES OF 
REINFORCEMENT IN APPLIED SETTINGS 


Applied Research with Intermittent Schedules 


Basic researchers have systematically analyzed the effects 
of intermittent schedules of reinforcement on behavior (e.g., 
Ferster & Skinner, 1957). Their results have produced well- 
established schedule effects. These schedule effects have 
strong generality across many species, response classes, and 
laboratories. However, a review of the applied literature on 
schedule effects (e.g., Journal of Applied Behavior Analysis, 
1968 to 2016) will show that applied behavior analysts have 
not embraced the analysis of schedule effects with enthu- 
siasm, as have basic researchers. Consequently, schedule 
effects have not been documented clearly in applied settings. 
Uncontrolled variables in applied settings, such as the follow- 
ing, influence a participant’s sensitivity and insensitivity to 
the schedule of reinforcement: 


1. Instructions given by the applied behavior analyst, self- 
instructions, and environmental aids (e.g., calendars, 
clocks) make human participants resistant to temporal 
schedule control. 


2. Past histories of responding to intermittent schedules of 
reinforcement can affect current schedule sensitivity or 
insensitivity. 

3. Immediate histories from schedules of reinforcement may 
affect current schedule performances more than remote 
past histories. 


4. Sequential responses required in many applied applica- 
tions of intermittent schedules of reinforcement Keg: 
work leading to the paycheck, studying for a pop quiz) are 
uncommon applications of schedules of reinforcement, 
particularly with interval schedules. 


5. Uncontrolled establishing operations in conjunction with 
schedules of reinforcement in applied settings will con- 
found schedule effects. 


Some well-established schedule effects found in basic 
research were presented earlier in this chapter. Applied behav- 
ior analysts, however, should use caution in extrapolating these 
effects to applied settings, for the following reasons: 


1. Most applied applications of schedules of reinforcement 
only approximate true laboratory schedules of reinforce- 
ment, especially the interval schedules that may occur 
rarely in natural environments (Nevin, 1998). 


2. Many uncontrolled variables in applied settings will influ- 
ence a participant’s sensitivity and insensitivity to the sched- 
ule of reinforcement (Madden, Chase, & Joyce, 1998). 


Applied Research with Compound Schedules 


Although a considerable body of applied research on multiple, 
concurrent, and chained schedules has been reported, scant 
research exists on the effects of nondiscriminated compound 
schedules of reinforcement in applied settings. We encourage 
applied behavior analysts to explore the use of mixed, tandem, 
alternative, and conjunctive schedules. Doing so will reveal the 
effects of these compound schedules in applied settings and their 
interactions with other environmental variables (e.g., antecedent 
stimuli, motivating operations) (Lattal & Neef, 1996). 


Applied Research with Adjunctive Behavior 


This chapter has stressed the effects of schedules of reinforcement 
on the specific behaviors that produce reinforcement. Other behav- 
iors can occur when an individual responds to a given contingency 
of reinforcement. These other behaviors occur independently of 
schedule control. Typical examples of such behaviors include 
normal time fillers, such as doodling, smoking, idle talking, and 


drinking. Such behaviors are called adjunctive behaviors, or 
schedule-induced behaviors, when the frequency of these time-fill- 
ing behaviors increases as a side effect of other behaviors main- 
tained by a schedule of reinforcement (Falk, 1961, 197 I): 

A substantial body of experimental literature is available 
on many types of adjunctive behaviors with nonhuman subjects 
(see reviews: Staddon, 1977; Wetherington, 1982) and some 
basic research with human subjects (e.g., Kachanoff, Leveille, 
McLelland, & Wayner 1973; Lasiter, 1979). Common diverse 
examples of adjunctive behaviors observed in laboratory exper- 
iments include aggression, defecation, pica, and wheel running. 
Some common excessive human problem behaviors might 
develop as adjunctive behaviors (e.g., the use of drugs, tobacco, 
caffeine, and alcohol; overeating; nail biting; self-stimulation; 
and self-abuse). These potentially excessive adjunctive behav- 
iors are socially significant, but the possibility that such excesses 
are developed and maintained as adjunctive behaviors has been 
essentially ignored in applied behavior analysis. 

Foster (1978), in an extended communication to the read- 
ership of the Journal of Applied Behavior Analysis, reported that 
applied behavior analysts have neglected the potentially import- 
ant area of adjunctive behavior. He stated that applied behavior 
analysis does not have a data or knowledge base for adjunc- 
tive phenomena. Similarly, Epling and Pierce (1983) called 


SUMMARY 


Intermittent Reinforcement 


1. A schedule of reinforcement is a rule that establishes the 
probability that a specific occurrence of a behavior will 
produce reinforcement. 


2. Continuous reinforcement and extinction provide the 
boundaries for all other schedules of reinforcement. 


3. Only selected occurrences of behavior produce reinforce- 
ment with an intermittent schedule of reinforcement. 


4. Continuous reinforcement (CRF) is used to strengthen 
behavior, primarily during the initial stages of learning 
new behaviors. 


5. Intermittent reinforcement (INT) is used to maintain an 
established behavior, especially during maintenance stages 
of learning. 


Defining Basic Intermittent Schedules of 
Reinforcement 


6. Ratio schedules require a number of responses before one 
response produces reinforcement. 


7. Interval schedules require an elapse of time before a 
response produces reinforcement. 


Schedule Effects and Consistency of Performance 


g. A fixed ratio (FR) schedule requires the completion of a 
specified number of responses before a response produces 
reinforcement. 
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for applied behavior analysts to extend the laboratory-based 
findings in adjunctive behavior to the understanding and con- 
trol of socially significant human behavior. To our knowledge, 
Lerman, Iwata, Zarcone, and Ringdahl’s (1994) article provides 
the only research on adjunctive behavior published in the Jour- 
nal of Applied Behavior Analysis from 1968 through 2016. Ler- 
man and colleagues provided an assessment of stereotypic and 
self-injurious behavior as adjunctive responses. Data from this 
preliminary study suggest that intermittent reinforcement did 
not induce self-injury, but with some individuals, stereotypic 
behavior showed characteristics of adjunctive behavior. 

Foster (1978) and Epling and Pierce (1983) cautioned that 
many teachers and therapists may apply interventions directly 
to adjunctive behaviors rather than to the variables functionally 
related to their occurrence. These direct interventions may be 
futile and costly in terms of money, time, and effort because 
adjunctive behaviors appear resistant to interventions using 
operant contingencies. 

The condition under which adjunctive behaviors are 
developed and maintained is a major area for future research in 
applied behavior analysis. Applied research directed to adjunc- 
tive behaviors will advance the science of applied behavior 
analysis and will provide an important foundation for improved 
practices in therapy and instruction. 


9. FR schedules produce a typical pattern of responding: (a) 
After the first response of the ratio requirement, the partic- 
ipant completes the required responses with little hesita- 
tion between responses; and (b) a postreinforcement pause 
follows reinforcement. 


10. The size of the ratio and the reinforcement magnitude 
can influence the duration of the postreinforcement 
pause. 


11. A variable ratio (VR) schedule of reinforcement requires 
the completion of a variable number of responses before a 
response produces reinforcement. A number representing 
the average (e.g., mean) number of responses required for 
reinforcement identifies the VR schedule. 


12. VR schedules produce consistent, steady rates of response. 
They typically do not produce a postreinforcement pause, 
as do FR schedules. 


13. A fixed interval (FI) schedule provides reinforcement for 
the first response following the elapse of a specific, con- 
stant duration of time since the last reinforced response. 
The reinforcer is available after the fixed time interval has 
elapsed, and it remains available until the first response is 
emitted. 


14. FI schedules typically produce a postreinforcement 
pause in responding during the early part of the interval. 
An initially slow, but accelerating, rate of response is 
evident toward the end of the interval, usually reaching 
a maximum rate just before delivery of the reinforcer. 
This gradually accelerating rate of response toward the 
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15. 


16. 


17. 


end of the interval is called an FI scallop because of the 
rounded curves that are shown on a cumulative graph. 


A variable interval (VI) schedule provides reinforcement 
for the first response following the elapse of a variable 
duration of time. 


A VI schedule of reinforcement tends to produce a low to 
moderate stable rate of response. On a cumulative graph, 
the slope of the VI schedule appears uniform with few 
pauses in responding. 


When a limited hold is added to an interval schedule, 
reinforcement remains available for a finite time following 
the elapse of the FI or VI interval. 


Thinning Intermittent Reinforcement 


18. 


19. 


20. 


Two procedures are used commonly for schedule thinning. 
First, the response ratio or the duration of the time interval 
is gradually increased. Instructions can be used to clearly 
communicate the schedule of reinforcement, facilitating a 
smooth transition during the thinning process. Instructions 
include rules, directions, and signs. 


To adjust the thinning process, and to avoid the loss of 
previous improvements, use small increments of schedule 
changes. 


Ratio strain can result from abrupt increases in ratio 
requirements when moving from denser to thinner rein- 
forcement schedules. 


Variations on Basic Intermittent Schedules of 
Reinforcement 


21. 


22. 


23. 


24. 


Differential reinforcement of high rates (DRH) and 
differential reinforcement of low rates (DRL) are vari- 
ations of ratio schedules that make reinforcement 
contingent on responses occurring above or below 
criterion response rates. 


¢ DRH schedules produce higher rates of responding. 


* DRL schedules produce lower rates of responding. 


The differential reinforcement of diminishing rates sched- 
ule provides reinforcement at the end of a predetermined 

time interval when the number of responses is less than a 

criterion that is gradually decreased across time intervals 

based on the individual’s performance. 


A lag schedule of reinforcement is one method to increase 
response variability. Reinforcement on a lag schedule is 
contingent on a response differing in some predetermined 
way (e.g., different topography, different sequence) from 
one or more responses that preceded it. 


A progressive ratio schedule of reinforcement is a vari- 
ation of the fixed ratio (FR) schedule in which the ratio 
requirements increase incrementally within the session. 


Compound Schedules of Reinforcement 


25. 


26. 


Dil 


28. 


29. 


30. 


3h 


32. 


33. 


34. 


Continuous reinforcement, the four basic intermittent 
schedules of reinforcement, differential reinforcement 
of rates of responding, and extinction, when combined, 
produce compound schedules of reinforcement. 


The basic component schedules within a compound 
schedule (a) can occur successively or simultaneously; 
(b) can occur with or without discriminative stimuli; 

and (c) make reinforcement contingent on meeting the 
response requirements of each component schedule inde- 
pendently, or in combination with compound schedules. 


A multiple schedule (mult) presents two or more basic 
schedules of reinforcement in an alternating, usually ran- 
dom, sequence. The basic schedules within the multiple 
schedule occur successively and independently. A dis- 
criminative stimulus is correlated with each basic sched- 
ule, and that stimulus is present as long as the schedule is 
in effect. 


Behavioral contrast describes the effects of (a) a schedule 
change that increases or decreases the rate of responding 
in one component of a multiple schedule of reinforcement, 
which is accompanied by (b) a change in the response rate 
in the opposite direction on another unaltered component. 


A concurrent schedule (conc) of reinforcement occurs when 
(a) two or more contingencies of reinforcement (b) oper- 

ate independently and simultaneously (c) for two or more 
behaviors. Each schedule is correlated with a discriminative 
stimulus. 


A chained schedule (chain) is similar to a multiple sched- 
ule. The multiple and chained schedules have two or more 
basic schedule requirements that occur successively, and 
have a discriminative stimulus correlated with each inde- 
pendent schedule. 


A mixed schedule of reinforcement (mix) uses a procedure 
identical to that of the multiple schedules, except no dis- 
criminative stimuli signal the presence of the independent 
component schedules. 


A tandem schedule (tand) operates like the chained sched- 
ule, except that discriminative stimuli are not used with the 
elements in the chain. 


An alternative schedule (alt) provides an opportunity for 
reinforcement on two or more simultaneously available 
basic schedules of reinforcement. The first schedule 
completed provides reinforcement, regardless of which 
schedule component is met first. All schedule require- 
ments start over following reinforcement. 


A conjunctive schedule (conj) of reinforcement is in effect 
whenever reinforcement follows the completion of response 
requirements for two or more schedules of reinforcement. 


Perspectives on Using Schedules of Reinforcement 
in Applied Settings 


35. Well-established schedule effects found in basic research 
were presented in this chapter. Applied behavior analysts, 
however, should use caution in extrapolating these effects 
to applied settings. 


36. Applied behavior analysts should include an analysis 


of the basic intermittent schedules and the compound 
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KEY TERMS 

adjunctive behaviors 
(DRL) 

fixed interval (FI) 

fixed ratio (FR) 


alternative schedule (alt) 
behavioral contrast 
chained schedule (chain) 


compound schedule of reinforcement 


concurrent schedule (conc) ON) 
conjunctive schedule (conj) aie aeS 
limited hold 


continuous reinforcement (CRF) 


differential reinforcement of diminishing EBs 2h 


rates (DRD) 


differential reinforcement of high rates 
(DRH) 


multiple schedule (mult) 


1. Every time Baxter raised his hand in class, Mrs. Tulley 
called on him. What schedule of reinforcement is Baxter’s 
hand raising on? 


a. Intermittent reinforcement 

b. Fixed-interval 

c. Continuous reinforcement 

d. Fixed-ratio 2 

Hint: (See “Continuous Reinforcement”) 


2. Timmy has just recently begun clearing his plate after 4, 


he finishes his meal. Timmy’s mother really likes this 
new behavior and would like to see more of it. Which 
schedule of reinforcement should Timmy’s mother use to 
strengthen Timmy’s new behavior of clearing his plate? 


a. Variable-ratio 8 

b. Intermittent reinforcement 
c. Fixed-ratio 10 

d. Continuous reinforcement 


Hint: (See “Intermittent Reinforcement’) 


Sie 


differential reinforcement of low rates 


intermittent schedule of reinforcement 


mixed schedule of reinforcement (mix) 
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schedules in their research agendas. A better understand- 
ing of the schedule effects in applied settings will advance 
the development of applied behavior analysis and its 
applications. 


The conditions under which adjunctive behaviors (e.g., 
time fillers, such as doodling, smoking, idle talking, and 
drinking) are developed and maintained are important 
areas for future research in applied behavior analysis. 


postreinforcement pause 


progressive schedule of 
reinforcement 


progressive-ratio (PR) schedule of 
reinforcement 


ratio strain 

schedule of reinforcement 
schedule thinning 

tandem schedule (tand) 
variable interval (VI) 
variable ratio (VR) 


3. Which schedule of reinforcement occurs when two or more 


contingencies of reinforcement operate independently and 
simultaneously for two or more behaviors? 


a. Variable-ratio schedules of reinforcement 

b. Concurrent schedules of reinforcement 

c. Fixed-interval schedules of reinforcement 

d. Multiple schedules of reinforcement 

Hint: (See “Compound Schedules of Reinforcement”) 
Which schedule of reinforcement presents two or more 
basic schedules of reinforcement in an alternating (usually 


random) sequence usually occurring successively and 
independently with no correlated discriminative stimuli? 


a. Multiple schedules of reinforcement 

b. Alternative schedules of reinforcement 

c. Variable-interval schedules of reinforcement 
d. Mixed schedules of reinforcement 


Hint: (See “Discriminated Compound Schedules of 
Reinforcement’) 


360 Part 4 ¢ Reinforcement 


5. Which schedule of reinforcement produces a post- 


reinforcement pause? 

a. Fixed ratio 

b. Continuous reinforcement schedule 

c. Tandem schedules of reinforcement 

d. Variable-interval schedules of reinforcement 

Hint: (See “Defining Basic Intermittent Schedules of 
Reinforcement’) 

. To gradually reduce a high-frequency behavior, which 
schedule of reinforcement is the most appropriate? 

a. Differential reinforcement of high rates of responding 
b. Differential reinforcement of low rates of responding 
c. Fixed-ratio schedules of reinforcement 

d. Concurrent schedules of reinforcement 

Hint: (See “Schedules of Differential Reinforcement of 
Rates of Responding’) 

. Schedule thinning refers to: 


a. Gradually decreasing the response ratio or the duration 
of the time interval 


b. Quickly increasing the response ratio or the duration of 
the time interval 


c. Gradually increasing the response ratio or the duration 
of the time interval 


d. Moving gradually from an intermittent schedule to a 
continuous reinforcement schedule. 


Hint: (See “Thinning Intermittent Reinforcement”) 


8. Common behavioral characteristics associated with 


include avoidance, aggression, and unpredict- 
able pauses in responding. 


a. Ratio strain 

b. Post-reinforcement pause 

c. Scallop effect 

d. Fixed interval 

Hint: (See “Thinning Intermittent Reinforcement’) 


. Which schedule of reinforcement is identical to the 


chained schedule except it does not use discriminative 
stimuli with the elements in the chain? 


a. Mixed schedule of reinforcement 

b. Fixed-ratio schedule of reinforcement 
c. Tandem schedule of reinforcement 

d. Chained schedule of reinforcement 


Hint: (See “Discriminated Compound Schedules of 
Reinforcement”; “Nondiscriminated Schedules of 
Reinforcement’) 


. When the frequency of a time-filling behavior increases 


as a side effect of some other behavior maintained by a 
schedule of reinforcement, this is referred to as: 


a. Ratio strain 

b. Post-reinforcement pause 
c. Schedule induced 

d. Schedule thinning 


Hint: (See “Applied Research with Adjunctive Behavior’) 
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ESSAY-TYPE QUESTIONS 


1. Discuss the term schedules of reinforcement. In your 5. Describe the process of how you would use a ratio sched- 
discussion, be sure to describe what it is, and discuss the ule to (1) teach a student a new skill and (2) move the skill 


“boundaries for all other schedules of reinforcement” 
identified in the chapter. 


Hint: (See “Defining Basic Intermittent Schedules of 
Reinforcement’’) 


. Explain in detail how you would utilize schedules of 
reinforcement for teaching a student a new skill and, once 
acquired, what schedule you would use to maintain it. 


Hint: (See “Defining Basic Intermittent Schedules of 
Reinforcement’) 


- Compare and contrast the four basic intermittent sched- 
ules of reinforcement. 


Hint: (See “Defining Basic Intermittent Schedules of 
Reinforcement’’) 


. Identify and describe the schedule of effects, specifically 
the issues of consistency of performance and rate of 
response, for each of the four basic intermittent schedules 
of reinforcement. 


Hint: (See “Defining Basic Intermittent Schedules of 
Reinforcement”) 


into maintenance. 


Hint: (See “Thinning Intermittent Reinforcement’) 


. During the treatment condition, a student became avoid- 


ant of work, began engaging in physical aggression and 
random pauses in responding. Upon inquiry, you discover 
that the staff was in the process of thinning the schedule 
of reinforcement the student was on. Identify the term that 
describes these phenomena and discuss how it occurred. 
How can it be remedied? 


Hint: (See “Thinning Intermittent Reinforcement”) 


. Julia is a 9-year-old student at Blackmore Elementary. 


You have been asked by your professor to help her 
teacher reduce the frequency of Julia’s hand raising in 
class using a differential reinforcement strategy. The 
teacher has recorded Julia raising her hand an average 
of 57 times an hour. Discuss which differential rein- 
forcement procedure you would use, why you would 
use it and how it would work. 


Hint: (See “Schedules of Differential Reinforcement of 
Rates of Responding”) 
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8. Identify and discuss the discriminated schedules of 


10. Discuss and explain two different ways applied behay- 
reinforcement. 


ior analysts use progressive schedules of reinforcement. 


Hint: (See “Discriminated Compound Schedules of Hint: (See “Progressive Schedules of Reinforcement”) 


Reinforcement’’) 


9. Identify and discuss the nondiscriminated schedules of 
reinforcement. 


Hint: (See “Nondiscriminated Schedules of Reinforcement’’) 


NOTES 


1. While we define the PR schedule of reinforcement as increasing response schedules in which response requirements are increased from one session 
requirements within the session, as introduced by Hodos (1961), we rec- to the next. 
ognize that applied behavior analysts may increase response requirements 2. An escalating schedule of reinforcement, in which consecutive responses 
over successive sessions rather than within the session (e.g., DeLeon, produce an ever-increasing magnitude of reinforcement (e.g., Romano- 
Iwata, Goh, & Worsdell, 1997; Wilson & Gratz, 2016). Jarmolowizc wich & Lamb, 2010, 2015), should not be confused with a PR schedule of 


and Lattal (2010) suggested the label progressive fixed-ratio (PFR) for reinforcement. 
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PART 


Punishment 


Punishment is a basic principle of learning. Although often considered bad—the unfortunate evil twin 
to reinforcement—punishment has as much importance to learning as reinforcement does. Learning 
from consequences that produce pain, discomfort, or the loss of reinforcers has survival value for the 
individual and for the species. 

As with reinforcement, stimulus changes that function as consequences in punishment 
contingencies can be described by two types of operations: A new stimulus is presented or an 
existing stimulus is removed. Chapter 14 begins by defining the basic principle of punishment 
and distinguishes positive punishment and negative punishment based on the operation of the 
response-suppressing consequence. The remainder of the chapter focuses on the side effects and 
limitations of punishment, factors that influence the effectiveness of punishment, examples of 
interventions involving positive punishment, guidelines for using punishment effectively, and ethical 
considerations related to using punishment in applied settings. Chapter 15 describes two behavior 
change tactics based on negative punishment: time-out from positive reinforcement and response 
cost. Examples show how time-out and response cost reduce or eliminate undesired behavior, and 
guidelines provide direction for effectively implementing these tactics. 
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Positive Punishment 


LEARNING OBJECTIVES 


¢ Define and discuss the definitions of positive and negative punishment. 
¢ Define and discuss factors that influence the effectiveness of punishment. 
e Identify and explain the potential side effects of punishment use. 


e List examples of positive punishment and describe their respective procedures. 


¢ Discuss the guidelines for the use of punishment as an intervention. 


ave you ever stubbed your toe while walking too fast in a 

darkened room and then walked slowly the rest of the way 

to the light switch? Have you ever smashed your finger 
with a hammer and then refrained from holding the nail in quite 
the same way before swinging the hammer again? Have you ever 
left a sandwich unattended at a beach party, watched a seagull fly 
away with it, and then did not turn your back on the next treat you 
pulled from the picnic basket? If you have had these or similar 
experiences, you have been the beneficiary of punishment. 

It may seem strange to refer to people who stubbed their 
toe, smashed a finger, or lost a tasty sandwich as benefiting 
from the experience, as opposed to referring to that person as 
a “victim” of punishment. Although many people consider 
punishment a bad thing—reinforcement’s evil counterpart— 
punishment is as important to learning as reinforcement. Learn- 
ing from consequences that produce pain, discomfort, or the loss 
of reinforcers has survival value for the individual organism and 
for our species. Punishment teaches us not to repeat behaviors 
that cause harm. Fortunately, it usually does not take too many 
stubbed toes or smashed fingers to reduce the occurrence of the 
behaviors that produced those outcomes. 

Although punishment is a natural phenomenon and one of 
the basic principles of operant conditioning, it is poorly under- 
stood and frequently misapplied, and its application is contro- 
versial. At least some of the misunderstandings and controversy 
surrounding the use of punishment in behavior change programs 
can be attributed to practitioners who, on the one hand, confuse 
punishment as an empirically derived principle of behavior with 
its commonplace usage and its legal connotations; and, on the 
other hand, misapply punishment altogether. 

Punishment is generally understood to be the application 
of aversive consequences—such as physical pain, psychologi- 
cal hurt, and the loss of privileges, or fines—for the purpose of 
teaching a lesson to a person who has misbehaved so she will 
not repeat the misdeed. Punishment is sometimes used as an act 
of retribution on the part of the person or agency administering 
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Identify and discuss ethical considerations regarding the use of punishment. 
Explain the current state of knowledge regarding the use of punishment as a treatment intervention. 


the punishment, or to provide “a lesson for others” on how to 
behave. Punishments meted out by the legal system, such as jail 
time and fines, are often considered a process by which con- 
victed lawbreakers must repay their debt to society. 

These everyday and legal notions of punishment, although 
having various degrees of validity and utility within their own 
contexts, have little, if anything, to do with punishment as a 
principle of behavior. In the everyday connotation of punish- 
ment, most people would agree that a teacher who sends a 
student to the principal’s office for misbehaving in class, or a 
police officer who issues a ticket to a speeding motorist, has 
punished the offender. However, as a principle of behavior, 
punishment is not about punishing the person; punishment is 
a response — consequence contingency that suppresses the 
future frequency of similar responses. From the perspective of 
both the science and the practice of behavior analysis, the trip 
to the principal’s office did not punish fooling around in class 
unless the future rate at which the student fools around in class 
decreases as a function of his trip to the principal’s office, and 
the police officer’s ticket did not punish speeding unless the 
motorist drives above the speed limit less often than she did 
before receiving the ticket. 

In this chapter, we define punishment, identify factors 
that influence the effectiveness of punishment, discuss its side 
effects and limitations, describe examples of several behavior 
change tactics that incorporate punishment, present guidelines 
for using punishment effectively, and discuss ethical consid- 
erations in the use of punishment. In the chapter’ s concluding 
section, we remind behavior analysts of punishment’s nat- 
ural and necessary role in learning and underscore the need 
for more basic and applied research on punishment, and state 
that a wide variety of differential reinforcement options, stim- 
ulus control procedures, time-out from positive reinforcement, 
response blocking, and contingent restraint, be employed in 
lieu of treatments featuring unconditioned punishers as default 
technologies. 


DEFINITION AND CHARACTERISTICS 
OF PUNISHMENT 


This section presents the functional relation that defines punish- 
ment, describes the two operations by which punishment can be 
implemented, identifies the types of consequences that function 
as punishers, explains the discriminative effect of punishment, 
and describes the recovery from the punishment effect. 


Operation and Defining Effect of Punishment 


Like reinforcement, punishment is a two-term functional relation 
defined by its effects on the frequency of behavior. Punishment 
has occurred when a response is followed immediately by a 
stimulus change that decreases the future frequency of that type 
of behavior. 

An early study by Hall and colleagues (1971) provides a 
straightforward example of punishment. Andrea, a 7-year-old girl 
with hearing impairments, “pinched and bit herself, her peers, 
the teacher, and visitors to the classroom at every opportunity 
and was so disruptive the teacher reported that academic instruc- 
tion was impossible” (p. 24). During an initial 6-day baseline 
period, Andrea bit or pinched an average of 71.8 times per 
day. Whenever Andrea bit or pinched anyone during the inter- 
vention condition, her teacher immediately pointed at her with 
an outstretched arm and shouted “‘No!” On the first day of inter- 
vention, the frequency of Andrea’s biting and pinching decreased 
substantially (see Figure 14.1). Her aggressive behavior followed 
a downward trend during the initial intervention phase, ending 
in an average of 5.4 incidents per day. A 3-day return to baseline 
conditions resulted in Andrea’s biting and pinching at a mean rate 
of 30 times per day. When the teacher reinstated the intervention, 
pointing her finger and stating “No!” each time Andrea pinched 
or bit, Andrea’s problem behavior dropped to 3.1 incidents per 
day. During the second intervention phase, the teacher reported 
that Andrea’s classmates were no longer avoiding her, perhaps 
because their behavior of being near Andrea was punished less 
often by bites and pinches. 
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It is important to point out that punishment is defined 
neither by the actions of the person delivering the consequences 
(in the case of socially mediated punishment) nor by the nature 
of those consequences.! A decrease in the future occurrence 
of the behavior must be observed before a consequence-based 
intervention qualifies as punishment. The intervention that 
proved successful in reducing the frequency of Andrea’s bit- 
ing and pinching—her teacher’s pointed finger and “No!’”—is 
classified as a punishment-based treatment only because of its 
suppressive effects. If Andrea had continued to bite and pinch 
at the baseline level of responding when the intervention was 
applied, her teacher’s pointing and “No!” would not have been 
punishment. 

Because the presentation of punishers often evokes 
behavior incompatible with the behavior being punished, the 
immediate suppressive effects of punishment can easily be 
overestimated. Michael (2004) explained and provided a good 
example: 


The decreased frequency of the punished response that is 
due to its having been followed by punishment will not be 
seen until after behavior evoked by the punishing stimulus 
changes has ceased. Because the evocative effect of the 
punishment stimulus change (as a respondent uncondi- 
tioned or conditioned elicitor or as operant S? or MO) is in 
the same direction as the future change due to punishment 
as a response consequence (weakening of the punished 
behavior), the former can be easily misinterpreted as the 
latter. For example, when a small child’s misbehavior is 
followed by a severe reprimand, the misbehavior will cease 
immediately, but primarily because the reprimand controls 
behavior incompatible with the misbehavior—attending to 
the adult who is doing the reprimanding, denying respon- 
sibility for the misbehavior, emotional behavior such as 
crying, etc. This sudden and total cessation of the misbe- 
haver does not imply, however, that the future frequency of 
its occurrence has been reduced, which would be the true 
effect of punishment. (pp. 36-37) 


Pointed Finger, “No”, 


Figure 14.1 Number of bites and pinches 

by a 7-year-old girl during baseline and 
punishment (“No” plus pointing) conditions. 
From “The Effective Use of Punishment to Modify 
Behavior in the Classroom,” by R. V. Hall, S. Axelrod, 
M. Foundopoulos, J. Shellman, R. A. Campbell, and 


S. S. Cranston, 1971, Educational Technology, 11(4), 
35 p. 25. Copyright 1971 by Educational Technology. 
Used by permission. 
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Extinction effects caused by withholding reinforcement 
for the problem behavior (a procedure that should be included 
in punishment-based interventions whenever possible) con- 
tribute to the difficulty of determining the effectiveness of 
punishment—or even whether the punishment contingency is 
responsible for reductions in responding. However, rapid reduc- 
tions in responding characteristic of punishment can usually be 
distinguished from gradual reductions in responding typical 
of extinction. Compare the immediate decrease in Andrea’s 
biting and pinching from baseline to punishment condition in 
Figure 14.1 with the gradual decrease in responding typical of 
extinction shown in Figure 24.4. 


Positive Punishment and Negative Punishment 


Like reinforcement, punishment can be accomplished by either of 
two types of stimulus change operations. Positive punishment 
occurs when the presentation of a stimulus (or an increase in the 
intensity of an already present stimulus) immediately following 
a behavior results in a decrease in the frequency of the behav- 
ior. Stubbing one’s toe on a chair leg is an example of positive 
punishment—f it suppresses the frequency of the behavior that 
preceded the toe stub—because the painful stimulation is best 
described as the presentation of a stimulus. Behavior change 
tactics based on positive punishment involve the contingent pre- 
sentation of a stimulus immediately following occurrences of the 
target behavior. Andrea’s teacher constituted positive punishment; 
the teacher’s pointed finger and “No!” were stimuli presented or 
added to Andrea’s environment. 

Negative punishment has occurred when the removal of 
an already present stimulus (or a decrease in the intensity of 
an already present stimulus) immediately following a behavior 
results in a decrease in the future frequency of the behavior. 
The beach party attendee’s behavior of turning his back on his 
food was negatively punished when the seagull flew off with his 
sandwich. For a stimulus change to function as negative punish- 
ment, which amounts to the removal of a positive reinforcer, a 
“motivating operation for the reinforcer must be in effect, oth- 
erwise removing it will not constitute punishment” (Michael, 
2004, p. 36). A seagull’s flying off with a hungry person’s sand- 
wich would function as punishment for his inattentiveness, but 
perhaps have little effect on the behavior of a person who has 
eaten his fill and set the sandwich down. 

Behavior change tactics based on negative punishment 
involve the contingent loss of available reinforcers immediately 
following a behavior (i.e., response cost, a procedure akin to a 
fine) or the removal of the opportunity to acquire additional rein- 
forcers for a period of time (i.e., time-out from reinforcement, 
a procedure akin to being sidelined during a game). Negative 
punishment is the focus of Chapter 15. 

As with reinforcement, the modifiers positive and negative 
when used with punishment connote neither the intention nor 
the desirability of the behavior change produced; they specify 
only that the stimulus change that functioned as the punishing 
consequence is best described as the presentation of a new stim- 
ulus (positive punishment) or the termination (or reduction in 
intensity or amount) of an already present stimulus (negative 


punishment). As Michael (2004) noted, positive punishment and 
negative punishment are susceptible to misunderstanding. 


Assuming that you must receive either positive or neg- 
ative punishment, which would you prefer? As with 
reinforcement, you should certainly not decide until you 
know specifically what each consists of. Would you prefer 
negative reinforcement or positive punishment? Of course, 
negative reinforcement. (p. 37) 


Positive punishment and negative reinforcement are 
frequently confused as well. Because aversive events are associ- 
ated with positive punishment and with negative reinforcement, 
the umbrella term aversive control is often used to describe 
interventions involving either or both of these two principles. 
Distinguishing between the two principles is difficult when 
the same aversive event is involved in concurrent positive pun- 
ishment and negative reinforcement contingencies. For exam- 
ple, Baum (2017) described how the application and threat of 
physical beatings might condition the behavior of people living 
in a police state. 


If speaking out results in a beating, then speaking out is 
positively punished. If lying avoids a beating, then lying 

is negatively reinforced. The two tend to go hand-in-hand, 
because if one activity is punished, some alternative action 
will avoid punishment. (p. 163) 


The keys to identifying and distinguishing concurrent 
positive punishment and negative reinforcement contingencies 
involving the same aversive stimulus event are (a) recognizing 
the opposite effects the two contingencies have on the future 
frequency of behavior and (b) realizing that two different 
behaviors must be involved because the same consequence 
(1.e., stimulus change) cannot serve as positive punishment 
and negative reinforcement for the same behavior. In a positive 
punishment contingency, the stimulus is absent prior to a 
response and is presented (or increased in intensity) as a conse- 
quence; in a negative reinforcement contingency, the stimulus 
is present prior to a response and is removed (or decreased in 
intensity) as a consequence. For example, positive punishment 
and negative reinforcement contingencies operated concurrently 
in a study by Azrin, Rubin, O’Brien, Ayllon, and Roll (1968) in 
which adults wore an apparatus throughout the normal work- 
ing day that automatically produced a 55-dB tone contingent on 
slouching (sustained rounding of the shoulders or upper back 
for 3 seconds) and immediately terminated the tone when they 
straightened their shoulders. Slouching produced a tone (posi- 
tive punishment), straightening the shoulders escaped the tone 
(negative reinforcement), and nonslouching avoided a tone 
(negative reinforcement). 

Threatening a person with punishment if she engages ina 
behavior should not be confused with punishment. Punishment 
is a behavior — consequence relation, and a threat of what will 
happen if a person behaves in a certain way is an antecedent 
event to the behavior. When the threat of punishment suppresses 
behavior, it may be due to the threat functioning as an establish- 
ing operation that evokes alternative behaviors that avoid the 
threatened punishment. 


Types of Punishers 


A punisher is a stimulus change that immediately follows a 
response and decreases future occurrences of that type of behav- 
ior. The stimulus change in a positive punishment contingency 
may be called a positive punisher or, more simply, a punisher. 
Likewise the term negative punisher can be applied to a stimu- 
lus change involved in negative punishment. Though technically 
accurate, the term negative punisher is clumsy because it refers 
to a positive reinforcer that is removed contingent on occurrences 
of the target behavior. Therefore, when most behavior analysts 
use the term punisher, they are referring to a stimulus whose 
presentation functions as punishment (i.e., a positive punisher). 
As with reinforcers, punishers can be classified as unconditioned 
or conditioned. 


Unconditioned Punishers 


An unconditioned punisher (also called primary punisher or 
unlearned punisher) is a stimulus whose presentation functions as 
punishment without having been paired with any other punishers. 
Because unconditioned punishers are the product of the evolu- 
tionary history (phylogeny) of a given species, all biologically 
intact members of that species are susceptible to punishment by 
the same unconditioned punishers. Painful stimulation such as 
that caused by physical trauma to the body, certain odors and 
tastes, physical restraint, loss of bodily support, and extreme 
muscular effort are examples of stimulus changes that typically 
function as unconditioned punishers for humans (Michael, 2004). 
Like unconditioned reinforcers, unconditioned punishers are 
“phylogenetically important events that bear directly on fitness” 
of the organism (Baum, 2017, p. 67). 

However, virtually any stimulus to which an organism’s 
receptors are sensitive—light, sound, and temperature, to name a 
few—can be intensified to the point that its delivery will suppress 
behavior even though the stimulus is below levels that cause tissue 
damage (Bijou & Baer, 1965). Unlike unconditioned reinforcers, 
such as food and water, whose effectiveness depends on a relevant 
establishing operation, under most conditions many unconditioned 
punishers will suppress any behavior that precedes their onset. An 
organism does not have to be “deprived of electric stimulation” for 
the onset of electric shock to function as punishment. (However, 
the behavior of an organism that has just received many shocks 
at a specified voltage level in a short period, particularly shocks 
of mild intensity, may be relatively unaffected by another 
shock at a similar intensity.) 


Conditioned Punishers 


A conditioned punisher (also called secondary punisher or 
learned punisher) is a stimulus change whose presentation 
functions as punishment as a result of a person’s conditioning 
history. A conditioned punisher acquires the capability to func- 
tion as a punisher through stimulus—stimulus pairing with one or 
more unconditioned, or previously conditioned, punishers. For 
example, as a result of its onset at or very near the same time as 
an electric shock, a previously neutral stimulus change, such as 
an audible tone, will become a conditioned punisher capable of 
suppressing behavior that immediately precedes the tone when 
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it occurs later without the shock (Hake & Azrin, 1965). If a 
conditioned punisher is repeatedly presented without the pun- 
isher(s) with which it was initially paired, its effectiveness as 
punishment will wane until it is no longer a punisher. 

Previously neutral stimuli can also become conditioned 
punishers for humans without direct physical pairing with 
another punisher through a pairing process Alessi (1992) called 
verbal analog conditioning. This is similar to the example of 
verbal pairing described in Chapter 11 for the conditioning of 
a conditioned reinforcer in which Engelmann (1975) showed 
cut-up pieces of yellow construction paper to a group of pre- 
school children and told them, “These pieces of yellow paper 
are what big kids work for” (pp. 98-100). From that point on, 
many children began working extra hard for yellow pieces of 
paper. Miltenberger (2016) gave the example of a carpenter 
telling his apprentice that if the electric saw starts to make 
smoke, the saw motor may become damaged or the blade may 
break. The carpenter’s statement establishes smoke from the 
saw as a conditioned punisher capable of decreasing the fre- 
quency of any behaviors that immediately precede the smoke 
(e.g., pushing too forcefully on the saw, holding the saw at an 
inappropriate angle). 

A stimulus change that has been paired with numerous 
forms of unconditioned and conditioned punishers becomes a 
generalized conditioned punisher. Reprimands (“No!” “Don’t 
do that!’””) and disapproving gestures (e.g., scowls, head shakes, 
frowns) are generalized conditioned punishers for many people 
because they have been paired repeatedly with a wide range 
of unconditioned or conditioned punishers (e.g., burned finger, 
loss of privileges). As with generalized conditioned reinforcers, 
generalized conditioned punishers are independent of specific 
motivating conditions and will function as punishment under 
most conditions. 

To restate a critical point, punishers are defined by their 
function (a decrease in the future occurrence of behavior), not 
by their physical properties. Even stimulus changes that would 
function as unconditioned reinforcers or punishers can have the 
opposite effect under certain conditions. For example, a bite of 
food will function as a punisher for a person who has eaten too 
much, and an electric shock may function as a reinforcer if it 
signals the availability of food for a food-deprived organism 
(e.g., Holz & Azrin, 1961). If a student receives a smiley face 
sticker and praise while completing math problems and his pro- 
ductivity decreases as a result, smiley face stickers and praise 
are punishers for that student. A consequence that might serve as 
a punisher at work might not do so at home. Although common 
experiences mean that many of the same stimulus events func- 
tion as conditioned punishers for most people, a punisher for 
one person may be a powerful reinforcer for another. Remember 
the cartoon described in Chapter 11 of two students cleaning 
erasers after school: The activity was punishment for one and 
reinforcement for the other. 


Discriminative Effects of Punishment 


Punishment does not operate in a contextual vacuum. The 
environmental situation in which punishment occurs plays an 
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important role in determining the antecedent stimulus conditions 
in which punishment’s suppressive effects will be observed. 
The three-term contingency for “the operant functional rela- 
tion involving punishment can be stated much like that involv- 
ing reinforcement: (1) In a particular stimulus situation (S), 
(2) some kinds of behavior (R), when followed immediately 
by (3) certain stimulus changes (S?), show a decreased future 
frequency of occurrence in the same or in similar situations” 
(Michael, 2004, p. 36). 

If punishment occurs only in some stimulus conditions 
and not in others (e.g., a child gets scolded for reaching into 
the cookie jar before dinner only when an adult is in the room), 
the suppressive effects of punishment will be most prevalent 
under those conditions. A discriminated operant for punishment 
is the product of a conditioning history in which responses in the 
presence of a certain stimulus have been punished and similar 
responses in the absence of that stimulus have not been punished 
(or have resulted in a reduced frequency or intensity of pun- 
ishment). Highway speeding is a discriminated operant in the 
repertoire of many motorists who drive within the speed limit 
in and around locations where they have been pulled over by the 
police for speeding but continue to drive over the speed limit on 
roads where they have never seen a police cruiser. 

There is no standard term or symbol in the behavior 
analysis literature for an antecedent stimulus that acquires stim- 
ulus control related to punishment. Some authors have modified 
the shorthand symbol for a discriminative stimulus for reinforce- 
ment (SP) to indicate the antecedent stimulus in a three-term 
punishment contingency, such as the S>~ used by Sulzer-Azaroff 
and Mayer (1977). Other authors refer to an antecedent stimulus 
correlated with the presence of a punishment contingency as 
a punishment-based Sb (e.g., Malott & Shane, 2016; Michael, 
2004). We have adopted SP as the symbol for the discriminative 


Grandma in the kitchen 
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stimulus for punishment, as proposed by O’ Donnell (2001). 
“An SPP can be defined as a stimulus condition in the presence 
of which a response has a lower probability of occurrence than it 
does in its absence as a result of response-contingent punishment 
delivery in the presence of the stimulus” (p. 262). Figure 14.2 
shows three-term contingency diagrams for discriminated oper- 
ants for positive punishment and negative punishment. 

A study by McKenzie, Smith, Simmons, and Soderlund 
(2008) illustrates the stimulus control of a client’s problem 
behavior by a stimulus associated with punishment. Diane, a 
woman with profound intellectual disabilities, had engaged 
in chronic eye poking that impaired her vision and on two 
occasions dislodged one of her eyes from the socket. Several 
reinforcement-based interventions had failed to produce clini- 
cally meaningful reductions in the behavior. 

During an initial no-wristbands baseline phase, Diane 
was observed through a one-way mirror and the duration of 
her eye poking measured. In the wristbands (no reprimands) 
condition that followed, the therapist entered the room at the 
first occurrence of eye poking, placed two red tennis wrist- 
bands on Diane’s wrists, and then left the area and remained 
out of Diane’s sight for the remainder of the session. When 
eye poking was first observed during the wristbands (repri- 
mands) condition, the therapist immediately entered the room 
and delivered a verbal reprimand (e.g., “Diane, stop poking!”). 
After presenting the reprimand and observing that Diane had 
stopped eye poking, the therapist left the room. The plan “was 
to repeat the reprimand at 3-s intervals until eye poking ceased, 
but it was never necessary to deliver more than one reprimand” 
(McKenzie et al., 2008, p. 257). 

Data from a phase in which wristbands (reprimands) and 
no-wristbands (baseline) sessions were alternated showed that 
the wristbands rapidly acquired and maintained stimulus control 
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Figure 14.2 Three-term contingencies illustrating positive and negative punishment of discriminated 
operants: A response (R) emitted in the presence of a discriminative stimulus (SPP) is followed closely in 
time by a stimulus change (S°) and results in a decreased frequency of similar responses in the future when 
the S°P is present. A discriminated operant for punishment is the product of a conditioning history in which 
responses in the presence of the S?P have been punished and similar responses in the absence of the SPP 
have not been punished (or have resulted in a reduced frequency or intensity of punishment than in the 


presence of the $"P). 


over eye poking (see Figure 14.3). During wristbands (repri- 
mands) sessions, Diane poked her eyes for a mean of 2 seconds 
(range, 0 to 7 sec) compared to a mean of 133 seconds (range, 
2 sec to 402 sec) during no-wristbands sessions. Discriminative 
control of the wristbands was replicated in a delayed multiple 
baseline across settings analysis. Results showed reprimands in 
the treatment setting had acquired stimulus control. Notably, no 
reprimands were ever delivered when Diane wore the wristbands 
in the canteen or at home. 


Recovery from Punishment 


Punishment’s suppressive effects on behavior are usually 
not permanent. This phenomenon, called recovery from 
punishment, is especially likely when the behavior being 


No Wristbands 


Wristbands (no reprimands) | 
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punished also obtains reinforcement. Figure 14.4 shows the 
results from one of a series of basic laboratory experiments 
by Azrin (1960) that demonstrated recovery from punishment. 
The top set of data paths is the cumulative record of a pigeon’s 
key pecking during a 1-hr session of food reinforcement on a 
variable interval (VI) 6-min schedule prior to any punishment. 
The data paths on the lower part of the figure show responding 
during punishment and reinforcement. Beginning on the Ist 
day, and for each daily 1-hr session thereafter for 35 consecu- 
tive sessions, every key peck was followed immediately by a 
brief electric shock of moderate intensity (5 milliamperes for 
100 milliseconds). Contingent shock suppressed pecking on the 
Ist day of punishment, but responding gradually recovered over 
the next 20 sessions and stabilized at about 900 responses per 
session, compared with 2800 responses prior to punishment. 
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Figure 14.4 Partial recovery of responding by a pigeon 
during punishment of every response concurrent with 
reinforcement on a variable interval schedule. 

“Effects of Punishment Intensity during Variable-Interval Reinforcement,” 
By N. H. Azrin, 1960, Journal of Experimental Analysis of Behavior, 3, 

p. 127. Reproduced with permission of John Wiley & Sons Inc. 


Recovery of responding to prepunished levels is more 
likely to occur when the punishment was mild or when the 
person can discriminate that the punishment contingency is 
no longer active. Sometimes responding after punishment is 
discontinued will not only recover but also briefly exceed 
the level at which it was occurring prior to punishment 
(Azrin, 1960; Holz & Azrin, 1962). Although the response- 
weakening effects of punishment often wane when punishment 
is discontinued, so, too, do the response-strengthening effects 
of reinforcement, which are often fleeting when previously 
reinforced behavior is placed on extinction (Vollmer, 2002). 
Michael (2004) noted that 


Recovery from punishment is sometimes given as an argu- 
ment against the use of punishment to cause a decrease 

in behavior. “Don’t use punishment because the effect is 
only temporary.” But of course the same argument could 
be made for reinforcement. The strengthening effect of 
reinforcement decreases when the behavior occurs without 
the reinforcement. The weakening effect of punishment 
decreases when the behavior occurs without the punish- 
ment. (p. 38) 


FACTORS THAT INFLUENCE THE 
EFFECTIVENESS OF PUNISHMENT 


Reviews of basic and applied research identify several vari- 
ables as keys to the effectiveness of punishment: immediacy, 
intensity, schedule or frequency, inadvertent reinforcement of 


the target behavior, and the availability of reinforcement for 
an alternative behavior (Axelrod, 1990; Azrin & Holz, 1966; 
Hineline & Rosales-Ruiz, 2013; Lerman & Vorndran, 2002). 


immediacy 


Maximum suppressive effects are obtained when the onset of the 
punisher occurs immediately after the occurrence of the target 
response. Even small time delays can decrease the effectiveness 
of punishment.* 


Basic studies with humans and nonhumans indicate 
that punishment procedures can fail to suppress 
responding when the consequence is delayed 

by just 10-30 seconds. ... When a punisher is 

delayed, other responses or multiple occurrences of the 
targeted behavior are likely to intervene before the con- 
sequence is delivered, weakening the contingency 
between the response and its consequence. (Lerman & 
Toole, 2011, p. 359) 


Intensity of Punishment 


Basic researchers examining the effects of punishers of varying 
intensity have reported three reliable findings: (1) A positive 
correlation exists between the intensity of the punishing stimulus 
and response suppression; (2) a negative correlation exists 
between the intensity of the punishing stimulus and recovery 
from punishment (described above and illustrated in Figure 14.4); 
and (3) a high-intensity stimulus that was previously presented at 
low intensity and gradually increased in intensity for subsequent 
responses may be ineffective as punishment (e.g., Azrin & Holz, 
1960; Hake, Azrin, & Oxford, 1967; Holz & Azrin, 1962; Terris 
& Barnes, 1969). 

Data from another of Azrin’s (1960) early laboratory 
experiments on the effects of different intensities of punishment 
illustrate the first two findings (see Figure 14.5). Each curve 
is the cumulative record of a pigeon pecking an illuminated 
key during a 1-hr period of concurrent food reinforcement on a 
VI 1-min schedule and punishment of each response by shocks 
of various intensities. Results for two subjects show the more 
intense the punishing stimulus, the more immediately and thor- 
oughly it suppresses behavior. 

Data from other studies demonstrated that recovery from 
punishment varies inversely with the intensity of punishment 
(e.g., Azrin, 1960; Hake, Azrin, & Oxford, 1967). Responding 
by subjects that received very mild punishment recovered more 
rapidly and completely than did responding by subjects that 
received more intense punishment. Virtually permanent response 
suppression may occur when complete suppression of behavior 
to a zero rate of responding has been achieved with intense pun- 
ishment. In their review of basic research on punishment, Azrin 
and Holz (1966) noted that: 


Intense punishment did not merely reduce responses 
to the unconditioned or operant level, but reduced 
them to an absolute level of zero. Since punishment 
is delivered only after a response occurs, there is 
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no opportunity for the subject to detect the absence 
of punishment unless he responds. If punishment is 
so severe as to completely eliminate the responses 
then the opportunity for detecting the absence of 
punishment no longer exists. (p. 410) 


A high-intensity stimulus may be ineffective as a punisher 
if that stimulus was initially presented at low intensity and 
the intensity is increased gradually (Hineline & Rosales-Ruiz, 
2013). Recovery of responding to pre-punished levels is 
more likely to rebound when (a) the punishment was mild, 
(b) habituation to the punisher occurred, (c) the person can dis- 
criminate that the punishment contingency is no longer active, 
(d) the punished behavior is reinforced inadvertently, or (e) staff 
training (or the lack of it) militates against treatment integrity 
(Lerman & Toole, 2011). 

However, as Lerman and Vorndran (2002) pointed out, rel- 
atively few applied studies have examined the relation between 


punishment intensity and treatment efficacy, and that research 
has yielded inconsistent and sometimes contradictory results 
(Cole, Montgomery, Wilson, & Milan, 2000; Singh, Dawson, 
& Manning, 1981; Williams, Kirkpatrick-Sanchez, & Iwata, 
1993). When selecting the intensity of a punishing stimulus, the 
practitioner should ask: Will this degree of the punisher suppress 
occurrences of the problem behavior? Lerman and Vorndran 
(2002) recommended, “While the punishing stimulus needs to 
be intensive enough for an effective application, it should not be 
more intense than necessary.” (p. 443) 


Schedule and Consistency of Punishment 


The suppressive effects of a punisher are maximized by a con- 
tinuous schedule of punishment (FR 1) in which each occur- 
rence of the behavior is followed by the punishing consequence. 
In general, the greater the proportion of responses that are 
followed by the punisher, the greater the response reduction 
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(Azrin, Holz, & Hake, 1963; Zimmerman & Ferster, 1962). 
Azrin and Holz (1966) summarized the comparative effects of 
punishment on continuous and intermittent schedules as follows: 


Continuous punishment produces more suppression 

than does intermittent punishment for as long as the 
punishment contingency is maintained. However, after 
the punishment contingency has been discontinued, 
continuous punishment allows more rapid recovery of the 
responses, possibly because the absence of punishment 
can be more rapidly discriminated. (p. 415) 


Intermittent punishment, however, may be partially 
effective under some conditions (Cipani, Brendlinger, 
McDowell, & Usher, 1991; Clark, Rowbury, Baer, & Baer, 
1973; Romanczyk, 1977). Lerman, Iwata, Shore, and DeLeon 
(1997) demonstrated that a gradual thinning of the punishment 


schedule might maintain the suppressive effects of punishment 
that was initially delivered on a continuous schedule (FR 1). 
Participants were five adults with profound intellectual dis- 
abilities and histories of chronic self-injurious behavior (SIB) 
in the form of hand mouthing or head hitting. Treatment by 
punishment (time-out from reinforcement for one participant 
and contingent restraint for the other four) delivered on a con- 
tinuous (FR 1) schedule produced marked reductions in SIB 
from baseline levels for all five participants. (Figure 14.6 shows 
the results for three of the five participants.) The participants 
were then exposed to intermittent schedules of punishment (FI 
120-sec or FI 300-sec). On the FI 120-sec schedule, the thera- 
pist delivered punishment contingent on the first SIB response 
after 120 seconds had elapsed since the previous application 
of punishment or the start of the session. The frequency of 
SIB under the intermittent schedule of punishment for all but 
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one participant (Wayne, not shown in Figure 14.6) increased 
to baseline levels. 

After re-establishing low levels of SIB for each participant 
with FR 1 punishment, the researchers gradually thinned the 
punishment schedules. For example, schedule thinning for Paul 
consisted of increasing the duration of the fixed interval duration 
by 30-sec increments to FI 300-sec (i.e., FR 30-sec, FR 60-sec, 
FR 90-sec, etc.). With the exception of a few sessions, Paul’s SIB 
remained at low levels as the punishment schedule was progres- 
sively thinned over 57 sessions. During the final 11 sessions, in 
which punishment was delivered on an FI 300-sec schedule, his 
SIB occurred at a mean of 2.4% of observed intervals (compared 
to 33% in baseline). A similar pattern of success with a gradually 
thinning punishment schedule was obtained for another subject, 
Wendy (not shown in Figure 14.6). 

The effectiveness of punishment on an FI 300-sec 
schedule for three of the five participants—Paul, Wendy, and 
Wayne (whose SIB remained low even though punishment 
was abruptly changed from FR | to FI 120-sec and then FI 
300-sec)—enabled infrequent application of punishment. In 
practice, this would free therapists or staff from continuous 
monitoring of behavior. 

For Melissa and Candace, however, repeated attempts to 
gradually thin the punishment schedule proved unsuccessful 
in maintaining the frequency of their SIB at levels attained by 
FR 1 punishment. Lerman and colleagues (1997) speculated 
that one explanation for the ineffectiveness of punishment on a 
fixed interval schedule was that after a person has experienced 
an FI schedule for some time, the delivery of punishment could 
function “as a discriminative stimulus for punishment-free peri- 
ods, leading to a gradual overall increase in responding under FI 
punishment” (p. 198). 


Reinforcement for the Target Behavior 


The effectiveness of punishment is modulated by the reinforce- 
ment contingencies maintaining the problem behavior. [fa problem 
behavior occurs at a frequency sufficient to cause concern, it is pre- 
sumably producing reinforcement. If the target response is never 
reinforced, then “punishment would scarcely be possible since the 
response would rarely occur” (Azrin & Holz, 1966, p. 433). 

To the extent that the reinforcement maintaining the prob- 
lem behavior can be reduced or eliminated, punishment will be 
more effective. Of course, if all reinforcement for the problem 
behavior was withheld—a condition difficult to achieve in many 
settings—the resulting extinction schedule would decrease the 
behavior independent of the presence of a punishment contin- 
gency. However, as Azrin and Holz (1966) pointed out: 


The physical world often provides reinforcement con- 
tingences that cannot be eliminated easily. The faster we 
move through space, the quicker we get to where we are 
going, whether the movement be walking or driving an 
auto. Hence, running and speeding will inevitably be rein- 
forced. Extinction of running and speeding could be accom- 
plished only by the impossible procedure of eliminating 

all reinforcing events that result from movement through 
space. Some other reductive method, such as punishment, 
must be used. (p. 433) 
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Reinforcement for Alternative Behaviors 


Holz, Azrin, and Ayllon (1963) found that punishment was inef- 
fective in reducing psychotic behavior when that behavior was 
the only means by which patients could attain reinforcement. 
However, when patients could emit an alternative response that 
resulted in reinforcement, punishment was effective in reduc- 
ing their inappropriate behavior. Summing up laboratory and 
applied studies that have reported the same finding, Millenson 
(1967) stated: 


If punishment is employed in an attempt to eliminate 
certain behavior, then whatever reinforcement the unde- 
sirable behavior had led to must be made available via a 
more desirable behavior. Merely punishing school chil- 
dren for “misbehavior” in class may have little permanent 
effect. ... The reinforcers for “misbehavior” must be 
analyzed and the attainment of these reinforcers perhaps 
permitted by means of different responses, or in other 
situations. .. . But for this to happen, it appears import- 
ant to provide a rewarded alternative to the punished 
response. (p. 429) 


A study by Thompson, Iwata, Conners, and Roscoe (1999) 
provides an excellent illustration of how the suppressive effects 
of punishment can be enhanced by reinforcement for an alterna- 
tive response. Four adults with developmental disabilities who 
had been referred to a day treatment program for self-injurious 
behavior (SIB) participated in the study. Twenty-eight-year-old 
Shelly, for example, expelled saliva and then rubbed it onto 
her hands and other surfaces (e.g., tables, windows), which 
led to frequent infections; and Ricky, a 34-year-old man with 
deaf-blindness, frequently hit his head and body, resulting in 
bruises (contusions). Previous interventions such as differential 
reinforcement of appropriate behavior, response blocking, and 
protective equipment had been ineffective in reducing the SIB 
of all four participants. 

A functional behavior analysis (see Chapter 27) with each 
participant suggested that SIB was maintained by automatic 
reinforcement. Reinforcer assessments were conducted to iden- 
tify materials that produced the highest levels of contact with the 
lowest levels of SIB (e.g., wooden stringing beads, a mirrored 
microswitch that produced vibration and music, a balloon). The 
researchers then conducted a punisher assessment to determine 
the least intrusive consequences that produced a 75% reduction 
in SIB for each participant. 

Using an experimental design that combined reversal, 
alternating treatments, and multiple baseline components, 
Thompson and colleagues (1999) analyzed the effects of 
punishment with and without reinforcement of alternative behav- 
ior on participants’ SIB. During the no-punishment condition, 
the therapist was present in the room, but did not interact with 
the participant or provide any consequences for SIB. During the 
punishment condition, immediately following each occurrence of 
SIB the therapist delivered the consequence previously identified 
as a punisher for the participant. For example: 


Each time Shelly expelled saliva, the therapist delivered 
a reprimand (“no spitting”) and briefly dried each of her 
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hands (and any other wet surfaces) with a cloth. Ricky’s 
hands were held in his lap for 15 s each time he engaged in 
SIB. Donna and Lynn both received a verbal reprimand and 
had their hands held across their chests for 15 s following 
SIB. (p: 321) 


Within each no-punishment and punishment phase, ses- 
sions of reinforcement and no-reinforcement conditions were 
alternated. During reinforcement sessions, participants had con- 
tinual access to leisure materials or activities previously iden- 
tified as highly preferred; during no-reinforcemient sessions, 
participants had no access to leisure materials. 

The study’s results are shown in Figure 14.7. During the 
no-punishment baseline phase, only Shelly’s SIB was consis- 
tently lower during the reinforcement sessions than during the 


Responses per Minute 


Responses per Minute 


% Intervals 


2 

= 

Figure 14.7 Self-injurious behavior by four adults s 
with developmental disabilities during alternating a 
reinforcement and no-reinforcement conditions M1 ay 

across no-punishment and punishment phases. ra 

8 

“Effects of Reinforcement for Alternative Behavior during le 

Punishment for Self-Injury,” by R. H. Thompson, B. A. Iwata, Pod 


J. Conners, and E. M. Roscoe, 1999, Journal of Applied 
Behavior Analysis, 32, p. 323. Reproduced with permission of 
John Wiley & Sons Inc. 


no-reinforcement sessions. Although the introduction of punish- 
ment reduced SIB from baseline levels for all four participants, 
punishment was more effective during those sessions in which 
reinforcement for alternative behaviors was available. Also, 
fewer punishers were delivered during the punishment condition 
sessions when reinforcement was available. 

Thompson and colleagues (1999) concluded that the 
effects of punishment can be improved when reinforcement is 
applied for alternative behaviors, and that practitioners need not 
increase the aversiveness of the punishing stimulus to increase 
the effectiveness of punishment. Adding a reinforcement 
component to the punishment procedure may be sufficient to 
achieve desired results, while simultaneously implementing a 
less restrictive approach. 


Punish No Punish Punish 


No Punish 


“ 'e is Salhi Lynn 
—-~ POry (i ~e-\-e-e, pe, 
10 20 30 40 50 60 70 


Sessions 


POSSIBLE SIDE EFFECTS AND PROBLEMS 
WITH PUNISHMENT 


A variety of side effects are often correlated with punishment, 
including the elicitation of undesirable emotional responses 
and aggression, escape and avoidance, and an increased rate of 
the problem behavior under nonpunishment conditions (Azrin 
& Holz, 1966; Hutchinson, 1977; Linscheid & Meinhold, 
1990). Other problems noted include modeling undesirable 
behavior and overusing punishment because reductions in 


problem behavior can function as negative reinforcement for 
caregivers. 


Emotional and Aggressive Reactions 


Punishment, especially positive punishment in the form of 
aversive stimulation, may evoke aggressive behavior with 
respondent and operant components (Azrin & Holz, 1966). 
For example, electric shock elicited reflexive forms of aggres- 
sion and fighting in laboratory animals (Azrin, Hutchinson, & 
Hake, 1963; Ulrich & Azrin, 1962; Ulrich, Wolff, & Azrin, 
1962). Such pain-elicited, or respondent aggression, is directed 
toward any nearby person or object. A student who is punished 
severely may begin to throw and destroy materials within her 
reach, and attack other students or the person delivering the 
punishment. Aggressive behavior following punishment that 
occurs because it has enabled the person to escape the aver- 
sive stimulation in the past is referred to as operant aggression 
(Azrin & Holz, 1966). 


Escape and Avoidance 


Escape and avoidance are expected reactions to aversive stim- 
ulation. Escape and avoidance behaviors take a wide variety of 
forms, some of which may be a greater problem than the target 
behavior being punished. For example, a student who is admon- 
ished repeatedly for sloppy work or coming to class unprepared 
may stop coming to class altogether. A person may lie, cheat, 
hide, or exhibit other undesirable behaviors to avoid punish- 
ment. Mayer, Sulzer, and Cody (1968) indicated that escape and 
avoidance do not always occur in the literal sense of those terms. 
Individuals may escape punishing environments by consuming 
drugs or alcohol, or by simply “tuning out.” 

As the intensity of a punisher increases, so, too, does the 
likelihood of escape and avoidance. For example, in a study 
evaluating the effectiveness of a specially designed cigarette 
holder that delivered an electric shock to the user when it was 
opened as an intervention for reducing cigarette smoking, Pow- 
ell and Azrin (1968) found the following: “As the punishment 
intensity increased, the duration decreased for which the sub- 
jects would remain in contact with the contingency; ultimately, 
an intensity was reached at which they refused to experience it 
altogether” (p. 69). 

Escape and avoidance as side effects to punishment, like 
emotional and aggressive reactions, can be minimized or pre- 
vented altogether by providing desirable alternative responses to 
the problem behavior that both avoid the delivery of punishment 
and provide reinforcement. 
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Behavioral Contrast 


Reynolds (1961) introduced the term behavioral contrast to 
refer to the phenomenon in which a change in one component 
of a multiple schedule that increases or decreases the rate of 
responding on that component is accompanied by a change in 
the response rate in the opposite direction on the other, unaltered 
component of the schedule.* Behavioral contrast can occur as a 
function of a change in reinforcement or punishment density on 
one component of a multiple schedule (Brethower & Reynolds, 
1962; Lattal & Griffin, 1972). For example, behavioral contrast 
for punishment takes the following general form: (a) Responses 
are occurring at similar rates on two components of a multiple 
schedule (e.g., a pigeon pecks a backlit key, which alternates 
between blue and green, reinforcement is delivered on the same 
schedule on both keys, and the bird pecks at roughly the same 
rate regardless of the key’s color); (b) responses on one com- 
ponent of the schedule are punished, whereas responses on the 
other component continue to go unpunished (e.g., pecks on the 
blue key are punished, and pecks on the green key continue to 
produce reinforcement at the prior rate); (c) rate of respond- 
ing decreases on the punished component and increases on the 
unpunished component (e.g., pecks on the blue key are sup- 
pressed and pecks on the green key increase even though pecks 
on the green key produce no more reinforcement than before). 
Here is a hypothetical example to further illustrate 
the contrast effect of punishment. A child is eating cookies 
before dinner from the kitchen cookie jar at equal rates in the 
presence and absence of his grandmother. One day, Grandma 
scolds the child for eating a cookie before dinner, which sup- 
presses his rate of pre-dinner cookie eating when she is in 
the kitchen (see Figure 14.2); but when Grandma’s not in the 
kitchen, the boy eats cookies from the jar at a higher rate than 
he did when unsupervised prior to punishment. Contrast effects 
of punishment can be minimized, or prevented altogether, by 
consistently punishing occurrences of the target behavior in 
all relevant settings and stimulus conditions, withholding or at 
least minimizing the person’s access to reinforcement for the 
target behavior, and providing alternative desirable behaviors. 
(With respect to our hypothetical case of the child eating cookies 
before dinner, we recommend simply removing the cookie jar!) 


Punishment May Involve Modeling of 
Undesirable Behavior 


Most readers are familiar with the example of the parent who, 
while spanking a child, says, “This will teach you not to hit 
your playmates!” Unfortunately, the child may be more likely 
to imitate the parent’s actions, not the parent’s words. More 
than three decades of research have found a strong correla- 
tion between young children’s exposure to harsh, excessive 
punishment and antisocial behavior and conduct disorders in 
adolescents and adults (Patterson, 1982; Patterson, Reid, & 
Dishion, 1992; Sprague & Walker, 2000). Although the appro- 
priate use of behavior change tactics based on the punishment 
principle need not involve harsh treatment or negative personal 
interactions, practitioners should heed Bandura’s (1969) valu- 
able counsel in this regard: 
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Anyone attempting to control specific troublesome 
responses should avoid modeling punitive forms of behav- 
ior that not only counteract the effects of direct training but 
also increase the probability that on future occasions the 
individual may respond to interpersonal thwarting in an 
imitative manner. (p. 313) 


Punishment May Be Overused due to 
Negative Reinforcement of the Punishing 
Agent's Behavior 


Negative reinforcement may be a reason for the widespread use 
of, and reliance upon, punishment in child rearing, education, 
and society. When Person A delivers a reprimand or other aver- 
sive consequence to Person B for misbehaving, the immediate 
effect is often the temporary cessation of the troubling behav- 
ior. The termination of this aversive stimulus serves as negative 
reinforcement for Person A’s behavior. Reese (1966) stated the 
case succinctly: “Punishment reinforces the punisher” (p. 37). 

Alber and Heward (2000) described how the natural con- 
tingencies at work in a typical classroom could strengthen a 
teacher’s use of reprimands for disruptive behavior while under- 
mining her use of contingent praise and attention for appropriate 
behavior. 


Paying attention to students when they are behaving inap- 
propriately (e.g., “Carlos, you need to sit down right now!”) 
is negatively reinforced by the immediate cessation of the 
inappropriate behavior (e.g., Carlos stops running around 
and returns to his seat). As a result, the teacher is more 
likely to attend to student disruptions in the future. . . . 
Although few teachers must be taught to reprimand stu- 
dents for misbehavior, many teachers need help increasing 
the frequency with which they praise student accomplish- 
ments. Teacher-praising behavior is usually not reinforced 
as effectively as teacher-reprimanding behavior. Praising a 
student for appropriate behavior usually produces no imme- 
diate effects—the child continues to do his work when 
praised. Although praising a student for working produc- 
tively on an assignment may increase the future likelihood 
of that behavior, there are no immediate consequences for 
the teacher. By contrast, reprimanding a student often pro- 
duces an immediate improvement in the teacher’s world 

(if only temporary }—that functions as effective negative 
reinforcement for reprimanding. (pp. 178-179) 


POSITIVE PUNISHMENT INTERVENTIONS 


Interventions featuring positive punishment interventions can 
take a wide variety of forms. We describe five in this section: 
reprimands, response blocking, response interruption and redi- 
rection (RIRD), contingent exercise, and overcorrection. 


Reprimands 


It may seem strange that immediately following a discussion 
of teachers’ overreliance on reprimands, our first example of 
a positive punishment intervention focuses on reprimands. 
The delivery of verbal reprimands following the occurrence of 


misbehavior is perhaps the most common form of attempted 
positive punishment. However, a number of studies have shown 
that a firmly said “No!” or “Stop! Don’t do that!” delivered 
immediately on the occurrence of a behavior can function as 
effective punishment (e.g., Hall et al., 1971; Thompson et al., 
1999; Van Houten, Nau, Mackenzie-Keating, Sameoto, & Cola- 
vecchia, 1982). 

In spite of the widespread use of reprimands in an effort 
to suppress undesired behavior, surprisingly few studies have 
examined their effectiveness as punishers. Results of a series 
of experiments by Van Houten, Nau, Mackenzie-Keating, 
Sameoto, and Colavecchia (1982) designed to identify variables 
that increased the effectiveness of reprimands as punishers for 
disruptive behavior in the classroom revealed that reprimands 
delivered with eye contact and in close proximity to the student 
were more effective than reprimands delivered from across the 
room. An interesting study by O’Leary, Kaufman, Kass, and 
Drabman (1970) found that quiet reprimands that were audi- 
ble only to the child being reprimanded were more effective in 
reducing disruptive behavior than loud reprimands that could be 
heard by other children in the classroom. 

If being reprimanded is the only way a child receives adult 
attention, it should not be surprising that reprimands function as 
reinforcement for that child rather than as punishment. Indeed, 
Madsen, Becker, Thomas, Koser, and Plager (1968) found that 
the repeated use of reprimands while students were out of seat 
served to increase, rather than reduce, the behavior. Consistent 
with research on other punishing stimuli, reprimands are more 
effective as punishers when motivation for the problem behavior 
has been minimized and the availability of an alternative behav- 
ior maximized (Van Houten & Doleys, 1983). 

A parent or teacher does not want to adopt a pattern of 
constant reprimanding. Reprimands should be used thoughtfully 
and sparingly in combination with frequent praise and attention 
contingent on appropriate behavior. O’Leary and colleagues 
(1970) recommended: 


An ideal combination would probably be frequent praise, 
some soft reprimands, and very occasional loud repri- 
mands. . . . Combined with praise, soft reprimands might be 
very helpful in reducing disruptive behaviors. In contrast, 

it appears that loud reprimands lead one into a vicious 
cycle of more and more reprimands resulting in even 

more disruptive behavior. (p. 155) 


Response Blocking 


Response blocking—physically intervening as soon as the per- 
son begins to emit the problem behavior, to prevent the com- 
pletion of the response—has shown effectiveness in reducing 
the frequency of some problem behaviors, such as chronic hand 
mouthing, eye poking, and pica (e.g., Lalli, Livezey, & Kates, 
1996; Lerman & Iwata, 1996; Reid, Parsons, Phillips, & Green, 
1993). In addition to preventing the response from occurring by 
using the least amount of physical contact and restraint possible, 
the therapist might issue a verbal reprimand or prompt to stop 
engaging in the behavior (e.g., Hagopian & Adelinis, 2001). 
Lerman and Iwata (1996) used response blocking in treat- 
ing the chronic hand mouthing (contact between any part of the 


hand and the lips or mouth) of Paul, a 32-year-old man with pro- 
found intellectual disabilities. Following a baseline condition, in 
which Paul was seated in a chair with no one interacting with 
him and no leisure materials were available, response blocking on 
an FR 1 schedule was implemented. A therapist sat behind Paul 
and blocked his attempts to put his hand in his mouth. “Paul was 
not prevented from bringing his hand to his mouth; however, the 
therapist blocked the hand from entering the mouth by placing the 
palm of her hand about 2 cm in front of Paul’s mouth” (p. 232). 
Response blocking produced an immediate and rapid decrease 
in hand-mouthing attempts to near-zero levels (see Figure 14.8). 

Response blocking is often implemented as a treatment 
for SIB or self-stimulatory behavior when functional analysis 
reveals consistent responding in the absence of socially mediated 
consequences, which suggests the possibility that the behavior 
is maintained by automatic reinforcement from sensory stimuli 
produced by the response. Because response blocking prevents 
the learner from contacting the sensory stimuli that are normally 
produced by completing the response, subsequent decreases in 
responding could be due to extinction. Lerman and Iwata (1996) 
presented their study as a potential method for distinguishing 
whether the suppressive effects of response blocking are due 
to punishment or extinction mechanisms. They explained their 
reasoning, in part, as follows: 


Depending on the mechanism through which behavior is 
reduced (extinction vs. punishment), different schedules 
of reinforcement or punishment are in effect when a given 
proportion of responses is blocked. .. . Thus, as larger pro- 
portions of responses are blocked, the reinforcement sched- 
ule becomes leaner and the punishment schedule becomes 
richer. If response blocking produces extinction, response 
rates should increase or be maintained as more responses 
are blocked (i.e., as the reinforcement schedule is thinned), 
until [the effects of] extinction [i.e., reduced response rate] 
occurs at some point along the progression. Conversely, 

if the procedure functions as punishment, response rates 
should decrease as more responses are blocked (i.e., as the 
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punishment schedule becomes richer). (pp. 231-232, words 
in brackets added) 


A condition in which all responses are blocked might 
function as an extinction schedule (i.e., reinforcement in the 
form of sensory stimuli is withheld for all responses) or as a 
continuous (FR 1) schedule of punishment (i.e., all responses 
are followed by physical contact). As Lerman and Iwata (1996) 
explained, if only some responses are blocked, the situation 
may function as an intermittent schedule of reinforcement or as 
an intermittent schedule of punishment. Therefore, comparing 
response rates among conditions in which different proportions 
of responses are blocked should indicate whether the effects are 
due to extinction or to punishment. 

If response blocking functioned as extinction for Paul’s 
hand mouthing, an initial increase in response rate would be 
expected when the blocking procedure was implemented for 
every response; however, no such increase was observed.> If 
response blocking functioned as punishment, blocking every 
response would constitute a continuous schedule of punishment 
and a rapid decrease in responding would be expected; and that 
is exactly what the results showed (see data for the first response 
block [1.0] phase in Figure 14.8). 

Conversely, if response blocking functioned as extinc- 
tion for Paul’s hand mouthing, then blocking some, but not all, 
responses would place the hand mouthing on an intermittent 
schedule of reinforcement and responding would be expected 
to increase from baseline levels. And blocking an ever-larger 
proportion of responses thins the reinforcement schedule further, 
causing the response rate to rise even higher. Instead, as a greater 
proportion of responses were blocked, the suppressive effects on 
Paul’s SIB became more pronounced, a result expected as a pun- 
ishment schedule becomes denser. Overall, therefore, the results 
of the experiment indicated that response blocking functioned as 
punishment for Paul’s hand mouthing. 

Smith, Russo, and Le (1999) conducted a systematic repli- 
cation of Lerman and Iwata’s (1996) experiment and found that 
the frequency of eye poking by a 41-year-old woman treated 
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with response blocking decreased gradually—a response pattern 
indicative of extinction. The authors concluded that “whereas 
blocking may reduce one participant’s behavior via punishment, 
it may extinguish another participant’s behavior” (p. 369). 

If response blocking occurs before the response is emitted, 
its suppressive effects cannot be due to punishment or extinction. 
Punishment and extinction are both behavior — consequence 
relations. Any procedure that prevents the target behavior from 
occurring in its entirety eliminates the behavior — consequence 
contingency and should be considered an antecedent intervention. 
However, if the problem behavior is a progression of movements or 
a response chain, then blocking the initial movement or response 
in the chain before the sequence produces the presumed reinforcer 
would alter the behavior — consequence contingency. For exam- 
ple, blocking a client’s open hand after he begins to move it toward 
contact with his head is a consequence whose suppressive effects 
can be analyzed in terms of punishment and/or extinction. 

Although response blocking may be viewed as a less 
restrictive and more humane intervention than delivering aversive 
stimulation after a response has occurred, it must be approached 
with great care. Side effects such as aggression and resistance 
to the response-blocking procedure have occurred (Hagopian & 
Adelinis, 2001; Lerman, Kelley, Vorndran, & Van Camp, 2003). 
Providing prompts and reinforcement for an alternative response 
can minimize resistance and aggression. For example, the aggres- 
sive behavior by a 26-year-old man with moderate intellectual 
disabilities and bipolar disorder during treatment with response 
blocking for pica (ingesting paper, pencils, paint chips, and 
human feces) was reduced by supplementing response blocking 
with a prompt and redirection to engage in an alternative behav- 
ior, in this case moving to an area of the room where popcorn 
was available (Hagopian & Adelinis, 2001). 


Response Interruption and Redirection (RIRD) 


Response interruption and redirection (RIRD) is a proce- 
dural variation of response blocking that includes interrupting 
stereotypic behavior at its onset and redirecting the individual 
to complete high-probability behaviors instead. Two types of 
RIRD have been identified and evaluated in the literature: motor 
RIRD and vocal RIRD (Ahearn, Clark, MacDonald, and Chung, 
2007; Cassella, Sidener, Sidener, & Progar, 2011; Shawler & 
Miguel, 2015; Wunderlich & Vollmer, 2015). Both types have 
been used primarily with individuals with autism who emitted 
high levels of repetitive, stereotypic, nonfunctional behaviors. 
Procedurally, RIRD is employed when, at the onset of a ste- 
reotypic behavior, the therapist immediately presents a series 
of instructions, or demands, requiring high-probability vocal 
or motor responses (e.g., “What am I holding?” “Tell me your 
name.” “Clap your hands.”’). RIRD is usually terminated upon a 
set criterion of successful responses (e.g., three correct in a row), 
although Saini, Gregory, Uran, and Fantetti (2015) found that 
one correct demand response was effective. RIRD has been used 
alone and combined with other procedures to reduce stereotypy 
so that students could engage in more interpersonal or socially 
appropriate behaviors. Practitioners who consider using RIRD 
procedures should have sufficient time, resources, and trained 


staff before implementation. RIRD can be labor- and time- 
intensive, and may extend teaching sessions beyond what can 
be accommodated in some educational or therapeutic settings. 


Contingent Exercise 


Contingent exercise is an intervention in which the client is 
required to perform a response that is not topographically related 
to the problem behavior. Contingent exercise has been found 
effective as punishment for various self-stimulatory, stereo- 
typic, disruptive, aggressive, and self-injurious behaviors (e.g., 
DeCatanzaro & Baldwin, 1978; Kern, Koegel, & Dunlap, 1984; 
Luce & Hall, 1981; Luiselli, 1984).° In perhaps the most fre- 
quently cited example of contingent exercise as a punisher, Luce, 
Delquadri, and Hall (1980) found that the repetition of mild exer- 
cise contingent on aggressive behavior by two boys with severe 
disabilities reduced it to near-zero levels. Figure 14.9 shows the 
results for Ben, a 7-year-old who frequently hit other children 
at school. Each time Ben hit someone, he was required to stand 
up and sit down 10 times. Initially, Ben had to be physically 
prompted to stand; an assistant held the child’s hand while pull- 
ing his upper body forward. Physical prompts were accompanied 
by verbal prompts of “stand up” and “sit down.” Soon, whenever 
hitting occurred, the nearest adult simply said, “Ben, no hitting. 
Stand up and sit down 10 times” and the verbal prompts alone 
were sufficient for Ben to complete the exercise. If a hitting epi- 
sode occurred during the contingent exercise, the procedure was 
reinstated. 


Overcorrection 


Overcorrection is a behavior reduction tactic in which, contin- 
gent on each occurrence of the problem behavior, the learner is 
required to engage in effortful behavior that is directly or logically 
related to the problem behavior. Originally developed by Foxx 
and Azrin (1972, 1973; Foxx & Bechtel, 1983) as a method for 
decreasing disruptive and maladaptive behaviors of adults with 
intellectual disabilities in institutional settings, overcorrection 
combines the suppressive effects of punishment and the educative 
effects of positive practice. Overcorrection includes either or both 
of two components: restitution and positive practice. 

In restitutional overcorrection, contingent on the prob- 
lem behavior, the learner is required to repair the damage caused 
by the problem behavior by returning the environment to its orig- 
inal state and then to engage in additional behavior that brings 
the environment to a condition better than it was prior to the 
misbehavior. A parent applying restitutional overcorrection with 
a child who repeatedly tracks mud onto the kitchen floor might 
require the child to first wipe up the mud and clean his shoes and 
then to over correct the effects of his misbehavior by mopping 
and waxing a portion of the floor and polishing his shoes. 

Azrin and Foxx (1971) used restitutional overcorrection 
in their toilet training program by requiring a person who had 
an accident to undress, wash her clothes, hang them up to dry, 
shower, dress in clean clothing, and then clean up a portion of 
the lavatory. Azrin and Wesolowki (1974) eliminated stealing 
of food by hospitalized adults with intellectual disabilities by 
requiring residents to return not only the stolen food, or the 
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Figure 14.9 Number of times a 7-year-old boy hit other children during 6-hour school 
days during baseline and contingent exercise. X’s represent response measures 


recorded by a second observer. 
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portion that remained uneaten, but also to purchase an additional 
item of that food at the commissary and give it to the victim. 

Azrin and Besalel (1999) differentiated a procedure they 
called simple correction from overcorrection. With simple cor- 
rection, the learner is required, subsequent to the occurrence of an 
inappropriate behavior, to restore the environment to its previous 
state. For example, a simple correction procedure is in effect when 
requiring a student who cuts to the head of the lunch line to go to 
the back of the line. Requiring the student to wait until all others 
get in line and are served before re-entering the line would con- 
stitute a form of overcorrection in this instance. Azrin and Besalel 
recommended that simple correction be used to reduce behaviors 
that are not severe, occur infrequently, are not deliberate, and do 
not severely interfere with or annoy other people. 

Correction is not possible if the problem behavior produces 
an irreversible effect (e.g., a one-of-a-kind dish is broken) or if 
the corrective behavior is beyond the person’s means or skills. In 
such instances, Azrin and Besalel (1999) recommended that the 
person be required to correct as much of the damage his behavior 
caused as possible, be present at all points in the correction, and 
assist with any parts of the correction that he is able to perform. 
For example, a child who breaks a neighbor’s window might be 
required to clean up the pieces of glass, measure the window, 
contact the store for a replacement pane, and be present and assist 
with each step when the new window pane is installed. 

In positive practice overcorrection, contingent on an 
occurrence of the problem behavior, the learner is required to 
repeatedly perform a correct form of the behavior, or a behavior 


incompatible with the problem behavior, for a specified duration 
of time or number of responses. Positive practice overcorrection 
entails an educative component in that it requires the person 
to engage in an appropriate alternative behavior. The parent 
whose son tracks mud into the house could add a positive prac- 
tice component by requiring him to practice wiping his feet on 
the outside door mat and then to enter the house for 2 minutes 
or 5 consecutive times. Overcorrection that includes restitution 
and positive practice helps teach what to do in addition to what 
not to do. The child who breaks an irreplaceable dish could be 
required to gently and slowly wash a number of dishes, perhaps 
with an exaggerated carefulness. 

Researchers and practitioners have used positive practice 
overcorrection to reduce the frequency of problem behaviors such as 
toilet training (Azrin & Foxx, 1971), self-stimulation and stereotypic 
behavior (Azrin, Kaplan, & Foxx, 1973; Foxx & Azrin, 1973b), pica 
(Singh & Winton, 1985), bruxism (Steuart, 1993), sibling aggres- 
sion (Adams & Kelley, 1992), and classroom disruptions (Azrin & 
Powers, 1975). Positive practice overcorrection has also been used 
for academic behaviors (Lenz, Singh, & Hewett, 1991), most often 
to decrease oral reading and spelling errors (e.g., Ollendick, Matson, 
Esveldt-Dawson, & Shapiro, 1980; Singh & Singh, 1986; Singh, 
Singh, & Winton, 1984; Stewart & Singh, 1986). 

Positive practice overcorrection can also be applied to 
reduce or eliminate behaviors that do not create permanent 
response products that can be repaired or restored to their orig- 
inal state. For example, Heward, Dardig, and Rossett (1979) 
described how parents used positive practice overcorrection to 
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help their teenage daughter stop making a grammatical error in 
her speech. Eunice frequently used the contraction “don’t” instead 
of “doesn’t” with the third person singular (e.g., “He don’t want 
to go.”). A positive reinforcement program in which Eunice 
earned points she could redeem for preferred activities each 
time she used “doesn’t” correctly had little effect on her speech. 
Eunice agreed with her parents that she should speak correctly 
but claimed the behavior was a habit. Eunice and her parents then 
decided to supplement the reinforcement program with a mild 
punishment procedure. Each time Eunice or her parents caught 
her using “don’t” incorrectly in her speech, she was required to 
say the complete sentence she had just spoken 10 times in a row, 
using correct grammar. Eunice wore a wrist counter that reminded 
her to listen to her speech and to keep track of the number of times 
she employed the positive practice procedure. 

When positive practice effectively suppresses the prob- 
lem behavior, it is not clear what behavioral mechanisms are 
responsible for the behavior change. Punishment may result in 
decreased frequency of responding, because the person engages 
in effortful behavior as a consequence of the problem behavior. 
A reduction in the frequency of the problem behavior as an out- 
come of positive practice may also be a function of an increased 
frequency of an incompatible behavior, the correct behavior that 
is strengthened in the person’s repertoire as the result of the 
intensive, repeated practice. Azrin and Besalel (1999) suggested 
that positive practice may be effective because it provides addi- 
tional positive learning opportunities that have been absent in 
prior environments, and also requires “extra effort” that discour- 
ages future misbehaviors. 

Specific procedures for implementing overcorrection vary 
greatly. Factors such as the type and severity of the problem 
behavior, its effects on the environment, the setting, the desired 
alternative behavior, and the learner’s current skills should 
be considered. Figure 14.10 provides general guidelines for 
implementing overcorrection (Azrin & Besalel, 1999; Foxx & 
Bechtel, 1983; Kazdin, 2001; Miltenberger & Fuqua, 1981). 


Figure 14.10 Guidelines for implementing overcorrection. 


Although rapid and long-lasting therapeutic effects have 
been reported after corrective training, practitioners should be 
aware of several potential problems and limitations associated 
with overcorrection. First, overcorrection is a labor-intensive, 
time-consuming procedure that requires the full attention of 
the practitioner implementing it. Implementing overcorrection 
usually requires the practitioner to monitor the learner directly 
throughout the overcorrection process. Second, for overcor- 
rection to be effective as punishment, the time that the learner 
spends with the person monitoring the overcorrection sequence 
must not be reinforcing. “If it is, it just might be worth waxing 
the entire kitchen floor if Mom chats with you and provides a 
milk and cookies break” (Heward et al., 1979, p. 63). 

Third, a child who misbehaves frequently may not execute 
a long list of “cleanup” behaviors just because he was told to do 
so. Azrin and Besalel (1999) recommended three strategies to 
minimize the likelihood of refusal to perform the overcorrection 
sequence: (1) remind the learner what the more severe disci- 
plinary action will be and, if the refusal persists, then impose 
that discipline; (2) discuss the need for correction before the 
problem behavior occurs; and (3) establish correction as an 
expectation and routine habit for any disruptive behavior. If the 
child resists too strongly or becomes aggressive, overcorrection 
may not be a viable treatment. Adult learners must voluntarily 
make the decision to perform the overcorrection routine. 


GUIDELINES FOR USING PUNISHMENT 


Assuming that habituation, treatment integrity, reinforcement, 
and training issues can be addressed, punishment can yield 
rapid, long-lasting suppression of problem behavior. Unfortu- 
nately, however, agency policies, human subject review proce- 
dures, and historical practices have limited the use of punishment 
for research and treatment in clinic-based settings (Grace, Kahng, 
& Fisher, 1994). Still, punishment may be a treatment of choice 
when (a) the problem behavior produces serious physical harm 


1. Immediately upon the occurrence of the problem behavior (or the discovery of its effects), in a calm, unemotional tone 
of voice, tell the learner that he has misbehaved and provide a brief explanation for why the behavior must be corrected. 
Do not criticize or scold. Overcorrection entails a logically related consequence to reduce future occurrences of the problem 
behavior; criticism and scolding do not enhance the tactic's effectiveness and may harm the relationship between the 


learner and practitioner. 


Provide explicit verbal instructions describing the overcorrection sequence the learner must perform. 


3. Implement the overcorrection sequence as soon as possible after the problem behavior has occurred. When circumstances 
prevent immediately commencing the overcorrection sequence, tell the learner when the overcorrection process will be 


conducted. Several studies have found that overcorrection conducted at a later time can be effective (Azrin & Powers, 1975: 


Barton & Osborne, 1978). 


1 


4. Monitor the learner throughout the overcorrection activity. Provide the minimal number of response prompts, including 
gentle physical guidance, needed to ensure that the learner performs the entire overcorrection sequence. 


5. Provide the learner with minimal feedback for correct responses. Do not give too much praise and attention to the learner 


during the overcorrection sequence. 


6. Provide praise, attention, and perhaps other forms of reinforcement to the learner each time he “spontaneously” performs 
the appropriate behavior during typical activities. (Although technically not part of the overcorrection procedure, reinforcing 
an alternative behavior is a recommended complement to all punishment-based interventions.) 


and must be suppressed quickly, (b) reinforcement-based treat- 
ments have not reduced the problem behavior to socially accept- 
able levels, or (c) the reinforcer maintaining the problem behavior 
cannot be identified or withheld (Lerman & Vorndran, 2002). 
If the decision is made to use a punishment-based interven- 
tion, steps should be taken to ensure that the intended punisher 
actually works. The guidelines that follow will help practitioners 
apply punishment with optimal effectiveness while minimizing 
undesirable side effects and problems. We based these guidelines 
on the assumption that the analyst has conducted a functional 
behavior assessment to identify the variables that maintain the 
problem behavior, that the problem behavior has been defined 
clearly, and that the participant cannot avoid or escape the punisher. 


Select Effective and Appropriate Punishers 


Conduct Punisher Assessments 


Aversive stimulus assessments are conducted to identify stim- 
uli that function as punishers. Fisher and colleagues (1994) 
identified two advantages of conducting punisher assessments. 
First, the sooner an ideal punisher can be identified, the sooner 
it can be applied to the problem behavior. Second, data from 
punisher assessments might reveal the magnitude or intensity of 
the punisher necessary for behavioral suppression, enabling the 
practitioner to deliver the lowest intensity punisher to achieve 
socially significant response suppression. 

Punisher assessment mirrors reinforcement assessment, 
except that instead of measuring engagement or duration of 
contact with each stimulus, the behavior analyst measures 
overall response suppression, negative verbalizations, avoid- 
ance movements, escape attempts, or disruptions associated 
with each potential punishing stimulus. Data from the punisher 
assessments are then used to develop a hypothesis on the relative 
comparative effects of each stimulus change as a punisher. 

The decision of which among several potential punishers 
to choose should be based on the relative degree of punisher intru- 
siveness, the ease with which the punisher can be consistently 
and safely delivered, and data on potential response suppression. 
Subsequent observation might reveal that a consequence that is 
less intrusive, time-consuming, or difficult to apply might pro- 
duce a significant decrease in the targeted behavior (Thompson 
etl L909). 


Use Varied Punishers of Sufficient 
Quality and Intensity 


The quality of a punisher is relative to a number of past and cur- 
rent variables that affect the participant. For example, Thompson 
and colleagues found that 15 seconds of physical restraint was a 
high-quality punisher for Ricky, Donna, and Lynn, but the physical 
restraint was not as ideal for Shelly. Although stimuli that reliably 
evoke escape and avoidance behaviors often function as high- 
quality punishers, practitioners should recognize that a stimulus 
change that effectively suppresses some behaviors may not affect 
other behaviors and that highly motivated problem behaviors may 
only be suppressed by a particularly high-quality punisher. 
Generally, basic and applied researchers have found that 
the greater the intensity, magnitude, or amount of a punishing 
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stimulus, the greater the suppression of behavior. This finding is 
conditional on delivering the punishing stimulus at its optimum 
level of magnitude initially, rather than gradually increasing the 
level over time (Azrin & Holtz, 1966). For example, Thompson 
and colleagues (1999) used the previously described punisher 
assessment procedure to determine an optimum magnitude or 
duration level for a punishing stimulus: a level and duration that 
produced a 75%, or greater, decrease in self-injurious behaviors 
(SIB) from baseline. Specifically, they determined that to meet the 
criterion of a punishing stimulus, one participant had his hands 
held in his lap for 15 seconds when SIB occurred, whereas two 
Other participants received a verbal reprimand and their hands 
were held against their body for 15 seconds following SIB. 

Beginning with a punishing stimulus of sufficient intensity 
is important, because participants may adapt to the punishing 
stimulus when levels of magnitude are increased gradually. It 
is possible that 15 seconds of movement restriction would have 
been a poor punisher for the participants if Thompson et al. 
(1999) had begun with a punishing stimulus of 3 seconds and 
gradually increased its magnitude from 3- to 6- to 9- to 15-sec 
intervals. Hineline and Rosales-Ruiz (2013) advise practitioners 
to begin “with an intense but safe, practical, and accepted pun- 
isher and then withhold further increases in intensity until the 
other variables are ruled out” (p. 488). 


Use Varied Punishers 


The effectiveness of a punishing stimulus can decrease with 
repeated presentations of that stimulus. Using a variety of pun- 
ishers may help to reduce habituation effects. In addition, using 
various punishers may increase the effectiveness of less intru- 
sive punishers. For example, Charlop, Burgio, Iwata, and Ivancic 
(1988) compared various punishers to a single presentation of one 
of the punishers (i.e., a stern “No!”, overcorrection, time-out with 
physical restraint, a loud noise). Three-, 5-, and 6-year-old chil- 
dren with developmental disabilities served as participants. Their 
problem behaviors included aggression (Child 1), self-stimulation 
and destructive behavior (Child 2), and aggression and out-of- 
seat behavior (Child 3). The varied-punisher condition was 
slightly more effective than the single-presentation condition and 
enhanced the sensitivity of the behavior to less intrusive punish- 
ing stimuli (see Figure 14.11). Charlop and colleagues concluded, 
“Tt appears that by presenting a varied format of commonly used 
punishers, inappropriate behaviors may further decrease without 
the use of more intrusive punishment procedures” (p. 94). 


Deliver the Punisher at the Beginning of a 
Behavioral Sequence 


Punishing an inappropriate behavior as soon as it begins is 
more effective than waiting until the chain of behavior has been 
completed (Solomon, 1964). Once the sequence of responses 
that make up the problem behavior is initiated, powerful sec- 
ondary reinforcers associated with completing each step of the 
chain may prompt its continuation, thereby counteracting the 
inhibiting or suppressing effects of the punishment that occurs 
at the end of the sequence. Therefore, whenever practical, the 
punishing stimulus should be presented early in the behavioral 


382 PartS » Punishment 


Treatment Treatment 


Baseline 


Treatment Baseline Treatment 


Baseline 


ilime-Out Overcorrection 


Percent Intervals 


ij 


20 25 30 35 40 45 50 
Sessions 


Figure 14.11 Percentage of intervals of occurrence of self-stimulation and destructive 


behavior for a 6-year-old girl with autism. 


“Stimulus Variation as a Means of Enhancing Punishment Effects,” by M. H. Charlop, L. D. Burgio, 
B. A. Iwata, and M. T. lvancic, 1988, Journal of Applied Behavior Analysis, 21, p. 92. Reproduced with 
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sequence rather than later (Heron, 1978). For example, if violent 
arm swinging is a reliable precursor to self-injurious eye poking, 
then punishment (e.g., response blocking, restraint) should be 
delivered as soon as arm swinging starts. 


Punish Each Instance of the Behavior Initially 


Punishment is most effective when the punisher follows each 
response. This consistency is especially important when a pun- 
ishment intervention is first implemented. Consistency goes 
beyond just programming an effective punishment schedule, that 
is, when punishment is delivered. Consistency refers to whether 
the punisher itself, its application as a behavior — consequence 
functional relation, and its procedural elements are being imple- 
mented as intended. Consistency, therefore, relates to treatment 
integrity. One can easily imagine the situation in which the 
disruptive behavior of an adolescent high school student with 
multiple teachers is supposedly being addressed with a plan 
whereby contingent on a disruptive behavior, each teacher will 
point his or her finger toward the student and emit a sharp rep- 
rimand, “No!”. However, in reality some of the teachers follow 
the plan, but others do not. Or, a few teachers say “Noooo” with 
a semi-joking intonation, while others issue a firm reprimand. 
When these inconsistent situations occur, treatment integrity 
is at risk. Consequently, it becomes difficult to determine if 
the failure to suppress the disruptive behavior is the result of a 
stimulus that does not serve as a punisher, or if the punishment 
procedure is compromised by one or more treatment implemen- 
tation failures. 


Gradually Shift to an Intermittent 
Schedule of Punishment 


Although punishment is most effective when the punisher 
immediately follows each occurrence of the problem behavy- 
ior (Hineline & Rosales-Ruiz, 2013), practitioners may find a 
continuous schedule of punishment impractical because they 
lack the resources and time to attend to each occurrence of the 
behavior (O’ Brien & Karsh, 1990). Several studies have found 
that, after responding has been reduced by a continuous sched- 
ule of punishment, an intermittent schedule of punishment may 
be sufficient to maintain the behavior at a socially acceptable 
frequency (Clark, Rowbury, Baer, & Baer, 1973; Lerman et al., 
1997; Romanczyk, 1977). 

We recommend two guidelines for using intermittent pun- 
ishment. First—and this is especially important—a continuous 
(FR 1) schedule of punishment should be used to diminish the 
problem behavior to a clinically acceptable level before gradu- 
ally thinning to an intermittent schedule of punishment. Second, 
combine intermittent punishment with extinction. It is unlikely 
that reduced responding will continue under intermittent pun- 
ishment if the reinforcer that maintains the problem behavior 
cannot be identified and withheld. If these two guidelines for 
intermittent punishment are met and the frequency of the prob- 
lem behavior increases to an unacceptable level, return to a 
continuous schedule of punishment and then, after recovery of 
acceptably low rates of responding, gradually shift to an inter- 
mittent schedule of punishment that is denser than the one used 
previously (e.g,, WR 2 rather than VR 4). 


Mediate Unavoidable Response-to-Punishment 
Delay 


Consequences intended to serve as punishers for problem behay- 
ior are frequently delayed in the natural environment. Realisti- 
cally, delays may occur because caregivers are not able to monitor 
behaviors continuously and may miss them, the individual may 
escape the delivery of the punisher, or the misbehavior may go 
undetected (Lerman & Vorndran, 2002). When punishers cannot 
be delivered immediately, applied behavior analysts should strive 
to mediate response-to-punishment delay. 

For example, before administering a delayed punisher, 
Rolider and Van Houten (1985) played audiotapes of the child’s 
tantrums earlier in the day. Explaining the tape recorder’s pur- 
pose to the child and leaving it in sight while the recordings 
were collected might have resulted in the tape recorder bridg- 
ing the temporal gap between inappropriate behavior and its 
consequence (1.e., by functioning as discriminative stimuli for 
punishment) (Lerman & Vorndran, 2002). 


Supplement Punishment with Complementary 
Interventions 


Applied behavior analysts typically do not use punishment as 
a single intervention: They combine punishment with other 
interventions, primarily differential reinforcement, extinction, 
and a variety of antecedent interventions. Basic and applied 
researchers consistently find that the effectiveness of punishment 
is enhanced when the learner can make other responses for 
reinforcement. In most circumstances, applied behavior analysts 
should incorporate differential reinforcement of alternative 
behavior (DRA), differential reinforcement of incompatible 
behavior (DRI), or differential reinforcement of other behaviors 
(DRO) (see Chapter 25) into a treatment program to supplement 
a punishment. When used as a reductive procedure for problem 
behavior, differential reinforcement consists of two components: 
(1) providing reinforcement contingent on the occurrence of a 
behavior other than the problem behavior and (2) withholding 
reinforcement for the problem behavior (i.e., extinction). The 
study by Thompson and colleagues (1999) presented previously 
provides an excellent example of how reinforcement of an alter- 
native behavior can enhance the suppressive effects of punish- 
ment and enable relatively benign punishment procedures to be 
effective even for chronic problem behaviors that have been 
resistant to change. 

We recommend that practitioners reinforce alternative 
behaviors copiously. Additionally, the more reinforcement the 
learner obtains by emitting appropriate behaviors, the less moti- 
vated he will be to emit the problem behavior. In other words, 
heavy and consistent doses of reinforcement for alternative 
behaviors function as an abolishing operation that weakens 
(abates) the frequency of the problem behavior. 

There is still another important reason for recommending 
the reinforcement of alternative behaviors. Applied behavior 
analysts are in the business of building repertoires by teach- 
ing the clients and students they serve new skills and more 
effective ways of controlling their environments and achieving 
success. Punishment (with the exception of some overcorrec- 
tion procedures) eliminates behaviors from a person’s repertoire. 
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Although the person will be better off without those behaviors in 
her repertoire, punishment only teaches her what nor to do—it 
does not teach her what to do instead. 

The effectiveness of antecedent interventions such as 
functional communication training (FCT), high-probability 
request sequence (high-p), and noncontingent reinforcement 
(NCR), which diminish the frequency of problem behaviors by 
decreasing the effectiveness of the reinforcers that maintain the 
problem behaviors (see Chapter 26), can be made more effec- 
tive when combined with punishment. For instance, Fisher and 
colleagues (1993) found that, although FCT did not reduce to 
clinically significant levels the destructive behaviors of four par- 
ticipants with severe intellectual disabilities and communication 
deficits, a combination of FCT and punishment produced the 
largest and most consistent reductions in problem behaviors. 


Watch and Prepare for Negative Side Effects 


It is difficult to predict the side effects that may result from 
punishment. The suppression of one problem behavior by pun- 
ishment may lead to an increase in other undesirable behay- 
iors. For example, punishment of self-injurious behavior may 
produce increased levels of noncompliance or aggression. Pun- 
ishing one problem behavior may lead to a parallel decrease 
in desirable behaviors. For example, requiring that a student 
rewrite an ill-conceived paragraph may result in his ceasing to 
produce any academic work. Although punishment may pro- 
duce no undesirable side effects, practitioners should be alert 
to problems such as escape and avoidance, emotional outbursts, 
and behavioral contrast, and have a plan for dealing with such 
events should they occur. 


Record, Graph, and Evaluate Data Daily 


Data collection in the initial sessions of a punishment-based 
intervention is especially critical. Unlike some behavior reduc- 
tion procedures whose suppressive effects are often gradual 
(e.g., extinction, differential reinforcement of alternative behav- 
ior), the suppressive effects of punishment are usually quick and 
abrupt. In their classic review of the basic research on punish- 
ment, Azrin and Holz (1966) wrote: 


Virtually all studies of punishment have been in complete 
agreement that the reduction of responses by punishment 

is immediate if the punishment is at all effective. When 

the data have been presented in terms of the number of 
responses per day, the responses have been drastically 
reduced or eliminated on the very first day in which punish- 
ment was administered. When the data have been presented 
in terms of moment-to-moment changes, the reduction of 
responses has resulted within the first few deliveries of 
punishment . . . or within a few minutes. (p. 411) 


The first data points from the punishment conditions in 
all of the graphs presented in this chapter provide additional 
empirical evidence for Azrin and Holz’s statement of the imme- 
diate effects of punishment. Because of the abrupt effect of pun- 
ishment, the practitioner should pay particular attention to the 
data from the first session or two of intervention. If a noticeable 
reduction of the problem behavior has not occurred within two 
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sessions of a punishment-based intervention, we recommend 
that the practitioner make adjustments to the intervention. 

Frequent inspection of the data from a punishment per- 
spective reminds those involved of the purpose of the inter- 
vention and reveals whether the problem behavior is being 
reduced or eliminated as intended. When the data indicate that 
a clinically or socially significant change has occurred and has 
been maintained, punishment can be shifted to an intermittent 
schedule or perhaps terminated altogether. 


ETHICAL CONSIDERATIONS REGARDING THE 
USE OF PUNISHMENT 


Ethical considerations regarding the use of punishment revolve 
around three major issues: the client’s right to safe and humane 
treatment, the professional’s responsibility to use least restric- 
tive procedures, and the client’s right to effective treatment.’ 


Right to Safe and Humane Treatment 


Dating from the Hippocratic Oath (Hippocrates, 460 Bc—377 Bc), 
the first ethical canon and responsibility for any human services 
provider is to do no harm. So, any behavior change program, 
be it a punishment-based intervention to reduce life-threatening 
self-injurious behavior or an application of positive reinforce- 
ment to teach a new academic skill, must be physically safe for 
all involved and contain no elements that are injurious, degrad- 
ing, or disrespectful to participants. 

Treatments are deemed safe when they put neither the 
caregiver nor the target individual at physical, psychological, or 
social risk (Favell & McGimsey, 1993). Although there exists no 
universally accepted definition of what constitutes humane treat- 
ment, a reasonable case could be made that humane treatments 
are (a) designed for therapeutic effectiveness, (b) delivered in a 
compassionate and caring manner, (c) assessed formatively to 
determine effectiveness and terminated if effectiveness is not 
demonstrated, and (d) sensitive and responsive to the overall 
physical, psychological, and social needs of the person. 


Least Restrictive Alternative 


A second canon of ethics for human services professionals is 
not to intrude on a client’s life any more than necessary to pro- 
vide effective intervention. The doctrine of the least restrictive 
alternative holds that less intrusive procedures should be tried 
and found to be ineffective before more intrusive procedures 
are implemented. Interventions can be viewed as falling along 
a continuum of restrictiveness from least to most. The more 
a treatment procedure affects a person’s life or independence, 
such as her ability to go about daily activities in her normal 
environment, the greater its restrictiveness. A completely unre- 
strictive intervention is a logical fallacy. Any treatment must 
affect the person’s life in some way to qualify as an intervention. 
At the other end of the continuum, absolute restrictiveness exists 
during solitary confinement, where personal independence is 
unavailable. All behavior change interventions developed and 
implemented by applied behavior analysts fall within these 
extremes. 


Selecting any punishment-based intervention essen- 
tially rules out as ineffective all positive or positive reductive 
approaches based on their demonstrated inability to improve the 
behavior. For example, Gaylord-Ross (1980) proposed a deci- 
sion-making model for reducing aberrant behavior that would 
have practitioners rule out assessment considerations, inap- 
propriate or ineffective schedules of reinforcement, ecological 
variables, and curriculum modifications before punishment is 
implemented. 

Some authors and professional organizations have 
advanced the position that all punishment-based procedures are 
inherently intrusive and should never be used (e.g., LaVigna & 
Donnellen, 1986; Mudford, 1995). Others have advanced the 
counter-position that punishment-based procedures, because of 
their inherent level of intrusiveness, should be used only as a last 
resort (Gaylord-Ross, 1980; Iwata, 1988). 

Most people would rate interventions based on positive 
reinforcement as less restrictive than interventions based on 
negative reinforcement; reinforcement interventions as less 
restrictive than punishment interventions; and interventions 
using negative punishment as less restrictive than those using 
positive punishment. However, the intrusiveness or restrictive- 
ness of an intervention cannot be determined by the principles 
of behavior on which it is based. Restrictiveness is a relative 
concept that depends on the procedural details and ultimately 
rests at the level of the person with whom it is applied. A 
positive reinforcement intervention that requires deprivation 
may be more restrictive than a positive punishment procedure 
in which a buzzer sounds each time an incorrect response is 
made, and what one person considers intrusive may pose no 
discomfort for another person. Horner (1990) suggested that 
most people accept punishment interventions to reduce chal- 
lenging behavior as long as those interventions: “(a) do not 
involve the delivery of physical pain, (b) do not produce effects 
that require medical attention, and (c) are subjectively judged 
to be within the typical norm of how people in our society 
should treat each other” (pp. 166-167). 

In their review of response-reduction interventions based 
on the principle of punishment, Friman and Poling (1995) 
pointed out that punishment-based tactics, such as those requir- 
ing the person to make an effortful response contingent on the 
occurrence of target behavior, meet Horner’s (1990) criteria for 
acceptable response-reduction interventions. They stated: “None 
of the procedures caused pain or required medical attention, nor 
did they contrast with societal norms. For example, coaches 
often require disobedient players to run laps, and drill instructors 
make misbehaving recruits do pushups” (p. 585). 

Although least restrictive alternative practices assume 
that the less intrusive procedures are tried and found ineffective 
before a more restrictive intervention is introduced, practitioners 
must balance that approach against an effectiveness standard. 
Gast and Wolery (1987) suggested that, if a choice must be made 
between a less intrusive but ineffective procedure and a more 
intrusive but effective procedure, the latter should be chosen. 
A Task Force on the Right to Effective Treatment appointed by 
the Association for Behavior Analysis International provided, 
in part, the following perspective on the importance of judging 


the ultimate restrictiveness of treatment options on the basis of 
their degree of proven effectiveness: 


Indeed, a slow-acting but nonrestrictive procedure could 
be considered highly restrictive if prolonged treatment 
increases risk, significantly inhibits or prevents participa- 
tion in needed training programs, delays entry into a more 
optimal social or living environment, or leads to adapta- 
tion and the eventual use of a more restrictive procedure. 
Thus, in some cases, a client’s right to effective treatment 
may dictate the immediate use of quicker-acting, but tem- 


porarily more restrictive, procedures. (Van Houten et al., 
1988, p. 114). 


Right to Effective Treatment 


Ethical discussions regarding the use of punishment revolve 
most often around its possible side effects and how experiencing 
the punisher may cause unnecessary pain and possible psycho- 
logical harm for the person. Although each of these concerns 
deserves careful consideration, the right to effective treatment 
raises an equally important ethical issue, especially for persons 
who experience chronic, life-threatening severe problem behav- 
iors. Some maintain that failing to use a punishment procedure 
that research has shown to be effective in suppressing self- 
destructive behavior similar to their client’s is unethical because 
doing so withholds a potentially effective treatment and risks 
maintaining a dangerous or uncomfortable state for the person. 
For example, Baer (1971) stated: “[Punishment] is a legitimate 
therapeutic technique that is justifiable and commendable when 
it relieves persons of the even greater punishments that result 
from their own habitual behavior” (p. 111). 

As Iwata (1988) explained, if all other less intrusive treat- 
ments with a sufficient research base to provide a legitimate 
chance of success have failed, the use of punishment procedures 
is the only ethical option: 


In the ideal world, treatment failures do not occur. But in the 
actual world failures do occur, in spite of our best efforts, 
leaving us with these options: continued occurrence of 

the behavior problem toward some devastating endpoint, 
restraint, sedating drugs, or aversive contingencies. I predict 
that if we apply our skills to maximize the effectiveness of 
positive reinforcement programs, we will succeed often. 
After following such a course, my further prediction is that 
if we reach the point of having to decide among these ulti- 
mate default options, the client’s advocate, the parent, or, if 
necessary, the courts will select or use the option of aversive 
contingencies. Why? Because under such circumstances it is 
the only ethical action. (pp. 152-153) 


Balancing the least restrictive alternative, the right to 
effective treatment, and Baer’s (1971) vision to relieve a person 
of greater punishments if his severe behavior problem remains 
untreated or ineffective treatments are allowed to continue, 
is a difficult challenge. However, behavior analysts are not 
without relief. In the intervening 40-plus years since the 
1980s, and in light of more recent successful developments 
using least restrictive alternative interventions informed by 
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functional behavior analysis, even the most serious life-threat- 
ening behaviors can be treated effectively and humanely 
without resorting to the use of noxious or painful uncondi- 
tioned punishers (e.g., water mist, aromatic ammonia, electric 
shock). Research has shown that the judicious use of stimulus 
control procedures (McKenzie et al., 2008), time-out and con- 
tingent restraint (Lerman et al., 1997), adding a significant 
reinforcement component (DRA) to a punishment regimen 
(Thompson et al., 1999), effectively manipulating reprimands 
(McKenzie et al., 2008), response blocking (Lalli, Lizevey, & 
Kates, 1996), contingent exercise (Luce et al., 1980), vary- 
ing commonly used punishers (Charlop et al., 1988), and 
overcorrection (Steuart, 1993) can reduce problem behaviors 
for which unconditioned punishers were often applied. The 
application of unconditioned punishers no longer meets the 
standards of least restrictive alternative or best practice. 

Applied behavior analysts who conduct functional behay- 
ior analyses, perform comprehensive punisher assessments, vary 
antecedent and consequence methods, and apply these methods 
alone or in combination will achieve as much, or more, success 
in reducing self-injurious and other dangerous behaviors than 
previously considered possible. In doing so, they will fulfill their 
mission of improving clients’ quality of life by ensuring the right 
to effective treatment, providing best practice, and using the 
least restrictive alternative. 


Punishment Policy and Procedural Safeguards 


Armed with knowledge from the experimental literature, real- 
world variables and contingencies with which to contend (i.e., 
chronic, life-threatening problems), and practices and procedures 
rooted in ethical codes of conduct, practitioners can consider 
punishment approaches, when necessary, to provide meaning- 
ful programs for the persons in their care. One mechanism for 
ensuring that best practice approaches are used (Peters & Heron, 
1993) is to adopt and use dynamic policies and procedures that 
provide clear guidelines and safeguards to practitioners. 

Agencies can protect and ensure their clients’ rights to 
safe, humane, least restrictive, and effective treatment by devel- 
oping policy and procedural guidelines that must be followed 
when any punishment-based intervention is implemented (Favell 
& McGimsey, 1993; Griffith, 1983; Wood & Braaten, 1983). 
Figure 14.12 provides an outline and examples of the kinds of 
components that might be included in such a document. 

Best practice dictates that practitioners consult their 
local, state, or national professional association policy state- 
ments regarding the use of punishment. For example, the Asso- 
ciation for the Advancement of Behavior Therapy (AABT) 
provides guidelines that address treatment selection, including 
punishment (Favell et al., 1982). The Association for Behavior 
Analysis International adheres to the American Psychological 
Association’s Ethics Code (2004), which, in turn, addresses 
treatment issues. Collectively, policy statements address imple- 
mentation requirements, procedural guidelines and precau- 
tions, and evaluation methods that an agency should use when 
implementing any punishment-based intervention. Chapter 31 
provides additional information on ethical standards for applied 
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Figure 14.12 Suggested components in an agency's policy and procedural guidelines to help ensure 


the ethical, safe, and effective use of punishment. 


Policy Requirements 


tual termination or reduction of punishment. 


Procedural Safeguards 


necessary, booster-training sessions provided. 


Evaluation Requirements 


prospects. 


behavior analysts and procedures for securing informed con- 
sent, protecting clients’ rights, and ensuring safe and humane 
interventions. 


CONCLUDING PERSPECTIVES 


We conclude this chapter with brief comments on our perspec- 
tives regarding the principle of punishment, and especially 
applications featuring unconditioned punishers as default tech- 
nologies. We believe that applied behavior analysis would be 
a stronger, more competent discipline, and its practitioners 
would be more effective, if (a) punishment’s natural role 
and contributions to survival and learning are recognized 
and valued within their limitations, (b) focused basic and 
applied research on punishment is conducted, and (c) treat- 
ments featuring differential reinforcement options, stimulus 
control procedures, time-out from positive reinforcement, 
response blocking, and contingent restraint, be employed in 
lieu of treatments featuring unconditioned punishers as default 
technologies. 


Punishment’s Natural and Necessary Role in 
Survival and Learning Should Be Recognized 


Behavior analysts ought not dismiss punishment out of hand as a 
possible therapeutic intervention. Positive and negative punish- 
ment contingencies occur naturally in everyday life as part of a 
complex mix of concurrent reinforcement and punishment con- 
tingencies, as Baum (2017) illustrated so well in this example: 


e Intervention must conform to all local, state, and federal statutes. 

e Intervention must conform to the policies and codes of ethical conduct of relevant professional organizations. 
e Intervention should include procedures for strengthening and teaching alternative behaviors. 

e Intervention must include plans for generalization and maintenance of behavior change and criteria for even- 


e Informed consent must be obtained from the client or a parent or legal advocate before intervention begins. 


e Before intervention begins, all relevant staff must be trained in (a) the technical details of properly administering 
the punishment procedure; (b) procedures for ensuring the physical safety of the client and staff and the humane 
treatment of the client; and (c) what to do in the event of negative side effects, such as emotional outbursts, 
escape and avoidance aggression, and noncompliance. 


e Supervision and feedback of staff administering the punishment intervention must be provided and, if 


e Each occurrence of the problem behavior must be observed and recorded. 
e Each delivery of the punisher must be recorded and the client’s reactions noted. 


e Periodic review of the data (e.g., daily, weekly) by a team of parent/advocates, staff, and technical consultants 
must be conducted to ensure that ineffective treatment is not prolonged or effective treatment terminated. 


¢ Social validity data must be obtained from the client, significant others, and staff on (a) treatment acceptability 
and (b) the real and likely impact of any behavior change on the client’s current circumstances and future 


Life is full of choices between alternatives that offer 
different mixes of reinforcement and punishment. Going 
to work entails both getting paid (positive reinforcement) 
and suffering hassles (positive punishment), whereas 
calling in sick may forfeit some pay (negative punish- 
ment), avoid the hassles (negative reinforcement), allow 
a vacation (positive reinforcement), and incur some 
workplace disapproval (positive punishment). Which 

set of relations wins out depends on which relations are 
strong enough to dominate, and that depends on both 
the present circumstances and the person’s history of 
reinforcement and punishment. (p. 68) 


Vollmer (2002) suggested that the scientific study 
of punishment continue because unplanned and planned 
punishment occur frequently and that planned, sophisticated 
applications of punishment are within the scope of inquiry for 
applied behavior analysts. We believe that studying punish- 
ment is within the realm of a science of behavior. As Vollmer 
states, “Scientists and practitioners are obligated to understand 
the nature of punishment if for no other reason than because 
punishment happens.” (p. 469) 


Focused Research on Punishment Is Needed 


Many inappropriate and ineffective applications of punishment 
may reflect our incomplete knowledge of this principle (Lerman 
& Vorndran, 2002). Our technical understanding about, and 
recommendations for, applying punishment are derived from 


decades-old basic research. Although sound scientific data that 
describe still-relevant questions have an unlimited shelf life, 
additional basic research on the mechanisms and variables that 
produce effective punishment no doubt will uncover elements 
that remain tangled in complex variables and their interactions. 

Basic laboratory research allows for the control of vari- 
ables that is difficult or impossible to attain in applied settings. 
Once mechanisms are revealed in basic research, however, 
applications to real-world challenges may be devised or adapted 
with more confidence, but with the simultaneous recognition of 
limitations and ethics. Practitioners must acknowledge when, 
how, why, and under what conditions punishment produces 
behavioral suppression for the students, clients, or patients 
with whom they work. Further, we must come to the realization 
that while a laboratory finding, with all of its controls and high 
internal validity, might produce a resounding suppressive effect, 
an applied practitioner using an adaptation of that punishment 
application may only yield a fraction of that effect. Still, for 
the child who is self-mutilating, self-destructive, and seemingly 
only coming into contact with life-threatening situations, obtain- 
ing a fractional improvement in behavior suppression may be 
cause for celebrating progress. 

There are at least two explanations for why it might be 
difficult to conduct experimental work on punishment. First, 
for a variety of reasons, Human Subjects Review Committees 
may be less than enthusiastic about supporting research on pun- 
ishment. To the extent that fewer proposals to conduct basic 
research on punishment are approved, then fewer data will be 
available on its effects, side effects, and applicability in applied 
settings. Second, applied researchers, in response to complying 
with ethical considerations and the rights of individuals to least 
restrictive alternative treatments, have shifted their focus to pos- 
itive reductive procedures or less intrusive negative punishment 
applications. 

We support Horner’s (2002) call for practical applica- 
tions of punishment research in field settings. It is important 
to determine how punishment works in situations in which 


al 


SUMMARY 
Definition and Characteristics of Punishment 


1. Punishment has occurred when a response is followed 
immediately by a stimulus change that decreases the 
frequency of that type of behavior in the future. 


_ Punishment is defined neither by the actions of the person 
delivering the consequences nor by the nature of those 
consequences. A decrease in the future occurrences of the 
behavior must be observed before a consequence-based 
intervention qualifies as punishment. 


. Positive punishment has occurred when the presen- 
tation of a stimulus (or an increase in the intensity of 
an already present stimulus) immediately following a 
behavior results in a decrease in the frequency of the 
behavior. 
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environmental variables may not be as well controlled and in 
which the practitioner delivering the punisher might not be a 
trained professional. Educational and clinical applications of 
punishment also demand that behavior analysts understand the 
individual, contextual, and environmental variables that pro- 
duce effective application. Without a clear and refined knowl- 
edge base about these variables and conditions, applied behavior 
analysis as a science of behavior cannot make a legitimate claim 
to have achieved a comprehensive analysis of its own basic con- 
cepts (Lerman & Vorndran, 2002; Vollmer, 2002). 


Interventions Featuring Unconditioned 
Punishers as Default Technologies 


Iwata (1988) recommended that punishment-based interventions 
involving the contingent application of aversive stimulation be 
treated as default technologies. A default technology is one that 
a practitioner turns to when all other methods have failed. Twata 
recommended that behavior analysts not advocate for the use of 
aversive technologies (because advocacy is not effective, not 
necessary, and not in the best interests of the field), but that they 
be involved in research and development of effective aversive 
technologies. 


We must do the work because, whether or not we like it, 
default technologies will evolve whenever there is fail- 
ure and, in the case of aversive stimulation, we are in a 
unique position to make several contributions. First, we 
can modify the technology so that it is effective and safe. 
Second, we can improve it by incorporating contingencies 
of positive reinforcement. Third, we can regulate it so that 
application will proceed in a judicious and ethical manner. 
Last, and surely most important, by studying the conditions 
under which default technologies arise, as well as the tech- 
nologies themselves, we might eventually do away with 
both. Can you think of a better fate for the field of applied 
behavior analysis? (p. 156) 


. Negative punishment has occurred when the termination of an 
already present stimulus (or a decrease in the intensity of an 
already present stimulus) immediately following a behavior 
results in a decrease in future occurrences of the behavior. 


. Because aversive events are associated with positive 
punishment and with negative reinforcement, the term 
aversive control is often used to describe interventions 
involving either or both of these two principles. 


. A punisher is a stimulus change that immediately follows 
the occurrence of a behavior and reduces the future fre- 
quency of that type of behavior. 


. An unconditioned punisher is a stimulus whose presenta- 
tion functions as punishment without having been paired 
with any other punishers. 
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8. 


10. 


11. 


A conditioned punisher is a stimulus that has acquired its 
punishing capabilities by being paired with unconditioned 
or conditioned punishers. 


. A generalized conditioned punisher will function as 


punishment under a wide range of motivating operations 
because of its previous pairing with numerous uncondi- 
tioned and conditioned punishers. 


A discriminative stimulus for punishment, or sp. isa 
stimulus condition in the presence of which a response 
class occurs at a lower frequency than it does in the 
absence of the S?? as a result of a conditioning history in 
which responses in the presence of the S?P have been pun- 
ished and similar responses in the absence of that stimulus 
have not been punished (or have resulted in a reduced fre- 
quency or magnitude of punishment). 


Punishment’s suppressive effects on behavior are usually 
not permanent. Recovery from punishment is especially 
likely when the behavior being punished also obtains 
reinforcement. 


Factors That Influence the Effectiveness 
of Punishment 


12. 


Results of basic and applied research show that punish- 
ment is more effective when 


* the onset of the punisher occurs as soon as possible after 
the occurrence of a target response, 


* the intensity of the punisher is high, 


* each occurrence of the behavior is followed by the 
punishing consequence, 


* reinforcement for the target behavior is reduced, and 


¢ reinforcement is available for alternative behaviors. 


Possible Side Effects and Problems with Punishment 


13. 


Punishment sometimes causes undesirable side effects and 
problems, such as the following: 


¢ Emotional and aggressive reactions 
¢ Escape and avoidance behaviors 


¢ Behavioral contrast: Reduced responding from punish- 
ment in one setting or situation may be accompanied by 
increased responding in another setting or situation in 
which responses go unpunished 


¢ Modeling of undesirable behavior 


* Overuse of punishment caused by the negative rein- 
forcement of the punishing agent’s behavior 


Positive Punishment Interventions 


14. 


LIS 


Reprimands: Used sparingly, a firm reprimand such as 
“No!” can suppress future responding. 


Response blocking: When the learner begins to emit the 
problem behavior, the therapist physically intervenes to 
prevent or “block” the completion of the response. 


16. 


17. 


18. 


19. 


20. 


21. 


Response interruption and redirection (RIRD): This 
approach is a variation of response blocking that con- 
sists of interrupting stereotypic behavior at its onset and 
redirecting the individual to complete high-probability 
behaviors instead. 


Contingent exercise: In this intervention, the client is 
required to perform a response that is not topographically 
related to the problem behavior. 


Overcorrection: This is a punishment-based tactic in 
which, contingent on each occurrence of the problem 
behavior, the learner is required to engage in effortful 
behavior that is directly or logically related to the problem. 


Restitutional overcorrection: In this intervention, the 
learner is required to repair the damage caused by the 
problem behavior and then to engage in additional behav- 
ior that brings the environment to a condition better than it 
was prior to the misbehavior. 


Positive practice overcorrection: In this type of overcor- 
rection, the learner repeatedly performs a correct form of 
the behavior, or a behavior incompatible with the problem 
behavior, for a specified time or number of responses. 


Contingent exercise: In this intervention, the client is 
required to perform a response that is not topographically 
related to the problem behavior. 


Guidelines for Using Punishment 


22. 


To apply punishment with optimal effectiveness while 
minimizing undesirable side effects, a practitioner should: 


* Select effective and appropriate punishers: (a) conduct 
punisher assessmenis to identify the least intrusive pun- 
isher that can be applied consistently and safely; (b) use 
punishers of sufficient quality and intensity; (c) use a 
variety of punishers to combat habituation and increase 
the effectiveness of less intrusive punishers. 


* If problem behavior consists of a response chain, 
deliver the punisher as early in the response sequence as 
possible. 


e Punish each occurrence of the behavior. 


¢ If maintaining a continuous schedule is impractical, 
gradually shift to an intermittent schedule of punishment 
combined with extinction. 


* Mediate unavoidable response-to-punishment delays. 


* Supplement punishment with complementary interven- 
tions, in particular, differential reinforcement, extinc- 
tion, and antecedent interventions. 


* Watch and be prepared for negative side effects. 


* Record, graph, and evaluate data daily. 


Ethical Considerations Regarding the Use 
of Punishment 


23. 


The first ethical responsibility for any human services 
professional or agency is to do no harm. Any intervention 


must be physically safe for all involved and contain no ele- 
ments that are degrading or disrespectful to the client. 


24. The doctrine of the least restrictive alternative holds 
that less intrusive procedures (e.g., positive reductive 
approaches) must be tried first and found to be ineffective 


before more intrusive procedures are implemented (e.g., a 
punishment-based intervention). 


25. A client’s right to effective treatment raises an important 
ethical issue. Some maintain that the failure to use a pun- 
ishment procedure that research has shown to suppress 
self-destructive behavior is unethical because it withholds 
a potentially effective treatment and may maintain a dan- 
gerous or uncomfortable state for the client. 


26. Agencies and individuals providing applied behavior 
analysis services can help ensure that applications of pun- 
ishment-based interventions are safe, humane, ethical, and 


behavioral contrast 

conditioned punisher 

discriminative stimulus for punishment punisher 
generalized conditioned punisher punishment 
negative punishment 


overcorrection response blocking 
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MULTIPLE-CHOICE QUESTIONS 


1. Which of the following statements describes the definition 
of positive punishment? : 


a. The presentation of a stimulus that increases the rate of 
occurrence of the target behavior 


b. The presentation of a stimulus that decreases the rate of 
occurrence of the target behavior 


c. The removal of a stimulus that decreases the rate of 
occurrence of the target behavior 


d. The removal of a stimulus that increases the rate of 
occurrence of the target behavior 


Hint: (See “Positive Punishment and Negative Punishment’) 


2. Which of the following statements describes the definition 

of negative punishment? 

a. The presentation of a stimulus that increases the rate of 
occurrence of the target behavior 

b. The presentation of a stimulus that decreases the rate of 
occurrence of the target behavior 

c. The removal of a stimulus that decreases the rate of 
occurrence of the target behavior 

d. The removal of a stimulus that increases the rate of 
occurrence of the target behavior 

Hint: (See “Positive Punishment and Negative Punishment’) 
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positive practice overcorrection 


positive punishment 
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effective by creating and following a set of policy stan- 
dards, procedural safeguards, and evaluation requirements. 


Concluding Perspectives 


27. Applied behavior analysts should recognize and appreci- 
ate the natural role of punishment and the importance of 
punishment to learning. 


28. Many misapplications of punishment reflect the field’s 
incomplete knowledge of the principle. More basic and 
applied research on punishment is needed. 


29. Applied behavior analysts should advocate for the judicious 
and ethical application punishment, incorporate a full range 
of differential reinforcement contingencies, antecedent 
interventions, and extinction to reduce behaviors, study the 
conditions under which default technologies are considered 
and used, so as to eventually make them unnecessary. 


response interruption and redirection 
(RIRD) 


restitutional overcorrection 


unconditioned punisher 


recovery from punishment 


3. Which statement best describes recovery from punishment? 


a. After punishment is discontinued, sometimes the 
behavior that experienced the punishment contingency 
will recover. 


b. When the punishment contingency is implemented, the 
behavior of interest may initially increase in rate, 
followed by a rapid decline in frequency. 


After punishment is discontinued, the behavior of inter- 
est may gradually reappear, only to disappear on its own, 
regardless of existing environmental contingencies. 


Cc 


. 


d. The period of time the implementer of the punishment 
contingency requires to “recover” from the emotional 
side effects of using punishment. 


Hint: (See “Recovery from Punishment’) 
4. Which of the following is not a factor that influences the 
effectiveness of punishment listed in the chapter? 
a. Immediacy 
b. Intensity 
c. Variation 
d. Schedule 


Hint: (See “Factors That Influence the Effectiveness of 
Punishment”) 
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5. The use of a punishment procedure may cause 


unwanted side effects to appear. The chapter presents 
several side effects associated with the use of punish- 
ment. Which of the following is not a side effect listed 
in the chapter? 


a. Undesirable emotional responses 

b. Avoidance and escape 

c. Increased rate of problem behavior 

d. Decreased self-esteem 

Hint: (See “Possible Side Effects and Problems with 
Punishment’) 


. Little Peter was walking along the fence of his uncle’s 
dairy farm. Peter reached up and grabbed the wire along 
the fence, not realizing it was electrified. After recovering 
from the pain caused by the shock, Peter never again 
touched the fence wires. Which statement best describes 
the shock stimulus felt by Peter? 


a. Negative Reinforcement 
b. Negative Punishment 

c. Bad Judgment 

d. Positive Punishment 


Hint: (“Positive Punishment Interventions’) 


. Which statement best describes positive punishment? 


a. The delivery of a stimulus, following a behavior, which 
increases the likelihood that the behavior will occur 
again 

b. The delivery of a stimulus, following a behavior, which 
decreases the likelihood that the behavior will occur 
again 

c. A stimulus delivered with good intentions 


d. The removal of a stimulus, following a behavior, which 
decreases the likelihood that the behavior will occur 
again 

Hint: (See “Positive Punishment Interventions’’) 


. Which statement best describes the procedure of response 

blocking? 

a. Blocking the occurrence of the antecedent stimulus 
associated with the occurrence of the problem behavior 


b. Delivering a wood block to the student prior to the 
occurrence of a behavioral response 


c. Physically intervening as soon as the student begins to 
emit a problem behavior to prevent the completion of 
the response 

d. The interventionist closing his or her eyes to deny the 
student pleasure after engaging in the problem behavior 


Hint: (See “Response Blocking”) 


9. Punishment should be thought of as an “eye-for-an-eye’ 


10. 


11. 


12. 


13. 


> 


procedure. 

a. True 

b. False 

Hint: (See “Guidelines for Using Punishment’) 

Which statement is not recommended as a guideline for 
the use of punishment?: 


a. Test punisher on your pet prior to implementing it with 
your student 


b. Conduct punisher assessment 

c. Punish each occurrence of the behavior 

d. Deliver the punisher immediately 

Hint: (See “Guidelines for Using Punishment’) 

Carr and Lovaas (1983) recommended that practitioners 


experience any punisher personally before the treatment 
begins for what reason? 


a. It reminds the practitioner that it is a sadistic procedure. 


b. It reminds the practitioner that the technique produces 
physical discomfort. 


c. It reminds the practitioner that the technique is designed 
to increase the appropriate behavior of the student. 


d. It allows the practitioner to determine a baseline of 
discomfort which he/she then increases rapidly as it is 
administered to the student. 


Hint: (See “Guidelines for Using Punishment’) 

Which of the following is not listed in the chapter as an 
ethical consideration regarding the use of punishment? 
a. Right to safe and humane treatment 

b. Least restrictive alternative 

c. Right to effective treatment 

d. Most restrictive alternative 

Hint: (See “Ethical Considerations Regarding the Use of 


Punishment’) 


Sean was responsible for designing an intervention for 

a fifth-grade student who engaged in severe problem 
behavior consisting of throwing chairs at the teacher 
when an assignment was given. Sean decided to utilize a 
contingent physical restraint every time the child threw a 
chair. Sean is violating the ethical consideration of: 

a. Right to safe and humane treatment 

b. Least restrictive alternative 

c. Right to effective treatment 

d. Sean is not violating any ethical considerations 


Hint: (See “Ethical Considerations Regarding the Use of 
Punishment’) 


14. 


1. 


ESSAY-TYPE QUESTIONS 


Which statement regarding the strength of applied 


behavior analysis is not supported in the punishment 
chapter? 


a. Recognize and appreciate punishment’s natural role 
and contributions to survival and learning 


b. Increase the use of punishment procedures over the use 
of reinforcement-based interventions 


c. Increase basic and applied research on punishment 


d. View treatments featuring positive punishment as 
default technologies 


Hint: (See “Concluding Perspectives’’) 
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Explain the procedures of positive punishment and 
negative punishment. Be sure to include in your answer 
factors that may influence the effectiveness of the punisher. 


Hint: (See “Definition and Characteristics of Punishment”) 


Discuss the possible side effects of using punishment. 


Hint: (See “Possible Side Effects and Problems with 
Punishment’) 


Define and give examples of the five types of positive 
punishment given in the chapter: reprimands, response 
blocking, contingent exercise, overcorrection, and 
response interruption and redirection. 


Hint: (See “Positive Punishment Interventions”) 


With the decision to use punishment-based treatment, 
everything should be done to ensure that the punishment is 
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Although the term automatic punishment is used infrequently in the behavior 
analysis literature, it is similar to automatic reinforcement. Automatic punish- 
ment occurs when a punishing consequence (e.g., burned finger) is a socially 
unmediated, unavoidable outcome of a response (e.g., touching a hot stove). 
A stimulus that becomes a conditioned punisher does not have to be a neu- 
tral stimulus prior to its pairing with a punisher. The stimulus could already 
function as a reinforcer under other conditions. For example, the onset 
of a blue light that has been paired repeatedly with reinforcement in one 
setting and with punishment in another setting is a conditioned reinforcer 
or a conditioned punisher, depending on the setting. 

Instructions or conditioned stimuli that have been paired with known aver- 
sives can mitigate the time-delay gap. 

Multiple schedules of reinforcement are described in Chapter 13. 

When an extinction procedure is first implemented, an increase in respond- 
ing, called an extinction burst, is sometimes observed before the response 
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15. Many of the recommendations for punishment are derived 


from basic research conducted more than 
a. 5 

b. 20 

c. 10 

d. 40 


Hint: (See “Concluding Perspectives”) 


years ago. 


as effective as possible. Identify and discuss the suggested 
guidelines for the use of punishment. 


Hint: (See “Guidelines for Using Punishment”) 


The use of punishment presents some major ethical 
concerns. Identify and discuss two of the ethical concerns 
associated with the use of punishment and give an 
example of a situation in which the concern is involved. 


Hint: (See “Ethical Considerations Regarding the Use of 
Punishment”) 


. Most of the punishment research is derived from basic 


studies that occurred more than 40 years ago. Discuss 
and contribute to Horner’s (2002) call for practical 
applications of punishment research in field settings. 


Hint: (See “Concluding Perspectives”) 


rate begins to decline. The principle, procedure, and effects of extinction 
are detailed in Chapter 22. 


_ Increasing the effort or force required to perform a behavior can also be an 


effective tactic for reducing responding (Friman & Poling, 1995). There is 
no consensus as to whether punishment accounts for the reduced respond- 
ing. As with response blocking, one perspective from which increased 
response effort can be conceptualized as a punishment procedure is to 
consider the movement necessary to come into contact with the increased 
effort requirement as a member of the target behavior response class. In that 
case, the increased effort required to continue the response to completion is 
(a) a consequence for the response that brought the learner into contact with 
it and (b) aversive stimulation that functions as punishment as the frequency 
of future responding decreases. 


. Chapter 31 discusses ethical issues and practices for applied behavior 


analysts. 


So mot toro mmm 15 


Negative Punishment 


LEARNING OBJECTIVES 
¢ Define negative punishment. 
¢ Define nonexclusion time-out. 
¢ Define exclusion time-out. 


e State the procedures for implementing planned ignoring. 


¢ State the procedures for implementing contingent observation. 


¢ State the procedures for implementing withdrawal of a specific positive reinforcer. 


¢ State the procedures for using a time-out room. 


e State the procedures for implementing a partition time-out. 


e List the decisions a practitioner must make prior to, during, and after a time-out application. 


e Define response cost. 
¢ List the undesirable aspects of negative punishment. 


Half a minute into watching the funniest Internet video 
ever, Ruth clicked a button. Instead of seeing the full- 
screen view that she expected, the video disappeared. She 
reopened the video, advanced the cursor to the previous 
time stamp, clicked the same button, and once again lost 
access to the video. Ruth never again clicked the button 
that makes videos disappear! 

Kent selected the last two front-row tickets for an 
upcoming concert by his favorite band. After entering 
his credit card information and billing address, he was 
prompted to review the details before clicking the pur- 
chase button. Just then Kent’s cell phone buzzed with a 
text message, which led to several minutes of exchanged 
messages. When Kent returned to the ticket seller’s 
website, the two tickets were no longer available. Kent 
no longer accepts cell phone texts when completing 
online purchases. 


egative punishment contingencies changed the behavior 

of both actors. Ruth no longer clicks the button that 

causes videos to disappear; Kent does not reply to text 
messages when shopping online. In negative punishment, a 
response is followed immediately by the removal of a stimu- 
lus that decreases future occurrences of similar responses. By 
contrast, in positive punishment, a response is followed imme- 
diately by the presentation of a stimulus that decreases future 
occurrences of similar responses. Both negative and positive 
punishment contingencies decrease future occurrences of behav- 
iors. The distinction is in the type of stimulus change—removal 
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or presentation—that functions as the punishing consequence 
(see Figures 15.1 and 14.2). 

Applied behavior analysts implement negative punishment 
with two principal tactics: time-out from positive reinforcement 
and response cost. This chapter defines and operationalizes 
time-out and response cost, describes how these negative punish- 
ment tactics are used in applied settings, outlines desirable and 
undesirable aspects of each procedure, and offers practical 
and ethical guidelines to help practitioners implement time- 
out and response cost effectively. 


TIME-OUT FROM POSITIVE REINFORCEMENT 
DEFINED 


Time-out from positive reinforcement (or simply time-out) is 
the immediate response-contingent withdrawal of the opportu- 
nity to earn positive reinforcers or the immediate loss of access 
to positive reinforcers for a specified time. To function as neg- 
ative punishment, the time-out procedure must decrease future 
occurrences of similar behaviors. Implicit in the definition of 
time-out are three important factors: (a) The discrepancy between 
the “time-in” and the time-out environments must be discrimina- 
tive, (b) the response-contingent loss of access to reinforcement 
must occur immediately, and (c) a resultant decrease in the future 
frequency of the time-out—producing behavior must occur.! 
Time-out can be viewed from a procedural, conceptual, 
or functional perspective. Procedurally, time-out entails remov- 
ing the person from a reinforcing environment, or prohibiting 
access to reinforcers within his current environment for a spec- 


ified period of time contingent upon the occurrence of problem 
behavior. 


Negative Punishment 


he 
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ee Effect on future 
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Teacher says, “Work S 
j tudent sent to time-out 
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Positive Punishment 
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Crash into ditch 
(stimulus presented) 


Figure 15.1 Negative punishment and positive punishment are distinguished by the type of 
stimulus change that functions as the punishing consequence. 


Conceptually, the client must discriminate between the 
time-out and the time-in environment: that is, the more dis- 
criminatively reinforcing the time-in setting, the more effec- 
tive time-out is as a negative punisher (Solnick, Rincover, & 
Peterson, 1977). In other words, the greater the difference 
between the reinforcing value of time-in and absence of that 
reinforcing value in the time-out setting, the more effective 
time-out will be. 

From a functional perspective, time-out involves reduced 
future occurrences of similar behaviors. Without a reduction 
of future similar behavior, time-out is not in effect, even if the 
person has been removed from the setting, segregated briefly 
from the group but remains in the setting, or loses access to rein- 
forcers. Assuming an exclusion time-out procedure was intended, 
if a teacher removes a student from the classroom—presumably 
the time-in setting—but upon the student’s return, the problem 
behavior continues, effective time-out cannot be claimed. 

So, contrary to popular thinking, time-out does not involve 
simply removing an individual to a sequestered setting. Although 
such removal may arguably describe an element of isolation, 
exclusion, or seclusion (Costenbader & Reading-Brown, 1995), 
the mere removal to another location falls far short of being 
accurately named time-out from positive reinforcement. Pro- 
cedural, conceptual, and functional criteria need to be met to 
assert that claim. 


TIME-OUT TACTICS FOR APPLIED SETTINGS 


There are two basic categories of time-out available in applied 
settings: nonexclusion and exclusion. Within each type, several 
procedural variations provide options for the practitioner when 
deciding on a course of action to reduce similar behavior in the 
future (see Figure 15.2). As a rule, nonexclusion time-out should 


be the method of first choice because practitioners are ethi- 
cally bound to employ the most powerful, but least restrictive, 
alternative when deciding on eligible variations. Regardless of 
which category is employed, we believe that, as a rule, time-out 
sessions should be brief, lasting not more than 2 to 5 minutes. 


Nonexclusion Time-Out 


Nonexclusion time-out means that contingent on the occurrence 
of the target behavior, the participant remains physically within 
the time-in setting, but elements of that setting are changed so 
that (a) the opportunity to earn reinforcers is removed; (b) a 
specific reinforcer is terminated; (c) repositioning occurs so that 
observations of ongoing activity continue, but without reinforce- 
ment; or (d) the individual is removed to a different space within 
the time-in setting. 

Nonexclusion time-out is typically implemented using one 
of four tactics: planned ignoring, terminate specific reinforcer con- 
tact, contingent observation, and partition/select space time-out. 


Planned Ignoring 


Planned ignoring occurs when, contingent on the occurrence 
of an inappropriate behavior, the opportunity to earn social 
reinforcers—usually attention, physical contact, or verbal 
interaction—is removed. Operationally, planned ignoring 
involves systematically looking away from the individual, 
remaining quiet, or refraining from any interaction whatsoever 
for a specific time (Kee, Hill, & Weist, 1999). For example, if a 
behavior targeted for time-out occurred in a one-to-one instruc- 
tional session, the therapist would simply turn her back to the 
learner for the duration of the time-out interval. The therapist's 
turning away essentially removes the time-in environment and 
denies the learner access to reinforcers. 
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Time-Out from Positive Reinforcement 


Exclusion Time-Out 
(time-out period outside time-in setting) 


Nonexclusion Time-Out 
(time-out period within time-in setting) 


Remove person from 


Planned ignoring time-in setting 


Remove time-in setting 


Terminate specific 


- from person 
reinforcer contact P 


Contingent observation 


Partition/Select space 


Figure 15.2 Tactics for implementing 
time-out from positive reinforcement. 


Terminate Specific Reinforcer Contact 


Terminate specific reinforcer contact is a variation of time- 
out whereby each occurrence of the target behavior immedi- 
ately stops an activity or sensory reinforcer. West and Smith 
(2002) reported an application of this tactic to reduce noise 
level during school lunchtime. They mounted a facsimile traf- 
fic signal light to the cafeteria wall and fitted the light with 
a sensor to detect noise levels. When the sensor detected an 
acceptable conversation decibel level, a green light registered 
on the traffic signal and music played through cafeteria speak- 
ers (i.e., music was identified as a reinforcer and constituted 
the time-in condition). The music played as long as the students 
engaged in an appropriate level of conversation. If the conver- 
sational decibel level increased to a predetermined threshold, 
the traffic light changed from green to yellow, visibly warning 
the students that further increases in noise would lead to time- 
out from music. If the decibel level rose still higher, the traffic 
signal turned from yellow to red, and the music ceased for 
10 sec. Under this group procedure, louder decibel conversa- 
tions were reduced. 

Bishop and Stumphauzer (1973) demonstrated that the 
contingent termination of television cartoons successfully 
reduced thumb sucking in three young children. Unknown to 
the children, a remote on/off switch was attached to the tele- 
vision. Baseline data indicated that each child emitted a high 
rate of thumb sucking while viewing the cartoons. During the 
terminate contact with a specific reinforcer variation of time- 
out, the television was immediately turned off when thumb 


time-out 


sucking occurred, and it was turned back on when thumb suck- 
ing stopped. The procedure was effective in reducing thumb 
sucking not only in the treatment location (an office) but also 
during a story period at school. 


Contingent Observation 


In contingent observation, the person is repositioned within 
an existing setting such that he or she can observe ongoing 
activities, but access-to available reinforcers is lost. A teacher 
uses contingent observation after an undesirable behavior when 
she redirects the offending student to move from the group, but 
remain visible, and reinforcers are withheld for a specified time. 
In short, the student is directed to “‘sit and watch” (Twyman, 
Johnson, Buie, & Nelson, 1994; White & Bailey, 1990). When 
the contingent observation period ends, the student rejoins 
the group, and is then able to earn reinforcers for appropriate 
behavior. An athletic analog to contingent observation occurs 
when an athlete commits an unnecessary foul during a game, 
and, in turn, the coach immediately “sidelines” the player. The 
player can still see the game; however, he cannot participate in 
it, or receive social reinforcers for contributing to the outcome 
(e.g., crowd cheers, praise from fellow players). 

Figure 15.3 shows the effects of White and Bailey’s (1990) 
use of contingent observation in reducing the number of disrup- 
tive behaviors in two inclusive physical education classes. When 
contingent observation was in effect (i.e., sit and watch), the 
number of disruptive behaviors across two classrooms decreased 
and remained near zero levels. 


Base- Behavior Checklist Sit and Watch 


Alternative 
Education 


Class 


Number of Disruptive Behaviors 
per 10 Minute Session 
oO 


371624 2 7 Tost 


Days 
Figure 15.3 Number of disruptive behaviors per 10-min 
observation period. The numbers above the data points 
represent the number of times “sit and watch” was 
implemented during the class. 


From “Reducing Disruptive Behaviors of Elementary Physical Education 
Students with Sit and Watch”, A. G. White and J. S. Bailey, 1990, 
Journal of Applied Behavior Analysis, 23, p. 357. Reproduced with 
permission of John Wiley & Sons Inc. 


Partition/Select Space Time-Out 


In partition or select space time-out, the student remains within 
the time-in setting, but his view within the setting is restricted by 
a panel or cubicle, or a select space is arranged to serve as the 
time-out area (i.e., a carpet, a corner). A teacher would use parti- 
tion time-out by directing a student, contingent on a misbehavior, 
to move from his assigned seat to that partitioned location for a 
specified time period. Although partition or select space time- 
out has the advantage of keeping the student within the time-in 
setting—presumably to hear academic content and the teacher 
praising other students for appropriate behavior—it also can 
be disadvantageous. That is, the individual may still be able to 
obtain reinforcement from other students, reducing effectiveness. 
Second, the student might become the object of disparaging com- 
ments from other students (e.g., “Brandon is a dummy; he’s in 
trouble.”). Third, since the individual's view—and the teacher’s 
view—can be at least partially restricted, undesirable surrepti- 
tious behaviors may occur. Also, public perceptions must again 
be taken into account with this form of exclusion. Finally, even 
though the student remains in the room, the partition time-out area 
might be viewed as exclusionary. 


Exclusion Time-Out 


Exclusion time-out is the physical separation of the participant 
from the time-in environment when time-out goes into effect. 
There are two basic tactics for implementing exclusion time-out 
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contingent on the occurrence of the target behavior: (a) The par- 
ticipant is removed immediately from the time-in environment, 
or (b) the time-in environment is removed immediately from 
the participant. 


Participant Removed from Time-I/n Setting 


A time-out room is any confined, safe, secure, supervised, and 
continuously monitored area within the participant’s normal edu- 
cational or treatment facility that is devoid of existing positive 
reinforcers, that does not have access to potential positive reinforc- 
ers, and in which the person can be securely placed. A time-out 
room should preferably be located in close proximity to the time-in 
setting, and it should have minimal furnishing (e.g., a chair and 
a table). It should have adequate light, heat, safety features, and 
ventilation, but should not contain potentially reinforcing items 
(e.g., pictures on the wall, telephones, breakable objects, windows 
with access to playgrounds, or busy streets). The room should not 
be locked, and it must be monitored continuously by staff. 

A time-out room has several advantages that make it 
attractive to practitioners. First, the opportunity to acquire rein- 
forcement during time-out is eliminated or reduced substantially 
because the time-out environment is physically constructed to 
minimize such stimuli. Second, after a few exposures to the 
time-out room, students learn to discriminate this room from 
other rooms in the building. The room assumes conditioned 
aversive properties, thus increasing the probability that the 
time-in setting will be more reinforcing. Third, the risk of a 
student hurting other students in the time-in setting is reduced 
when the offending student is removed to this space. 

However, before using a time-out room, the practitioner 
must consider several strong and compelling disadvantages. 
Escorting a student to the time-out room is not without its 
potential challenges: Practitioners can encounter resistance and 
outbursts, and they should also anticipate and be fully prepared 
to deal with emotional escalations. Using a time-out room may 
serve as a negative reinforcer for the teacher, thus increasing its 
use (Skiba & Raison, 1990). Also, the time-out room may set the 
occasion for the person to engage in undetected self-destructive or 
self-stimulatory behaviors that should be stopped (Solnick et al., 
1977). This situation compromises the effectiveness of time- 
out. Practitioners must be sensitive to the public’s perception of 
the time-out room. Even a benign-appearing time-out room can 
be viewed with distrust by persons who perceive the intent of the 
room differently and are misinformed about its purpose. 

In recent years, the use of the time-out room has increas- 
ingly fallen into disfavor with practitioners, members of pro- 
fessional associations, and client rights advocacy groups, all of 
whom embrace less intrusive, but still effective, interventions 
for problem behavior.” 

Hallway time-out used to be common practice in schools, 
and some teachers continue to employ it, despite increasing 
administrative mandates to prohibit it. In hallway time-out, 
the student is directed to leave the classroom and sit (or stand) 
outside the classroom in the hallway or corridor until the time- 
out period expires. We do not recommend this procedure for 
several reasons: (a) Many school districts and child service 
agencies prohibit teachers and staff from placing children in 
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unsupervised settings; (b) the student can obtain “bootleg” 
reinforcement from a multitude of sources (e.g., students in 
other rooms, individuals walking in the hallway, school visi- 
tors); (c) there is increased likelihood of the student escaping 
(i.e., the student may run away from the school altogether); 
(d) the teacher is less likely to be able to monitor the student 
outside of her line of sight; and (e) unattended students, espe- 
cially those in the midst of a behavioral episode, may represent 
a safety hazard to themselves or others. Therefore, for ethical, 
safety, and programmatic reasons, we strongly recommend that 
practitioners seek alternatives to hallway time-out when trying 
to decrease problematic behavior. 


Time-In Setting Removed from Participant 


This variation of exclusion time-out does not require changing 
the participant’s physical location. Instead, contingent on the 
occurrence of the target behavior, the time-in environment is 
removed from the participant. This form of time-out is a feature 
of some computer-based learning programs whereby certain 
error responses are followed immediately by the screen going 
dark (or blank) for several seconds, denying the user access to 
the activity, and, hence, its reinforcers. 


Desirable Aspects of Time-Out 


Ease of Application 


Time-out, especially in its nonexclusion variations, is relatively 
easy to apply. The teacher can accomplish even physically 
removing a student from the environment with comparative 
ease if the teacher acts in a businesslike fashion and does not 
attempt to embarrass the student. Issuing a direction privately 
(e.g., “Deion, you have disrupted Monique twice; time-out is 
now in effect.”) can help the teacher handle a student who has 
misbehaved but who does not want to leave the room. If the 
behavior warrants exclusion time-out, the teacher must insist 
that the student leave; however, the teacher should communicate 
this insistence at close range so that the student is not placed in 
a position of challenging the teacher openly to save face with a 
peer group. Teachers should consult district policy on whether 
an administrator, or other staff member, should be called to 
assist with escorting the student to the time-out location. 


Acceptability 


Time-out, especially in its nonexclusion variations, meets an 
acceptability standard because practitioners regard it as appro- 
priate, fair, and effective. Even so, practitioners should always 
check with the appropriate administering body to ensure com- 
pliance with school or agency policy before applying time-out 
for major or minor infractions. 


Rapid Suppression of Behavior 


When effectively implemented, time-out suppresses the tar- 
get behavior in a moderate to rapid fashion. Sometimes only 
a few applications are needed to achieve acceptable reduction 
levels. Other reductive procedures (e.g., extinction, differential 
reinforcement of low rates) also produce decreases in behavior, 
but they can be time-consuming. Practitioners may not have 


the luxury of waiting several days or a week for a behavior to 
decrease. In such instances, time-out merits consideration. 


Can Be Combined with Other Applications 


Time-out can be combined with other procedures, extending 
its usability in applied settings (Dupuis, Lerman, Tsami, & 
Shireman, 2015). When combined with differential reinforce- 
ment, time-out can increase desirable behavior and decrease 
undesirable behavior (Byrd, Richards, Hove, & Friman, 2002). 


USING TIME-OUT EFFECTIVELY 


Although time-out has shown effectiveness in reducing behav- 
ior, it should not be the method of first choice. Practitioners 
faced with the task of reducing behavior should initially consider 
nonpunitive methods for reducing the target behavior (e.g., 
differential reinforcement of alternative behavior, extinction). 
Time-out should be considered only when the evidence shows 
that less intrusive procedures have failed. 

Further, effective implementation of time-out requires 
the practitioner to make several decisions prior to, during, and 
after time-out implementation. Figure 15.4 shows Powell and 
Powell’s (1982) time-out implementation checklist as an aid in 
the decision-making process. The following sections expand 
on several major points underlying many of these decisions. 


Enrich the Time-In Environment 


Selected behaviors in the time-in environment must be rein- 
forced. In choosing the behaviors to reinforce during the 
time-in setting, practitioners should identify behaviors that are 
alternative to, or incompatible with, behaviors that will lead to 
time-out (e.g., using differential reinforcement of alternative 
behavior, differential reinforcement of incompatible behavior). 
Differential reinforcement will facilitate the development of 
appropriate behaviors. Further, the ratio of reinforcing state- 
ments to reprimanding comments should strongly fall in favor 
of reinforcing statements. Additionally, upon return from time- 
out, reinforcement for appropriate behavior should be delivered 
as quickly as possible. 

Assuming that the effectiveness of any reinforcing or 
punishing stimulus is derived from the antecedent-behavior- 
consequence contingency, practitioners should examine all 
aspects of this relationship, including the time-in setting. 
Answering the following questions will help in that regard. 


* Are the presumed reinforcers actually functioning as 
reinforcers? 

* What function does the target behavior serve (i.e., atten- 
tion, escape)? 

* Is the time-out setting appropriate, and is the contingency 
being delivered immediately? 


Within this context, practitioners are then charged with 
individualizing time-out to the specific circumstances at hand. 
Solnick et al. (1977) provide guidance: “This view of timeout 
suggests that there can be no ‘standard’ timeout procedure that 
will reliably reduce problem behavior. Alternatively, we are left 


necessary). 


5 

6 Decide on time-out area. 

Ve Decide on length of time-out. 
8 


12. Write up the entire procedure. 


program in the child’s IEP. 


18. Modify the procedure as needed. 


with the task of monitoring the relative reinforcing properties of 
the timein and timeout setting for each child” (p. 423). 


Define the Behavior That Will Lead to Time-Out 


Before time-out is implemented, all of the appropriate parties 
must be informed of the target behaviors that will lead to time- 
out. If a teacher decides to use time-out, she should describe in 
explicit, observable terms the behavior that will result in time-out. 
Merely informing students that disruptive behavior will result in 
time-out is not sufficient. Giving specific examples and nonex- 
amples of what is meant by disruptive behavior is a better course 
to follow. Donaldson and Vollmer (2011) provided operational 
definitions using measurable criteria for screaming, disrupting, 
and aggressive behaviors. It is important to measure behaviors 
precisely because in post-time-out interviews, Readdick and 
Chapman (2000) discovered that children were not always aware 
of the reasons time-out was implemented. Providing explicit 
examples and nonexamples addresses this problem. 


Determine the Form and Variation of Time-Out 


School board policy or institutional constraints may be estab- 
lished that set the parameters for time-out variations that can be 


Figure 15.4 Steps for planning, implementing, evaluating, and terminating time-out. 


Step Task Date Teacher's 
Completed Initials 
1. Try less aversive techniques and document results. soning 
a Operationally define disruptive behaviors. eee ile 6h 
3. Record baseline on target behaviors. col ony le 
4. 


Consider present levels of reinforcement (strengthen if 


Decide on time-out procedure to be used. 


Decide on command to be used to place child in time-out. 


eh Set specific criteria that will signal discontinuance of time-out. 
10. Set dates for formal review of the time-out procedure. 
Vi: Specify back-up procedures for typical time-out problems. 


Wey Have the procedure reviewed by peers and supervisors. 
14. Secure parental/guardian approval and include the written 


(S. Explain procedure to the student and class (if appropriate). 
16. Implement the procedure, take data, and review progress daily. 
‘eas Formally review procedure as indicated. 


Record results for future teachers/programs. 


From “The Use and Abuse of Using the Timeout Procedure for Disruptive Pupils” by T. H. Powell and |. Q. Powell, 
1982, The Pointer, 26(2), p. 21. Reprinted by permission of the Helen Dwight Reid Educational Foundation. 
Published by Heldref Publications, 1319 Eighteenth St., N.W., Washington DC 20036-1802. Copyright © 1982. 
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used in applied settings. In addition, physical factors within the 
building (e.g., a lack of available space) may prohibit exclusion- 
ary forms of time-out. As stated previously, the behavior analyst 
should first consider nonexclusion time-out tactics. 


Obtain Permission 


One of the more important tasks a practitioner must perform prior 
to using time-out, especially in its exclusionary variations, is to 
obtain permission from supervisors or administrators. Because of 
the potential to misuse time-out (e.g., leaving the person in time- 
out too long, continuing to use time-out when it is not effective), 
practitioners must obtain administrative approval before employ- 
ing it. However, interactions can happen so fast in applied settings 
that obtaining permission on an instance-by-instance basis would 
be unduly cumbersome. A preferred method is for the practi- 
tioner, in cooperation with administrators, to decide beforehand 
the type of time-out that will be implemented for certain offenses 
(e.g., nonexclusion versus exclusion), the duration of time-out 
(e.g., 2 to 5 minutes), and the behaviors that will demonstrate 
reinstatement to the time-in environment. Communicating these 
procedures and/or policies to other practitioners and caretakers in 
the building, as well as to parents, is advisable. 
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Explain the Time-Out Procedure and Rules 


In addition to posting the behaviors that will lead to time-out, the 
teacher should also post the rules. Posting helps make explicit the 
behaviors that produce the contingencies (Van Houten, 1979). 
Minimally, these rules should focus on the initial duration of time- 
out and the exit criteria (1.e., rules that determine when time-out 
is in effect, when it is over, and what happens if ongoing inappro- 
priate behavior occurs when the scheduled time-out period ends). 


Determine the Duration of the Time-Cut Period 


Time-out periods should be as brief as possible while main- 
taining effectiveness. Time-out durations of 2 to 5 minutes are 
often effective in reducing behavior, although a short duration 
may be ineffective initially if an individual has had a history of 
longer time-out periods. As a rule, time-out periods exceeding 
15 minutes are not effective. 

Longer time-out periods are counterproductive for sev- 
eral reasons. First, the person may develop a tolerance for the 
longer duration and find ways to obtain reinforcers during time- 
out. Observing the person unobtrusively in time-out may pro- 
vide evidence of the reinforcers being obtained (Solnick et al., 
1977). This situation occurs with people who have a history of 
self-stimulatory behavior. The longer the duration of time-out, 
the more opportunity there is to engage in self-stimulation, and 
the less effective time-out becomes. Second, longer time-out 
periods remove the person from the educational, therapeutic, or 
family time-in environment in which the opportunity to learn 
and earn reinforcers is available. Third, given the undesirable 
practical, legal, and ethical aspects of longer time-out periods, a 
prudent course of action is for the practitioner to use relatively 
short, but consistent, time-out periods. When these periods are 
short, student academic achievement is not affected adversely 
(Skiba & Raison, 1990). 


Apply Time-Out Consistently 


Each occurrence of the undesirable behavior should lead to 
time-out. If a teacher, parent, or therapist is not in a position 
to deliver the time-out consequence after each occurrence of 
the target behavior, it may be better to use an alternative reduc- 
tive technique. Using time-out occasionally may lead to student 
or client confusion about which behaviors are acceptable and 
which are not. 


Establish Exit Criteria 


Practitioners can consider exit criteria options to include: (a) 
extending the time-out period for a predetermined amount of 
additional time, (b) continuing the time-out period until the dis- 
ruptive behavior ceases, or (c) requiring the absence of misbe- 
havior for a specified time prior to the end of the fixed time-out 
period (e.g., no misbehavior for the last 30 seconds of the 5-min 
period). This latter tactic is termed a release contingency. The 
rationale for implementing a release contingency is twofold: 
(1) to mitigate the risk of adventitiously reinforcing the behavior 
that leads to time-out by releasing the person while that behay- 
ior is being emitted (i.e., an escape contingency); and (2) to 
reduce the effects of the time-out behavior from carrying over, 


or spilling into, the time-in setting. A release contingency has 
been evaluated in several studies (e.g., Donaldson & Vollmer, 
2011; Erford, 1999; Mace, Page, Ivancic, & O’Brien, 1986). 

Donaldson and Vollmer (2011) compared fixed-duration 
versus a release contingency time-out procedures. Participants 
were four preschool children who exhibited high rates of prob- 
lem behavior (e.g., screaming, crying, aggression, disruptions). 
The researchers collected data on (a) behaviors that led to time- 
out and (b) delay-producing responses (i.e., problem behavior 
that occurred during the time-out period). During baseline, 
teachers, parents, and experimenters conducted regularly sched- 
uled playground or circle time activities as usual. Children were 
expected to participate in a variety of low-demand activities, and 
programmed consequences were not delivered should a disrup- 
tive episode occur. If children engaged in potentially dangerous 
behavior—jumping from playground equipment, for example— 
teachers or experimenters intervened by blocking and redirect- 
ing, but not by reprimanding. 

During the fixed-duration time-out condition, the child 
was shown a yellow card prior to the start of each session and 
told, “If you are sent to time-out, you only have to stay for 
4 minutes, no matter what” (p. 698). The yellow card was then 
placed in a visible location, and each occurrence of problem 
behavior resulted in a fixed-duration 4-min time-out. 

During the release contingency time-out condition, the 
participant was shown a red card prior to the start of each ses- 
sion and told, “If you are sent to time-out, you will have to 
stay for 4 minutes, but you cannot leave time-out until you are 
calm” (p. 698). The red card was placed where the child could 
see it, and each occurrence of problem behavior resulted in a 
fixed-duration 4-min time-out. However, if disruptive behavior 
occurred during the last 30 seconds of that 4-min interval, time- 
out was extended until 30 seconds elapsed without problem or 
until 10 minutes passed. 

Both time-out procedures reduced problem behavior by all 
four participants (see Figure 15.5). Donaldson and Vollmer (2011) 
reported that problem behavior that occurred during either the 
fixed-time or the release contingency time-out periods did not 
predict spillover effects in the classroom, home, or playground 
after time-out periods were terminated. They concluded the dis- 
cussion of the study’s results with this recommendation: “A fixed 
duration time-out should be considered first because there was 
no apparent benefit to including a release contingency, release 
contingencies resulted in longer time-outs, and the effort required 
to implement a release contingency time-out is greater than that 
required to implement a fixed-duration time-out” (p. 704). 

The decision to exit time-out should consider not only 
the child’s behavior during or shortly before time-out ends but 
also the effect of reintroducing an explosive, out-of-control cli- 
ent back into the time-in setting. Reintroducing a misbehaving 
student to the time-in setting may bring about two undesirable 
outcomes: (1) the offending student may learn that regardless 
of his behavior, he will be able to rejoin his classmates after 
a short period, and (2) classmates predisposed to act out may 
realize that they, too, can escape but later rejoin the group 
without having to change their behavior. Time-out should not 
be terminated until such behaviors are under control, or an 
alternative form of behavior suppression can be executed. 
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If the practitioner anticipates that the inappropriate behav- 
ior that led to time-out may occur when time-out ends, then the 
individual should not be released from time-out. Exit criteria 
should establish this contingency, thereby increasing the poten- 
tial that the problem behavior will not carry over to the time-in 
setting. Use of prompts, cues, or other discriminative stimuli 
toward the end of the time-out session may set the occasion for 
improved performance during the remaining minutes of time-out 
(e.g., “Rosario, when your time-out ends in 2 minutes, you may 
rejoin your classmates’). 


Ensure Compliance with Legal 
and Ethical Guidelines 


The use of time-out in applied settings has been much dis- 
cussed. Although litigation surrounding the use of this pro- 
cedure has focused primarily on institutionalized populations 
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Figure 15.5 Rate of time-out-producing problem 
behavior across sessions for each participant. 


“An Evaluation and Comparison of Time-Out Procedures with and 
without Release Contingencies”, J. M. Donaldson and T. R. Vollmer, 
2011, p, 699, Journal of Applied Behavior Analysis, 44. Reproduced 
with permission of John Wiley & Sons Inc. 


(e.g., Morales v. Turman, 1973; Wyatt v. Stickney, 1974), the 
rulings in these cases have had a profound effect on the use of 
time-out in classroom, center, and clinical settings. The issues 
before the courts focus on the protection of client rights, whether 
time-out represents cruel and unusual punishment, and the 
degree of public acceptability of the time-out procedure. 

The upshot of major court rulings has been that a per- 
son’s right to treatment includes the right to be free from 
unnecessary and restrictive isolation. However, rulings also 
include language that permits the use of time-out in a behavior 
change program as long as the program is closely monitored 
and supervised and is designed to protect the individual or 
others from bodily harm. 

In response to changing conditions in practice, court cases, 
increased knowledge about effective instruction, best practice, 
and input from practitioners, parents, and caretakers, the Behav- 
ior Analyst Certification Board (BACB) (2014) codified its view 
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Figure 15.6 Key distinctions between time-out, seclusion, and restraint. 


Definition 


Immediate response-contingent ° 
withdrawal of the opportunity 

to earn positive reinforcers or 

the immediate loss of access to 
positive reinforcers for a specified 
time. 


Time-Out from 
Positive Reinforcement. 
(Time-Out) 


Seclusion Involuntary isolation of the indi- 
vidual to a private, escape-proof 
setting, physically displacing the 
person from time-in to interrupt 


problem or harmful behavior. 


Restraint Restricting the free movement 
of a person physically (e.g., 
basket hold) or mechanically 
(by a device) to prevent severe 
episodic, or self-injurious, 
behavior. 


Requirements 


A behavioral intervention 
plan (BIP) if exclusion 
time-out is used 


Informed consent 


A BIP prohibiting use of a 
locked room & involuntary 
sedation 


Informed consent 


Alternative, evidence-based 
practices to be reviewed 


Continuous monitoring 


Training on positive behavioral 
methods 


De-escalation training 
Regular safety, fire, and 
programmatic inspections 
Continuous monitoring 
Training to ensure health 


and welfare of the individual 
during restraint 


Compatible with Behavior 


Intervention Program (BIP)? 


e Yes, to the extent time-out is 
protective and therapeutic. 


Yes, to the extent seclusion is 
protective and therapeutic. 


Yes, to the extent restraint is 
protective and therapeutic. 


Yes, when used in an 
emergency situation to 
prevent immediate harm. 


Based on The Council for Exceptional Children’s (CEC, 2009) Policy on Physical Restraint and Seclusion Procedures in School Settings is an important 
source of guidance for school-based practitioners. 


on ethical issues of punishment in its Professional and Ethical 
Compliance Code for Behavior Analysts. In this document, the 
BACB reaffirmed three guiding principles: (1) The welfare of the 
individual served is the highest priority, (2) individuals (and par- 
ents/guardians) have a right to choose, and (3) the principle of least 
restrictiveness should be followed. The BACB also provides its 
view on the use of punishment in applied settings in this document. 

The Association for Behavior Analysis International’s 
(ABAI) Position Statement on Restraint and Seclusion differen- 
tiates time-out, restraint, and seclusion in the context of serving 
clients with severe problem behavior. Figure 16.5 defines and 
summarizes key distinctions between these interventions. 

Behavior analysts who serve clients with severe problem 
behavior should familiarize themselves with the entire ABAI 
position statement, which includes sections on client rights, clin- 
ical oversight, procedural safeguards, and ongoing evaluation 
based on objective data. 

Applied behavior analysts who serve students with chal- 
lenging behavior must also abide by policies and regulations 
governing the application of crisis interventions in schools. The 
Council for Exceptional Children’s (CEC: (2009) Policy on 
Physical Restraint and Seclusion Procedures in School Settings 
is an important source of guidance for school-based practitioners 
(see https://www.cec.sped.org/~/media/Files/Policy/CEC%20 
Professional %20Policies%20and%20Positions/restraint%20 
and%20seclusion.pdf). Chapter 31 provides additional details 
on the ethical use of negative and positive punishment. 


Evaluate Effectiveness 


At the very least, data need to be obtained on the inappropriate 
behavior that initially led to time-out. If time-out is effective, 
the level of that behavior should be reduced substantially, and 
that reduction should be noticed by other persons in the envi- 
ronment. For legal and ethical reasons, additional records should 
be kept, documenting the use of time-out; the duration of each 
time-out episode; and the individual’s behavior before, during, 
and after time-out (Yell, 1994). 

Resources permitting, in addition to collecting data on 
the target behavior, it is sometimes beneficial to collect data 
on collateral behaviors, unexpected side effects, and the target 
behavior in other settings. For instance, time-out may produce 
emotional behavior (e.g., crying, aggressiveness, withdrawal) 
that might overshadow positive gains. Keeping a record of these 
episodes is helpful. Also, Reitman and Drabman (1999) sug- 
gested that for home-based applications of time-out, parents 
keep a record of the date, time, and duration of time-out, as 
well as a brief anecdotal description of the effects of time-out. 
When reviewed with skilled professionals, these anecdotal logs 
provide a session-by-session profile of performance, making 
adjustments to the time-out procedure more informed. In the 
view of Reitman and Drabman: 


The time-out record and discussions centered around it 
yielded a wealth of information including the type, fre- 
quency, context, and rate of misbehavior. . . . Graphing these 


data provided visual feedback to both the therapists and the 
clients regarding the extent to which behavior change had 
occurred and facilitated the setting of new goals. (p. 143) 


Terminate the Time-Out Program 


Time-out has the potential to be employed well beyond its effec- 
tive date. “Given the clinical and administrative effort required 
to initiate punishment programs, agencies may be inclined to 
keep them in place longer than needed rather than to expend 
more effort in evaluating their continued necessity” (Iwata, 
Rolider, & Dozier, 2009, p. 204). 

Notwithstanding the general recommendation that the 
overall length of any behavior change program should be 
determined by data, the plan to eliminate time-out should begin 
shortly after the procedure is initiated. That is, once time-out 
has been introduced, and data show reduced levels of the target 
behavior, efforts should be made to switch to positive behavior 
reductive approaches (e.g., differential reinforcement of alter- 
native behaviors, differential reinforcement of incompatible 
behaviors) or to reduce implementation over time. Conceptual- 
izing time-out as existing on a continuum may give practitioners 
context for deciding for whom and under what conditions time- 
out is warranted. Stated in the form of questions, analysts might 
ask: Is time-out still necessary for this individual? Are there 
other less intrusive behavior reduction procedures that could 
be employed? If time-out is still considered the best method 
to reduce behavior, how can it be implemented less intrusively 
(nonexclusion versus exclusion)? Continuing time-out—or any 
punishment approach—beyond the point of its demonstrated 
effectiveness risks ethical violations and is counterproductive. 


RESPONSE COST DEFINED 


Response cost is the response-contingent loss of a specific 
number of positive reinforcers that has the effect of decreas- 
ing the future occurrence of similar responses. Essentially, 
response cost constitutes a fine for misbehavior. 
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Response cost typically involves the loss of one or more 
generalized conditioned reinforcers (e.g., money, tokens), tan- 
gibles (e.g., stickers, marbles), or minutes of already-earned 
time with a preferred activity (e.g., listening to music, play- 
ing a computer game) (Lee, Penrod, & Price, 2017; Musser, 
Bray, Kehle, & Jenson, 2001; Tanol, Johnson, McComas, & 
Cote, 2010; Wahl, Hawkins, Haydon, Marsicano, & Morrison, 
2016). A teacher implements response cost by removing a 
specific number of positive reinforcers already held by a stu- 
dent contingent on the occurrence of a targeted inappropriate 
behavior. 

The punitive value of the lost reinforcers depends 
on the individual reinforcement preferences of the partici- 
pant. Hall and colleagues (1970), for example, found that 
the response-contingent removal of slips of paper bear- 
ing the student’s name had a reductive effect on the num- 
ber of complaints by a boy during 30-minute reading and 
math periods. Prior to each period during the response cost 
condition, the teacher placed five slips of paper bearing the 
boy’s name on his desk. Each time the boy cried, whined, or 
complained during the period, the teacher removed a slip. 
A multiple baseline across academic periods with a reversal 
to baseline analysis revealed a functional relation between 
response cost and a marked decrease in disruptive behaviors 
(see Figure 15:7): 


Desirable Aspects of Response Cost 


Effectiveness 


Response cost reduces the future occurrence of similar 
behaviors. According to Lerman and Toole (2011), “. . . the 
contingent removal of a specific amount of reinforcement 
(e.g., tokens) can function as an effective punisher” (p. 352). 
Boerke and Reitman (2011) underscore this point, citing 
the wide range of successful reductive response cost appli- 
cations in home-, academic-, and hospital-based settings. 
Practitioners who face important challenges with disruptive, 


Figure 15.7 The number of cries, whines, and 
complaints by Billy during the 30-minute reading and 
arithmetic periods. 

Erom “Modification of Disrupting and Talking-Out Behavior with 
the Teacher as Observer and Experimenter,” by R. V. Hall, R. Fox, 
D. Williard, L. Goldsmith, M. Emerson, M. Owen, E. Porcia, and 
R. Davis, 1970, paper presented at the American Educational 
Research Association Convention, Minneapolis. Reprinted with 
permission. 


402 Part5 ¢ Punishment 


off-task, non-compliant, and even life-threatening behaviors 
need a procedure that produces reliable reductive results. 
Response cost addresses this criterion. 

Even though the data from numerous studies show the 
shorter term effectiveness of response cost, the longer term ben- 
efits for the individual, and the significant others with whom the 
individual interacts, are still largely unknown (Lerman & Toole, 
2011). In their summary on developing function-based punish- 
ment procedures, of which response cost is a prime example, 
Lerman and Toole provide a number of recommendations for 
improving the long-term effectiveness of response cost: 


... punishment can be improved by (1) selecting punishers 
via pretreatment avoidance, choice, or activity assessments; 
(2) delivering the consequences immediately following 
each instance of the behavior; (3) ensuring that alternative 
sources of reinforcement are readily available; (4) estab- 
lishing discriminative control... ; and (5) .. . using condi- 
tioned punishers. (pp. 363-364) 


Practitioners would do well to follow Lerman and Toole’s 
guidelines. Furthermore, applied behavior analysts charged with 
training field-based practitioners would also be wise to incor- 
porate these five suggestions into their initial and recurrent 
response cost training programs. Chapter 31 contains additional 
strategies for trainers placed in supervisory positions. 


Least Restrictive Alternative 


We state repeatedly throughout this text, and especially in 
Chapter 31, that behavior analysts must comply with an ethi- 
cal code of conduct. One important application of this code is 
that practitioners use interventions that are consistent with the 
principle of the least restrictive alternative as detailed in the Pro- 
fessional and Ethical Compliance Code for Behavior Analysts 
(2014). Briefly, this principle states that the most powerful, yet 
least intrusive, method be employed first and found to be ineffec- 
tive, before more restrictive options are exercised. Hence, when 
analysts face challenging behaviors, especially those that have 
been resistant to other reductive treatment (e.g., extinction, differ- 
ential reinforcement of lower rates of behavior), it is incumbent 
on the analyst to continue exploring least restrictive measures. 
Lerman and Toole (2011) add the following guidance. 


Ethical guidelines mandate that priority should be given 
to the least restrictive alternative procedure that is clini- 
cally effective. .. . Inherent in this approach to treatment 
selection is the need to arrange punishment procedures 
hierarchically based on the degree of restrictiveness 

(i.e., how much the procedure limits an individual’s free- 
dom or intrudes into an individual life in some manner) or 
aversiveness (i.e., how much the procedure produces dis- 
comfort, pain, or distress). Nonexclusionary time-out and 
response cost typically are considered the least restrictive 
of the procedures, followed by exclusionary time-out, over- 
correction, and other physical punishers. (p. 353) 


Moderate to Rapid Decrease in Behavior 


Like other forms of punishment, response cost usually pro- 
duces a moderate to rapid decrease in behavior. Unlike other 


behavior-reductive procedures that may require more time to 
show a significant effect (e.g., extinction, differential reinforce- 
ment of alternative behaviors), response cost tends to produce 
changes quickly. Suppressive effects are noted rapidly, within 
two to three sessions, as evidenced by Hall and colleagues’ data 
(see Figure 15.7) and replicated by Donaldson, DeLeon, Fisher, 
and Kahng (2014) (see Figure 15.8). 

For instance, when the response cost contingency was 
introduced during “Tokens” (denoted by the square data points 
in Figure 15.8), students lost one token per disruption, and the 
number of disruptive responses per minute fell to near zero. Ina 
contrasting Token Earn condition, students could earn one token 
for emitting any appropriate behavior at the time a signal was 
emitted to record occurrences or nonoccurrences. This rapid 
loss was again noted after “Tokens: Choice” was reintroduced 
following Baseline 2. Overall, the data show an immediate and 
significant reduction in disruptive behavior. Interestingly, when 
students were given a choice (i.e., token earn or token loss), all 
four students shown in Figure 15.8, and two thirds of the other 
students in the entire study, preferred token loss over token gain. 


Convenience 


Response cost is convenient to implement and can be used in a 
variety of school-, home-, hospital-, and center-based settings 
(Ashbaugh & Peck, 1998; Musser et al., 2001). For example, if 
students are expected to follow explicitly stated classroom pro- 
cedures, rules, or contracts, then any rule infraction will produce 
a loss of a specified reinforcer (e.g., a token), and it signals to 
the student that future occurrences of the misbehavior will result 
in the same consequence. Under these conditions, decision rules 
about implementation can be made expediently. 

At home, response cost is convenient to the extent that it 
can be incorporated into an existing routine. For instance, if a 
child earns an allowance of $7 per week ($1 per day) for clean- 
ing her room and placing her dishes in the sink after eating, 
noncompliance with one or both of these behaviors would lead 
to a monetary loss (e.g., 50¢ per infraction). In this setting, a 
response cost chart, posted in a visible location, would provide 
a convenient feedback mechanism to show the child and her 
parents accumulated losses (Bender & Mathes, 1995). 

Figure 15.9 shows a sample response cost chart as it might 
be used in a classroom or tutorial setting. To implement the 
response cost procedure, the teacher writes a column of decreasing 
numbers on the board or chart. Immediately following the occur- 
rence of a disruptive behavior, the teacher crosses off the highest 
remaining number. If the teacher places the numbers 15 through 0 
on the board and three disruptive behaviors happened, the students 
would receive 12 minutes of preferred activity time that day. 


Can Be Combined with Other Approaches 


Response cost can be combined with other behavior change 
procedures (Athens, Vollmer, Sloman, St. Peter Pipkin, 2008; 
DeLeon et al., 2008; Long, Miltenberger, Ellingson, Ott, 1999; 
Watkins & Rapp, 2014). For instance, practitioners might add 
response cost to an existing treatment to further reduce levels 
of a target behavior. Researchers might add response cost to an 
existing procedure, withdraw it, and then reinstitute response 
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Figure 15.8 Responses per minute of disruptive 
behavior across sessions for Damon, Shaun, 
Lamar, and Tabitha. 


“Effects of and preference for conditions of token earn 
versus token loss” by J. M. Donaldson, |. G. DeLeon, 
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cost, as part of a component analysis to calibrate the occur- 
rence, rate, or percentage reduction. Such analysis might address 
important questions as to whether response cost alone is suffi- 
cient to produce desirable reductive levels, or whether it must be 
applied with another procedure to gain this effect. The following 
sections describe a few of these alternatives. 
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Figure 15.9 A response cost chart showing number of 
minutes of free time remaining after three rule infractions. 


A.B. Fisher, and S. W. Kahng, 2014, Journal of Applied 
20 Behavior Analysis, p. 544. Reproduced with permission of 
John Wiley & Sons Inc. 


Response Cost with Positive Reinforcement. Response cost 
can be combined with positive reinforcement. For example, a 
student might receive a point, a star, or a sticker for each aca- 
demic assignment completed during the morning seatwork 
period, but lose a point, a star, or a sticker for calling out with- 
out permission. Donaldson et al. (2014) used this dichotomous 
gain-loss procedure in their study. 

Musser and colleagues (2001) combined response cost 
and positive reinforcement to improve the compliant behavior of 
three students with emotional disturbance and oppositional defi- 
ant behavior. Briefly, the procedures involved the teacher mak- 
ing a normal request for compliance (e.g.., “Take out your book, 
please”). If the student complied within 5 seconds, the teacher 
delivered praise. If a 30-min period expired and no episodes of 
noncompliance were recorded, a sticker was delivered to the 
student. If the student failed to comply with a request within 
5 seconds, the teacher waited 5 seconds and issued another 
directive (e.g., “Take out your book”). The teacher then waited 
an additional 5 seconds for compliance. If the student complied, 
the teacher issued praise. If not, one sticker was deducted for 
noncompliance. Stickers were later exchanged for a “mystery 
motivator” (e.g., a backup reinforcer from a grab bag). Results 
for all three students showed marked decreases in noncompli- 
ant behavior. Thus, a dual system, which was user friendly and 
convenient, proved to be effective insofar as the students earned 
stickers for intervals of compliant behavior, but lost stickers— 
the response cost contingency—for noncompliant behavior. 

This method of combining response cost with positive rein- 
forcement has at least two advantages: (1) If all of the tokens or 
points are not lost through response cost, the remaining ones can 
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be traded for backup conditioned or unconditioned reinforcers; 
and (2) the practitioner’s behavior is not entirely focused on deliv- 
ering a negative punishment contingency. Presumably, the practi- 
tioner will also focus on the occurrence of positive, constructive 
behaviors to reinforce. In doing so, the practitioner teaches new 
behaviors to the individual and reduces legal and ethical concerns. 


Response Cost with Group Consequences. Response cost can 
be combined with group consequences. That is, contingent on 
the inappropriate behavior of any member of a group, the whole 
group loses a specific number of reinforcers. The Good Behavior 
Game, discussed in Chapter 28, serves as an excellent illustra- 
tion of how response cost can be applied with groups. 


Response Cost with Differential Reinforcement of Alternative 
Behavior (DRA). Response cost has been combined with differ- 
ential reinforcement of alternative behavior (DRA) to treat the 
food refusal behavior of a 5-year-old child with developmental 
delay. Kahng, Tarbox, and Wilke (2001) replicated the Keeney, 
Fisher, Adelinis, and Wilder (2000) study, except that response 
cost was combined with DRA. During a baseline condition, 
various food group trials were presented every 30 seconds. If 
food was accepted, praise was delivered. If food was refused, the 
behavior was ignored. Based on reinforcer assessments, during 
DRA and response cost, books and audiotapes were made avail- 
able at the meal. If food was accepted, praise was delivered. If 
food was refused, the books and audiotapes were removed for 
30 seconds. The books and audiotapes were replaced during the 
next trial contingent on accepting food (i.e., DRA). Parents and 
grandparents were also trained in the DRA plus response cost 
procedures. Results showed a substantial increase in the percent- 
age of intervals of food acceptance and a dramatic decrease in 
the percentage of intervals of food refusal and disruption. 


Other Combinations. Watkins and Rapp (2014) combined 
response cost with environment enrichment to reduce the stereo- 
typy behavior of five preadolescent and adolescent individuals with 
autism. In their study, participants were exposed to three condi- 
tions: no interaction; environment enrichment, during which rein- 
forcers identified through a stimulus preference assessment were 
available; and environment enrichment, during which those pre- 
ferred reinforcers were removed for 15 seconds contingent upon 
each occurrence of stereotypy behavior. Results showed that when 
environment enrichment and response cost were combined, the ste- 
reotypy behavior of all five individuals decreased immediately and 
markedly. Further, after these two procedures were withdrawn, the 
stereotypy behavior of three of the five participants did not increase. 

In sum, the findings of studies directly combining response 
cost with other procedures have been positive. Whether response 
cost is used alone, or in combination with other procedures, 
it works. Still, practitioners should remember that (1) punish- 
ment procedures do not teach “new” behaviors; instead, they 
are designed to reduce the frequency of problem behavior: and 
(2) the environment where the target behavior occurs also needs 
to provide an ample number of reinforcers for appropriate 
behaviors. Stated differently, response cost should not be used 
as a sole behavior change approach without also considering 
the beneficial effects that accrue when positive reinforcement is 


provided. Combining response cost procedures with behavior- 
building approaches (e.g., DRA) is a good plan. 


Established Social Validity 


As discussed in Chapter 10, a procedure is considered socially 
valid if it meets three criteria: (1) The target behavior is signif- 
icant for the individual, (2) the methodological procedures are 
acceptable and meet the standard of best care, and (3) the results 
produce meaningful positive change. 

On the basis of these criteria, a legitimate claim can be 
made that response cost is socially valid. Typically, the behay- 
iors selected for a response cost procedure call attention to them- 
selves and have not improved in any meaningful way given prior 
interventions. Also, teachers and the students themselves report 
that response cost procedures are acceptable (Donaldson et al., 
2014; Hirst, Dozier, & Payne, 2016). The dramatic reductive 
effect that response cost produces reaches the level of signifi- 
cance for those involved. That is, their lives are improved. Ulti- 
mately, they come into contact with more reinforcers and fewer 
punishers in the future. 

Embry and Biglan (2008) included response cost in their 
list of 52 fundamental units of behavioral influence. In their 
view, response cost meets this designation because it is socially 
valid, is easy to implement, and produces clear and discrete out- 
comes that have been verified repeatedly by experimental and 
applied analyses. 


RESPONSE COST METHODS 


Response cost is implemented using two principal methods. The 
practitioner can (a) levy a fine against an individual’s exist- 
ing cache of previously earned reinforcers; or (b) levy a fine 
against an individual’s surplus of extra, noncontingently pro- 
vided reinforcers. 


Existing Cache Response Cost 


When a fine is levied against a person’s existing cache of posi- 
tive reinforcers, the individual loses a specific number of posi- 
tive reinforcers from a supply of currently available reinforcers. 
For example, each occurrence of noncompliant behavior would 
produce a loss of | minute of free time from a current allocation 
of 15 minutes. In this case, response cost is applied to stimuli 
known to be reinforcing and accessible to the person. 


Bonus Response Cost 


Bonus response cost operates when practitioners make addi- 
tional reinforcers available noncontingently to the participant, 
specifically for removal when a target behavior occurs. Box 15.1 
presents a vignette of how a father might implement a bonus 
response cost procedure with his two sons to reduce their fight- 
ing and squabbling at the dinner table. 


USING RESPONSE COST EFFECTIVELY 


To use response cost effectively, the behavior targeted for 
response cost, the timing of the fine, and the amount of the fine 
need to be explicitly stated. For instance, the behavior analyst 


Chapter 15 * Negative Punishment 405 


Father and Sons Vignette 


Father: Boys, Vd like to talk to you. 
Tom and Pete: OK. 


Father: We have to find a way to stop the fighting and squabbling 
at the dinner table. The way you two go at each other upsets 
everyone in the family, and I can’t eat my dinner in peace. 


Tom: Well, Pete usually starts it. 
Pete: I do not! 


Tom: You do, too... 


Father: (Interrupting) That’s enough! This is just what I mean, 
nit-picking at each other. It’s ridiculous and it needs to 
stop. I’ve given it some thought, and here’s what we’re 
going to do. Since each of you earns an allowance of 
$5 per week for doing household chores, I was tempted 
to take it away from you the next time either of you started 
squabbling. But I’ve decided against doing that. Instead, 
I’m going to make an additional $5 available to each of 
you, but for each sarcastic comment or squabble you get 
into at the dinner table, you’ ll lose $1 of that extra money. 
So, if you act properly, you’ll get an additional $5 per 
week. If you have two squabbles, you’ll lose $2 and earn 
only an additional $3. Do you understand? 


might say, “Any occurrence of an off-task behavior during 
reading will be subject to an immediate loss of one token.” 
Further, any rules that would apply for refusal to comply with 
the response cost procedure need to be explained. However, 
in situations in which multiple behaviors are subject to the 
response cost contingency, or in which the degree of severity 
of the behavior determines the response cost, correspondingly 
greater fines should be associated with more severe behaviors. 
In effect, the magnitude of the punishment (response cost) 
should reflect the magnitude of the targeted behavior. 
According to Weiner (1962)—the originator of the 
term response cost—the magnitude of the response cost fine 
is important. As the magnitude of the fine increases, larger 
reductions in the rate of the undesirable behavior may occur. 
However, if the loss of reinforcers is too great and too rapid, 
the person will soon be exhausted of his or her supply of rein- 
forcers, and a deficit situation will exist. As a general rule, the 
fine should be large enough to suppress the future occurrence 
of the behavior, but not so large as to bankrupt the person or 
cause the system to lose its effectiveness. In short, if a client 
loses tokens or other reinforcers at too high a rate, he or she 
may give up and become passive or aggressive, and the pro- 
cedure will become ineffective. Furthermore, fines should not 
be changed arbitrarily. If a teacher imposes a |-unit token loss 
for each noncompliant response in the morning, the teacher 
should not impose a 3-unit token loss for the same behavior 


in the afternoon. 


Pete: How about if Tom starts something and I don’t? 


Father: Whoever does any fighting or squabbling loses $1. If 
Tom starts something and you ignore him, only he loses. 
Tom, the same thing holds true for you. If Pete starts some- 
thing and you ignore him, only Pete loses. One more thing. 
Dinnertime starts when you are called to the table; it ends 
when you are excused from the table. Any other questions? 


Tom: When do we begin? 
Father: We'll begin tonight. 


It is important to note that the father described the contingen- 
cies completely to his sons. He told them what would happen 
if no squabbles occurred—they would earn an additional $5 
per week—and he told them what would happen if one sibling 
or the other started to bicker but the other ignored him. In 
addition, he clearly defined when dinnertime would begin and 
end. Each of these explanations was necessary for a complete 
description of the contingencies. Note that the money the boys 
normally earned as an allowance for doing household chores 
was not affected by this bonus response cost contingency. 
Ideally, the father praised his sons for their appropriate table 
manners and decorum when it occurred so that these behaviors 
could be reinforced and strengthened. 


Deliver Fines Immediately 


Ideally, the fine should be imposed immediately after the occur- 
rence of each undesirable behavior. The more quickly response 
cost is applied subsequent to the occurrence of the behavior, the 
more effective the procedure becomes. 


Response Cost or Bonus Response Cost? 


Three considerations may help practitioners decide whether to ini- 
tiate a response cost or a bonus response cost procedure. First, the 
least aversive procedure should be attempted initially. Consistent 
with the principle of the least restrictive alternative, an effort should 
be made to ensure that the minimum loss of reinforcers occurs for 
each infraction. Bonus response cost may be the less aversive of 
the two variations because the reinforcers are not deducted directly 
from the individual cache of reinforcers; instead, they are lost from 
a pool of noncontingently available (i.e., bonus) reinforcers. 

The second consideration is similar to the first and can 
be stated in the form of a question: What is the potential for 
aggressive, emotional outbursts? From a social validity perspec- 
tive, students (and their parents) would probably find it more 
agreeable to lose reinforcers from an available reserve than from 
their earnings. Consequently, a bonus response cost procedure 
may spark fewer aggressive or emotional outbursts, or be less 
offensive to students or their parents. 

A third consideration is the need to reduce the behavior 
quickly. Combative or noncompliant behavior may be more 
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appropriately suppressed with response cost because the con- 
tingency directly reduces the student’s available reinforcers. The 
response-contingent withdrawal of reinforcers serves to reduce 
the behavior swiftly and markedly. 


Ensure Reinforcer Reserve 


Positive reinforcers cannot be removed from a person who does 
not have any. Prior to using response cost, the practitioner must 
ensure a sufficient reinforcer reserve. Without such a reserve, 
the procedure is unlikely to be successful. For example, if a 
teacher used response-contingent withdrawal of free time for 
each occurrence of inappropriate behavior in a highly disrup- 
tive class, the students could exhaust all of their available free 
time before the end of the first hour, leaving the teacher won- 
dering what to do for the remainder of the day. Deducting free 
time from time earmarked for succeeding days would hardly be 
beneficial. 

Two suggestions apply to reduce the likelihood of hav- 
ing no reinforcers available. First, the ratio of points earned to 
points lost can be managed. If baseline data indicate that the 
inappropriate behavior occurs at a high rate, more reinforcers 
can be programmed for removal. Also, determining the magni- 
tude of the fine, and stating it explicitly beforehand, is helpful. 
Minor infractions may warrant relatively smaller fines, whereas 
major infractions may warrant substantially higher fines. Sec- 
ond, if all reinforcers are lost and another inappropriate behav- 
ior occurs, the analyst might consider implementing time-out 
from positive reinforcement as a supplemental procedure. 

To establish the initial number of available reinforcers, 
practitioners collect baseline data on the occurrence of inap- 
propriate behavior during the day or session. The mean base- 
line figure can be increased for the number of reinforcers to 
ensure that all reinforcers are not lost when response cost is in 
effect. Although no empirically verifiable guidelines are avail- 
able, a prudent approach is to increase the number of reinforcers 
25% above the mean number of occurrences during baseline. 
For example, if baseline data indicate that the mean number 
of disruptions per day is 20, the practitioner might establish 
25 minutes of free time (the positive reinforcer) as the initial 
level (i.e., 20 X 1.25). If the practitioner calculates points 
instead of percentages, she could add an additional 10 to 
20 points to ensure an adequate buffer. 


Recognize the Potential for Unplanned 
or Unexpected Outcomes 


Two situations may require the implementation of a contingency 
plan. One occurs when the repeated imposition of response cost 
serves to reinforce, and not punish, the undesirable behav- 
ior. When this situation arises, the practitioner should stop 
using response cost and switch to another reductive procedure 
(e.g., time-out or DRA). The second situation occurs when the 
person refuses to hand over positive reinforcers. To reduce the 
likelihood of this event, the practitioner should clarify the con- 
sequences of such refusal beforehand and (a) make sure that an 
adequate supply of backup reinforcers is available, (b) impose 
an additional penalty for not giving up the reinforcer (e.g., the 
sticker) (Musser et al., 2001), (c) refund some fractional portion 


of the fine for complying with immediate payment, or (d) side- 
step the whole possibility of refusal by maintaining control of 
the tokens on a ledger, electronically, or in a jar placed out of 
the participant’s reach. 


Avoid Overusing Response Cost 


Response cost should be saved for those major undesirable 
behaviors that call attention to themselves and need to be 
suppressed quickly. A teacher’s or parent’s primary attention 
should always focus on positive behavior to reinforce; response 
cost should be implemented as a last resort and should be com- 
bined with other procedures that ultimately are designed to build 
adaptive behavior. 


Keep Records 


Each occurrence of response cost and the behavior that occa- 
sioned it should be recorded. Minimally, the analyst should 
record the number of times fines are imposed, the persons to 
whom fines are issued, and the effects of the fines. Daily data 
collection helps to determine the efficacy of the response cost 
procedure that might otherwise remain undetected (Boerke & 
Reitman, 2011). By graphing the effects of the response cost 
intervention, a chart is created that the behavior analyst can use 
to (1) determine the immediacy, the extent, and the duration 
of the suppressive effect of the procedure, and (2) document 
the effectiveness to a skeptical practitioner predisposed to using 
response cost based on biased personal beliefs, or inadequate 
preservice training (Cooke, 1984). 


RESPONSE COST CONSIDERATIONS 
Increased Aggression 


Response-contingent withdrawal of positive reinforcers may 
increase student verbal and physical aggressiveness. The student 
who loses several tokens, especially within a short time, may 
verbally or physically assault the teacher. Emotional behaviors 
should be ignored whenever possible if they accompany the 
implementation of response cost (Walker, 1983). Still, teachers 
should anticipate this possibility and (a) preempt a decision to 
use response cost if they suspect that a worse condition will 
result in the aftermath of an emotional episode, or (b) be pre- 
pared to “ride out the storm.” 


Avoidance 


The setting in which response cost occurs or the person who 
administers it can become a conditioned aversive stimulus. If 
this situation occurs in school, the student may avoid the school, 
the classroom, or the teacher by being absent or tardy. A teacher 
can reduce the likelihood of becoming a conditioned aversive 
stimulus by contingently delivering positive reinforcement for 
appropriate behavior. 


Collateral Reductions of Desired Behavior 


The response-contingent withdrawal of positive reinforcers for 
one behavior can affect the occurrence of other behaviors as 


well (St. Peter, Byrd, Pence, & Foreman, 2016). If the teacher 
fines Angelina | minute of recess each time she calls out during 
math class, the response cost procedure may reduce not only 
her call-outs but also her math productivity. Angelina may 
say to the teacher, “Since I lost my recess time for calling out, 
1 am not going to do my math.” She could also become passive- 
aggressive by sitting in her seat with arms folded and not work. 
Teachers and other practitioners should anticipate such collat- 
eral behaviors and clearly explain the response cost rules, rein- 
force other classmates as models of appropriate behavior, and 
avoid face-to-face confrontations. 


Calling Attention to the Punished Behavior 


Response cost calls attention to the undesirable behavior. That 
is, upon the occurrence of an inappropriate behavior, the student 
is informed of reinforcer loss. The teacher’s attention—even 
in the form of notification of reinforcer loss—could serve as 
an attention-getting reinforcing event. In effect, the teacher’s 
attention may increase the occurrence of future misbehavior. For 
instance, the teacher may have difficulty because every mark 
that she places on the chalkboard indicates that a positive rein- 
forcer has been lost (e.g., minutes of free time). To counteract 
the possibility of calling attention to inappropriate behavior, the 
teacher should ensure that the delivery ratio of reinforcers to the 
removal ratio of reinforcers favors positive reinforcement. 


Unpredictability 


As with other forms of punishment, the side effects of response 
cost can be unpredictable and are related to a number of vari- 
ables that are not fully understood or well investigated across 
participants, settings, or behaviors. These variables include the 
magnitude of the fine, the previous punishment and reinforce- 
ment history of the individual, the number of times behaviors 
are fined, and the availability of alternative responses that are 
eligible for reinforcement. 
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SUMMARY 


Time-Out from Positive Reinforcement Defined 


1. Time-out from positive reinforcement (or simply 
time-out) is the immediate response-contingent with- 
drawal of the opportunity to earn positive reinforcers or 
the immediate loss of access to positive reinforcers for a 
specified time. 


2. To function as negative punishment, the time-out pro- 
cedure must decrease future occurrences of similar 
behaviors. Implicit in the definition of time-out are three 
important factors: (a) The discrepancy between the 
“time-in” and the time-out environments must be dis- 
criminative, (b) the response-contingent loss of access to 
reinforcement must occur immediately, and (c) a resultant 
decrease in the future frequency of the time-out—producing 
behavior must occur. 
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Treatment Integrity 


Treatment integrity is the accuracy and procedural consistency 
with which a planned intervention is executed. If the plan is 
not implemented with high fidelity, then it becomes extremely 
difficult to determine if the lack of demonstrated improvement 
is due to the intervention not being implemented as designed, 
or if the treatment itself was not robust enough to affect the 
change. 

Response cost, like other behavior-building or -suppressing 
procedures, suffers from two possible threats to treatment 
integrity: omission errors and commission errors. Omission 
errors occur when the analyst fails to apply the response cost 
contingency and levy the appropriate fine after target behavior 
occurrence. In this situation, a target behavior occurs, and the 
practitioner either does not register the behavior as an occur- 
rence or observes the behavior but still fails to apply the fine. In 
either case, an omission error occurs. Commission errors occur 
when practitioners levy fines for behavior that should not have 
been subjected to response cost. In other words, a conditioned 
generalized reinforcer is removed from the individual when it 
should not have been. 

Three points are worth noting about omission and com- 
mission errors. First, providing supervisory training, practice, 
feedback, and recurrent maintenance instruction reduces the 
chances of these types of errors. Second, the higher the fidelity 
of treatment integrity—that is, approaching 100%—the more 
suppressed the behavior becomes. Finally, at least some data 
in nonclinical applications suggest that even if a response cost 
procedure is implemented partially incorrectly (e.g., a mix of 
omission and/or commission errors), acceptable suppression 
may still result (St. Peter et al., 2016). On the continuum of 
errors scale, obviously, practitioners should aim for zero, or 
few, errors, but be comforted by the fact that some errors can 
exist without the entire response cost program being severely 
compromised. 


Time-Out Tactics for Applied Settings 


3. There are two basic types of time-out: nonexclusion and 
exclusion. 


4. Nonexclusion time-out means that contingent on the occur- 
rence of the target behavior, the participant remains physi- 
cally within the time-in setting, but elements of that setting 
are changed so that (a) the opportunity to earn reinforcers 
is removed; (b) a specific reinforcer is terminated; (c) repo- 
sitioning occurs so that observations of ongoing activity 
continue, but without reinforcement; or (d) the individual 
is removed to a different space within the time-in setting. 


5. Nonexclusion time-out is typically implemented using one 
of four tactics: planned ignoring, terminate specific rein- 
forcer contact, contingent observation, and partition/select 
space time-out. 
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6. Exclusion time-out is the physical separation of the par- 
ticipant from the time-in environment when time-out goes 
into effect. 


7. There are two basic tactics for implementing exclu- 
sion time-out contingent on the occurrence of the 
target behavior: (a) The participant is removed imme- 
diately from the time-in environment, or (b) the 
time-in environment is removed immediately from the 
participant. 


8. Time-out is a desirable alternative for reducing behavior 
because of its ease of application, acceptability, rapid sup- 
_ pression of behavior effects, and ability to be combined 
with other approaches. 


Using Time-Out Effectively 


9. The time-in environment must be reinforcing if time-out is 
to be effective. 


10. Effective use of time-out requires that the behaviors 
leading to, the duration of, and exit criteria for time-out 
be explicitly stated. Further, practitioners must decide 
whether to use nonexclusion or exclusion time-out. 


11. The exit criteria for time-out can be conceptualized in 
three dimensions: (a) the duration of any individual time- 
out session, (b) the behavioral criteria for leaving time-out 
and returning to the time-in setting, and (c) the overall 
length of the time-out program. 


12. The duration of any individual time-out session, the 
behavioral criteria that prompt termination of time-out, 
and the duration of the time-out program need to be made 
explicit before launching time-out, and appropriate ethical 
and legal guidelines should be followed. 


13. In the majority of applications, permission is required 
before time-out can be implemented. 


14. Practitioners should be aware of legal and ethical 
considerations before implementing time-out. As a 
punishment procedure, it should be used only after 
positive reductive procedures have failed, and with 
planned monitoring, supervision, and evaluation 
considerations in place. 


Response Cost Defined 


15. Response cost is the response-contingent loss of a 
specific number of positive reinforcers that has the 
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KEY TERMS 


bonus response cost 
contingent observation 
exclusion time-out planned ignoring 


negative punishment response cost 
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nonexclusion time-out 


partition or select space time-out 


effect of decreasing the future occurrence of similar 
responses. 


16. Response cost constitutes a fine for misbehavior. 


17. Response cost typically involves the loss of one or more 
generalized conditioned reinforcers, tangibles, or minutes 
of already-earned time with a preferred activity. 


18. The punitive value of the lost reinforcers depends entirely on 
the individual reinforcement preferences of the participant. 


19. Response cost affords practitioners a number of desirable 
features. It is effective, meets the standard for least restric- 
tive alternative, produces rapid suppression of behavior, is 
convenient, can be combined with other procedures, and is 
socially valid. 


Response Cost Methods 


20. Response cost is implemented using two principal meth- 
ods. The practitioner can (a) levy a fine against an indi- 
vidual’s existing cache of previously earned reinforcers; 
or (b) levy a fine against an individual’s surplus of extra, 
noncontingently provided reinforcers. 


21. When a fine is levied against a person’s existing cache of 
positive reinforcers, the individual loses a specific number 
of positive reinforcers from a supply of currently available 
reinforcers. 


22. Bonus response cost operates when practitioners make 
additional reinforcers available noncontingently to the 
participant, specifically for removal when a target behavior 
occurs. 


Using Response Cost Effectively 


23. To use response cost effectively, practitioners should 
deliver fines immediately, decide whether bonus response 
cost is a preferred option, ensure reinforcer reserve, recog- 
nize the potential for unplanned or unexpected outcomes, 
avoid overusing response cost, and keep records on its 
effects. 


Response Cost Considerations 


24. Implementing response cost may increase aggression, 
produce avoidance responses, affect collateral reduc- 
tions of desired behaviors, call attention to the punished 
behavior, be unpredictable, and suffer treatment integrity 
challenges. 


terminate specific reinforcer contact 


time-out from positive reinforcement 
(time-out) 
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MULTIPLE-CHOICE QUESTIONS 


1. A punishment procedure has been put in place for Carlos. 4. When implementing the contingent observation program, 


Whenever he swears in class his teacher says, “Carlos, 
swearing is not allowed in this classroom,” and he is 
escorted to the time-out room where he has to stay for 

at least five minutes. His teacher has used the procedure 
for three weeks and his swearing has not decreased at all. 
What is the problem in this example? 


a. The time-out room is not functioning as 
punishment. 


b. Carlos should be placed in the time-out room for a 
longer period of time. 


c. His teacher needs to explain to him why he is being 
placed in time-out. 


d. His teacher has not done enough to reinforce the 
time-in setting. 

Hint: (See “Exclusion Time-Out’) 

. Before implementing a contingent observation program 

for cutting in line at the drinking fountain, Mrs. Salkey 

made sure to send a note home to guardians telling them 

about the program and asking the guardians to contact 

her if there were any objections. In addition she checked 

with her principal to make sure she was following the 

schoolwide behavior management program. Which part 

of using time-out effectively does this example best 

exemplify? 

a. Reinforce and enrich the time-in environment. 

b. Define the behaviors leading to time-out. 

c. Define the procedures for the duration of time-out. 

d. Explain the time-out rules. 

e. Evaluate effectiveness. 

f. Obtain permission. 

Hint: (See “Using Time-Out Effectively”) 

. When implementing a contingent observation program for 

cutting in line at the drinking fountain, Mrs. Salkey made 

sure to provide lots of tokens for students who waited their 

turn. Which part of using time-out effectively does this 

example best exemplify? 


a. Reinforce and enrich the time-in environment. 

b. Define the behaviors leading to time-out. 

c. Define the procedures for the duration of time-out. 
d. Explain the time-out rules. 

e. Evaluate effectiveness. 

f. Obtain permission. 

Hint: (See “Using Time-Out Effectively”) 


Mrs. Salkey kept a record of each student who cut in line 
every day for one week. By Friday, there were no students 
who cut in line. Which part of using time-out effectively 
does this example best exemplify? 


a. Reinforce and enrich the time-in environment. 

b. Define the behaviors leading to time-out. 

c. Define the procedures for the duration of time-out. 
d. Explain the time-out rules. 

e. Evaluate effectiveness. 

f. Obtain permission. 

Hint: (See “Using Time-Out Effectively”) 


. Mrs. Salkey used tokens for waiting along with a contingent 


observation program. This approach best exemplifies: 
a. Rapid suppression of behavior 

b. Combined approaches 

c. Ease of application 

d. Acceptability 

Hint: (See “Desirable Aspects of Time-Out”) 


. Mr. Marley docks the class one minute of extra recess time 


every time he has to say, “people quiet down please.” This 
best exemplifies which response cost method? 


a. Direct fines 

b. Bonus response cost 

c. Using a response cost with a group 

d. Combining with positive reinforcement 
Hint: (See “Response Cost Methods”) 


. In addition to using a response cost with an entire group, 


Mr. Marley is also using which procedure? 
a. Direct fines 

b. Bonus response cost 

c. Using a response cost with a group 

d. Combining with positive reinforcement 
Hint: (See “Response Cost Methods”) 


. Mrs. Webb uses a response-cost procedure in which a 


student loses all points toward participation in a monthly 
pizza party any time he or she says “shut up” to another 
student. As a result, whenever students lose their points, 
they say “shut up” more often. What is the problem? 


a. The magnitude of the fine is too great. 

b. The points weren’t functioning as reinforcers. 
c. The students became more aggressive. 

d. The fine should have been larger. 


Hint: (See “Using Response Cost Effectively”) 
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NOTES 


I. 


ESSAY-TYPE QUESTIONS 
1. 


As Type II punishment, response cost can be classified 
and is defined by function. Specifically, if the future 
frequency of the punished behavior is reduced by the 
response-contingent withdrawal of a positive reinforcer 
then a response cost has occurred. However, if the removal 
of the reinforcer increases the level of the behavior or 

has no effect on it, response cost has not occurred. This 
distinction between application and effect is the key aspect 
in the definition of response cost. Why is it important to 
make a distinction between application and effect? 


Hint: (See “Response Cost Defined’) 
Why is it important to reinforce functional alternatives to a 
problem behavior when using Type II punishment? 


Hint: (See “Using Time-Out Effectively” and “Response 
Cost Methods”’) 


The term time-out from positive reinforcement first appeared in the behay- 
ioral literature in a monograph authored by Charles Ferster (1958). He 
described five experiments with chimpanzees and pigeons. “The basic 
paradigm used . . . ‘time out’ from positive reinforcement as the aversive 
event. To study the effect of the time out, a simple response, such as 
a pigeon’s peck at a circular disk, or a chimpanzee’s press of a toggle 
switch, was maintained when occasional pecks or presses were followed 
with food. To generate the aversive event, reinforcement was withheld in 
the presence of one stimulus but alternately continued in the presence of 
a second stimulus. When the rate of responding fell to zero in the pres- 
ence of the stimulus where reinforcement was discontinued, that stimulus, 
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. Suppose you are a third-grade teacher of a classroom 


with 25 students. Three of the students in the classroom 
are causing problems. The problems include failure to 
complete assignments and talking during independent 
seat-work. Pick a Type II punishment and pair it with a 
reinforcement program that you will use for the entire 
class. Please refer to Powell and Powell’s (1982) time-out 
implementation checklist. 


Hint: (See “Using Time-out Effectively” and “Using 
Response Cost Effectively’’) 


. List the undesirable aspects of negative punishment and 


describe how one might minimize those aspects. 


Hint: (See “Response Cost Considerations”) 


called the time-out stimulus, could then be used as an aversive event and 
its effect on the base-line performance measured” (p. 1). 


. See the Association for Applied Behavior Analysis International’s 


Statement on Restraint and Seclusion, 2010. 


. In some situations, removing unconditioned and conditioned reinforcers 


(e.g., food, free time) from a person is legally and ethically inappropriate. 
For instance, it would be inadvisable to remove minutes from mandated, 
structured leisure or free time using response cost. To avoid any poten- 
tial problems, practitioners should consider another method, or they must 
obtain permission from the local human rights review committee by justi- 
fying the rationale for removing this reinforcer using response cost. 


PAR T 


Antecedent Variables 


Parts 4 and 5 detailed the effects of various types of stimulus changes immediately following 
behavior. The two chapters in Part 6 address the effects of stimulus conditions and changes that take 
place prior to a behavior occurring. Behavior does not occur in an environmental vacuum. Every 
response is emitted in the context of a particular set of antecedent conditions, and these antecedent 
events play a critical role in motivation and learning. 


What people do at any particular moment is partly a function of what they want at that moment. 


In Chapter 16, Jack Michael and Caio Miguel describe motivating operations, environmental 
variables that (a) alter the momentary effectiveness of some stimulus, object, or event as a reinforcer 
(or punisher); and (b) alter the current frequency of all behaviors that have been followed by that 
form of reinforcement. 

Although the defining feature of reinforcement is an increase in the future frequency of 
behavior, reinforcement produces a second important effect. Chapter 17 details how stimuli 
immediately preceding a response, or present during reinforcement, acquire an evocative function 
over future occurrences of the behavior. The chapter describes how behavior analysts use differential 
reinforcement under changing antecedent stimulus conditions to achieve the desired degrees of 
stimulus discrimination and generalization. 
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Motivating Operations 


Jack Michael and Caio Miguel 


LEARNING OBJECTIVES 


¢ Define an establishing operation and provide characteristics of an establishing operation. 


e Discuss the recent movement to the term motivating operation. 


¢ Describe the difference between motivating operations and discriminative relations. 


¢ Define unconditioned motivating operations. 


¢ State various examples of UMOs as related to the human organism. 


¢ Define conditioned motivating operations. 


* State the different classifications of CMOs and how they operate. 
e State various examples of CMOs as related to the human organism. 


¢ Name some general implications for the use of motivating operations in the study of behavior analysis. 


We (Jack and Caio) just love Mexican food, especially 
tacos. Of course, we do not think about Mexican food 

all the time, only around lunch- or dinnertime—in other 
words, when we have not had food for a while, or when we 
are hungry. However, sometimes, even if we have already 
eaten, just the sight of our favorite Mexican restaurant, or 
maybe even a TV ad about Mexican food, makes us want 
some. At those times, we look for a good Mexican restau- 
rant and feel quite satisfied when we find one. When we get 
to the restaurant, the sight of tortilla chips makes us want 
salsa. We can eat salsa at any time, but it doesn’t sound 
good unless we are eating some tortilla chips. After eating 
the chips and salsa, all we can think about is taking a sip 
of that ice-cold margarita. And when the chips are gone, 
finding the waiter to get some more, and to order our tacos, 
becomes the most important thing in the world. 


he scenario above suggests that the value of certain 

stimuli may vary conditionally with other environmen- 

tal factors. Mexican food is not wanted all the time, and 
behaviors that produce it (e.g., driving to the restaurant) vary 
depending on someone’s motivation. From a behavior analytic 
perspective, if a certain behavior is not occurring, there is a 
chance that reinforcing consequences are not being produced. 
However, even if consequences are available, their value or 
effectiveness can vary over time. Thus, the concept of rein- 
forcement alone cannot properly explain a person’s motivation 
to act (Michael, 1993). Going to a restaurant and ordering food 
invariably produces food. However, the likelihood of going toa 
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restaurant may vary depending on one’s motivation to eat. When 
we say someone is motivated or wants to do something, we are 
assuming that (a) what is wanted would function as a reinforcer 
at that moment, and (b) the current frequency of any behavior 
that has previously been so reinforced will increase. This chapter 
discusses motivation from a behavior analytic perspective. 


DEFINITION AND CHARACTERISTICS 
OF MOTIVATING OPERATIONS 


Early in his career, Skinner (1938) used the term drive to 
describe changes in behavior that were a function of motiva- 
tional variables. ! Although the term appeared in Keller and 
Schoenfeld’s (1950) didactic treatment of Skinner’s work, 
these authors placed an emphasis on the specific environmen- 
tal operations (i.e., deprivation, satiation, and aversive stimu- 
lation) that are necessary to establish the drive (Miguel, 2013). 
Although not conforming to the same usage set forth by Keller 
and Schoenfeld, the term establishing operation (EO) was 
reintroduced by Michael (1982, 1993) to describe any environ- 
mental variable that momentarily alters (a) the effectiveness 
of some stimulus, object, or event as a reinforcer; and (b) the 
current frequency of all behavior that has been reinforced by 
that stimulus, object, or event. Subsequently, Laraway, Snyc- 
erski, Michael, and Poling (2003) proposed the inclusive term 
motivating operation (MO) to encompass the fact that these 
variables serve not only to strengthen, but also to weaken, the 
value of a consequence as a reinforcer.” The term setting event 
has also been used to describe some of the variables that may 


fit under the MO definition (see Box 16.1: “What’s a Setting 
Event?’’). 
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BOX 16.1 


What's a Setting Event? 


The term setting event appeared in the behavior analytic lit- 
erature (e.g., Bijou & Baer, 1961; Kantor, 1959) to describe 
a set of variables that may affect stimulus-response relation- 
ships (e.g., McLaughlin & Carr, 2005). More specifically, this 
description pertains to variables that are not discrete, such as 
the presence/absence of immediate events or objects, depriva- 
tion/satiation, or recent social interactions, all of which may 
differentially affect specific functions of stimuli (Leigland, 
1984; Wahler & Fox, 1981). An example would be sleep depri- 
vation affecting rates of crying (Bijou & Baer, 1961). 

The term setting event seems to describe the same set of 
environmental variables that we have classified as motivating 
operations (MOs). However, no clear definition or functional 


Thus, an MO can be defined as an environmental vari- 
able that has two effects: value-altering and behavior-altering 
effects. The value-altering effect is either (a) an increase in the 
reinforcing effectiveness of some stimulus, object, or event, in 
which case the MO is an establishing operation (EO); or (b) a 
decrease in reinforcing effectiveness, in which case the MO is 
an abolishing operation (AO). The behavior-altering effect is 
either (a) an increase in the current frequency of behavior that 
has been reinforced by some stimulus, object, or event, called 
an evocative effect; or (b) a decrease in the current frequency of 
behavior that has been reinforced by some stimulus, object, or 
event, called an abative effect. In addition to frequency, other 
aspects of behavior such as response magnitude, latency, and 
relative frequency (occurrence per opportunity) can be altered 
by an MO. These relations are shown in Figure 16.1. 

Food deprivation is an EO that increases the effective- 
ness of food as a reinforcer (value-altering effect) and evokes 
all behavior that has been reinforced by food (behavior-altering 
effect). In other words, after not having food for a while, food 
becomes a reinforcer, and behaviors such as looking for food 
are increased. Food ingestion (consuming food) is an AO that 
decreases the effectiveness of food as a reinforcer (value-altering 
effect) and abates all behavior that has been followed by food 
reinforcement (behavior-altering effect). In other words, after 
eating, the reinforcing effectiveness of food is decreased, and so 
is looking for food (along with other previously learned behav- 
iors that have produced food). These changes of behavior are said 
to be momentary. For instance, food deprivation (MO) increases 
the value of food, as well as all behaviors that have led to food, 
for as long as the organism is deprived. As food is obtained, 
the effectiveness of food deprivation as an MO is decreased 
(the animal becomes less hungry), as evidenced by a reduction 
in (a) the value of food as a reinforcer and (b) food-seeking 
behaviors. 

Although the term hunger has been used to describe the 
effects of food deprivation, the term is simply a description 
(i.e., tact) of physiological changes that may accompany food 


description of the term has appeared in the literature, and 
most descriptions of setting events are topographical rather 
than functional (Leigland, 1984; Nosik & Carr, 2015; Smith 
& Iwata, 1997). 

The MO concept, on the other hand, describes any vari- 
able that has two specific effects, namely, value-altering and 
behavior-altering effects. Thus, “a greater evidential require- 
ment exists to classify an event as an MO than to classify it as 
a setting event.” (Nosik & Carr, 2015, p. 221). We agree with 
Nosik and Carr that the MO concept, with its functional defini- 
tion and increasing evidence of its utility, is a better taxonomic 
option for describing the variables subsumed under the term 
setting event. 


deprivation (e.g., hunger pangs). Although these physiological 
sensations may prompt someone to get food, deprivation oper- 
ations have direct value-altering and behavior-altering effects 
(Miguel, 2013). Thus, terms such as hunger and thirst are unnec- 
essary (intervening) variables that may obscure a behavior anal- 
ysis of motivation. The effects of the MO related to food are 
shown in Figure 16.2. 

Painful stimulation is an EO that increases the effective- 
ness of pain reduction as a reinforcer and evokes all behavior 
that has been reinforced by pain reduction. So, a headache 
would make its reduction (or elimination) an effective reinforcer, 
and would also evoke behaviors, such as taking an aspirin, that 
have produced headache removal in the past. Of course, these 
two effects would only be observed while the organism is expe- 
riencing a headache, and for this reason we refer to the effects of 
the MO as being momentary. Conversely, a decrease in painful 
stimulation is an AO that decreases the effectiveness of pain 
reduction as a reinforcer and abates all behavior that has been 
followed by pain reduction. In other words, once the headache is 
gone, its elimination no longer functions as a reinforcer, and the 
likelihood of taking an aspirin is greatly reduced. The effects of 
the MO related to painful stimulation are shown in Figure 16.3. 

Most of the statements in this chapter about value-altering 
and behavior-altering effects refer to relations involving rein- 
forcement rather than punishment. It is reasonable to assume 
that MOs also alter the effectiveness of stimuli, objects, and 
events as punishers, with either establishing or abolishing 
effects; and also alter the current frequency of all behavior that 
has been punished by those stimuli, objects, or events, either 
abating or evoking such behavior. Later in the chapter we will 
consider MOs for punishment. 


Isolating Value- and Behavior-altering Effects 


The value-altering effect of an MO influences the potency 
of a reinforcer to increase the future frequency of behavior 
(Laraway, Snycerski, Olson, Becker, & Poling, 2014). An MO’s 
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Motivating Operations 


Value-altering Behavior- 
Effect altering Effect 


Reinforcer- 
establishing 
Effect 


An increase in the 
current 
effectiveness of 
some stimulus, 
object, or event as 
a reinforcer 


Figure 16.1 Motivating 


operations and their two defining 
effects. 


value-altering effect is observed when the manipulation of cer- 
tain environmental variables (e.g., 4, 8, and 12 hours of food 
deprivation) affects the speed with which an organism acquires 
new behaviors that produce the relevant consequence (e.g., 
food). In this way, an MO directly changes the potency of rein- 
forcers (or punishers) to increase and maintain (or decrease and 
eliminate) behavior (Laraway et al., 2014). 

The value of a reinforcer may also vary depending upon 
specific response requirements (DeLeon, Frank, Gregory, & 
Allman, 2009). One way to assess the potency of a specific 
reinforcer is to use a progressive ratio (PR) schedule in which 
response requirements increase after the delivery of each rein- 
forcer (Roane, 2008; see Chapter 13). Under this arrangement, 
an organism may respond under high schedule requirements 
to obtain one consequence, but not another, even when both 
consequences have been observed to increase behavior under 
low response requirements (Roane, Lerman, & Vorndran, 2001). 
This procedure may serve to measure the momentary value of 
a reinforcer whose effectiveness varies depending on the MO. 
Other potential measures of reinforcer potency may include 
choice preparations, and latency to respond during a discrete 
trial (Poling, Lotfizadeh, & Edwards, 2017). 

The behavior-altering effect is interpreted as a change in 
frequency due to the organism’s behavior contacting reinforce- 
ment, implying that this change occurs only after the reinforcer 
has been obtained. However, a strong relation exists between 
the level of the MO and responding under extinction—when 
no reinforcers are being received (Keller & Schoenfeld, 1950). 
Early research (Skinner, 1938) showed that rats that had learned 
to press a lever to obtain food, when exposed to an extinction 
condition at different levels of deprivation, pressed the lever at 


Reinforcer- Evocative Abative 
abolishing Effect Effect 
Effect 
An increase in the A decrease in the 
A decrease in the current frequency current frequency 
current of all behavior of all behavior 
effectiveness of that has been that has been 
some stimulus, reinforced by that reinforced by that 
object, or event as stimulus, object, stimulus, object, 
a reinforcer or event or event 
seas ee Se Sey 


different rates according to their levels of deprivation. Thus, dif- 
ferent deprivation levels differentially evoked responses that had 
been previously learned. Similarly, if a child’s tantrum has been 
followed by a parent’s attention, the duration and intensity of the 
tantrum may be correlated with the amount of attention that the 
child has received before the tantrum. 

The value-altering and behavior-altering effects of an MO 
occur simultaneously but are considered independent in the 
sense that one does not derive from the other. Food deprivation, 
for instance, increases behaviors that have been previously rein- 
forced by food prior to obtaining any food (evocative effect). 
Food deprivation would also increase the reinforcing potency 
of food (the establishing effect). Thus, food-seeking behaviors 
would increase further after contacting food (Michael, 2000). 
Unfortunately, once behavior contacts reinforcement, these two 
effects may become difficult to discern (Poling et al., 2017). 


Direct and Indirect Effects of Motivating 
Operations 


The behavior-altering effect of an MO can increase (or decrease) 
the frequency of behavior because of a direct evocative (or aba- 
tive) effect. The strength of the MO seems to be correlated with 
its capacity to directly evoke previously learned behavior. For 
example, when food deprivation is high, food-seeking behav- 
iors may occur in situations that differ from the situations in 
which those behaviors were reinforced in the past (Lotfizadeh, 
Edwards, Redner, & Poling, 2012). An individual who has 
not eaten for a few hours is likely to request a snack from an 
acquaintance who usually carries some in her backpack (SP), 
However, under extreme food deprivation, the individual might 


Food Deprivation 
(Establishing Operation) 


Reinforcer-establishing effect: 
An increase in the reinforcing 
effectiveness of food 
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Evocative effect: 
An increase in the current 
frequency of all behavior that has 
been reinforced by food 
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Food Ingestion 
(Abolishing Operation) 


Reinforcer-abolishing effect: 
A decrease in the reinforcing 
effectiveness of food 


Abative effect: 
A decrease in the current 
frequency of all behavior that has 
been reinforced by food 


Figure 16.2 Motivating operations 
related to food. 


Reinforcer-establishing effect: 
An increase in the reinforcing 
effectiveness of pain reduction 


Increase in Painful Stimulation 
(Establishing Operation) 


Evocative effect: 
An increase in the current 
frequency of all behavior that has 
been reinforced by pain reduction 


Reinforcer-abolishing effect: 
A decrease in the reinforcing 
effectiveness of pain reduction 


Decrease in Painful Stimulation 


(Abolishing Operation) 


Abative effect: 
A decrease in the current 
frequency of all behavior that has 
been reinforced by pain reduction 


Figure 16.3 Motivating operations 


ask strangers for a snack, even though there has been no pre- 
vious history of obtaining food from them. It seems that “the 
range of stimuli that evokes responding is wider when depri- 
vation is high than when it is lower” (Lotfizadeh et al., 2012, 
p. 98). In other words, strong MOs can directly evoke behavior in 
the absence of relevant discriminative stimuli or, at least, in the 
presence of untrained stimuli. Thus, the direct evocative effect 


related to painful stimulation. 


of the MO may be described as its capacity to evoke behavior 
irrespective of relevant discriminative stimuli.? It is important 
to remember that individuals do not need to be “aware” of the 
effects of reinforcement for it to affect behavior. Likewise, the 
effectiveness of MOs does not depend on individuals being 
able to describe the effects of MOs upon their behavior (see 
Box 16.2, “Do You Need to Know You Are Motivated?’). 
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BOX 16.2 


Do You Need to Know You Are Motivated? 


The variables responsible for behavior-altering effects are 
often misconstrued by assuming that individuals must under- 
stand (verbally describe) the situation, and then behave appro- 
priately as a result of this understanding. It is safe to assume 
that when pressing a lever to obtain water, a rat does not under- 
stand that lever presses have been reinforced by water in the 
past, nor that it should press the lever when water deprived. 
Similarly, humans do not need to verbally describe the situa- 
tion for the MO to affect behavior. The fact that the effects of 
reinforcement upon behavior are automatic suggests that the 
behavior that has led to reinforcement will be momentarily 


The MO indirectly affects the evocative or abative 
strength of relevant discriminative stimuli (SPs). Food depri- 
vation, for instance, would increase the discriminative control 
of any stimulus that has been correlated with the availability 
of food. Therefore, the friend who carries snacks in her back- 
pack would serve as a stronger SP that would evoke a request 
when food deprivation is high. Also, the stronger the headache 
(MO), the more likely it is that the sight of the aspirin (as an SP) 
would evoke the behavior of taking it. Note that the value of 
reinforcers present during discrimination training also depends 
upon a current MO (Laraway et al., 2003). In other words, 
the establishment of stimulus control over behavior not only 
depends on but also varies as a function of MOs. For instance, 
a child would never learn to discriminate between blue and red 
if no effective reinforcer were available for correct responses. To 
summarize, in the absence of an MO that establishes the value 
of the consequence as a reinforcer, discrimination learning is 
unlikely to occur, and a previously effective S? would also not 
evoke behavior. 

The value-altering effect also indirectly influences the 
reinforcing effectiveness of relevant conditioned stimuli. For 
example, food deprivation not only increases the capacity of 
food as a consequence to strengthen behavior (direct establish- 
ing effect), but also enhances the reinforcing effectiveness of 
all other stimuli that have been correlated with food, such as 
the smell of food, the sight of a restaurant, and so on. In the 
same way, a headache would not only establish its removal as 
an effective reinforcer, but also all stimuli that have been paired 
with pain reduction, such as an aspirin, a nap, a dark environ- 
ment, and so forth. The latter effect, a conditioned one, will be 
discussed in the section on conditioned MOs, below. Figure 16.4 
displays the different direct and indirect effects of the MO. 


Behavior-altering versus Function-altering 
Effects* 


Both MOs and SPs are antecedent variables that have 
behavior-altering effects. Antecedents can evoke or abate 
responses, but their occurrence does not alter the organism’s 
functional behavioral repertoire. Thus, as antecedent variables, 
MOs alter the current frequency of all behaviors relevant to that 


increased when the relevant MO is in effect. Thus, just as 
the effects of reinforcing consequences do not depend on the 
organism’s understanding of the contingency, neither do the 
effects of the MO upon behavior require that the organism be 
able to describe the relationship between behavior and con- 
sequence. Someone who spends a lot of time studying may 
not be able to identify the contingencies that have led to the 
establishment of this behavior or serve to momentarily evoke 
it. The frequency (or duration) of studying is still a function of 
environmental variables, whether the individual can describe 
them or not. 


MO. For example, a difficult task may serve as an establishing 
operation that momentarily increases all behaviors that in the past 
have produced removal of the task. Presenting a difficult task to a 
child who has learned that tantrums often result in task postpone- 
ment or removal is likely to produce a tantrum (i.e., evocative 
effect). Conversely, if the task is not presented, or made eas- 
ier (an abolishing operation), then the likelihood of tantrums is 
diminished (i.e., abative effect). However, as an intervention 
strategy, the removal of the task does not alter the child’s reper- 
toire; it only momentarily abates behavior that would otherwise 
occur if the motivating variable (difficult task) were in place. In 
other words, the child will not behave differently in the presence 
of difficult tasks solely as a result of antecedent manipulations.° 
Thus, reducing or eliminating the MO for a specific problem 
behavior is a temporary rather than permanent solution. When 
the MO is again in place, the problem behavior may re-occur 
(Michael, 2000). Nonetheless, eliminating environmental con- 
ditions (e.g., MOs) that contribute to the occurrence of problem 
behavior may be necessary,° especially when these conditions 
are impoverished or aversive (McGill, 1999). 

Consequences. can change the organism’s repertoire, 
enabling the organism to behave differently in the future. Thus, 
consequences have function-altering effects. These variables 
include reinforcers, punishers, and the occurrence of a response 
without its reinforcer (extinction procedure), or without its pun- 
isher (recovery from punishment procedure). Consequences alter 
the future frequency of whatever behavior immediately preceded 
those consequences. In the previous example, if task removal or 
reduction of task difficulty is made contingent upon the child 
asking for a break or help, the frequency of these behaviors 
would increase under similar conditions in the future. In other 
words, the child’s functional repertoire is changed as he or she 
can engage in two new responses: asking for a break or asking 
for help. 


DISTINGUISHING BETWEEN MOs AND SPs 


As described above, MOs and Ss are both antecedent vari- 
ables that alter the current frequency of some particular behay- 
ior (i.e., evoke). Each operant variable influences response 
frequency because of its relation to reinforcing or punishing 
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Direct Establishing Effect 
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Food deprivation directly establishes food as a reinforcer (value-altering effect) and evokes food-seeking behaviors (behavior-altering 
effect). Food deprivation indirectly establishes the effectiveness of all conditioned reinforcers correlated with food (value-altering effect) 
and indirectly affects the salience of the discriminative stimulus correlated with food (behavior-altering effect). 


Figure 16.4 Direct and indirect effects of a motivating operation (MO). 


consequences. An SP affects behavior because its presence has 
been correlated with the differential availability of an effec- 
tive reinforcer in the past. Differential availability means the 
relevant consequence has been available in the presence of the 
sp (i.e., reinforcement) and unavailable in its absence (i.e., 
extinction; see Chapter 24). 

For instance, if a food-deprived rat receives food for lever 
presses in the presence, but not absence, of a light, the light will 
eventually evoke lever presses. In this case, the light would be 
functioning as an SP. since food delivery was more available in 
its presence than in its absence. Note that although food was 
never delivered when the light was off (the S4 condition), food 
would have served as an effective reinforcer had it been deliv- 
ered when the light was off. In other words, food was valued 
(i.e., reinforcing) in both conditions, but available only when 
the light was on. The reduced frequency of lever presses during 
the S4 condition is due to extinction. 

An MO, in contrast, controls behavior because of its 
relation to the differential effectiveness of a reinforcer for that 
behavior. Differential effectiveness means that the relevant con- 
sequence has been effective in the presence of, and ineffective 
in the absence of, the MO. In the example above, the rat would 
press the lever only in the presence of the light (S?) when the 
MO is present (.e., food deprived) and not when the MO is 
absent (i.e., food satiated). When the MO is absent, behavior 
decreases not because the reinforcer 1s unavailable (extinction), 
but because it is no longer valuable or effective. 

Now imagine that the rat learns to press the lever to 
eliminate shock. In this case, pressing the lever produces pain 
reduction (or elimination) as a form of reinforcement. If pain- 
ful stimulation as an antecedent were to be considered an S?, 
pain reduction (the reinforcer) must have been available as a 


consequence in the presence of painful stimulation, but not in 
its absence. More importantly, like food in the previous exam- 
ple, pain reduction would have been effective as a reinforcer in 
both conditions, regardless of its availability. It is with respect 
to this requirement that some motivating variables fail to qual- 
ify as discriminative stimuli. Pain reduction could not serve as 
a reinforcer in the absence of pain as an antecedent.’ In other 
words, in the absence of painful stimulation, behavior has not 
decreased due to extinction, but because there is no possible 
valuable reinforcer. The organism is not motivated to engage in 
behavior to reduce pain, if no pain is being felt. 

In the previous example of light-on/light-off discrimination 
training, the reinforcer is available in the light-on condition and 
unavailable, yet still effective, in the light-off condition. In other words, 
the rat still “wants food” when the light is off but food is not available. 
The behavior occurs less frequently in the light-off (S4) condition 
because it no longer produces an effective reinforcer. In contrast, in 
the absence of food deprivation or painful stimulation, food or pain 
removal, respectively, is not an effective reinforcer. Hence, food depri- 
vation and painful stimulation qualify as MOs because they momen- 
tarily alter (a) the effectiveness of some stimuli, objects, or events as 
reinforcers (value-altering effect); and (b) the frequency of the behavior 
that has been reinforced by those stimuli, objects, or events (behavior- 
altering effect). 

Similarly, an itch does not serve as a discriminative stimu- 
lus that evokes scratching it. Although the reinforcer (removing 
the itch) is available in the presence of the itch and unavailable 
in the absence of the itch, this is not an effective reinforcer in the 
absence of the itch. In this sense, the itch is best interpreted as 
functioning as an MO that establishes its removal as an effective 
reinforcer, and momentarily evokes scratching. In other words, 
having an itch is what motivates you to scratch. 
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Many antecedent stimuli that have been considered to 
evoke behavior as S?s would be better classified as MOs. This 
is particularly the case with aversive stimuli. For example, a 
headache is an MO that directly establishes headache removal 
as an unconditioned reinforcer and indirectly establishes aspirin 
as a conditioned reinforcer (see Figure 16.5). A headache also 
evokes behaviors that have produced pain reduction (finding and 
taking aspirin) in the past. If aspirin is found in the bathroom but 
not in the kitchen, then the bathroom will function as an SP for 
the behavior of looking for aspirin because it has been correlated 
with availability of an effective reinforcer, while the kitchen will 
function as an S4 for looking for aspirin because it has been 
correlated with the unavailability of an effective reinforcer. 
In other words, for discrimination to take place, an individual 
needs to find aspirin in the bathroom, and not find aspirin in the 
kitchen, respectively, when needed (MO; see first two rows of 
Figure 16.5). In the absence of the headache (AO), one can still 
find aspirin in the bathroom, but there is no reason to look for 
it (available, but not a reinforcer), since there is no headache to 
be removed (unavailable non-reinforcer). Thus, in the absence 
of the headache, looking for an aspirin will not occur in either 
location (see bottom row of Figure 16.5). 

In applied settings, when a child is given an instruction 
to complete a difficult task, the reinforcer may be task com- 
pletion or termination. Additionally, the instruction would also 
acquire conditioned aversive properties due to its correlation 
with difficult tasks. Thus, the instruction could serve as an MO 
that establishes its removal as a reinforcer and evokes avoidance 
behavior (Carbone, Morgenstern, Zecchin-Tirri, & Kolberg, 
2010). In contrast, if task completion has produced praise as a 
reinforcer in the presence, but not the absence, of the instruction, 
then the instruction may be functioning as an S?. Other events 
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Figure 16.5 Distinguishing between S° and MO. 
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> Find Aspirin 


best classified as MOs rather than S?s will be discussed later 
in this chapter, most specifically in the section on conditioned 
motivating operations. 

Although behavior analysts use the three-term con- 
tingency (antecedent-behavior-consequence) to understand 
operant behavior, any given response may be evoked by a mul- 
titude of variables functioning as antecedents (i.e., multiple 
control). A child who enters the kitchen and asks for a cookie 
does so because in the past such behavior has produced rein- 
forcement under similar circumstances. These circumstances 
involve variables that establish cookies as reinforcers, as well 
as increase the likelihood of asking for them (food deprivation 
as an MO). However, the behavior is more likely to occur in 
environments where cookies are available (kitchen as an SP) 
and in the presence of an adult (also an SP) who can give the 
child a cookie (Skinner, 1957). In this case, food deprivation 
is correlated with increasing the effectiveness of cookies as 
reinforcers, while the kitchen and the adult are both correlated 
with the availability of cookies. 

To summarize, SPs alter the differential availability of a 
currently effective form of reinforcement for a particular type of 
behavior; MOs alter the differential reinforcing effectiveness of 
a particular type of environmental event. In nontechnical terms, 
an SP tells you that something you want is available; an MO 
makes you want something. 


UNCONDITIONED MOTIVATING 
OPERATIONS (UMOs) 


For all organisms there are events, operations, and stimulus 
conditions with unconditioned value-altering effects. In other 
words, these MOs establish or abolish the value of stimuli 
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Behavior 2 Reinforcer 
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as reinforcers in the absence of prior learning. For example, 
humans are born with the capacity to be affected by food rein- 
forcement as a result of food deprivation and by pain reduc- 
tion reinforcement as a result of pain onset or increase. Thus, 
food deprivation and painful stimulation are unconditioned 
motivating operations (UMOs).® 

MOs are classified as unconditioned based upon the 
unlearned aspect of their value-altering effects, since the behavior- 
altering effects of MOs are usually learned. Said another way, 
we are born with the capacity to be affected by food reinforce- 
ment as a result of food deprivation, but we learn the behaviors 
evoked by food deprivation—asking for food, going to where 
food is kept, cooking, and so forth. 

Deprivation of food, water, oxygen, activity, and sleep 
all have reinforcer-establishing and evocative effects (see 
Table 16.1). For instance, water deprivation momentarily estab- 
lishes the effectiveness of water as a reinforcer and evokes 
all behaviors that have produced water. By contrast, food and 
water ingestion, oxygen intake, being active, and sleeping have 
reinforcer-abolishing and abative effects (see Table 16.2). For 
instance, water ingestion abolishes the effectiveness of water as 
a reinforcer and abates all behaviors that have produced water. 

The passage of time since last sexual activity functions 
as a UMO relevant to sexual stimulation. This form of depriva- 
tion establishes the effectiveness of sexual stimulation as a rein- 
forcer and evokes behavior that has produced such stimulation. 
In contrast, sexual orgasm is a UMO that abolishes (weakens) 
the effectiveness of sexual stimulation as a reinforcer and abates 
(decreases the frequency of) behavior that has achieved that kind 
of reinforcement. In addition, tactile stimulation of erogenous 


TABLE 16.1 
and Evoke Relevant Behavior 


UMO 


Food deprivation 


Reinforcer-Establishing Effect 


Water deprivation 
Sleep deprivation 
Activity deprivation 


Oxygen deprivation® 
reinforcer 


Sex deprivation 
reinforcer 


Becoming too warm 
as a reinforcer 


Becoming too cold 
as a reinforcer 


Increase in painful 


stimulus as a reinforcer 


Increases effectiveness of food as a reinforcer 
Increases effectiveness of water as a reinforcer 
Increases effectiveness of sleep as a reinforcer 
Increases effectiveness of activity as a reinforcer 
Increases effectiveness of breathing (oxygen) as a 
Increases effectiveness of sex stimulation as a 
Increases effectiveness of temperature decrease 
Increases effectiveness of temperature increase 


Increases effectiveness of pain decrease 
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regions of the body seems to function as a UMO in making 
further similar stimulation even more effective as a reinforcer, 
and in evoking all behavior that in the past has achieved such 
stimulation. 

UMOs related to temperature changes include becoming 
uncomfortably cold and uncomfortably warm. Getting cold is a 
UMO that establishes getting warmer as a reinforcer and evokes 
behavior that has produced warmth, such as putting on a jacket 
or moving closer to a heat source. A return to a comfortable (i.e., 
nonaversive) temperature is a UMO that abolishes increased 
warmth as a reinforcer and abates behavior that has produced 
warmth. In this case, behaviors such as putting on a jacket are 
less likely to occur. Getting uncomfortably warm is a UMO that 
establishes a cooler temperature as an effective reinforcer and 
evokes behaviors that have resulted in a body-cooling effect. 
For example, getting warm may evoke the behavior of turn- 
ing on the air conditioning. A return to a normal temperature 
abolishes being cooler as a reinforcer and abates body-cooling 
behavior. In this case, turning on the air conditioning is less 
likely to occur. 

An increase in painful stimulation establishes pain reduc- 
tion as a reinforcer and evokes the (escape) behavior that has 
achieved such reduction. As described previously, a headache 
establishes its own removal as a reinforcer and evokes any 
behavior that has eliminated headaches, such as taking aspirin. 
A decrease in painful stimulation abolishes the effectiveness of 
pain reduction as a reinforcer and abates the behavior that has 
been reinforced by pain reduction. Thus, when a headache is 
gone, headache removal is no longer a reinforcer and taking an 
aspirin is less likely to occur. 


Unconditioned Motivating Operations (UMOs) That Increase Reinforcer Effectiveness 


Evocative Effect 


Increases current frequency of all behavior previously 
reinforced with food 


Increases current frequency of all behavior previously 
reinforced with water 


Increases current frequency of all behavior previously 
reinforced with being able to sleep 


Increases current frequency of all behavior previously 
reinforced with activity 


Increases current frequency of all behavior previously 
reinforced with being able to breathe 


Increases current frequency of all behavior previously 
reinforced with sexual stimulatio 


=) 


Increases current frequency of all behavior previously 
reinforced with becoming cooler 


Increases current frequency of all behavior previously 
reinforced with becoming warmer 


Increases current frequency of all behavior previously 
reinforced with a decrease in painful stimulation 


Tt is not actually oxygen deprivation that functions as a UMO, but rather the buildup of carbon dioxide in the blood as a result of not being able to excrete 


carbon dioxide because of not being able to breathe or because of breathing in air that is as rich in carbon dioxide as the exhaled air. 
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TABLE 16.2  Unconditioned Motivating Operations (UMOs) That Decrease Reinforcer Effectiveness 


and Abate Relevant Behavior 


UMO 


Food ingestion (after food 
deprivation) 


Reinforcer-Abolishing Effect 


Water ingestion (after 
water deprivation) 


Sleeping (after sleep 
deprivation) 
Being active (after activity 


deprivation) reinforcer 


Breathing (after not being 


able to breathe) as a reinforcer 


Orgasm or sexual stimula- 


tion (after sex deprivation) as a reinforcer 


Becoming cooler (after 


being too warm) decrease as a reinforcer 


Becoming warmer (after 


being too cold) increase as a reinforcer 


Painful stimulation 


decrease (while in pain) as a reinforcer 


In addition to establishing pain reduction as a reinforcer 
and evoking the behavior that has produced pain reduction, pain- 
ful stimulation may evoke aggressive behavior toward another 
organism.” Painful stimulation may function as a UMO that 
establishes events such as signs of damage as effective rein- 
forcers and evokes behaviors that have been reinforced by such 
signs. Skinner (1953) made this case in his analysis of anger and 
extended it to the emotions of love and fear.!° 

Although it is indisputable that both food deprivation/ 
satiation and painful stimulation/reduction have unlearned value- 
altering effects, restriction of and access to certain conditioned 
reinforcers also seem to affect their value. This is the case with 
social reinforcers, such as praise (Vollmer & Iwata, 1991). It could 
be argued that for infants, attention (e.g., touch, parent’s voices) is 
a form of unconditioned reinforcer whose value may be affected 
by deprivation and satiation. However, for most adults, attention 
functions as a conditioned reinforcer due to its relation to other 
forms of effective reinforcers (Michael, 2000). In this case, access 
and restriction to attention (or any other conditioned reinforcer) 
would function as a type of conditioned MO, as discussed below. 


MOs FOR PUNISHMENT 


MOs may also alter (increase or decrease) the punishing effec- 
tiveness of a stimulus, as well as the frequency (evoke or abate) of 
behaviors that have been punished by that stimulus. Thus, an EO 
related to punishment would have a punisher-establishing effect 
and an abative effect. For example, a migraine would estab- 
lish the effectiveness of bright light as a punisher and decrease 
the frequency of all behaviors that in the past have produced 
the bright light (e.g., opening the curtains, turning a light on). 
Some biological states, such as pregnancy, may establish cer- 
tain tastes or smells as punishers and decrease the occurrence of 


Decreases effectiveness of food as a reinforcer 
Decreases effectiveness of water as a reinforcer 
Decreases effectiveness of sleep as a reinforcer 
Decreases effectiveness of activity as a 
Decreases effectiveness of breathing (oxygen) 
Decreases effectiveness of sexual stimulation 
Decreases effectiveness of temperature 
Decreases effectiveness of temperature 


Decreases effectiveness of pain decrease 


Abative Effect 


Decreases current frequency of all behavior previously 
reinforced with food 


Decreases current frequency of all behavior previously 
reinforced with water 


Decreases current frequency of all behavior previously 
reinforced with sleep 


Decreases current frequency of all behavior previously 
reinforced with activity 


Decreases current frequency of all behavior previously 
reinforced with being able to breathe 


Decreases current frequency of all behavior previously 
reinforced with sexual stimulation 


Decreases current frequency of all behavior previously 
reinforced with becoming cooler 


Decreases current frequency of all behavior previously 
reinforced with becoming warmer 


Decreases current frequency of all behavior previously 
reinforced with a decrease in pain 


behaviors that have produced them. An AO related to punish- 
ment would have a punisher-abolishing effect and an evocative 
effect, in that behaviors that have been punished would occur 
again (assuming that a reinforcer contingency for the previously 
punished behavior is intact). Ingesting alcohol may decrease the 
effectiveness of social disapproval as a punisher and increase 
behaviors that have produced some social embarrassment 
(assuming those behaviors also produced some reinforcement 
in the past). 

Most punishers that affect humans are effective because 
of a learning history; that is, they are conditioned rather than 
unconditioned punishers. In the case of conditioned punishers 
that were established by having been paired with the removal (or 
reduced availability) of reinforcers (i.e., negative punishment), 
their effectiveness as conditioned punishers will depend on the 
same MOs that establish the value of the reinforcers that were 
removed. For instance, a parent’s disapproving look may func- 
tion as a conditioned punisher as a result of having been paired 
with restricted access to video games. The disapproving look 
will be most effective as a conditioned punisher when an MO for 
playing video games is in effect. If the child has already played 
enough video games for the day (i.e., is “‘satiated”), then the dis- 
approving look will be less effective in reducing behavior. Thus, 
social disapproval (e.g., a frown, a head shake, or vocal response 
such as “No” or “Bad”) that typically precedes withholding of a 
reinforcer will only function as an effective conditioned punisher 
when the MOs for the withheld reinforcers are in effect. How- 
ever, it is possible for a stimulus to be associated with several 
different punishers. In this case, this stimulus may function as 
a generalized conditioned punisher, whose effectiveness will be 
almost free from the influence of MOs. If the disapproving look 
has been associated with the loss of a wide range of reinforcers, 
including generalized reinforcers such as money, then the look 


may almost always function as a punisher, since at least one of 
the reinforcers will be valuable at any given moment. 

The same applies for the negative punishment procedures 
time-out and response cost (see Chapter 15). Time-out will only 
be effective when “time-in” serves as a reinforcer when the 
behavior is being punished. Solnick, Rincover, and Peterson 
(1977) compared the effectiveness of time-out when the time-in 
environment was either impoverished or enriched when trying 
to decrease self-injury and stereotypy in a child diagnosed with 
intellectual disabilities. The impoverished time-in condition 
looked no different from the child’s typical treatment session, 
in which correct responses (sorting blocks) were reinforced 
with praise, food, and toys. During the enriched time-in con- 
dition, the experimenters played music, introduced new toys, 
and increased social interaction by continuously prompting the 
participant to play with the toys. Time-out consisted of 90 sec 
of no access to reinforcers contingent upon problem behay- 
ior. Their results showed that time-out was effective when the 
time-in was enriched and generally ineffective when time-in 
was impoverished. This suggests that the effectiveness of 
a negative punishment procedure depends upon the MO for 
the reinforcer being eliminated. The same can be said about 
response cost, unless the events removed contingent upon 
behavior are effective as reinforcers at the time the procedure 
is implemented, response cost will not serve as an effective 
punishment procedure. 

In general, when observing a punishment effect, one 
must also consider the status of the reinforcer responsible for 
the occurrence of the punished behavior. A behavior is targeted 
for reduction only if it has been occurring and its occurrence is 
a function of reinforcement. For instance, disruptive behavior 
maintained by attention may increase in frequency when it 
has not received much attention (MO for reinforcement). Any 
punishment procedure targeting this disruptive behavior will 
have to be delivered when the MO for attention is in effect. 

Suppose a time-out procedure had been used to punish 
disruptive behavior. As mentioned earlier, only when the MO 
relevant to the reinforcer available (attention during time-in) was 
in effect should we expect time-out to function as punishment. 
And then only if those MOs were in effect should we expect 
to see the abative effect of the punishment procedure on dis- 
ruptive behavior (1.e., immediate reduction in behavior). In the 
absence of the MO for disruptive behavior (lack of attention), 
there would be no behavior to be abated. 

Although these complex behavioral relations have not 
received much attention in the conceptual, experimental, or 
applied behavior analytic literatures, they seem to follow nat- 
urally from existing knowledge of reinforcement, punishment, 
and motivating operations. Behavior analysts should be aware 
of the possible participation of these behavioral relations in any 
situation involving punishment.' ! 


MULTIPLE EFFECTS OF MOs 


Most environmental events have more than one effect upon 
behavior. It is important to recognize these various effects 
and to not confuse one effect with another (Skinner, 1953, 
pp. 204-224). Multiple effects are readily apparent in an animal 
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laboratory demonstration of an operant chain. A food-deprived 
rat pulls a cord hanging from the ceiling of the chamber. Pulling 
the cord produces an auditory stimulus such as a buzzer. In the 
presence of the buzzer, the rat presses a lever and receives a food 
pellet. The onset of the buzzer has two obvious effects: (1) It 
evokes the lever press, and (2) it increases the future frequency 
of cord pulling. The buzzer functions as an S? for the evoca- 
tive effect and as a conditioned reinforcer for the reinforcement 
effect. Both effects influence behavior in the same direction: 
an increase in the current frequency and an increase in future 
frequency of the lever press. 7 Similarly, a hungry person may 
open the fridge to find her roommate’s leftover Mexican food. 
The sight of the Mexican food evokes behaviors associated with 
eating it (e.g., getting it out of the fridge, heating it up), as well 
as possibly increasing the future frequency of opening the fridge. 

Similarly, if pressing the lever in the presence of the buzzer 
produces an electric shock, then the sound of the buzzer would 
function as a discriminative stimulus for punishment (S??). The 
buzzer would then have abative effects on the current frequency 
of lever pressing and function as a conditioned punisher that 
decreases the future frequency of cord pulling. Likewise, if the 
leftover food is spoiled, then it abates the behavior of eating it 
as well as decreases the behavior of opening the fridge to look 
for food. 

Like SPs, events that function as UMOs will typically 
have behavior-altering effects on the current frequency of a 
type of behavior and, when presented as consequences, will have 
function-altering effects on the future frequency of behavior 
that immediately preceded their onset. An increase in painful 
stimulation will, as an antecedent, have MO establishing effects 
that would increase the current frequency of all behavior that 
has alleviated pain and, as a consequence, would decrease the 
future frequency of whatever behavior immediately preceded 
the increased pain. 

As in the example above, events that have a UMO evoc- 
ative effect, such as painful stimulation, will also function as 
punishment for the response immediately preceding the onset of 
the event. However, this statement must be qualified for events 
that have such gradual onsets (like food or water deprivation) 
that they cannot easily function as consequences. In other words, 
even though food deprivation can evoke all behaviors that have 
produced food in the past, it would likely not be effective as 
a punishing consequence to decrease the future frequency of 
behavior. 

Events that have MO abolishing effects as antecedents may 
have reinforcing effects when presented as consequences for 
behavior. For example, food consumption momentarily decreases 
the reinforcing value of food as an effective reinforcer (abolish- 
ing effect), as well as momentarily decreases behaviors that have 
produced food in the past (abative effect). As a consequence, 
food consumption increases the future frequency of behaviors 
that have produced it (reinforcing effect). Thus, it is important 
to note that when edibles are used as consequences for increas- 
ing behavior (reinforcement effect), the repeated consumption of 
edibles decreases their value as an effective reinforcer (abative 
effect). Similarly, if an item is removed in an attempt to increase 
its value as a reinforcer (establishing effect), its removal may 
reduce the future frequency of whatever behavior immediately 
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preceded the removal (punishing effect). Tables 16.3 and 16.4 
illustrate the multiple effects of environmental events. 


CONDITIONED MOTIVATING 
OPERATIONS (CMOs) 


Motivating variables that alter the reinforcing effectiveness of 
other stimuli, objects, or events as a result of the organism’s 
learning history are called conditioned motivating operations 
(CMOs). As with UMOs, CMOs alter the momentary frequency 
of all behavior that has been reinforced by those other events. 


In commonsense terms, some environmental variables, as a 
product of our experiences, make us want something different 
from what we wanted prior to encountering those variables, and 
encourage us to try to obtain what we now want. For example, 
needing to enter a room with a locked door establishes the key 
to the lock as an effective reinforcer. A locked door is a CMO 
because its value-altering effect is a function of a learning his- 
tory involving doors, locks, and keys. 

There seem to be at least three kinds of CMOs, all of 
which are motivationally neutral events prior to their relation 
to another MO or to a form of reinforcement or punishment. 


TABLE 16.3 Contrasting the Behavior-altering (MO Evocative) and Function-altering (Punishing) Effects 


of Environmental Events 


Environmental Event 


Deprivation of food, water, 
Sleep, activity, or sex 


Oxygen deprivation 


Becoming too cold 


Becoming too warm 


Increase in painful stimulation 


Evocative Effect on Current Behavior 
as a UMO 


Increases the current frequency of all behavior 
that has been reinforced with food, water, 
sleep, activity, or sex 


Increases the current frequency of all behavior 
that has been reinforced with being able to 
breathe 


Increases the current frequency of all behavior 
that has been reinforced by becoming warmer 


Increases the current frequency of all behavior 
that has been reinforced by becoming cooler 


Increases the current frequency of all behavior 
that has been reinforced with pain reduction 


Function-altering Effect on Future Behavior 
as Punishment 


Should be punishment, but onset Is too gradual 
to function as a behavioral consequence 


The sudden inability to breathe decreases the 
future frequency of the type of behavior that 
preceded that instance of being unable to 
breathe 


Decreases the future frequency of the type of 
behavior that preceded that instance of becom- 
ing too cold 


Decreases the future frequency of the type of 
behavior that preceded that instance of becom- 
ing too warm 


Decreases the future frequency of the type of 
behavior that preceded that instance of painful 
stimulation 
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TABLE 16.4 Contrasting the Behavior-altering (MO Abative) and Function-altering (Reinforcing) Effects 


of Environmental Events 


Environmental Event 


Ingestion of food and water, 
sleeping, being active, or 
having sex 


Breathing (after not being able 
to breathe) 


Becoming cooler (after being 
too warm) 


Becoming warmer (after being 
too cold) 


Decrease in painful stimulation 


Abative Effect on Current Behavior 
as a UMO 


Decreases the current frequency of all behav- 
ior that has been reinforced with food, water, 
sleep, activity, or sex 


Decreases the current frequency of all behavior 
that has been reinforced with (oxygen) being 
able to breathe 


Decreases the current frequency of all behavior 
that has been reinforced by 
becoming cooler 


Decreases the current frequency of all behavior 
that has been reinforced by becoming warmer 


Decreases the current frequency of all behavior 
that has been reinforced with pain reduction 


Function-altering Effect on Future Behavior 
as Reinforcement 


Increases the future frequency of the type of 
behavior that immediately preceded the food or 
water ingestion, being able to sleep, being able 
to be active, or sexual activity 


The sudden ability to breathe increases the 
future frequency of the type of behavior that 
preceded that instance of ability to breathe 


Increases the future frequency of the type of 
behavior that preceded that instance of becom- 
ing cooler 


Increases the future frequency of the type of 
behavior that preceded that instance of becom- 
ing warmer 


Increases the future frequency of the type of 
behavior that preceded that instance of pain 
decrease 


SS). 


The distinction among these conditioned motivating operations 
is based upon how they were developed and how they affect 
behavior. A surrogate conditioned motivating operation (CMO) 
produces the same effects as another MO because of some type 
of pairing (i.e., a surrogate is a substitute or stand-in), a reflexive 
CMO alters a relation to itself (makes its own removal effec- 
tive as reinforcement), and a transitive CMO makes something 
else effective as reinforcement (rather than altering itself). 


Surrogate CMO (CMO-S) 


A CMO-S is a previously neutral stimulus that acquired its MO 
effects by being paired with a UMO. This process is similar 
to how respondent conditioned stimuli, operant conditioned 
reinforcers, and operant conditioned punishers acquire their 
functions by being paired with another behaviorally effective 
stimulus. 

For example, a restaurant may exert discriminative con- 
trol over several behaviors, including ordering food, since 
restaurants have been differentially correlated with the avail- 
ability of food. However, because we typically go to restaurants 
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after a period in which we have not had any food (i.e., food 
deprivation as a UMO), it is possible that after multiple pair- 
ings with food deprivation, the restaurant would function as a 
CMO-S, (a) establishing food as a reinforcer and (b) evoking 
behaviors that have produced food (e.g., ordering some). Fig- 
ure 16.6 exemplifies the establishment of the restaurant as a 
CMO-S. 

However, the existence of this type of CMO seems to be 
opposed to the organism’s best interest for survival (Mineka, 
1975). For example, ordering food when satiated, as in the 
example above, would not seem healthy. However, a num- 
ber of animal studies have shown that a stimulus paired with 
a reinforcer (e.g., food) under a high (e.g., 24 h) deprivation 
condition is preferred over a stimulus that was paired with the 
same reinforcer amount under a low (e.g., 12 h) deprivation 
condition (e.g., Lewon & Hayes, 2014; Vasconcelos & Urcuioli, 
2008). In other words, when two stimuli are paired with the 
same reinforcer, but under different MO conditions, subjects 
that are mildly deprived chose the stimulus correlated with high- 
deprivation levels. It is possible that in addition to the food 
deprivation (mild deprivation), the presence of the stimulus 


Reinforcer-establishing effect: 
An increase in the reinforcing 
effectiveness of food 


Food Deprivation 
(Unconditioned Establishing 


Operation) 


Evocative effect: 
An increase in the current 
frequency of all behavior that has 
been reinforced by food 


Food Deprivation Pairing 


(UMO) 


Restaurant 
(Neutral Stimulus) 


Reinforcer-establishing effect: 
An increase in the reinforcing 
effectiveness of food 


Restaurant 
(Conditioned Establishing 
Operation) 


Evocative effect: 
An increase in the current 
frequency of all behavior that has 
been reinforced by food 


Figure 16.6 Example of a surrogate 
CMO (CMO-S). 
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correlated with high deprivation acts as a CMO-S, which makes 
the subjects behave as if they are “hungrier” than they would 
be otherwise, and this might be why they tend to respond more 
frequently to the high-deprivation alternative. Durlach, Elliman, 
and Rogers (2002) have found that adults under the same MO 
condition chose a flavored and colored drink that had been cor- 
related with a high-value UMO for liquids (high-salt meal) over 
a drink that had been paired with a low-value UMO (low-salt 
meal), suggesting that the flavor and color acquired CMO-S 
properties. 

O’Reilly, Lancioli, King, Lally, and Dhornhnaill (2000) 
found that aberrant behavior (pushing, pinching, property 
destruction, and self-injury) displayed by two individuals with 
developmental disabilities was maintained by attention when 
their parents were interacting with a third person (diverted atten- 
tion). Given that parents provided low levels of attention when 
they were talking to a third person, diverted attention was paired 
with low attention (MO). This may have led to diverted attention 
functioning as a CMO-S. Figure 16.7 shows that for both par- 
ticipants, problem behavior that never occurred during the brief 
functional analysis did so when the diverted attention condition 
(CMO-S) was introduced. Providing attention noncontingently 
(abolishing operation) produced significant reductions in prob- 
lem behavior. Follow-up observations showed reduced levels of 
problem behavior. These results seem to suggest that previously 
neutral stimuli may acquire MO functions due to being paired 
with another effective MO. 

In a translational evaluation of the CMO-S, Lanovaz, 
Rapp, Long, Richling, and Carroll (2014) repeatedly paired a 
poster board with the presence of a condition (last 2 minutes of 


each session) known to function as an EO for stereotypy. After 
this pairing procedure, three out of four participants diagnosed 
with autism displayed higher rates of stereotypy in the pres- 
ence of the poster board. These results have important applied 
implications, as there may be many environmental features in 
a therapeutic setting that could acquire CMO-S control over 
problem behavior simply by being paired with another MO. If a 
toy known for evoking stereotypy is delivered after compliance 
for a specific instruction, this repeated pairing may establish 
the instruction as a CMO-S that would increase the value of a 
specific form of automatic reinforcement and evoke stereotypy 
(Lanovaz et al., 2014). 

The relationship between the CMO-S and an effective MO 
can generally be weakened by two kinds of MO unpairing: pre- 
senting the previously neutral stimulus (now CMO-S) without 
the effective MO or presenting the MO as often in the absence 
of the CMO-S as in its presence. In the examples above, if the 
instruction is no longer followed by the toy that usually evokes 
stereotypy, or a parent talking to a third person is no longer fol- 
lowed by low levels of attention, the value-altering and behavior- 
altering effects of the toy or diverted parental attention would 
be weakened. Similarly, if the toy known to evoke stereotypy 
is presented as often in the presence of the instruction as in its 
absence, or if differing levels of attention are associated with 
talking to a third person, their motivating effectiveness would 
be reduced. 

The CMO-S appears to play a role in the development of 
behaviors that do not seem to make much sense, such as seeing 
a restaurant after dinner and wanting to eat again or feeling the 
urge to urinate after seeing the sign for the restroom. Both the 
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Figure 1. Percentage of intervals with aberrant behavior for Ole and Lena during assessment, treatment evaluation, and follow-up sessions. 


Figure 16.7 Example of control by a motivating operation “a surrogate CMO (CMO-S)”. 


From “Using Brief Assessment to Evaluate Aberrant Behavior Maintained by Attention”, M. F. O'Reilly, G. E. Lancioni 
L. King, G. Lally, and O. N. Dhomhnaill, 2000, Reproduced with permission of John Wiley & Sons Inc. 


restaurant and the restroom may evoke behavior as a CMO-S, 


given previous pairings with food deprivation and a full bladder, 
respectively (Miguel, 2013). 


Reflexive CMO (CMO-R) 


Any stimulus that systematically precedes the onset of painful 
stimulation becomes a CMO-R, in that its own offset functions 
as a reinforcer, and its occurrence evokes any behavior that has 
produced such reinforcement in the past. For example, in a tradi- 
tional discriminated avoidance procedure,!* after some interval of 
time, the onset of an initially neutral stimulus is followed by the 
onset of painful stimulation—usually electric shock (see Box 16.3, 
“What’s an Aversive Stimulus?”). Some arbitrary response (one 
that is not part of the organism’s phylogenic repertoire), such 
as lever pressing, terminates the painful stimulation (the animal 
escapes the pain) and restarts the interval. If the same response 
occurs during the warning stimulus, it terminates that stimulus and 
the shock does not occur on that trial. The response at this phase 
of the procedure is said to have avoided the pain and is called an 
avoidance response. Many organisms learn to respond during the 
onset of a warning stimulus, and thus receive very few shocks. 

In this case, the warning stimulus functions similarly 
to the shock as an MO for the escape response, the reinforcer 
for which is shock termination. However, the capacity for the 
warning stimulus to establish its own termination as an effective 
reinforcer is a result of the organism’s learning history, involving 
the pairing of the warning stimulus and shock. In other words, 
the warning stimulus evokes the avoidance response as a CMO, 
just as the painful stimulation evokes the escape response 
as a UMO. 

The warning stimulus is not an SP correlated with the 
availability of the consequence, but rather an MO related to the 
reinforcing effectiveness of the consequence. Recall that a dis- 
criminative stimulus is related to the current availability of a 
type of consequence for a given type of behavior. Availability is 
defined by two components: (a) An effective consequence (ies 
one whose MO is currently in effect) must have followed the 
response in the presence of the stimulus; and (b) the response 
must have occurred without the consequence (which would 
have been effective as a reinforcer if it had been obtained) in 
the absence of the stimulus. The relation between the warning 
stimulus and consequence availability does not meet the second 
requirement. In the absence of the warning stimulus, there is 
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no effective consequence that could have failed to follow the 
response as an analog to the extinction responding that occurs in 
the absence of an S”. The fact that the avoidance response does 
not turn off the absent warning stimulus is like the unavailability 
of food reinforcement for a food-satiated organism. 

By contrast, a stimulus that has preceded some form of 
improvement may function as a CMO-R by (a) establishing its 
own offset as an effective punisher and (b) abating any behavior 
that has been so punished. For example, a positive evaluation from 
an employer may have been followed by a promotion or some 
sort of monetary bonus. This history would establish the offset of 
positive evaluations as a form of punishment and likely reduce the 
frequency of all behaviors that may have resulted in a bad eval- 
uation (e.g., being late). Although this relation is quite plausible, 
little research has seemed to focus on this type of CMO-R. 

The CMO-R plays an important role in identifying the 
negative aspects of many everyday interactions. Imagine that 
a stranger asks you where a particular building is located or 
asks for the time. The appropriate response is to give the infor- 
mation quickly or to say you don’t know. Typically, the person 
who asked will smile and say, “thank you”. Also, your question 
may be reinforced by the knowledge that a fellow human being 
has been helped (Skinner, 1957). In a sense, the question is an 
opportunity to obtain reinforcers that were not previously avail- 
able. However, the question also begins a brief period that can 
be considered a warning stimulus, and if a response is not made 
soon, a form of social worsening (awkwardness) will occur. The 
stranger may repeat the question, stating it more clearly or more 
loudly, and will certainly think you are socially inept if you do 
not respond quickly. You would also consider it inappropriate to 
provide no answer. Even when no clear threat for nonresponding 
is implied by the person who asked, our social history under 
such conditions implies a form of worsening for continued inap- 
propriate behavior. Many such situations probably involve a 
mixture of positive and negative components, but in those cases 
in which answering the question is an inconvenience (e.g., the 
listener is in a hurry), the stranger’s thanks is not a strong rein- 
forcer, nor is helping one’s fellow human being, and the CMO-R 
is probably the main controlling variable. 

Another example is when a person thanks another for 
doing a kindness of some sort. Saying “thanks” is evoked by 
the person’s performing the favor or the kindness. Performing 
the favor may be considered an SP in the presence of which 
one has learned to say “Thank you” and received reinforcement 


BOX 16.3 


What's an Aversive Stimulus? 


A stimulus may be called aversive because its contingent 
removal functions as reinforcement, because its contingent 
presentation functions as punishment, because it evokes 
behavior that in the past has terminated it as an MO, or 
because it has evocative effects with respect to certain 
smooth muscle and glandular responses (e.g., heart rate 


increase, adrenal secretion) (Michael, 1993). Thus, aversive 
stimulus does not imply a specific functional relation; it is 
simply a behavioral translation of common sense expressions 
for “unpleasant feeling,” “unpleasant states of mind,” and so 
on—a form of usage possibly fostered by the term's lack of 
specificity. 
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consisting of the other person saying, “You’re welcome.” How- 
ever, in many cases, ordinary courteous remarks may involve 
a CMO-R component. Consider the following scenario. Bill 
has his arms full, carrying something out of the building to his 
car. As he approaches the outer door, Danielle opens the door 
and holds it for Bill. Ordinarily, Bill would then smile and say, 
“Thanks.” The CMO-R component can be illustrated by sup- 
posing that Bill just walks out without acknowledging the favor. 
In such circumstances, it would not be unusual for Danielle to 
call out sarcastically, “You’re welcome!” Someone’s favor may 
function as a warning stimulus (a CMO-R) that has systemati- 
cally preceded some form of disapproval in the absence of some 
form of acknowledgement (saying “Thanks”). 

In the typical laboratory avoidance procedure, the response 
terminates the warning stimulus. In extending this analysis to the 
human situation, it must be recognized that the warning stim- 
ulus is not simply the event that initiated the interaction. In the 
previous example, the CMO-R is not the vocal request itself, 
which is too brief to be terminated. It is, instead, the more com- 
plex social stimulus situation consisting of having been asked 
and not having made a response during the time when such a 
response would be appropriate. The termination of that situation 
is the reinforcement for the response. Some social interactions 
with a stimulus—a facial expression, an aggressive posture—are 
more like the warning stimulus seen in the laboratory that can 
be terminated by the avoidance response, but most involve the 
more complex stimulus condition consisting of the request and 
the subsequent period described earlier. 

In applied behavior analysis, the CMO-R is a component 
of training procedures for teaching individuals with defective 
social and verbal repertoires. During early intensive behavioral 
intervention, for example, learners are typically asked questions 
or given verbal instructions, followed by verbal or physical 
prompts so they can respond correctly. This arrangement may 
function as CMO-Rs, which will be followed by further intense 
social interaction if they do not respond appropriately (e.g., error 
correction). Such questions and instructions may be function- 
ing primarily as CMO-Rs, rather than Ss related to the possi- 
bility of receiving praise or other positive reinforcers (Carbone 
et al., 2010). In other words, in these situations, it is likely that 
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Figure 16.8 Illustration of how an 
instruction may become a reflexive 
conditioned motivating operation (CMO-R). 


Instruction 
establishes its own 
removal as a reinforcer 


correct responses are partially maintained by negative rather 
than positive reinforcement. In addition to establishing their 
own removal as a reinforcer and evoking escape-maintained 
behaviors, these instructions may establish the value of activi- 
ties or items unrelated to the task at hand as conditioned rein- 
forcers. In this case, the instructions would function as transitive 
CMOs, as described in the following section (Michael, 2000). 
Figure 16.8 illustrates how, after preceding an aversive event, 
an instruction may become a CMO-R that would (a) establish 
its own removal as a conditioned reinforcer and (b) evoke any 
behavior that has produced such a reinforcer, such as compliance 
with the instruction. 

Behaviors maintained by negative reinforcement are under 
the influence of a variety of antecedent stimuli, including task 
difficulty as a CMO-R. Changing these tasks may abolish the rein- 
forcing effects of negative reinforcement and abate undesirable 
escape behaviors (Smith & Iwata, 1997). As an attempt to assess 
the function of problem behavior, as well as identify the specific 
features of the environment that served as EOs, McComas, Hoch, 
Paone, and E]-Roy (2000) exposed three children diagnosed with 
developmental disabilities to a typical experimental functional 
analysis, a descriptive assessment to generate hypotheses about 
possible antecedents for destructive behavior, and an analysis in 
which they tested the evocative effects of the hypothesized EOs. 
Although all participants engaged in problem behavior in the 
presence of task demands, the specific task features that evoked 
problem behavior (CMO-Rs) varied across participants. However, 
problem behavior by all participants was reduced simply by mak- 
ing specific modifications to the instructional methods, without 
modifying the demands. For one participant (Eli), problem behay- 
lor was evoked by a novel or difficult task (CMO-R). During the 
EO analysis, the authors compared the level of problem behay- 
ior when an instructional strategy was used (i.e., manipulables 
available during a math task) versus when the strategy was not 
used. Problem behavior rarely occurred when the strategy was 
used, suggesting that the establishing and evocative effects of task 
demands on escape-maintained problem behavior were eliminated 
by the addition of the instructional strategy. This effect may have 
been achieved by making the tasks less difficult and, thus, unpair- 
ing task demands with aversive properties. 


Instruction precedes 
aversive events (pairing) 


Aversive verbal 


and physical 
prompts 


Compliance eliminates the instruction 


Results from a sample participant are shown in Figure 16.9. 
Functional analysis (top panel) suggested that problem behavior 
consisted of escape from academic tasks. During the EO analysis 
and follow-up (middle panel), problem behavior did not occur 
when the instructional strategy was employed, compared to when 
it was not. Additionally, the participant was more compliant when 
the instructional strategy was available (bottom panel). 

Numerous treatments to weaken or abolish CMO-Rs that 
evoke problem behavior in the presence of task demands have 
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Figure 1. Percentage of intervals with shirt biting for Eli in the functional 


analysis (top panel) and EO analysis and follow-up (middle panel). Percentage of 
intervals with compliance is depicted in the bottom panel. Follow-up sessions begin in 
Session 5. The single carat indicates probe in novel setting, the double carat indicates 
probe with novel instructor, and the triple carat indicates probe with novel materials. 


Figure 16.9 Example of control by a motivating operation a 
reflexive CMO (CMO-R). 
From “Escape Behavior During Academic Tasks: A Preliminary Analysis 


of Idiosyncratic Establishing Operations” by J. J. McComas, H. Hoch, 
H., D. Paone, and D. El-Roy 2000, Reproduced with permission of John 


Wiley & Sons Inc. 
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been evaluated (see Carbone et al., 2010, for a review). These 
include programming competing reinforcers for compliance (e.¢., 
Piazza et al., 1997), pairing stimuli that evoke problem behay- 
ior with reinforcement by embedding preferred activities during 
task presentation (e.g., Kemp & Carr, 1995), reducing the aver- 
siveness of the task by using errorless procedures (Heckaman, 
Alber, Hooper, & Heward, 1998), systematically fading instruc- 
tional demands (Pace, Iwata, Cowdery, Andree, & McIntyre, 
1993), varying the tasks being presented (McComas et al. 2000; 
Winterling, Dunlap, & O’ Neill, 1987), increasing the pace of 
instruction to produce higher rates of reinforcement for appro- 
priate behavior (Roxburgh & Carbone, 2013), presenting highly 
preferred activities and low demands to neutralize CMO-Rs 
occurring prior to the session (e.g., Horner, Day, & Day, 1997), 
reducing the aversiveness of tasks by providing choices of 
activities and reinforcers during sessions (Dyer, Dunlap, & 
Winterling, 1990), interspersing easy tasks with difficult ones 
(Mace & Belfiore, 1990), gradually introducing novel tasks 
(Smith, Iwata, Goh, & Shore, 1995), and modifying task dura- 
tion based on whether participants engage in problem behavior 
early or late in the session (Smith et al., 1995). It may also be 
possible to weaken the CMO-R by unpairing it with the aversive 
stimulus it precedes (Kettering, Neef, Kelley, & Heward, 2018). 

It is important to note that negatively reinforced problem 
behaviors improve environmental conditions that are otherwise 
poor. Thus, it is the behavior analyst’s ethical duty to identify 
and modify these aversive variables (including CMO-Rs), rather 
than simply reduce problem behavior through some form of con- 
sequence manipulation such as escape extinction (Carbone et al., 
2010; McGill, 1999). 


Transitive CMO (CMO-T) 


When an environmental variable establishes the effectiveness 
of another event as a reinforcer or punisher, it is referred to 
as a CMO-T. This type of MO was previously described in 
the behavior analytic literature as an establishing stimulus 
(SE; Michael, 1982) or blocked-response CEO (Michael, 1988; 
also see Miguel, 2013). Conditioned reinforcers are dependent 
upon a variable functioning as a CMO-T. Thus, all variables 
that function as UMOs also function as CMO-Ts for the stimuli 
that are conditioned reinforcers because of their relation to the 
relevant unconditioned reinforcer. For example, not only is 
food deprivation a UMO that momentarily (a) establishes the 
value of food as an unconditioned reinforcer and (b) evokes 
all behaviors that have produced food in the past; food depri- 
vation is also a CMO-T that momentarily (a) establishes the 
value of all conditioned reinforcers paired with food (e.g., an 
attentive server in a restaurant, a menu, utensils) and (b) evokes 
all behaviors that have produced these conditioned reinforcers 
in the past.!4 

Consider the simple operant chain described earlier: 
A food-deprived rat pulls a cord that turns on a buzzer. In the 
presence of the buzzer sound, the rat then presses a lever and 
receives a food pellet. Food deprivation as a UMO makes food 
effective as an unconditioned reinforcer, a relation that requires 
no learning history. Food deprivation as a CMO-T also makes 
the buzzer sound effective as a conditioned reinforcer, which 
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clearly requires a learning history. Thus, food deprivation is 
a UMO with respect to the reinforcing effectiveness of food, 
and a CMO-T with respect to the reinforcing effectiveness of 
the buzzer sound. Conversely, food ingestion is a UMO that 
abolishes the reinforcing effectiveness of food and the buzzer 
sound. 

In the Mexican restaurant, we may need to call the waiter 
to get his attention prior to ordering our tacos. In this scenario, 
food deprivation functions as a UMO that establishes food as 
an unconditioned reinforcer, and as a CMO-T that establishes 
the sight of the waiter as a conditioned reinforcer. As in the rat 
example above, food ingestion would decrease the reinforcing 
values of both food and the waiter. 

Other UMOs, such as painful stimulation, also establish 
the reinforcing value of conditioned reinforcers. As mentioned 
above, as a UMO, a headache would establish its own removal 
as an unconditioned reinforcer. However, as a CMO-T, a head- 
ache also establishes the reinforcing value of other stimuli 
(e.g., aspirin) that have served to reduce headaches. 

The relationship between UMOs and conditioned rein- 
forcers cannot be overlooked by clinicians or educators who 
may make extensive use of conditioned reinforcers, includ- 
ing generalized ones, to strengthen academic or other socially 
significant behaviors. Studies have shown that the value of gen- 
eralized conditioned reinforcers in the form of tokens decreases 
when participants have access to the primary and conditioned 
reinforcers paired with the tokens (Ivy, Neef, Meindl, & Miller, 
2016; Moher, Gould, Hegg, & Mahoney, 2008). Conversely, 
when participants have restricted access to back-up reinforc- 
ers, tokens are more effective reinforcers, as evidenced by an 
increased rate of responding. In practice, these data suggest 
that the value of conditioned and generalized conditioned rein- 
forcers (e.g., tokens) depends upon the MO relevant to their 
back-up reinforcers. Thus, it seems reasonable to assume that 
the effectiveness of generalized reinforcers would be increased 
when paired with a greater number of back-up reinforcers 
(Skinner, 1957, p. 54). More specifically, satiation may be 
prevented by pairing the putative generalized reinforcer with a 
variety of back-up reinforcers, such as food and drink that are 
under control of dissimilar MOs (Moher et al., 2008). Money, 
for instance, seems almost completely free from MO control, 
given that it has been paired with a multitude of effective rein- 
forcers related to several different MOs. Thus, it is likely that 
one or more of these MOs will be in effect when money is deliv- 
ered contingent on behavior, making its effectiveness almost 
impossible to abolish. 

The reinforcing effectiveness of many (probably most) 
conditioned reinforcers is not only altered by relevant UMOs, 
as described above, but also dependent on other stimulus 
conditions. This is why the effectiveness of conditioned rein- 
forcers is often said to be “context” dependent. When the con- 
text is not appropriate, the stimuli may be available but are not 
accessed because they are not effective as reinforcers. A change 
to an appropriate context will evoke behavior that has been 
followed by those stimuli, which are now effective as condi- 
tioned reinforcers. The occurrence of the behavior is not related 
to the availability, but rather to the value, of its consequence. 


For example, flashlights are usually available in home settings 
but are not accessed until a power failure makes them valuable. 
In this sense, the power failure (the sudden darkness) evokes the 
behavior that has obtained the flashlight in the past (rummaging 
around in a particular drawer). However, when the light is back 
on, the value of the flashlight as a reinforcer is abolished, and 
any behaviors that led to the flashlight are momentarily abated. 
The motivating nature of this CMO-T relation is not widely rec- 
ognized, and the evocative variable (the sudden darkness) in this 
example is usually interpreted as an SY 

Consider a workman disassembling a piece of equipment, 
with his assistant handing him tools as he requests them.!> The 
workman encounters a slotted screw that must be removed and 
requests a screwdriver. The sight of the screw evoked the request, 
the reinforcement for which is receiving the tool. Prior to an anal- 
ysis in terms of the CMO-T, the sight of the screw would have 
been considered an S? for the request, but such screws have not 
been differentially related to the availability of reinforcement for 
requests for tools. In the typical workman’s history, assistants 
have provided requested tools irrespective of the stimulus condi- 
tions in which the request occurred. The sight of the screw is more 
accurately interpreted as a CMO-T for the request, not as an SP. 

The fact that several S?s are involved in this complex situ- 
ation makes the analysis somewhat difficult. The screw is an SP 
for unscrewing movements (with a screwdriver in hand) or for 
selecting a screwdriver rather than another tool. The workman’s 
verbal request for a screwdriver, although evoked by the sight 
of the slotted screw as a CMO-T, is dependent on the assistant’s 
presence as an S”. The offered screwdriver is also an S for the 
workman reaching to grasp it. The critical issue here, however, 
is the screw's role in evoking the request, and this is a motivating 
rather than a discriminative relation (Michael, 1982). 

Another common example involves a stimulus related 
to some form of danger that evokes some relevant protective 
behavior. Imagine a night security guard is patrolling an area and 
hears a suspicious sound. He pushes a button on his phone that 
signals another security guard, who then activates his phone and 
asks if help is needed (which reinforces the first guard’s call). 
The suspicious sound is not an S? in the presence of which the 
second security guard’s response is more available (the second 
guard would respond to the first guard’s call whether or not the 
first guard heard the sound), but rather a CMO-T in the pres- 
ence of which second guard’s response is more valuable. S?s 
are involved, however. A ringing phone is an S? in the presence 
of which one has activated the phone, said something into the 
receiver, and been reinforced by hearing a response from another 
person. Answering phones that are not ringing has typically not 
been so reinforced.!® (Note, incidentally, that the effect of the 
danger signal is not to evoke behavior that produces its own ter- 
mination, as with the CMO-R, but rather behavior that produces 
some other event, in this case, the sound of the security guard’s 
colleague offering to help.) 

Figure 16.10 shows an example of sudden darkness func- 
tioning as a CMO-T that establishes a flashlight as a conditioned 
reinforcer, as well as evoking the behavior of looking for a flash- 
light. Note that sudden darkness may also function as a CMO-R 
that establishes its own removal as a reinforcer. !’ 
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Sudden darkness as a CMO-R 


Sudden 


darkness flashlight 


Sudden darkness as a CMO-T 


Look for a Find 
flashlight 


Turn No more 
flashlight on darkness 


Figure 16.10 Illustration of a transitive conditioned motivating operation (CMO-T). 


The distinction between an SP and a CMO-T hinges on 
the relation between reinforcer availability and the presence or 
absence of the stimulus. If the reinforcer is more available in 
the presence than in the absence of the stimulus, the stimulus is 
an SP; if the reinforcer is equally available in the absence and 
presence of the stimulus, the stimulus is a CMO-T. Screwdrivers 
have typically been available to the workman in the absence as 
well as in the presence of screws. The response by the security 
guard’s colleague has been just as available in the absence as in 
the presence of a suspicious noise. 

A CMO-T evokes behavior because of its relation to the 
value of a consequence rather than to the availability of a con- 
sequence. The two forms of behavioral control, the SP and the 
CMO-T, which are so different in origin, would be expected to 
differ in other important ways. This is an example of termino- 
logical refinement in behavior analysis, not a discovery of new 
empirical relations. The value of this refinement will be found in 
the improved theoretical and practical effectiveness of behavior 
analysts whose verbal behavior has been affected by it. 


Teaching Mands with CMO-Ts. Mand training is an essential 
aspect of language programs for individuals with severely defi- 
cient verbal repertoires (see Chapter 18). For such individuals, 
manding does not spontaneously arise from tact and receptive 
discrimination training (i.e., listener behavior). The learner has 
to want something, make an appropriate verbal response, and be 
reinforced by receiving what was wanted. This procedure places 
the response under control of the relevant MO. The manipulation 
of CMO-Ts allows the teacher to contrive a situation to get the 
learner to want something that can be a means to another end 
(e.g., obtaining information by asking a question). Any stimulus, 
object, or event can be a basis for a mand simply by arranging 
an environment in which that stimulus can function as a condi- 
tioned reinforcer. Thus, if a pencil mark on a piece of paper is 
required for an opportunity to play with a favorite toy, a mand 
for a pencil and for a piece of paper can be taught. 

The manipulation of CMO-Ts is common practice when 
teaching mands to individuals with disabilities (e.g., Alwell, Hunt, 
Goetz, & Sailor, 1989; Betz, Higbee, & Pollard, 2010; Goetz, 
Gee, & Sailor, 1985; Lechago, Carr, Grow, Love, & Almason, 


2010; Shillingsburg, Valentino, Bowen, Bradley, & Zavatkay, 
2011). The typical procedure, termed interrupted-chain (Hall & 
Sundberg, 1987) involves withholding an item the learner needs 
to complete a previously learned task. This manipulation func- 
tions as a CMO-T, increasing the value of the item as a rein- 
forcer, which creates the ideal condition for teaching the learner 
to mand for it. In a mand training study, Shillingsburg, Bowen, 
Valentino, and Pierce (2014) exposed three children with dis- 
abilities to two scenarios. One scenario involved varying the 
placement of a preferred item in one of nine different colored 
(or animal) cups. A stimulus that signaled the preferred item 
(e.g., an empty Skittles bag) was then placed next to the par- 
ticipant. When the participant asked for the item (e.g., “May 
I have a Skittle?”), the experimenter responded by saying that 
the item was under one of the cups, increasing the value of the 
information for the location of the item (which cup) as a condi- 
tioned reinforcer. The second scenario involved giving a highly 
preferred item to one of three adults when the participant was not 
looking. The stimulus that signaled the presence of the preferred 
item was presented again, and when the child asked for it, the 
therapist said that one of the other adults had it, without saying 
whom. This statement served as a CMO-T that increased the 
value of the information about who had the item as a conditioned 
reinforcer. The authors used a vocal prompt delay to teach partic- 
ipants to ask, “which cup?” and “who has it?” in scenarios | and 
2, respectively. These CMO-T trials were alternated with trials in 
which the information was not valued as a reinforcer (abolishing 
operation; AO) to guarantee that behavior came under control 
of the EO, versus some other aspect of the environment. During 
AO trials, the therapist provided either the information about 
the location of the item or the name of the person who had it. In 
addition to the emission of mands, the authors also collected data 
on whether participants selected the correct cup or approached 
the correct adult to obtain the preferred item. All participants 
manded using “which” and “who” more often during EO rather 
than AO conditions, and approached the preferred items during 
both conditions, suggesting that the interrupted chain procedure 
was successful in placing mands under functional control of 
CMO-Ts. Figure 16.11 shows the results for the “which” ques- 
tions. None of the participants manded for information during 
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Figure 1. Cumulative record of independent responses for manding “which?” (left) and cumulative record for approach behavior (right) 


during baseline and posttraining probes for lan, Jeb, and Jen. 


Figure 16.11 Example of control by a motivating operation a transitive CMO (CMO-T). 


From “Mands for Information Using “Who?” and “Which?” in the Presence of Establishing and Abolishing Operations” 
by M, A. Shillingsburg, C. N. Bowen, A. L. Valentino, and L. W. Pierce, 2014, Reproduced with permission of John Wiley 


& Sons Inc. 


baseline (left panel), although each selected the correct container 
during the AO condition during which the information on the 
location of the item was provided (right panel). After training, 
participants manded only during the EO condition (left panel). 
Additionally, after receiving the information during the EO 
condition, participants started to select the correct containers 
to access the preferred item (right panel). Similar results were 
obtained for the “who” questions. Chapter 18 provides additional 
details about using CMO-Ts during mand training. 


RELEVANCE OF MOs TO THE GENERALITY 
OF TREATMENT EFFECTS 


In the applied area, the reinforcer-establishing effect of MOs 
seems to be well understood, as clinicians may temporarily 
withhold edibles, tangibles, and even social attention to make 
them more effective as reinforcers for behaviors being taught 
(e.g., Taylor et al., 2005). However, these target behaviors will 
not occur in future or novel circumstances, even if well learned 
and part of the learner’s repertoire, unless relevant MOs are 
in effect (Fragale et al., 2012; O’Reilly et al., 2012). Fragale 
and colleagues have shown that after teaching three children 
diagnosed with autism to mand for preferred items and testing 
for generalization across settings, the newly acquired mands 
occurred almost exclusively when items had been restricted 
prior to the session and seldom occurred when participants could 
access the items prior to sessions. These results show that even 
if well established, a behavior may not occur in the future in a 
new setting if the MO is absent. Thus, it is possible that general- 
ization and maintenance failures are not only a function of novel 
(generalization) stimuli being dissimilar to those present during 
training, but also due to different MOs. Thus, when program- 
ming for generalization, clinicians must do so not only across 
different discriminative stimuli (Stokes & Baer, 1977) but also 


across different MOs (Miguel, 2017). In summary, for a newly 
acquired response to be generalized and maintained, its relevant 
MO must also be in effect. 


RELEVANCE OF MOs TO APPLIED 
BEHAVIOR ANALYSIS 


Applied behavior analytic research has shown the effects of MOs 
on a multitude of behaviors across a variety of conditions. MOs 
have affected the results of preference assessments (Gottschalk, 
Libby, & Graff, 2000; McAdam et al., 2005) and functional 
analyses (Fischer, Iwata, & Worsdell, 1997; Worsdell, Iwata, 
Conners, Kahng, & Thompson, 2000), engagement with activi- 
ties (Klatt et al., 2000; Vollmer & Iwata, 1991), within-session 
responding (Roane, Lerman, Kelley, & Van Camp, 1999), social 
initiations (Taylor et al., 2005), the rate of acquisition of target 
behaviors (Ivy et al., 2016), and the occurrence of challenging 
behaviors (O’ Reilly et al., 2007). 

Gottschalk and colleagues (2000), for instance, com- 
pared the results of preference assessments of four children 
with autism under three conditions. In the control condition, 
participants had access to four edible items in premeasured 
portions, 3 times, 24 hr prior to the assessment. In the satiation 
condition, the same access to premeasured portions was pro- 
vided, but the participants had free access to one of the stimuli 
for 10 min prior to the assessment. Finally, in the deprivation 
condition, participants had the same premeasured portions, but 
were deprived of one stimulus for 48 hr prior to the assess- 
ment. Results showed that across all participants, approach 
responses were higher during deprivation conditions. These 
results were replicated with leisure items with both typically 
developing children and individuals with intellectual disabili- 
ties (McAdam et al., 2005) and suggest that results of prefer- 
ence assessments are highly dependent on current MOs. 


As an attempt to evaluate the effects of MOs on the 
results of functional analyses, Worsdell et al. (2000) exposed 
six individuals with profound intellectual disabilities who 
engaged in self-injury to an experimental functional analysis 
(FA), followed by another FA in which the MO and the rein- 
forcer for problem behavior were present or absent. During 
the EO condition, the experimenter ignored all but problem 
behavior, while in the AO condition, the experimenter deliv- 
ered noncontingent reinforcement. These conditions were 
associated with either the presence or the absence of continu- 
ous reinforcement. The authors reported high rates of problem 
behavior for all participants during the condition in which the 
EO and reinforcement were present, and almost no responding 
when both the EO and reinforcement were absent, or when the 
EO was present and reinforcement was absent. These results 
are consistent with the conceptualization that for behavior 
to occur, it must be followed by an effective reinforcer, and 
suggest that MOs may differentially affect the results of FAs. 

As previously discussed, in addition to affecting results of 
commonly used behavior analytic assessments, MOs have been 
shown to directly affect the rate of acquisition of novel behaviors 
reinforced with primary, conditioned, or generalized reinforcers 
(e.g., Ivy et al., 2016; Lang et al., 2009; Moher et al., 2008; 
O’Reilly et al., 2009), as well as the frequency of behaviors that 
have already been acquired (e.g., Klatt et al., 2000). Behavior 
analysts have also directly manipulated MOs to reduce problem 
behaviors (e.g., Rapp, Cook, McHugh, & Mann, 2017) and teach 
new skills (Lechago et al., 2010). 

Research on MOs falls within the three broad categories 
previously described by Iwata, Smith, and Michael (1991): 
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SUMMARY 


Definition and Characteristics of Motivating 
Operations 


1. A motivating operation (MO) (a) alters the effectiveness 
of some stimulus as a reinforcer, the value-altering effect; 
and (b) alters the current frequency of all behavior that 
has been reinforced by that stimulus, the behavior-altering 
effect. 


2. The value-altering effect is either (a) an increase in the 
reinforcing effectiveness of some stimulus, in which 
case the MO is an establishing operation (EO); or (b) a 
decrease in reinforcing effectiveness, in which case the 
MO is an abolishing operation (AO). 


3. The behavior-altering effect is either (a) an increase in the 
current frequency of behavior that has been reinforced by 
some stimulus, called an evocative effect; or (b) a decrease 
in the current frequency of behavior that has been rein- 
forced by some stimulus, called an abative effect. 


4. The value-altering effect of an MO changes the potency 
of reinforcers (or punishers) to increase and maintain (or 
decrease and eliminate) behavior 
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demonstrating the effects of MOs on behavior, using MOs to 
clarify results of assessments, and manipulating MOs to increase 
or decrease behavior. This ever-growing literature has had much 
impact on our practice. It allowed behavior analysts to recog- 
nize that the three-term contingency is dependent upon a fourth 
variable that modulates the effectiveness of reinforcement, and 
hence the evocative strength of discriminative stimuli. A clear 
understanding of these motivational variables seems critical for 
interpreting assessment results and designing effective behavior 
analytic interventions. 

As we go about our daily lives, we may clearly identify 
how MOs influence our own behavior. Revisiting the anecdote 
about our obsession with Mexican food, we see that food depri- 
vation as a UMO, or, more specifically, not having had Mexican 
food for some time, increases its value as a reinforcer, and the 
frequency of previous behaviors that have produced it, such as 
going to our favorite Mexican restaurant. Food deprivation as a 
CMO.-T establishes the sight of the restaurant as a conditioned 
reinforcer, increasing any behavior that preceded it (e.g., driving 
there) and making us happy when we see it. When at the restau- 
rant, the sight of tortilla chips may increase the value of salsa as 
a reinforcer, as they tend to covary (CMO-S). The spiciness of 
the salsa establishes its own removal as a reinforcer (CMO-R), 
which makes us drink water or any other liquid (e.g., too many 
margaritas). When the chips are gone (CMO-T), we call the 
waiter to bring us more. And even if we have already eaten, the 
sight of the restaurant alone (CMO-S) may get us to eat more. 
Given the ubiquity of motivational variables in our lives, a com- 
plete understanding of human behavior demands an analysis of 
the critical roles of MOs. 


5. The behavior-altering effect is a change in frequency 
due to the organism’s behavior contacting reinforcement 
or punishment. This change occurs only after the 
consequence has been obtained. 


6. The value-altering and behavior-altering effects of an MO 
occur simultaneously but are considered independent in 
the sense that one does not derive from the other. How- 
ever, once behavior contacts reinforcement, these two 
effects may not be discernible. 


7. The behavior-altering effect of the MO can directly 
increase or decrease the frequency of behavior. It can also 
indirectly affect the evocative or abative strength of rele- 
vant discriminative stimuli. 


8. The value-altering effect of the MO can directly alter the 
reinforcing (or punishing) effectiveness of the relevant 
consequence. It can also indirectly alter the reinforcing 
(or punishing) effectiveness of any (conditioned) stimulus 
correlated with the relevant consequence. 


9, SPs and MOs alter the current frequency of behavior 
(behavior-altering effect). 
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10. Reinforcers, punishers, and response occurrence with- 
out consequence alter the future frequency of behavior 
(function-altering effect). 


Distinguishing Between MOs and Ss 


11. An SP controls a type of behavior because it has been 
related to the differential availability of an effective 
reinforcer for that type of behavior. This means that the 
relevant consequence has been available in the presence 
of, and unavailable in the absence of, the stimulus. Most 
variables that qualify as motivating operations fail to meet 
this second SP requirement because in the absence of the 
variable, there is no MO for the relevant reinforcer, and 
thus no reinforcer unavailability. 


12. A useful contrast is that S?s are related to the differential 
availability of a currently effective form of reinforcement for 
a particular type of behavior, whereas MOs are related to the 
differential reinforcing effectiveness of a particular event. 


13. In nontechnical terms, an S? tells you that something you 
want is available; an MO makes you want something. 


Unconditioned Motivating Operations (UMOs) 


14. For all organisms there are events, operations, and stimu- 
lus conditions with unconditioned value-altering effects. 
In other words, these MOs establish or abolish the value of 
stimuli as reinforcers in the absence of prior learning. 


15. The main UMOs for humans are those related to depri- 
vation and satiation with respect to food, water, oxygen, 
activity, and sleep; and those related to sexual reinforce- 
ment, comfortable temperature conditions, and painful 
stimulation. For each variable, there are two MOs, one with 
an establishing operation (EO) and one with an abolishing 
operation (AQ). Also, each variable has an evocative effect 
and an abative effect. Thus food deprivation is an EO and 
has evocative effects on relevant behavior, and food inges- 
tion is an AO and has abative effects on relevant behavior. 


MOs for Punishment 


16. MOs alter (increase or decrease) the punishing effective- 
ness of a stimulus, as well as the frequency (evoke or abate) 
of behaviors that have been punished by that stimulus. 


17. An EO related to punishment would have a punisher- 
establishing effect and an abative effect. 


18. An AO related to punishment would have a punisher- 
abolishing effect and an evocative effect, in that behaviors 
that have been punished would occur again (assuming 
that a reinforcer contingency for the previously punished 
behavior is intact). 


19. In the case of conditioned punishers that were established 
by having been paired with the removal (or reduced avail- 
ability) of reinforcers (i.e., negative punishment), their 
effectiveness as conditioned punishers will depend on the 


same MOs that establish the value of the reinforcers that 
were removed. 


20. Social disapproval, time-out from reinforcement, and 
response cost are stimulus conditions that usually function 
as punishment because they are related to a reduction in 
the availability of some kinds of reinforcers. The MOs for 
those forms of punishment are the MOs for the reinforcers 
that are being made less available. 


Multiple Effects of MOs 


21. Most environmental events have more than one effect 
upon behavior. 


22. Events that function as UMOs will typically have 
behavior-altering effects on the current frequency of a type 
of behavior and, when presented as consequences, will 
have function-altering effects on the future frequency of 
behavior that immediately preceded their onset. 


23. Events that have MO evocative effects, such as painful 
stimulation, will also function as punishment for the 
response immediately preceding the onset of the event. 


24. Events that have MO abolishing effects, such as food con- 
sumption, may have reinforcing effects when presented as 
consequences for behavior. 


Conditioned Motivating Operations (CMOs) 


25. Motivating variables that alter the reinforcing effectiveness 
of other stimuli, objects, or events, but only as a result of 
the organism’s learning history, are called conditioned 
motivating operations (CMOs). As with the UMOs, they 
also alter the momentary frequency of all behavior that has 
been reinforced (or punished) by those other events. 


26. The surrogate CMO (CMO-S) is a stimulus that acquires 
its MO effectiveness by being paired with another MO, 
and has the same value-altering and behavior-altering 
effects as the MO with which it was paired. 


27. The CMO-S appears to play a role in the development of 
behaviors that do not seem to make much sense, such as 
seeing a restaurant after dinner and wanting to eat again 
or feeling the urge to urinate after seeing the sign for the 
restroom. Both the restaurant and the restroom may evoke 
behavior as a CMO-S, given previous pairings with food 
deprivation and a full bladder, respectively. 


28. A stimulus that acquires MO effectiveness by preced- 
ing some form of worsening or improvement is called a 
reflexive CMO (CMO-R). It is exemplified by the warning 
stimulus in a typical escape-avoidance procedure, which 
establishes its own offset as reinforcement and evokes all 
behavior that has accomplished that offset. 


29. A stimulus that has preceded some form of improvement 
may also function as a CMO-R by (a) establishing its own 
offset as an effective punisher and (b) abating any behavy- 
ior that has heen so punished. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


ea 


The CMO-R has usually been interpreted as an S?. The 
CMO-R fails to qualify as an S?, however, because in 
its absence there is no MO for a reinforcer that could be 
unavailable and thus no reinforcer unavailability. 


The CMO-R identifies a negative aspect of many everyday 
interactions that otherwise would be interpreted as a 
sequence of opportunities for positive reinforcement. 

One example is a request for information, which initiates 
a brief period during which a response must be made to 
terminate a period of increased social awkwardness. 


The CMO-R is often an unrecognized component of pro- 
cedures used in teaching effective social and verbal behav- 
ior. Learners are asked questions or given instructions, 
which are followed by further intense social interaction if 
they do not respond to them appropriately. The question or 
instruction may be functioning more as a warning stimu- 
lus, as a CMO-R, than as an S? related to an opportunity 
for receiving praise or other positive reinforcers. 


An environmental variable that establishes (or abolishes) 
the reinforcing effectiveness of another stimulus and 
evokes (or abates) the behavior that has been reinforced by 
that other stimulus is a transitive CMO, or CMO-T. 


Variables that function as UMOs also function as transi- 
tive CMOs for the stimuli that are conditioned reinforcers 
because of their relation to the unconditioned reinforcer. 
Food deprivation (as a UMO) establishes as a reinforcer 
not only food but also (as a CMO-T) all of the stimuli that 
have been related to obtaining food (e.g., the utensils with 
which one transports food to the mouth). 


The reinforcing effectiveness of many conditioned 
reinforcers not only is altered by relevant UMOs but also 
may be dependent on other stimulus conditions. Those 
stimulus conditions, as CMO-Ts, then also evoke behavior 
that has obtained the conditioned reinforcers. 


A CMO-T evokes behavior because of its relation to the 
value of a consequence rather than to the availability of a 
consequence. 


The manipulation of CMO-Ts is common practice when 
teaching mands to individuals with disabilities. The 


KEY TERMS 


abative effect 

abolishing operation (AO) 
behavior-altering effect 

conditioned motivating operation (CMO) 
establishing operation (EO) 


evocative effect 


function-altering effect 
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typical procedure, termed interrupted-chain, involves 
withholding an item the learner needs to complete a pre- 
viously learned task. This manipulation functions as a 
CMO-T, increasing the value of the item as a reinforcer, 
which creates the ideal condition for teaching the learner 
to mand for it. 


Relevance of MOs to the Generality 
of Treatment Effects 


38. 


ab) 


An acquired behavior may not occur in the future in 

a new setting if the MO is absent. Thus, it is possible 
that generalization and maintenance failures are not 
only a function of novel (generalization) stimuli being 
dissimilar to those present during training, but also due 
to different MOs. 


When programming for generalization, clinicians must do 
so not only across different discriminative stimuli but also 
across different MOs. For a newly acquired response to be 
generalized and maintained, its relevant MO must also be 

in effect. 


Relevance of MOs to Applied Behavior Analysis 


40. 


41. 


42. 


43. 


motivating operation (MO) 
reflexive conditioned motivating 
operation (CMO-R) 
reinforcer-abolishing effect 
reinforcer-establishing effect 


surrogate conditioned motivating 
operation (CMO-S) 


Applied behavior analytic research has shown the effects 
of MOs on a multitude of behaviors across a variety of 
conditions. MOs have affected the results of preference 
assessments and functional analyses, engagement with 
activities, within-session responding, social initiations, the 
rate of acquisition of target behaviors, and the occurrence 
of challenging behaviors. 


Research on MOs allowed behavior analysts to recognize 
that the three-term contingency is dependent upon a fourth 
variable that modulates the effectiveness of reinforcement, 
and hence the evocative strength of the discriminative 
stimuli. 


A clear understanding of MOs seems critical not only for 
interpreting and assessing results but also for designing 
effective behavior analytic interventions. 


Given the ubiquity of motivational variables in our lives, 
a complete understanding of human behavior demands an 
analysis of the critical roles of MOs. 


transitive conditioned motivating 
operation (CMO-T) 


unconditioned motivating operation 
(UMO) 


MO unpairing 


value-altering effect 
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MULTIPLE-CHOICE QUESTIONS 


IE 


. The term 


alters the effectiveness of some object or event 
as a reinforcer and the current frequency of all behavior 
that has been reinforced by that stimulus, object, or event. 


a. Discriminative stimulus 

b. Establishing operation 

c. Behavior-altering effects 

d. Value-altering effects 

Hint: (See “Definition and Characteristics of Motivating 
Operations”’) 

has been suggested to replace the - 
term establishing operation (EO). 

a. Motivating operation 

b. Evocative effect 

c. Abative effect 

d. Abolishing operation 


Hint: (See “Definition and Characteristics of Motivating 
Operations”’) 


. The effect that can produce either an increase or decrease 


in the reinforcing effectiveness of some stimulus, object, 
or event is the: 


a. Behavior-altering effect 
b. Abative effect 

c. Evocative effect 

d. Value-altering effect 


Hint: (See “Definition and Characteristics of Motivating 
Operations”) 


. The effect that can produce either an increase or decrease in 


the current frequency of behavior that has been reinforced 
by some stimulus, object, or event is the: 


a. Behavior-altering effect 
b. Abative effect 

c. Evocative effect 

d. Value-altering effect 


Hint: (See “Definition and Characteristics of Motivating 
Operations”) 


. Motivating operations (MOs) and Ss are both 


a. Antecedent variables that alter the future frequency of 
some behavior 


b. Consequent variables that alter the current frequency of 
some behavior 


c. Antecedent variables that alter the current frequency 
of some behavior 


d. Consequent variables that alter the future frequency of 
some behavior 


Hint: (See “Distinguishing Between MOs and SP”) 


6. 


10. 
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control(s) a type of behavior because it has 
been related to the differential availability or an effective 
reinforcer for that type of behavior. 


a. Motivating operations (MOs) 

b. Unconditioned motivating operations (UMOs) 

c. Conditioned motivating operations (CMOs) 

d. Discriminative stimulus (SP) 

Hint: (See “Distinguishing Between MOs and Ss”) 
is (are) related to the differential 


reinforcing effectiveness of a particular type of 
environmental event. 


a. Motivating operations 

b. Discriminative stimulus 

c. Motivating variables 

d. Discriminative operations 

Hint: (See “Distinguishing Between MOs and SPs”) 


. This type of motivating operation has value-altering 


effects that are not learned. 

a. Conditioned motivating operations 
b. Unconditioned motivating operations 
c. Surrogate motivating operations 

d. Reflexive motivating operations 


Hint: (See “Unconditioned Motivating Operations 
(UMOs)”) 


- Which of the following is an example of a UMO as it 


relates to humans? 


a. Getting paid for a job every two weeks 


b. Putting on a sweater to go outside 
c. Sleeping after a week of not getting any sleep 


d. Playing with your children after they have been in 
school all day 


Hint: (See “Unconditioned Motivating Operations 
(UMOs)’) 


The type of motivating operation that accomplishes 
what the original motivating operation it was paired 
with did is a 

a. Surrogate CMO 

b. Transitive CMO 

c. Reflexive CMO 

d. Substantive CMO 


Hint: (See “Conditioned Motivating Operations 
(CMOs)”) 


11. 


12. 


ESSAY-TYPE QUESTIONS 


This type of motivating operation alters a relation to itself 
by acquiring MO effectiveness by preceding a worsening 
or improvement. 


a. Surrogate CMO 

b. Transitive CMO 

c. Reflexive CMO 

d. Substantive CMO 

Hint: (See “Conditioned Motivating Operations (CMOs)”) 
This type of motivating operation makes something 


else effective as a reinforcer because of its relation or 
association with an unconditioned reinforcer. 


a. Surrogate CMO 

b. Transitive CMO 

c. Reflexive CMO 

d. Substantive CMO 

Hint: (See “Conditioned Motivating Operations (CMOs)”) 


1. Define and discuss the characteristics of an establishing 


emmcoesesEEANS 


operation, or motivating operation (MO), and the recent 
movement to changing terminology. 


Hint: (See “Definition and Characteristics of Motivating 
Operations’) 


. Discuss the similarities and differences between 


motivating operations (MOs) and discriminative 
relations. 


Hint: (See “Distinguishing Between MOs and Sead 


. Define and discuss examples of unconditioned motivating 


operations (UMOs) related to the human organism. 


Hint: (See “Unconditioned Motivating Operations 
(UMOs)”) 


. Given the unconditioned motivating operation 


(UMO) of food deprivation for an organism, 
describe the reinforcer-establishing and evocative 
effect that accompanies the UMO. Suppose food iS 


NOTES 


le 


Ds 
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Skinner (1938) used the term drive to express the relationships between 
operations other than reinforcement, such as deprivation, and changes in 
behavior. Even though Skinner found the term useful to describe these 
functional relations, he acknowledged it to be an inferred state between 
these operations and behavior and noted that the term was “not actually 
required in a descriptive system” (p. 368). 

For an account of the history of the concept, see Miguel (2013). 

The direct evocative effect suggests that the MO evokes behavior directly, 
in the absence of an S?. However, we must be careful not to ignore that 


13. 


14. 


———— ae 
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This type of motivating operation has particular 
implications for the training of language in individuals 
with little or no speech. 


a. Surrogate CMO 

b. Transitive CMO 

c. Reflexive CMO 

d. Substantive CMO 

Hint: (See “Conditioned Motivating Operations (CMOs)”) 
The continued study of motivating operations is most 


important to understanding in what field of applied 
behavior analysis? 


a. The three-term contingency 

b. How people learn 

c. Why people learn 

d. How to best refine the terminology 

Hint: (See “Relevance of MOs to Applied Behavior Analysis”) 


subsequently ingested by the organism. Now describe the 
reinforcer-abolishing and abative effects. 

Hint: (See “Unconditioned Motivating Operations 
(UMOs)” and chapter examples) 


. Discuss the evocative effect on current behavior as a 


UMO and the function-altering effect on future behavior 
as punishment given the environmental variable of 
becoming too cold. 


Hint: (See “Unconditioned Motivating Operations 
(UMOs)” and chapter examples) 


. Define conditioned motivating operations (CMOs) and the 


various types of CMOs, and how each of these types operates. 
Hint: (See “Conditioned Motivating Operations (CMOs)’) 


. What are some implications for the use of motivating 


operations (MOs) in the study and future of behavior analysis? 


Hint: (See “Relevance of MOs to Applied Behavior 
Analysis’) 


Neen e eee RR 


one of the defining features of an operant is the stimuli in the presence of 
which it occurs (Catania, 2013). 


_ Michael (1986, 2004) used the terms “repertoire-altering effect” and 


“function-altering effect” to classify operations (e.g., reinforcement) that 
would change the future probability of an organism’s behavior, as well as 
alter the function of stimuli. 


_ Even though this is an example of manipulating an MO (i.e., task diffi- 


culty), the same argument applies to the manipulation of SPs. If problem 
behavior is more likely to occur in the presence of Dad (SP) and not in the 
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presence of Mom (S4), removing Dad from the room (eliminating the SP) 
would only temporarily reduce (abate) the frequency of problem behavior. 
When Dad returns, the child is likely to misbehave again. 

An individual’s quality of life may be threatened by socially impoverished 
and/or extremely aversive environments, which may lead to the develop- 
ment of attention and escape-maintained problem behaviors. In addition to 
being an ethical necessity, improving the environment would reduce current 
problem behavior and help prevent its future occurrence. 

It is true that pain reduction as a form of reinforcement is unavailable in 
the absence of pain, and also, food reinforcement is, in a sense, unavailable 
in the absence of food deprivation, but this is not the kind of unavailability 
that occurs in discrimination training and that develops the evocative effect 
of an SP (Michael, 1982). 

The terms unconditioned and conditioned modify MOs in the same way 
they modify respondent-eliciting stimuli and operant reinforcers and pun- 
ishers. The effects of unconditioned motivating operations, like uncon- 
ditioned eliciting stimuli for respondent behavior and unconditioned 
reinforcers and punishers, are not dependent on the organism’s learning 
history. The effects of conditioned motivating operations, like the effects 
of conditioned eliciting stimuli and conditioned reinforcers and punishers, 
are dependent on a learning history. 

Some of this aggression may be elicited by pain functioning as a respondent 
unconditioned stimulus (Ulrich & Azrin, 1962). 


. Fora discussion of Skinner’s approach to emotional predispositions in the 


general context of MOs, see Michael (1993, p. 197). 
Itis sometimes argued that behavioral principles are too simplistic for analyz- 
ing complex human behavior, and therefore some nonbehavioral—usually 
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cognitive—approach is required. It may be that the behavioral repertoire 
of the people who make this argument is too simple for the task. However, 
behavioral principles themselves are not so simple, as can be seen from 
the preceding effort to understand the effects of punishment MOs, or the 
learned functions of painful stimulation. 

The sound of the buzzer will also function as a respondent conditioned 
eliciting stimulus for smooth muscle and gland responses typically elic- 
ited by food in the mouth, and will condition such responses to any other 
stimulus that is present at that time—an instance of higher order classical 
conditioning (see Chapter 2)—but the current chapter discusses operant 
relations only. 

The term discriminated avoidance arose so that this type of procedure could 
be distinguished from an avoidance procedure in which an exteroceptive 
stimulus change does not precede the shock itself (called avoidance without 
a warning stimulus, non-discriminated avoidance, or free-operant avoid- 
ance). See Chapter 12. 

The evocative effect of CMO-Ts is not easily confused with the evocative 
effect of an S?. If one can see food deprivation as an MO (rather than an 
SP) with respect to the behavior that has been reinforced with food, then 
its function as an MO (rather than an S$”) with respect to behavior that has 
been reinforced with the various food-related conditioned reinforcers is an 
easy extension. 

This scenario was first described in Michael (1982). At that time CMO-T 
was called an establishing stimulus, or S". 

See Chapter 2, Box 2.2, “When the Phone Rings.” 

If removal of darkness is to be considered an unconditioned reinforcer, then 
sudden darkness would establish the value of its removal as a UMO. 


CHAPTER 17 


Stimulus Control 


LEARNING OBJECTIVES 


e Define and provide examples of stimuli and stimulus classes. 


¢ Define and provide examples of stimulus control. 


e Define and provide examples of establishing operations. 


¢ Define and provide examples of stimulus generalization and stimulus discrimination. 


e Explain how differential reinforcement is used to establish stimulus control (i.e., discrimination training procedures). 
e Explain how stimulus control and stimulus generalization are used to produce concept formation. 
¢ Describe how to use response and stimulus prompts to establish stimulus control. 


° Describe how to use response and stimulus prompt fading to transfer stimulus control to the relevant stimulus. 


n addition to selecting and maintaining operant behavior, 

reinforcement confers controlling function to antecedent 

stimuli whose presence or absence coincides with the organ- 
ism’s differential contact with it. Drivers step on the brakes 
when approaching stop signs, red traffic lights, and street- 
crossing pedestrians more often than they do in the absence 
of those stimuli. Bilingual speakers switch languages to match 
the language of their listeners. Baseball pitchers throw fast- 
balls, curves, or change-ups, depending on signals from their 
catchers. 

Control by antecedent stimuli plays a fundamental role in 
social, educational, treatment, and therapeutic environments. It 
can help us understand how behaviors considered appropriate in 
one context are inappropriate when emitted in another context. 
Loud talking is appropriate on the playground, but not in the 
classroom. Arriving 15 to 20 minutes late is appropriate for a 
party, but not for a job interview. Some behaviors considered 
inappropriate by parents, teachers, and society in general are not 
behavior problems per se. The problem is emitting behaviors at 
times, in places, or in circumstances deemed inappropriate by 
others. This represents a problem of stimulus control, and it is a 
major concern for applied behavior analysts. 

This chapter describes basic concepts and processes for 
understanding stimulus control, methods for developing stim- 
ulus control, and techniques for transferring stimulus control 
from contrived antecedents to naturally occurring stimuli. 


STIMULUS CONTROL: BASIC CONCEPTS 
AND PROCESSES 
This section describes basic concepts and processes essential to 


understanding stimulus control: stimulus discrimination, stimu- 
lus generalization, faulty stimulus control, conditional stimulus 


control, respondent eliciting and operant discriminative func- 
tions of antecedent stimuli, and discriminative and motivational 
functions of antecedent stimuli. 


Stimulus Discrimination 


In a basic laboratory demonstration of operant conditioning, a 
rat is placed in an experimental chamber and given the oppor- 
tunity to press a lever. Following each press the rat receives a 
food pellet, and the rate of lever pressing increases. This sim- 
ple demonstration of reinforcement as a two-term contingency 
(operant response — reinforcement) seldom occurs outside of 
laboratory research (Pilgrim, 2015). 

Manipulating the presence of antecedent stimuli in concert 
with the availability of reinforcement makes the contingency 
more complex and demonstrates how an organism’s behavior 
can come under control of the environmental context. For 
example, a buzzer sounds occasionally and the rat receives a 
food pellet after lever presses when the buzzer is sounding, but 
lever presses in the absence of the buzzer sound produce no food 
pellets. Under these experimental conditions, the buzzer sound 
is called a discriminative stimulus ( SP, pronounced “ess-dee’’) 
and its absence is called stimulus delta (S4, pronounced “ess- 
delta”). After some experience with this arrangement, the rat 
will press the lever more often in the presence of the buzzer 
sound than in its absence. 

This three-term contingency (discriminative stimulus > 
operant response — reinforcement) is “considered the basic unit 
of analysis in the analysis of operant behavior” (Glenn, Ellis, & 
Greenspoon, 1992, p. 1332). Behavior that occurs more often 
in the presence of an SP than in its absence is said to be under 
stimulus control. Technically, stimulus control occurs when the 
rate, latency, duration, or magnitude of a response is altered in the 
presence of an antecedent stimulus (Dinsmoor, 1995a, 1995b). 
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A stimulus acquires control only when responses emitted in the 
presence of that stimulus produce more reinforcement than do 
responses in the absence of stimulus. When the S? signals a 
rich schedule of reinforcement and the S“ signals a zero chance 
of reinforcement (i.e., extinction), the organism will eventually 
respond only in the presence of the SP and emit no responses 
in the S4 condition. In such cases, stimulus discrimination is 
achieved. 

As introduced in Chapter 2, an SP evokes behavior because 
its presence has been correlated with the differential availabil- 
ity of an effective reinforcer. That is, responses emitted in the 
presence of the S? have produced reinforcement in the past, and 
responses emitted in its absence (S4) have resulted in no rein- 
forcement (extinction) or reinforcement of lesser quality or ona 
leaner schedule than the reinforcement available in the presence 
of the SP (Michael, 2004). The development of stimulus control 
does not require an all-or-nothing face-off of reinforcement ver- 
sus extinction conditions. While taking a break from working 
on this chapter by raking leaves, one of the authors of this text 
experienced an everyday example of stimulus control created 
by an S4 condition that entailed less reinforcement than the SP 
condition. 


After raking several big piles of leaves from the front lawn 
to the curb, where the city vacuums them up for compost, 

I dropped the rake and took off my jacket. I picked up the 
rake and started on a new patch of lawn. But this time, 
each response with the same sweeping motion of the rake 
that was so efficient moments before moved fewer leaves. 
I looked down and saw I was holding the rake with its tines 
facing up (S4) instead of down (SP). From then on, I held 
the rake tines down! 


Stimulus Generalization 


In contrast to stimulus discrimination, stimulus generalization 
refers to the extent to which stimuli other than the S? acquire 
stimulus control over the behavior. Stimuli sharing simi- 
lar physical properties with the SP are most likely to acquire 
evocative function. Stimulus generalization and discrimination 
are relative concepts. Stimulus generalization reflects a loose 
degree of stimulus control, whereas stimulus discrimination 
indicates a tight degree of control. Stimulus generalization can 
be observed when a young child who has been reinforced for 
Saying “daddy” in the presence of her father says “daddy” in 
the presence of her father’s two brothers. Further conditioning 
in the form of differential reinforcement—praise or other 
reinforcers for “daddy” in the presence of the child’s father or 
photos and other likenesses of him and no reinforcement for 
“daddy” in the presence of her two uncles—will sharpen the 
degree of stimulus control to one stimulus class, the child’s 
father and likenesses of him. 

Stimulus generalization occurs with new stimuli that 
share similar physical dimensions with the controlling anteced- 
ent stimulus. For instance, if a behavior has a history of produc- 
ing reinforcement in the presence of a blue stimulus, stimulus 
generalization is more likely with a lighter or darker color of 
blue than with a red or yellow stimulus. Also, stimulus gener- 
alization is likely when the new stimuli have other elements 
(e.g., size, shape) in common with the controlling stimulus. 


A student whose behavior has produced reinforcement for 
making a response to a circle is more likely to make the same 
response to an oval shape than to a triangular shape. 

A stimulus generalization gradient graphically depicts 
the degree of stimulus generalization and discrimination by 
showing the extent to which responses reinforced in one stim- 
ulus condition are emitted in the presence of untrained stimuli. 
The slope of the gradient indicates the degree of generalization. 
When the slope of the gradient is relatively flat, little stimulus 
control is evident. However, an increasing slope of the gradient 
shows more stimulus control. The gradients are typically pre- 
sented as rate, count (i.e., total number), or percentage of total 
number across all stimuli (see Figure 17.1). 

Behavior analysts have used several procedures to pro- 
duce stimulus generalization gradients. The classic technique of 
Guttman and Kalish (1956) provides a representative example. 
Their technique is important because many prior researchers 
had obtained stimulus generalization gradients by conditioning 
groups of subjects on the same stimulus value and then test- 
ing them individually, each with a different stimulus value. 
Obviously, this type of technique cannot demonstrate the degree 
of stimulus control for individual subjects. Guttman and Kalish 
provided a method of acquiring gradients for each subject and 
laid the foundation for greater understanding of the principles 
governing stimulus control. 

Guttman and Kalish (1956) reinforced pigeons on a vari- 
able interval (VI) 1-min schedule for pecking a disk illumi- 
nated with a light source appearing yellow-green to humans 
(i.e., a wavelength of 550 my). After the rate of pecking the 
yellow-green disk had stabilized, the pigeons were tested 
under extinction conditions on the original stimulus and a ran- 
domized series of 11 different wavelengths never presented 
during training as the test for stimulus generalization. 

Stimulus generalization occurs with responses to a new 
stimulus after a response has been conditioned in the presence of 
another similar stimulus. If responding produces reinforcement 
during testing for stimulus generalization, it cannot be clear 
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1998, Journai of Applied Behavior Analysis, 31, p. 481. Reproduced with 
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whether any responses to a new stimulus after the first response 
represent generalization or if the responses are a function of the 
reinforcement schedule. Guttman and Kalish (1956) avoided this 
problem of confounding their results by testing for generaliza- 
tion during extinction. 

Lalli, Mace, Livezey, and Kates (1998) reported an excel- 
lent applied example of assessing and displaying a stimulus 
generalization gradient. They used a stimulus generalization 
gradient in an assessment of the relation between the physical 
proximity of an adult and the self-injurious behavior (SIB) of a 
10-year-old girl with severe intellectual disabilities. The results 
presented in Figure 17.1 show that the percentage of total SIB 
across sessions became progressively lower as the distance 
between the therapist and the girl increased. 


Faulty Stimulus Control 


Faulty stimulus control occurs when a behavior comes under 
the restricted control of an irrelevant antecedent stimulus. 
Instructional materials students can complete with 100% 
accuracy without attending to the critical stimulus features 
of the material are widely used in primary and secondary 
classrooms. Figure 17.2 is an example of a worksheet likely 
to engender faulty stimulus control. Students need not read 


This is very slick. 


Jim will go on the hill with his sled. 
Lucy has a big hole in her sleeve. 

It is time for me to go to sleep. 
The dog has my slipper. 
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anything but the highlighted word at the end of each sentence 
to determine which picture goes with it. Other examples 
include a science vocabulary worksheet that can be completed 
without reading the definitions—the student simply matches 
the number of letters in each term with the number of spaces 
provided next to the definitions, and a language arts activity 
in which students construct compound words by drawing lines 
between the color-matched boxes (e.g., base and ball are in 
blue boxes, bath and tub in green boxes) without reading the 
words and thinking about which ones go together (Heward, 
2003). Because students' answers on such poorly designed 
materials are under the stimulus control of irrelevant features 
(e.g., color, size, position on the page), the materials provide 
no meaningful practice with the knowledge or skills they were 
intended to teach. 

Vargas (1984) suggested that a “Yes” answer to any of 
the following questions indicates that an instructional mate- 
rial is likely to result in faulty stimulus control over students’ 
behavior: 


1. Can students use pictures or diagrams instead of text to 
complete an exercise? 


2. Does highlighting or physical layout give away answers, 
making it unnecessary to read through an assignment? 


Figure 17.2 Example of 
instructional materials likely to 
engender faulty stimulus control. 


From “What Are Your Exercises Teaching: 
An Analysis of Stimulus Control in Instruc- 
tional Materials,” by J. S. Vargas, 1984. 
In W. L. Heward, T. E. Heron, D. S. Hill, & 
J. Trap Porter (Eds.), Focus on Behavior 
Analysis in Education (p. 132). Copyright: 
Charles E. Merrill Publishing Company. 
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3. Are students able to answer questions on a passage with- 
out reading it? 

4. Do all of the problems on a page require the same pro- 
cess for solution, making it unnecessary to discriminate 
between strategies? 


5. Are the questions Jabberwocky comprehension ques- 
tions—that is, can they be answered using grammatical 
cues alone? (p. 130) 


Applied behavior analysts who work in schools should 
seek to detect and eliminate faulty stimulus control in instruc- 
tional materials as their roles permit. 


Conditional Stimulus Control 


In a simple stimulus discrimination, only one antecedent stim- 
ulus controls the response. Holding a cup under the tap on a 
water cooler and pushing the blue lever is a simple discrimi- 
nation by a person wanting cold water (see Figure 11.3). The 
blue lever is an SP that signals the availability of cold water. 
A child touching a circle from an array of different shapes when 
her teacher says, “Touch the circle” is a simple discrimination. 
Most stimulus control in everyday life is more complex than the 
simple discriminations described by the three-term contingency: 
SP>R— Sr, Obtaining reinforcement often depends on dis- 
criminating the environmental context provided by multiple 
stimuli. 

In a conditional discrimination, the response that will 
produce reinforcement in the presence of a particular stimu- 
lus depends on (is conditional on) the presence or absence of 
other stimuli. Pushing the blue (or red) lever when a friend asks, 
“Cold (or hot) water please,” is a conditional discrimination. 
The friend’s request is the conditional stimulus, and the color 
of the water tap lever is the discriminative stimulus. Conditional 
discrimination is evidenced in a child’s correct response to the 
request “Point to the circle that is the same color as your shirt.” 
The color of the child’s shirt is the conditional stimulus that 
makes the same-colored circle the discriminative stimulus. 


Respondent Eliciting and Operant Discriminative 
Functions of Antecedent Stimuli 


At first glance, stimulus control of operant behavior appears 
similar to the control of respondent behavior by a conditioned 
stimulus. Discriminative stimuli and conditioned stimuli are 
antecedent stimuli whose presence increases the frequency of 
some behavior. Applied behavior analysts, however, need to 
distinguish between the evocative function of SPs for operant 
behavior and the eliciting function of conditioned stimuli for 
respondent behavior. This is a crucial distinction for understand- 
ing antecedent control of operant behavior. 

In a typical laboratory demonstration of respondent con- 
ditioning, the experimenter presents food to a dog. The food 
is an unconditioned stimulus that elicits the unconditioned 
response—salivation. The experimenter then introduces a buzzer 
sound (a neutral stimulus). The buzzer sound does not elicit 
salivation. Then after several trials of pairing the buzzer sound 
with food delivery, the buzzer becomes a conditioned stimulus 


that elicits salivation (a conditioned response) in the absence of 
food (an unconditioned stimulus). 

Laboratory experiments on operant and respondent condi- 
tioning have demonstrated consistently that antecedent stimuli 
can acquire control over a behavior. A buzzer sounds, and a 
rat presses a bar. A buzzer sounds, and a dog salivates. Despite 
the similarities, the bar press is an operant behavior, salivation 
is a respondent behavior, and the manner in which the SP and 
the conditioned stimulus acquire their controlling functions 
is very different. An S? acquires its controlling function for 
operant behavior by being paired with stimulus changes that 
occur immediately following behavior. Conversely, a condi- 
tioned stimulus acquires its controlling function for respondent 
behavior by being paired with other antecedent stimuli that elicit 
the behavior (i.e., an unconditioned stimulus or a conditioned 
stimulus). 


Discriminative and Motivational Functions 
of Antecedent Stimuli 


Distinguishing the nature of antecedent control of operant behav- 
ior can be difficult. Discriminative stimuli and establishing oper- 
ations share two important similarities: Both occur before the 
behavior of interest and both have evocative functions. To evoke 
a behavior means to occasion it or call it up. The question often 
becomes, “Was a particular response evoked by a discriminative 
stimulus (sP ), an establishing operation (EO), or both?” 

In some situations, an antecedent stimulus change evokes 
a response and the functional relation appears to be an SP. 
For example, in a typical shock—escape procedure an animal 
is placed in an experimental chamber. Shock is administered 
until a response terminates the shock for a designated period. 
Then the shock is reintroduced until it is again terminated by a 
response, and so on. An experienced animal escapes the shock 
immediately. In such a situation, some would say that the shock 
serves as an SP. The shock, an antecedent stimulus, evokes a 
response that is negatively reinforced (the shock is removed). 
In this situation, however, shock does not function as an S?. To 
repeat an important point, an SP evokes behavior because its 
presence has been correlated with the differential availability of 
an effective reinforcer. That is, responses emitted in the presence 
of the SP have produced reinforcement in the past and responses 
emitted in its absence (SA) have not. 

Even though the animal receives reinforcement by ter- 
minating the shock, the absence of shock does not signal the 
availability of less frequent reinforcement. Before the response 
can be reinforced, the shock must be on. In this example, the 
onset of shock is an EO because it changes what functions as 
reinforcement (shock termination) rather than signaling the 
differential availability of reinforcement (Michael, 2000). Often 
the apparent SP effect does not have a history of effective dif- 
ferential reinforcement correlated with the altered frequency of 
response. These situations are probably related to motivating 
operations (MOs) rather than stimulus control. 

The following scenario puts the laboratory examples into 
an applied context: A teacher asks a student to begin working on 
an academic task. The student refuses to comply, disrupting the 
class, and the teacher withdraws the request. Later, the teacher 


again requests an academic response, and the cycle of noncom- 
pliance and disruptive behavior and removal of the request con- 
tinues. Although the teacher’s request is an antecedent stimulus 
that evokes aggression that is negatively reinforced (the request 
is removed), it is not an SP. Like the onset of shock in Michael’s 
laboratory example, the teacher’s request is an EO that evokes 
the aggressive behaviors rather than an S?. It makes no sense to 
talk about an SP evoking the aggressive behavior in the absence 
of the request (McGill, 1999), just as it makes no sense to talk 
about an SP evoking a response in the absence of the shock. The 
laboratory rat is not motivated to escape shock in the absence of 
shock, and the student does not “want” to escape his teacher’s 
task requests in the absence of those task requests. An anteced- 
ent stimulus functions as an S? only when in its presence a spe- 
cific behavior produces reinforcement, and in the absence of 
that stimulus the same behavior does not produce reinforcement 
(Michael, 2000). 

The laboratory and classroom examples can be changed 
to show the difference between the evocative functions of EOs 
and stimulus control. The experimental conditions could change 
so that a buzzer sounds at different periods throughout the ses- 
sion, the shock would be removed only when a response is made 
while the buzzer is sounding, and a response would not produce 
reinforcement in the absence of the buzzer (i.e., the shock would 
not be removed). Under these conditions the buzzer would func- 
tion as an cer and stimulus control would be demonstrated. 

The classroom conditions could change so that two teach- 
ers work with the student. One teacher terminates task requests 
when the student is noncompliant; the other teacher does not 
remove the requests. In this arrangement, the presence of the 
first teacher signals the availability of negative reinforcement; 
the second teacher signals extinction. The teacher who termi- 
nates the requests to work would become an SP that evokes 
noncompliant, disruptive behaviors from the student; the sec- 
ond teacher would become an S* for noncompliance. In these 
modified examples, the characteristics of the antecedent control 
would be different in the presence and absence of the buzzer and 
the other teacher, and the buzzer and the other teacher would be 
correlated with an increased frequency of reinforcement. 

An understanding of the SP and MO evocative functions 
will improve technological descriptions of antecedent control 
and the understanding of behavior change (Laraway, Snycerski, 
Michael, & Poling, 2001). Ultimately, these understandings will 
produce greater effectiveness in education and treatment (see 
also Chapter 16). 


DEVELOPING STIMULUS CONTROL 


This section describes the major processes required for the 
development of stimulus control. 


Stimulus Discrimination Training 


The basic procedure for stimulus discrimination training 
entails a multiple schedule (Chapter 13) with antecedent stimu- 
lus conditions representing each component schedule. Responses 
in the presence of one stimulus condition (SP) are reinforced, 
and responses in the presence of the other stimulus (S4) are not 
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reinforced. When this procedure is applied appropriately and 
consistently, responding in the presence of the SP will come to 
exceed responding in the presence of the S*. Often, over time, 
the participant will learn not to respond in the presence of the S*. 

Applied behavior analysts often describe the conven- 
tional procedures for discrimination training, with differential 
reinforcement as alternating conditions of reinforcement and 
extinction, meaning that a response produces reinforcement in 
the SP condition but not in the S* condition. However, to clarify 
and stress and important point: the S4 is used not only to show 
a condition of zero reinforcement (extinction) but also to denote 
a condition that provides a lesser amount or quality of reinforce- 
ment than the S? condition (Michael, 2004). 

Maglieri, DeLeon, Rodriguez-Catter, and Sevin (2000) 
used discrimination training as part of an intervention to 
decrease the food stealing of a 14-year-old girl with Prader-Willi 
syndrome, a serious medical condition usually correlated with 
obesity and food stealing. During discrimination training, a 
teacher showed the girl two containers of cookies. One container 
contained a warning label, the S*, and the other container had 
no warning label, the S’. The teacher told the girl that she could 
eat cookies only from the container without the warning label. 
The teacher asked the girl in the presence of the two contain- 
ers, “Which cookies can you eat?” If the girl answered that she 
could eat the cookies from the container without the warning, 
the teacher let her eat one cookie from the container. This dis- 
crimination training procedure decreased stealing food from the 
containers marked with the warning label. 

Researchers have used multiple schedules to help learners 
develop stimulus control of social approach responses (e.g., 
Cammilleri, Tiger, & Hanley, 2008; Grow, LeBlanc, & Carr, 
2010; Torelli, Lloyd, Diekman, & Wehby, 2016). Although 
teaching students to recruit teacher attention can increase 
teacher praise and instructional assistance and improve stu- 
dent academic performance (see Teach the Learner to Recruit 
Reinforcement in Chapter 30), classrooms are busy places and 
teacher attention is not continuously available. Students must 
be taught not only how to recruit teacher attention but also 
how to discriminate when recruiting is appropriate (Alber & 
Heward, 2000). 

Cammilleri et al. (2008) used a classwide multiple- 
schedule procedure to bring young children’s requests to 
teachers under stimulus control in the three primary class- 
rooms. In two to four 5-min sessions per day throughout the 
study, the teachers randomly alternated wearing a green lei or 
a red lei. During baseline, differential consequences were not 
paired with each lei. The teachers responded to a student’s social 
approach and provided academic assistance as needed. During 
the multiple-schedule condition, the green lei (SP) signaled that 
teacher attention was available, and the red lei (S4) signaled 
teacher attention was unavailable. Prior to each session during 
the multiple-schedule condition, the teacher described the con- 
tingency associated with the relevant lei: “While I am wearing 
the green lei, I will be able to answer your questions” or “While 
Iam wearing the red lei, I will not be able to answer your ques- 
tions” (p. 301). Student requests during the multiple-schedule 
condition were maintained during desirable periods and mini- 
mized during undesirable periods (see Figure 17.3). 
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Figure 17.3 Social approach rates during baseline and 
multiple-schedule conditions across three elementary 
classrooms. 

“Developing Stimulus Control of Young Children’s Requests to Teachers: 
Classwide Applications of Multiple Schedules”, A. P. Cammilleri, 

J. H. Tiger, and G. P. Hanley, 2008, Reproduced with permission of 

John Wiley & Sons Inc. 


Teaching Conditional Discriminations 


Skinner conducted early research on complex stimulus control 
with an experimental procedure called matching-to-sample. 
Dinsmoor (1995b) described Skinner’s procedure in the follow- 
ing way. A pigeon was presented with three horizontal keys to 
peck. The middle key was illuminated with a color to start the 
trial. A peck on the illuminated key turned it off and lighted two 
side keys. One of the side keys was the same color as the sample 
color illuminated on the middle key. A peck on the side key with 
the same color as the sample key produced reinforcement. Error 
responses were not reinforced. 

Skinner’s matching-to-sample procedure included the 
three-term contingency: 


Sorctiicteee> Response —————_» Reinforcement 
Side key with 
color 


Key peck Grain 


This basic contingency, however, is incomplete because 
of constraints from the environmental context. The contex- 
tual events operating on the three-term contingency become 
conditional discriminations (Sidman, 1994; Spradlin & Simon, 
2011). The color of the middle key in Skinner’s procedure is the 
conditional stimulus. The three-term contingency is effective 
only when it matches the sample stimulus. Reinforcement is 
conditional on the context of discriminative stimuli other than 
the S?; that is, the three-term contingency itself comes under 


contextual control. Conditional discriminations operate at the 
level of a four-term contingency: 


Conditional sample ——> sP _____ Response —> Reinforcement 
Color of middle key ——> Side ——-» Key peck —> Grain 

key same 

color as 

middle key 


Sean Key peck —> 


Side key different 
color than middle key 


To start a matching-to-sample trial, the participant will 
make a response (called the observing response) to present the 
sample stimulus (i.e., the conditional sample). The comparison 
stimuli (i.e., the discriminative events) are presented usually 
after removing the sample stimulus, but not always, and pro- 
vide for an effective three-term contingency and other non- 
effective three-term contingencies. One comparison stimulus 
will match the conditional sample. A response that selects the 
matched comparison and rejects the nonmatched comparisons 
will produce reinforcement. Figure 17.4 presents an example 
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Figure 17.4 An example of the observing response, the 
conditional sainple, the discriminative events, and the 
comparison match during a matching-to-sample trial. 


of the observing response, the conditional sample, the discrimi- 
native events, and the comparison match. Responses that select 
the nonmatching comparison stimuli are not reinforced. During 
conditional discrimination training, the same selection must be 
correct with one conditional stimulus, but incorrect with one 
or more other sample stimuli. Most matching-to-sample appli- 
cations use a correction procedure with error responses. One 
correction procedure has the learner respond to the same sam- 
ple and comparison stimuli until a correct response has been 
reinforced. Error correction procedures and the random posi- 
tioning of the comparison stimuli control for position respond- 
ing. Numerous examples of using the matching-to-sample 


procedure to teach conditional discriminations are described 
in Chapter 19. 


Teaching Concepts 


The preceding section on discrimination training describes how 
an antecedent stimulus can acquire evocative control over a 
response, meaning that the behavior occurs more frequently in 
the presence of the stimulus than in its absence. A discrimination 
training procedure might be used to teach a preschool student 
to name the primary colors. For instance, to teach the color red, 
the teacher could use a red object, such as a red ball, as the sp 
condition and a non-red object, such as a yellow ball, as the 
S4 condition. The teacher could position both balls randomly 
in front of the student, direct the student to name and point to 
the red ball, and reinforce correct responses, but not incorrect 
responses. After a few trials the red ball would acquire stimulus 
control over the student’s response, and the student would reli- 
ably differentiate the red ball from the yellow ball. This simple 
discrimination training, however, is unlikely to meet the instruc- 
tional objective of identifying the color red. The teacher may 
want the student to learn not only to discriminate between red 
balls and balls of different color but also the concept of redness. 
A concept is defined by a set of shared features found in each 
example of the concept (Layng, 2017). 


That is, every example of a concept shares certain must 
have features with all other examples of the concept. In 
addition to these must have features, the examples have 
other can have features, which other examples of the con- 
cept may or may not have. . . . Thus, the shared must have 
features are what define something as an example of a 
concept and do not change from example to example. The 
other nondefining features are those the example can have, 
often changing from example to example, but that don’t 
define the example as an instance of the concept. (Layng, 
2017; pp. 12) 


Terms such as concept formation or concept acquisition 
imply for many people some hypothetical construct or men- 
tal process. Yet acquiring a concept is clearly dependent on 
responses in the presence of antecedent stimuli and the conse- 
quences following those responses. When given examples and 
nonexamples of a concept not presented during instruction, a 
learner who understands the concept can: 


¢ Distinguish examples of the concept from similar nonex- 
amples that lack one or more of the defining must have 
features (discrimination). 
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¢ Identify examples of the concept across a wide range of 
varying examples that include nondefining can have fea- 
tures (stimulus generalization). (adapted from Layng, 2017) 


Concept formation is a behavioral outcome of stimulus 
generalization and discrimination (Keller & Schoenfeld, 1950; 
Zentall, Galizio, & Critchfield, 2002). Concept formation is a 
complex example of stimulus control that requires both stimu- 
lus generalization within a class of stimuli and discrimination 
between classes of stimuli. An antecedent stimulus class is a 
set of stimuli that share a common relationship. All of the stim- 
uli in an antecedent stimulus class will evoke the same operant 
response class or elicit the same respondent behavior. This evoc- 
ative or eliciting function is the only common property among 
the stimuli in the class (Cuvo, 2000). For example, consider a 
stimulus class for the concept red. A red object is called red 
because of a particular conditioning history. This conditioning 
history of differential reinforcement will evoke the response red 
to light waves of different wavelengths, from light red to dark 
red. These different shades of the color red that will evoke the 
response red share a conditioning history and are included in 
the same stimulus class. Shades of red (e.g., very light red) that 
do not evoke the response red are not members of that stimulus 
class. Therefore the concept of redness requires stimulus gener- 
alization from the trained stimuli to many other stimuli within 
the stimulus class. If the preschool student described earlier had 
acquired the concept of redness, he would be able to identify the 
red ball and, without specific training or reinforcement, choose 
a red balloon, a red toy car, a red pencil, and so on. 

In addition to stimulus generalization, a concept requires 
discrimination between members and nonmembers of the 
stimulus class. For example, the concept of redness requires 
discriminating between red and other colors and irrelevant 
stimulus dimensions such as shape or size. The concept begins 
with discrimination between the red ball and the yellow ball but 
results in discriminating a red dress from a blue dress, a red toy 
car from a white toy car, and a red pencil from a black pencil. 

Discrimination training is fundamental to teaching con- 
ceptual behavior. Antecedent stimuli representative of a group 
of stimuli sharing a common relationship (i.e., the stimulus 
class) and antecedent stimuli from other stimulus classes must 
be presented. Before a concept can be acquired, the teacher must 
present exemplars of what the concept is (i.e., the SP condi- 
tion) and what the concept is not (i.e., the S4 condition). This 
approach holds true for all conceptual development, even for 
highly abstract concepts (e.g., honesty, patriotism, justice, free- 
dom, sharing). It is also possible to acquire a concept through 
vicarious discrimination training and differential reinforcement. 
A verbal definition of a concept, with examples and nonexam- 
ples of the concept, may be sufficient for concept formation 
without additional direct training. 

Authors of children’s literature often teach concepts 
vicariously, such as good and bad, honest and dishonest, cou- 
rageous and cowardly. For example, consider the story of an 
owner of a mom-and-pop grocery store who wanted to hire a 
young person to work in the store. The job included sweeping 
the floor, bagging groceries, and keeping the shelves neat. The 
owner wanted an honest person to work for her, so she decided 
to test all applicants to see whether they were honest. The first 
young person who applied was given the opportunity to try the 


444 Part6 * Antecedent Variables 


Antecedent Stimuli —---—————> Response —----——> Reinforcement 


50% 
1/2 

Divided evenly 
5 


Half ————————> Reinforcement 


Figure 17.5 Example of an arbitrary stimulus class: 
Antecedent stimuli with different physical forms that 
evoke the same response, half. 


job before the owner made the commitment to hire him. But 
before the applicant came to work, the owner hid a dollar bill 
where she knew the young person would find it. At the end of 
the test period the owner asked the applicant how he had liked 
working in the store, whether he wanted the job, and whether 
anything surprising or unusual had happened to him. The appli- 
cant replied that he wanted the job and that nothing surprising 
had happened to him. The grocer told the first applicant that she 
wanted to consider others who had also applied. The second 
applicant worked a test period with the same results as the first 
person. He did not get the job. The third young person to work 
for the grocer was sweeping the floor, found the dollar bill, 
and took it immediately to the grocer. The third applicant said 
he turned in the dollar bill in case one of the customers or the 
grocer had dropped it. The grocer asked the applicant whether 
he liked the job and wanted to work for her. The young person 
replied that he did. The grocer told the applicant that he had the 
job because he was an honest person. The grocer also let him 
keep the dollar bill. 

The previous children’s story presents exemplars of honest 
and dishonest behavior. The honest behavior was rewarded (i.e., 
the honest person obtained the job), and the dishonest behavior 
was not rewarded (i.e., the first two applicants did not get the 
job). This story might vicariously teach a certain concept of 
honesty. 

Stimuli that make up a class may function within fea- 
ture stimulus classes and arbitrary stimulus classes (Mcllvane, 
Dube, Green, & Serna, 1993; Catania, 2013: Spradlin & Simon, 
2014). Stimuli in a feature stimulus class share common phys- 
ical forms (e.g., topographical structures) or common relative 
relations (e.g., spatial arrangements). Feature stimulus classes 
include an infinite number of stimuli and constitute a large por- 
tion of our conceptual behavior. For instance, the concept ‘dog’ 
is based on a feature stimulus class. The common physical forms 
of all dogs will be members of that stimulus class. A young 
child, through differential reinforcement, will learn to differ- 
entiate dogs from horses, cats, cows, and so on. Physical forms 
provide common relations for many feature stimulus classes, 
such as stimulus classes that evoke the responses book, table, 
house, tree, cup, cat, rug, onion, and car. A relational, or relative, 
relation exists among stimuli in other feature classes. Examples 
of these feature stimulus classes based on relative relations are 
found with concepts such as bigger than, hotter than, higher 
than, on top of, and to the left of. 

Stimuli composing an arbitrary stimulus class evoke the 
same response, but they do not share a common stimulus fea- 
ture (i.e., they do not resemble each other in physical form, nor 


do they share a relational relationship). For example, a teacher 
could form an arbitrary stimulus class using the stimuli 50%, 
1/2, divided evenly, .5, and a shaded circle (see Figure 17.5). 
Following training, each of these stimuli of different physical 
forms will evoke the same response, half. Green bean, aspara- 
gus, potato, and corn could be developed into an arbitrary stim- 
ulus class to evoke the response vegetable. English language 
students learn to associate vowels with the arbitrary class of the 
letters A, E, I, O, U, and sometimes Y. 

The development of concepts and complex verbal relations 
plays an important role in parenting, caregiving, education, and 
treatment. Applied behavior analysts need to consider different 
instructional procedures when teaching concepts and complex 
verbal relations that produce feature stimulus classes and 
arbitrary stimulus classes. A common instructional procedure 
used with feature stimulus classes is to differentially reinforce 
responses to is (S?) and is not (S4) examples of the concept. 
Broad generalization is common with feature stimulus classes; 
depending on the functioning level of the participants, a few 
trained examples may be sufficient to develop the concept. 
Stimulus generalization, however, is not a characteristic of 
arbitrary stimulus classes. Applied behavior analysts have devel- 
oped arbitrary stimulus classes by using matching-to-sample 
procedures to create stimulus equivalence among arbitrary 
stimuli (see Chapter 19). 


Using Response and Stimulus Prompts 


Prompts are supplementary antecedent stimuli used to occa- 
sion a correct response in the presence of the natural SP that 
will eventually control the behavior. Applied behavior analysts 
give response and stimulus prompts before or during the per- 
formance of a behavior. Response prompts operate directly on 
the response to cue a correct response. The three major forms 
of response prompts are verbal instructions, modeling, and 
physical guidance. Stimulus prompts operate directly on the 
antecedent task stimuli to cue a correct response in conjunction 
with the critical S°. 


Verbal Instructions 


Applied behavior analysts use functionally appropriate verbal 
instructions as supplementary response prompts. Verbal response 
prompts occur frequently in almost all training contexts in the 
forms of vocal verbal instruction (e.g., oral, telling) and nonvocal 
verbal instruction (e.g., written words, manual signs, pictures). 

Teachers often use vocal verbal instructional prompts. 
Suppose a teacher asks a student to read the sentence “Plants 


need soil, air, and water to grow.” The student reads, “Plants 
need... Plants need... Plants need... ” The teacher could 
use any number of verbal prompts to occasion the next word. 
She might say, “The next word is soil. Point to soil and say 
soil.” Or she might use a rhyming word for soil. For another 
example, Adkins and Mathews (1997) taught in-home caregiv- 
ers to use vocal verbal response prompts to improve voiding 
procedures for two adults with urinary incontinence and cog- 
nitive impairments. The in-home caregiver checked for dryness 
each hour or every | or 2 hours between 6:00 AM and 9:00 pM. 
The caregiver praised dryness, asked the adult to use the toilet, 
and provided assistance as needed when the adult was dry at 
the regularly scheduled check. This simple response prompt 
procedure (1.e., asked the adult to use the toilet), which was 
introduced following a baseline condition, produced in one of 
the adults a mean 22% reduction of grams of urine collected 
per day in wet diapers during the 2-hour prompted voiding con- 
dition, and a mean 69% reduction during the 1-hour prompt 
condition. The second adult received only the |-hour prompted 
voiding condition, which resulted in a mean 55% reduction of 
urine collected per day. 

Krantz and McClannahan (1998) and Sarokoff, Taylor, 
and Poulson (2001) used nonvocal verbal instructional response 
prompts in the form of embedded scripts to improve the sponta- 
neous social exchanges of children with autism. Examples of the 
embedded scripts in the children’s photographic activity sched- 
ules included look, watch, and let’s eat our snack. In another 
example of nonvocal verbal instructional response prompts, 
Wong, Seroka, and Ogisi (2000) developed a checklist with 
54 steps to prompt self-assessment of blood glucose level by 
a diabetic woman with memory impairments. This participant 
followed the checklist sequence and checked off each step as 
she completed it. 


Modeling 


Applied behavior analysts can demonstrate or model the 
desired behavior as a response prompt. Modeling can effec- 
tively prompt behaviors especially for learners who have 
already learned some of the component behaviors required for 
the imitation. Modeling is an easy, practical, and successful 
way for a coach to show a player an appropriate form for shoot- 
ing a basketball through a hoop when the player already can 
hold the ball, raise it over his head, and push the ball away from 
his body. Few teachers would use modeling to teach a child 
with severe disabilities to tie her shoes if she could not hold 
the laces in her hands. In addition, attending skills are import- 
ant. The learner must observe the model to enable imitation 
of the performance. Finally, modeling as a response prompt 
should be used only with learners who have already developed 
imitative skills. The use of models to develop a wide range 
of new behaviors has been demonstrated repeatedly. Chapter 
21 provides a detailed discussion of modeling, imitation, and 
observational learning. 


Physical Guidance 


Physical guidance is a response prompt applied most often 
with young children, learners with severe disabilities, and older 
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adults experiencing physical limitations. Using physical guid- 
ance, the teacher partially guides the student’s movements or 
physically guides the student throughout the entire movement 
of the response. 

Hanley, Iwata, Thompson, and Lindberg (2000) reported 
the use of physical guidance to help participants with profound 
intellectual disabilities manipulate leisure items. Conaghan, 
Singh, Moe, Landrum, and Ellis (1992) used physical guid- 
ance to prompt adults with intellectual disabilities and hearing 
impairments to use manual signs. When a participant made an 
error in sign production, the teacher physically guided the per- 
son’s hands to prompt a correct response. In another example, 
a personal trainer worked with three older adults with severe 
disabilities, osteoporosis, and arthritis. The trainer physically 
guided the participants’ arm movements whenever they did 
not begin independent pushes with a dumbbell or stopped the 
pushes before reaching their exercise criterion (K. Cooper & 
Browder, 1997). 

Physical guidance can be an effective response prompt, 
but it is more intrusive than verbal instruction and modeling. 
It requires direct physical contact between the teacher and the 
student, making precise assessment of student progress diffi- 
cult. Even though some learners require physical guidance, such 
response prompts provide little opportunity for the student to emit 
the behavior without the direct assistance of the teacher. Another 
possible problem is that some learners resist physical touch. 


Stimulus Prompts 


Applied behavior analysts frequently use movement, position, 
and redundancy of antecedent stimuli as stimulus prompts. For 
example, movement prompts can help a learner discriminate 
between a penny and a dime by pointing to, tapping, touching, 
or looking at the coin to be identified. In the coin discrimination 
task, the teacher could use a position prompt and place the cor- 
rect coin closer to the student. Redundancy prompts occur when 
one or more stimulus or response dimensions (e.g., color, Size, 
shape) are paired with the correct choice. For instance, a teacher 
might use a color mediation procedure of associating a numeral 
with a color, then link the name of a color to an answer for an 
arithmetic fact (Van Houten & Rolider, 1990). 

Practitioners must be aware that there is a risk of prompt 
dependence with the applications of response and stimulus 
prompts (e.g., Grow & LeBlanc, 2013). Prompt dependence 
results from a failure to transfer stimulus control from the 
prompt to the desired stimuli. The basic research foundations 
and procedures for the transfer of stimulus control are presented 
later in this chapter. 


Factors Affecting the Development 
of Stimulus Control 


Applied behavior analysts establish stimulus control with fre- 
quent differential reinforcement of behavior in the presence and 
absence of the S? condition. Effective differential reinforcement 
requires the consistent use of consequences that function as rein- 
forcers. Additional factors such as preattending skills, stimulus 
salience, masking, and overshadowing also will affect the devel- 
opment of stimulus control. 
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Preattending Skills 


The development of stimulus control requires certain prereq- 
uisite skills. For academic or social skills, the student should 
engage in orienting behaviors appropriate to the SPs in the 
instructional setting. Such preattending skills include looking 
at the instructional materials, looking at the teacher whenever 
a response is modeled, listening to oral instructions, and sitting 
quietly for short periods. Teachers should use direct behavioral 
interventions to specifically teach preattending skills to learners 
who have not developed these skills. Learners must emit behav- 
iors that orient their sensory receptors to the appropriate S? for 
the development of stimulus control. 


Stimulus Salience 


The salience of a stimulus (i.e., the prominence of the stimulus 
in the learner’s environment) influences attention to it and ulti- 
mately the development of stimulus control (Dinsmoor, 1995b). 
For example, Conners and colleagues (2000) included salient 
cues (e.g., specific room color, specific therapist) into the con- 
text of multielement functional analyses (see Chapter 27). Their 
results suggested that salient cues facilitated the efficiency of 
functional analyses by producing faster and clearer outcomes 
in the presence of the salient cues than in the absence of them. 

Some stimuli have more salience than others, depend- 
ing on the sensory capabilities of the individual, the past his- 
tory of reinforcement, and the context of the environment. For 
instance, a student may not attend to words written on the black- 
board because of poor vision, or to the teacher’s oral directions 
because of poor hearing, or to curriculum materials because of 
past failures to learn, or to teacher directions because of focusing 
attention on a toy in the student’s desk. 

Catania (2013) clarified the common use of the term 
salience (as we have used it here) when he stated that “salience 
isn’t a property of a stimulus; it is actually a property of the 
organism’s behavior with respect to that stimulus” (p. 140). 
We can say this another way: Although a behavior analyst can 
change various aspects (e.g., size, position, color) of a stimulus 
in an effort to increase its salience, whether or not salience is 
achieved is determined by the learner’s differential responding 
(or lack thereof) to those changes. 


Overselective Stimulus Control 


In overselective stimulus control (also called stimulus over- 
selectivity) the range of discriminative stimuli or stimulus fea- 
tures controlling behavior is extremely limited (Lovaas, Koegel, 
& Schreibman, 1979). Some children with autism or intellectual 
disabilities focus on a minute feature of an object or a person 
rather than the whole (Dube et al., 2016; Ploog, 2010). For exam- 
ple, if shown a guitar for the first time, a child might focus on the 
sound hole and not consider anything else about the instrument, 
such as its size, shape, other parts, or even the sound it makes. 
This selective stimulus control interferes with the child’s under- 
standing of what a guitar is—the totality of its parts and function. 
Overselective stimulus control hinders a child’s ability to interpret 
relevant meaning from the environment and to learn new concepts 
and academic and language skills. 


Several studies have demonstrated that teaching learners 
to emit more elaborate observing responses before making a 
selection response can improve the performance of learners in 
matching-to-sample activities (Doughty & Hopkins, 2011; Dube 
& Mcllvane, 1999; Gutowski & Stromer, 2003). The elaborated 
observing response used by Farber, Dickson, and Dube (2017) 
to improve the performance of participants with overselective 
stimulus control was described as follows: 


In typical matching procedures, the sample stimuli are 
presented first, the student makes an observing response to 
the samples, and then an array of comparison stimuli is pre- 
sented, one of which is correct in relation to the samples. 
Observing responses to sample stimuli may be classified 

as nondifferential or differential. With nondifferential 
observing responses, the response to samples is the same on 
every trial and the only discrimination required is between 
the presence versus absence of the stimuli. By contrast, 
differential observing responses (DORs) include behavioral 
requirements that verify discrimination of stimulus features 
that differ among the samples. For example, if the sample 
stimuli are pictures, one type of DOR would be to name the 
pictures aloud (e.g., Constantine & Sidman, 1975). (Farber 
et al., 2017, p. 88) 


Stimulus Blocking and Overshadowing 


Stimulus blocking and overshadowing are methods for decreas- 
ing the salience of stimuli (Dinsmoor, 1995b). In stimulus 
blocking (sometimes called masking), even though one stimulus 
has acquired stimulus control over behavior, a competing stim- 
ulus can block the evocative function of that stimulus (Didden, 
Prinsen & Sigafoos, 2000; Seraganian & vom Saal, 1969). For 
example, a student may know the answers to a teacher’s ques- 
tions, but will not respond in the presence of the peer group, 
which in this example is competition of different contingencies 
of reinforcement, not just an antecedent stimulus that makes 
it harder to “attend to” the relevant SP. In overshadowing, 
the most salient component of a compound stimulus arrange- 
ment controls responding and interferes with the acquisition of 
stimulus control by the more relevant stimulus. For example, 
Dittlinger and Lerman (2011) found that pictures in text delayed 
acquisition of sight word reading by children with developmen- 
tal disabilities, suggesting that pictures overshadowed the text 
and may have interfered with learning. Fading in the pictures 
using a least-to-most prompting hierarchy and a text-to-picture 
matching response requiring students to attend to both the text 
and the picture have proved effective against overshadowing 
(Richardson et al., 2017). 

Applied behavior analysts need to recognize that block- 
ing and overshadowing can hinder the development of stimulus 
control and apply procedures that reduce these effects. Examples 
of reducing the influence of blocking and overshadowing include 
(a) rearranging the physical environment (e.g., lowering the 
window shade, removing distractions, changing seating assign- 
ments), (b) making instructional stimuli appropriately intense 
(e.g., providing a rapid pace of instruction, many opportunities 
to respond, an appropriate level of difficulty, and opportunities 


to set goals), and (c) consistently reinforcing behavior in the 
presence of the instructionally relevant discriminative stimuli. 


TRANSFERRING STIMULUS CONTROL 


Applied behavior analysts should provide response and stim- 
ulus prompts as supplementary antecedent stimuli only during 
the acquisition phase of instruction. With the reliable occur- 
rence of behavior, applied behavior analysts need to transfer 
stimulus control from the response and stimulus prompts to the 
naturally existing stimulus. Applied behavior analysts transfer 
stimulus control by gradually fading stimuli in or out, gradually 
presenting or removing antecedent stimuli. Eventually, the nat- 
ural stimulus, a partially changed stimulus, or a new stimulus 
will evoke the response. Fading response prompts and stimulus 
prompts is used to transfer stimulus control from the prompts to 
the natural stimulus, and also to minimize the number of error 
responses occurring in the presence of the natural stimulus. For 
a review of response-prompt and stimulus-prompt fading proce- 
dures, see Cenghar, Budd, Farrell, and Fienup (2018). 


Basic Research Foundations 


Terrace’s (1963a, b) influential research on the transfer of stim- 
ulus control using fading and superimposition of stimuli pro- 
vides classic examples of transferring stimulus control. In these 
studies, Terrace taught pigeons to make red-green and vertical— 
horizontal discriminations with a minimum of errors. His use of 
techniques for gradually transferring stimulus control was called 
errorless learning. To teach a red—green discrimination, Terrace 
presented the S4 (red light) at the beginning of discrimination 
training, before the SP (green light) had stimulus control over the 
pigeon’s responses. The initial introduction of the red light was 
with low illumination and for brief time intervals. During succes- 
sive presentations of the stimuli, Terrace gradually increased the 
intensity of the red light and the duration it was illuminated until 
it differed from the green light only in hue. With this procedure, 
Terrace taught pigeons to discriminate red from green with only 
a minimum number of errors (responses to the SS). 

Terrace further demonstrated that stimulus control 
acquired with red and green lights could be transferred to verti- 
cal and horizontal lines with a minimum number of errors (ies 
responses in the presence of the S4). His procedure consisted 
of first superimposing a white vertical line on the green light 
(sP ) and a white horizontal line on the red light (SA). Then the 
pigeons were given several presentations of the two compound 
stimuli. Finally, the amplitude of the red and green lights was 
reduced gradually until only the vertical and horizontal lines 
remained as stimulus conditions. The pigeons showed almost 
perfect transfer of stimulus control from the red—green lights 
to the vertical-horizontal lines. That is, they emitted responses 
in the presence of the vertical line (SP) and seldom responded in 
the presence of the horizontal line (SA). 

Following Terrace’s work, other pioneer researchers 
(e.g., Moore & Goldiamond, 1964) produced landmark studies 
showing that the transfer of stimulus control with few incorrect 
responses was possible with human learners, which provided 
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the foundations for developing effective procedures to transfer 
stimulus control from response prompts to natural stimuli in the 
applied context. 


Transferring Stimulus Control from Contrived 
Response Prompts to Naturally Existing Stimuli 


Wolery and Gast (1984) described four procedures for transfer- 
ring stimulus control from response prompts to natural stimuli. 
They describe these procedures as most-to-least prompts, grad- 
uated guidance, least-to-most prompts, and time delay. 


Most-to-Least Prompts 


The applied behavior analyst can use most-to-least response 
prompts to transfer stimulus control from response prompts to 
the natural stimulus whenever the participant does not respond 
to the natural stimulus or makes an incorrect response. To 
apply most-to-least response prompts, the analyst physically 
guides the participant through the entire performance sequence, 
and then gradually reduces the amount of physical assistance 
provided as training progresses from trial to trial and session 
to session. Customarily, most-to-least prompting moves from 
physical guidance to visual prompts to verbal instructions, and 
finally to the natural stimulus without prompts. 


Graduated Guidance 


The analyst provides physical guidance as needed, but using 
graduated guidance she will immediately start to fade out the 
physical prompts to transfer stimulus control. Graduated guid- 
ance begins with the applied behavior analyst following the par- 
ticipant’s movements closely with her hands, but not touching 
the participant. The analyst then increases the distance of her 
hands from the participant by gradually changing the location 
of the physical prompt. For example, if the analyst used phys- 
ical guidance for a participant’s hand movement in zippering 
a coat, she might move the prompt from the hand to the wrist, 
to the elbow, to the shoulder, and then to no physical contact. 
Graduated guidance provides the opportunity for an immediate 
physical prompt as needed. 


Least-to-Most Prompts 


When transferring stimulus control using least-to-most 
response prompts, the applied behavior analyst gives the par- 
ticipant an opportunity to perform the response with the least 
amount of assistance on each trial. The participant receives 
greater degrees of assistance with each successive trial without 
a correct response. The procedure for least-to-most prompting 
requires the participant to make a correct response within a set 
time limit (e.g., 3 seconds) from the presentation of the natural 
SP. If the response does not occur within the specified time, the 
applied behavior analyst presents the natural SP and a response 
prompt of least assistance, such as a verbal response prompt. If 
after the same specified time limit (e.g., another 3 seconds) the 
participant does not make a correct response, the analyst gives 
the natural SP and another response prompt, such as a gesture. 
The participant receives partial or full physical guidance if the 
lesser prompting does not evoke a correct response. 
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Applied behavior analysts using the least-to-most response 
prompt procedure present the natural S? with the same time limit 
for an unprompted correct response each trial. For example, 
Heckaman, Alber, Hooper, and Heward (1998) used instructions, 
nonspecific verbal prompts, modeling, and physical prompting 
in a least-to-most 5-sec response prompt hierarchy with four 
students with autism. 

Based on the results of two studies in which they com- 
pared the effectiveness of most-to-least and least-to-most 
prompting on the acquisition of solitary play skills by children 
with autism, and previous research, Libby, Weiss, Bancroft, 
and Ahearn (2008) recommended the following guidelines for 
selecting prompting techniques: 


* Most-to-least should be the default prompting technique 
when a child’s learning history is unknown. 

¢ If errors have been found to evoke problem behavior or 
impede learning, most-to-least or most-to-least with time 
delay is preferable. (Time delay is described in the fol- 
lowing section.) 

¢ Least-to-most may be preferable for students who have 

shown rapid skills acquisition with this technique. 

In all cases, progress should be monitored to ensure errors 

do not impede learning. (adapted from Libby et al., 2008) 


Time Delay 


Touchette (1971) discovered that inserting small increments 
of time between presentation of the instructional stimulus 
and a response prompt across trials resulted in the learner 
“anticipating” the correct response. A time delay procedure 
begins with simultaneous presentation of the target stimulus 
(e.g., picture of a dog) and response prompt (e.g., teacher says, 
“dog”). After the student has responded correctly for several 
trials, the teacher inserts a delay between the instructional 
stimulus and the response prompt until the student emits the 
unprompted correct response. Time delay is considered an 
“errorless learning” technique because students often make few 
or no errors as control of responding shifts from the contrived 
prompt to the instructional stimulus. 

Two variations of time delay are progressive time delay 
and constant time delay. A progressive time delay procedure 
starts with simultaneous presentation of the natural stimulus 
and the response prompt (i.e., 0-sec delay). Usually, a teacher 
will use several 0-sec trials before extending the time delay. The 
number of 0-sec trials will depend on the task difficulty and 
the functioning level of the participant. Following the simulta- 
neous presentations, the teacher gradually and systematically 
extends the time delay, often in 1-sec intervals. With constant 
time delay, after the student has responded correctly to several 
0-sec delay trials, presentation of the response prompt follows 
the instructional stimulus by a predetermined and fixed delay 
(usually 3 or 4 seconds) for all subsequent trials. 

Time delay has been used to teach a variety of skills, such 
as sight word reading (Knight et al., 2003); spelling (Stevens & 
Schuster, 1987); numeral identification (Ault et al., 1988); and 
chained tasks such as food preparation, purchasing, vocational, 
and leisure skills (Dogoe & Banda, 2009). Time delay has been 
used effectively by peer tutors (Hughes & Fredrick, 2006) and 
parents (Dipipi-Hoy & Jitendra, 2004). Heckaman et al. (1998) 


reported that implementing progressive time delay immedi- 
ately reduced rates of disruptive behavior associated with a task 
previously taught with least-to-most prompting procedure. For 
reviews of research on time delay, see Browder, Ahlgrim-Delzell, 
Spooner, Mims, and Baker (2009) and Dogoe and Banda (2009). 


Transferring of Stimulus Control with Fading 
Techniques 


The preceding sections focused on response prompts that do not 
change the task stimuli or materials. The stimulus control shap- 
ing procedures presented here modify the task stimuli or mate- 
rials gradually and systematically to prompt correct responses. 
Supplementary stimulus conditions are faded in or faded out to 
transfer stimulus control from the stimulus prompt to the natural 
stimulus. Stimulus control shaping can be accomplished with 
stimulus fading and stimulus shape transformations (MclIlvane 
& Dube, 1992; Sidman & Stoddard, 1967). 


Stimulus Fading 


Stimulus fading involves highlighting a physical dimension 
(e.g., color, size, position) of a stimulus to increase the like- 
lihood of a correct response. The highlighted or exaggerated 
dimension is faded gradually in or out. The following examples 
of (a) writing the uppercase letter A and (b) writing the answer 
9 to an arithmetic problem are illustrations of stimulus fading. 
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Krantz and McClannahan (1998) faded out scripts (i.e., the 
words Look and Watch me) embedded in photographic activity 
schedules. The embedded scripts prompted the social exchanges 
of children with autism. The words Look and Watch me were 
printed with 72-point font and bold letters on white 9-cm note 
cards. Krantz and McClannahan began fading out the words 
by removing one third of the script card, then another third. 
Sometimes, during the script fading, portions of letters were 
still shown on the card, such as part of an o in Look. Finally, the 
scripts and cards were removed. 

The treatment of a feeding disorder reported by Patel, 
Piazza, Kelly, Ochsner, and Santana (2001) provides an example 
of fading stimuli in and out. Severe food selectivity is a com- 
mon problem for children with feeding disorders. For example, 
some children find highly textured foods aversive because they 
evoke a gagging response. Patel and colleagues faded in Carna- 
tion Instant Breakfast (CIB) and then milk to water in treating a 
6-year-old boy with pervasive developmental feeding disorder. 
The boy would drink small amounts of water. The researchers 
began the fading in procedure by adding 20% of the CIB packet 
to 240 ml of water. Following three sessions of drinking the 20% 
mixture, more CIB was gradually added to the water, initially 
by 5% and later 10% increments. The researchers then faded in 
milk to the CIB and water mixture after the boy would drink one 
CIB packet with 240 ml of water. Milk was gradually added to 


the CIB/water mixture in 10% increments as the water was faded 
out (e.g., 10% milk and 90% water, plus one packet of CIB; then 
20% milk and 80% water). 

Applied behavior analysts have used the superimposition 
of stimuli with stimulus fading. In one instance, the transfer 
of stimulus control occurs when one stimulus is faded out; in 
another application, one stimulus is faded in as the other stimulus 
is faded out. The research by Terrace (1963a, b) demonstrating 
the transfer of stimulus control from a red—green discrimination 
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to a vertical—-horizontal discrimination shows the superimpo- 
sition of two specific classes of stimuli and the fading out of 
one stimulus class. The lines were superimposed on the colored 
lights; then the lights were gradually faded out, leaving only 
the vertical and horizontal lines as the discriminative stimuli. 
Figure 17.6 provides an applied example of the superimposi- 
tion and stimulus fading procedures used by Terrace. The figure 


shows a series of steps from an arithmetic program to teach 
a 


{- 


aH 


so 


FR 


7-2= 


MARK 


Lae 


Figure 17.6 Illustration of two classes of superimposed stimuli with one class then faded out. 


From Addition and Subtraction Math Program with Stimulus Shaping and Stimulus Fading, by T. Johnson, 1973, 
unpublished project, Ohio Department of Education. Reprinted by permission. 
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The other frequently used procedure fades in the natural 
stimulus and fades out the stimulus prompt. Figure 17.7 illus- 
trates this superimposition procedure, in which the prompt is 
faded out and the natural stimulus8 +5 = _is faded in. 


Stimulus Shape Transformations 


The procedure for stimulus shape transformations uses an initial 
stimulus shape that will prompt a correct response. That initial 
shape is then gradually changed to form the natural stimulus, 
while maintaining correct responding. The shape of the stim- 
ulus prompt must change gradually so that the student contin- 
ues to respond correctly. In teaching word identification, using 


Figure 17.7 Illustration of superimposition and stimulus 
fading to fade in the natural stimulus and fade out the 
stimulus prompt. 

From Addition and Subtraction Math Program with Stimulus Shaping and 
Stimulus Fading, by T. Johnson, 1973, unpublished project, Ohio Depart- 
ment of Education. Reprinted by permission. 


stimulus shape transformations could include the following 
steps (Johnson, 1973): 


Bp of Cav ¢.4,r car 


In summary, a variety of procedures exist for transferring 
stimulus control from response and stimulus prompts to natural 
stimuli. Although fading, superimposition, and stimulus shaping 
procedures are often successful in transferring stimulus con- 
trol, preparing the necessary stimulus materials for some tasks 
requires considerable time and technical skill, and may be costly. 
Practitioners should always consider the most effective and 
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efficient approach to changing behavior. A study related to binge 
drinking by college students is informative in this regard. Metz, 
Kohn, Schultz, and Bettencourt (2017) compared the effec- 
tiveness of three training procedures (stimulus fading, super- 
imposition, and verbal feedback) on college students’ pouring 
standard, 12-ounce servings of beer. Four of five participants in 
the stimulus fading group reached criterion (poured within 10% 
deviation of 12 oz) without additional training; three of four 
participants in the superimposition group required and received 


SEROUS SESE 


cUMnRey ———. 


Stimulus Control: Basic Concepts and Processes 


1. Behavior that occurs more often in the presence of 
a stimulus (discriminative stimulus, S”) than in its 
absence (stimulus delta, Se) is said to be under stimulus 
control. 


2. When an S? signals a rich schedule of reinforcement and 
a corresponding S‘ signals a zero chance of reinforcement 
(i.e., extinction), the organism will eventually respond 
only in the presence of the SP. In such cases, stimulus 
discrimination is complete. 


3. Stimulus generalization refers to the extent to which 
stimuli other than the SP acquire stimulus control 
over the behavior. Stimuli sharing similar physical 
properties with the SP are most likely to acquire 
evocative function. 


4. A stimulus generalization gradient graphically depicts 
the degree of stimulus generalization and discrimination 
by showing the extent to which responses reinforced in 
one stimulus condition are emitted in the presence of 
untrained stimuli. 


5. Stimulus generalization reflects a loose degree of stimulus 
control, whereas discrimination indicates a relatively tight 
degree of stimulus control. 


6. In a conditional discrimination, the response that will 
produce reinforcement in the presence of a particular 
stimulus depends on (is conditional on) the presence or 
absence of other stimuli. 


7. An SP acquires its controlling function for operant 
behavior by being paired with stimulus changes that 
occur immediately following behavior. Conversely, a 
conditioned stimulus acquires its controlling function 
for respondent behavior by being paired with other 
antecedent stimuli that elicit the behavior (1.e., an 
unconditioned stimulus or conditioned stimulus). 


8. Distinguishing the evocative functions of discriminative 
stimuli and establishing operations can be difficult. An S? 
evokes behavior because its presence has been correlated 
with the differential availability of an effective reinforcer. 
EO evokes behavior because it changes what functions as 
reinforcement rather than signaling the differential avail- 
ability of reinforcement. 
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additional stimulus fading training. But three of four partic- 
ipants in the verbal-feedback-only group achieved criterion 
on their first set of posttraining pours. The authors concluded 
that “verbal feedback may be a relatively easy and effective 
method if used with actual pouring” (p. 29). We believe prac- 
titioners should keep in mind this adage: Just because you can 
do something doesn’t mean you should, and always consider 
the simplest, most efficient methods of behavior change before 
designing and implementing a complex intervention. 


Developing Stimulus Control 


9. Stimulus discrimination training entails reinforcing 
responses in the presence of the SP and not reinforcing 
responses in the presence of the so 


10. Behavior analysts often teach and study conditional discrim- 
inations with a matching-to-sample procedure entailing a 
four-term contingency of observing response, the conditional 
sample, the discriminative events, and the comparison match. 


11. A concept is defined by a set of shared features found in 
each example of the concept. 


12. Concept formation is a complex example of stimulus control 
that requires both stimulus generalization within a class of 
stimuli and discrimination between stimulus classes. 


13. An antecedent stimulus class is a set of stimuli that share 
a common relationship. All of the stimuli in the class will 
evoke the same operant response class, or elicit the same 
response in the case of respondent behavior. 


14. Stimuli in a feature stimulus class share common physical 
forms (e.g., topographical structures) or common relative 
relations (e.g., spatial arrangements). 


15. Stimuli composing an arbitrary stimulus class evoke the 
same response, but they do not share a common stimulus 
feature (i.e., they do not resemble each other in physical 
form, nor do they share a relational relationship). 


16. Prompts are supplementary antecedent stimuli used to 
occasion a correct response in the presence of an SP that 
will eventually control the behavior. 


17. Response prompts (e.g., verbal instructions, modeling, 
physical guidance) operate directly on the response. 
Stimulus prompts (e.g., movement, position, redundancy) 
operate directly on antecedent task stimuli to cue a correct 
response in conjunction with the critical SP. 


18. Applied behavior analysts establish stimulus control with 
frequent differential reinforcement of behavior in the pres- 
ence and absence of the S? condition. Effective differential 
reinforcement requires the consistent use of consequences 
that function as reinforcers. Additional factors such as pre- 
attending skills, stimulus salience, overselective stimulus 
control, stimulus blocking, and overshadowing will also 
affect the development of stimulus control. 
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Transferring Stimulus Control 23. With least-to-most prompting, the learner is given the 
opportunity to perform the response with the least amount 
of assistance on each trial. The teacher provides increas- 
ingly greater degrees of assistance with each successive 
trial, as needed. 


19. Terrace (1963a, b) conducted influential basic research 
using fading and superimposition of stimuli to transfer 
stimulus control. 


20. Fading response prompts and stimulus prompts can 24. Time delay procedures begin with simultaneous presen- 
transfer stimulus control from the prompts to the natural tation of the target stimulus and response prompt. After 
stimulus and minimize errors by the learner. the student has responded correctly for several trials, the 


teacher inserts a delay between the instructional stimulus 


21. In the most-to-least response prompting procedure, the 
and the response prompt. 


analyst physically guides the participant through the 


entire performance sequence and then gradually reduces 25. Stimulus fading involves exaggerating a dimension 
the amount of physical assistance provided as training of a stimulus to increase the likelihood of a correct 
progresses from trial to trial and session to session. response. The exaggerated dimension is faded gradually 
22. Graduated guidance begins with the applied behavior ogo 
analyst following the participant’s movements closely with 26. Stimulus shape transformations begin with an initial 
her hands, but not touching the participant. The analyst then stimulus shape that will prompt a correct response. The 
increases the distance of her hands from the participant by initial shape is then gradually changed to form the natural 
gradually changing the location of the physical prompt. stimulus, while maintaining correct responding. 
KEY TERMS 
antecedent stimulus class least-to-most response prompts stimulus delta (S4) 
arbitrary stimulus class most-to-least response prompts stimulus discrimination 
concept overselective stimulus control stimulus discrimination training 
concept discrimination overshadowing stimulus fading 
constant time delay progressive time delay stimulus generalization 
discriminative stimulus (S?) response prompts stimulus generalization gradient 
errorless learning stimulus blocking stimulus prompts 
feature stimulus class stimulus control time delay 
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MULTIPLE-CHOICE QUESTIONS ae 


1. When a group of stimuli all evoke the same response (for c. A picture of a chair with no arms, a chair with arms, 
example, when one sees a picture of a Border collie, a and a rocking chair 
Doberman pinscher, and a Labrador retriever, one says 


‘ ainaniies d. A picture of a bus, a car, and a motorcycle 
“that’s a dog’’), the group of stimuli is referred to as: 


Hint: (See “Developing Stimulus Control”) 

a. A response class 

3. Which of the following illustrates an example of stimulus 
control? 


b. A stimulus class 
c. An arbitrary class . : 
a. Joanne occasionally runs the stop sign near her house 


d. A response class because there are rarely any cars at the intersection 


Hint: (See “Developing Stimulus Control”) b. Frank flaps his hands almost constantly all day long. 


2. Which of the following is an example of an arbitrary c. Joe hits his teacher every time and only when she asks 
stimulus class? him to wash his hands. 
a. The following women’s restroom signs: the word woman, d. Molly uses a Kleenex to wipe her nose when it is run- 
a pictogram of a woman, and a painting of a mermaid ning and often when it is not running. It is a bad habit 
b. A picture of a brick house, a stick built house, and she has. 


a log cabin Hint: (See “Introduction to the chapter”) 


4. Which of the following statements is true? 


a. When a dog has been trained to salivate when a bell 


rings by repeatedly pairing meat powder and a bell, this 
1s an example of stimulus control. 


b. When an individual has been trained to say the word 
“red” when he is shown a card that says RED (but not 
when he has been shown a card that says GREEN), 
this is an example of stimulus control. 


c. Antecedent stimuli rarely acquire control over human 
behavior. 


d. The first two statements are true. 
Hint: (See “Respondent Eliciting and Operant Discrimina- 


tive Functions of Antecedent Stimuli’’) 


. A teacher makes a request of a student. The student emits 
aggressive behavior immediately following the request. 
The teacher removes the request in order to avoid getting 
hit. The teacher’s request: 


a. Was clearly inappropriate 

b. Is a discriminative stimulus for the problem behavior 
c. Is a motivating operation for the problem behavior 

d. Is a stimulus delta for problem behavior 

Hint: (See “Respondent Eliciting and Operant Discrimina- 
tive Functions of Antecedent Stimuli’’) 

. Motivating operations and discriminative stimuli: 

a. Have similar evocative effects on behavior 

b. Are considered synonymous 

c. Occur after the behavior of interest 

d. All of these 

Hint: (See “Discriminative and Motivational Functions of 


Antecedent Stimuli’) 


. Amanda, a typically developing two-year-old girl, has a 
father who is in the Marines. The last time her father was 
home, he was in uniform. Now, every time she sees a man 
in uniform, she says, “Daddy!” This is an example of: 


a. Stimulus control 

b. Stimulus equivalence 
c. A response class 

d. Stimulus generalization 


Hint: (See “Stimulus Generalization’) 


. Stimulus generalization is more likely to occur: 


a. When stimuli share more similar properties than when 
stimuli are very different from one another. 


b. When the practitioner tightly controls the training 
procedures and stimuli used during training (.e., using 
few different stimuli during training). 


c. If time delay is used as part of the training procedure. 
d. If reflexivity is observed. 


Hint: (See “Stimulus Generalization’) 


9: 


10. 


11. 


12. 


13. 
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Stimulus discrimination is acquired by: 


a. Reinforcing responses in the presence of a stimulus 
delta and withholding reinforcement in the presence of 
the discriminative stimulus. 


b. Reinforcing responses in the presence of both discrimi- 
native stimuli and stimulus deltas. 


c. Reinforcing responses in the presence of the discrim- 
inative stimulus and withholding reinforcement in the 
presence of the stimulus deltas. 


d. Thinning reinforcement so that no reinforcement is 
required for any antecedent stimuli. 


Hint: (See “Developing Stimulus Control’) 
Stimulus discrimination involves the behavioral 
principles of: 

a. Prompts and extinction 

b. Reinforcement and extinction 

c. Form and function of behavior 

d. Concepts and stimulus equivalence 

Hint: (See “Developing Stimulus Control”) 

In order to form the concept of “vehicles,” an individual 
must be able to: 

a. Have an IQ of at least 100 

b. Have a strong grasp of concrete operations first 


c. Discriminate vehicles from nonvehicles and generalize 
across all motorized methods of transporting people 


d. Discriminate vehicles from nonvehicles 
Hint: (See “Developing Stimulus Control”) 
Which of the following is a concept that can be taught 


using stimulus discrimination and generalization 
procedures? 


a. Fruit 

b. Under 

c. Integrity 

d. All of these 

Hint: (See “Developing Stimulus Control”) 

The difference between a response prompt and a stimulus 
prompt 1s: 

a. A response prompt operates directly on the response, 


while a stimulus prompt operates on the antecedent 
task stimuli. 
b. Stimulus prompts are easier to fade than response 
prompts. 
A response prompt involves changing the required 
response, while a stimulus prompt requires changing 
the antecedent task stimuli. 


Cc 


° 


d. Response prompts are verbal, while stimulus prompts 
are nonverbal. 


Hint: (See “Using Response and Stimulus Prompts”) 
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14. 


15. 


Rahul keeps forgetting to push the Total button on the cash 
register at his place of employment. His job coach wants to 
insert a prompt to help him remember to total customers’ 
orders. She wants to use a stimulus prompt. Which of the 
following is an example of a stimulus prompt? 


a. When Rahul reaches the end of the order, the job coach 
gives him a nudge on the elbow to get his hand moving 
toward the Total button. 


b. When Rahul reaches the end of the order, the job coach 
says, “Don’t forget to push Total.” 


c. The job coach places a sign on the cash register that 
says “Don’t forget to push Total.” 


d. The job coach places a bright pink sticker on the 
total button. 


Hint: (See “Using Response and Stimulus Prompts”’) 


Stimulus and response prompt fading is used to: 


a. Transfer stimulus control from the prompt to the natural 
antecedent cue. 


seman cnoremmennocunenecrsans 


ESSAY-TYPE QUESTIONS 


1. 


Explain the basic teaching procedure for helping develop a 
stimulus class. 


Hint: (See “Developing Stimulus Control’) 


. By what mechanisms do antecedents acquire control over 


respondent and operant behaviors? 


Hint: (See “Respondent Eliciting and Operant Discrimina- 
tive Functions of Antecedent Stimuli”) 


. Brian hits his teacher when he is asked to brush his 


teeth. However, he never hits his mother when she 
asks him to brush his teeth. His mother used to have 
this problem (of Brian hitting her), but she ignored 

it and made him continue brushing his teeth. His 
teacher, however, gets frustrated when Brian hits 

her and usually lets him out of the toothbrushing 

task (because she thinks it is not her job to get beaten 
up over doing a task that his parents should have 
responsibility for anyway). In this scenario, what is 
the discriminative stimulus and what is the motivating 
operation for Brian’s hitting? 

Hint: (See “Discriminative and Motivational Functions of 
Antecedent Stimuli’’) 


16. 


b. Thin the reinforcement schedule for responding 
correctly. 


c. Establish stimulus equivalence. 
d. Promote stimulus generalization. 


Hint: (See “Transferring Stimulus Control’) 


Most-to-least prompt fading involves: 


a. Gradually decreasing the time between the presence 
of the antecedent cue to the prompt from most time to 
least time. 

b. Gradually changing the form or intensity of the prompt 
from most intense to least intense. 

c. Gradually decreasing exaggerated dimensions of the 
stimulus prompt. 

d. Gradually changing the shape of the prompt to look 
more like the natural antecedent cue. 


Hint: (See “Transferring Stimulus Control’) 


. Compare and contrast stimulus generalization and stimulus 


discrimination. 


Hint: (See “Stimulus Generalization’) 


. Explain (or use diagrams to illustrate) how you could teach 


an individual to say “ball” in the presence of balls, but not 
in the presence of other stimuli (say bats, blocks, paper). 


Hint: (See “Developing Stimulus Control!’”) 


. Give an example of a concept and delineate what stimuli 


a learner might need to generalize across and what stimuli 
the learner might need to discriminate. 


Hint: (See “Developing Stimulus Control’’) 


. Swapna often forgets to pick up her dirty clothes in the 


morning and to put them in the clothes hamper. Give an 
example of each type of response prompt you could use 
to get her to put her clothes in the hamper. Also, give an 
example of how you could use stimulus prompts to help her. 


Hint: (See “Using Response and Stimulus Prompts”) 


. Give an example of how you could use stimulus fading to 


help a learner draw a circle. 


Hint: (See “Transferring Stimulus Control”) 


PAR T 


Verbal Behavior 


Chapter 18 is devoted to a distinguishing feature of the human behavioral repertoire. Verbal behavior 
is the principal phenomenon that makes humans interesting and the vehicle with which we express 
that interest. Verbal behavior makes progress possible from one generation to the next and enhances 
the development of sciences, technologies, literatures, and the arts. Building on Skinner’s (1957) 
conceptual analysis, Mark Sundberg presents verbal behavior within the context of typical human 
development, emphasizing the form and function of language, the defining characteristics of verbal 
behavior, the distinction between topography-based and selection-based verbal behavior, elementary 
verbal operants and listener discriminations, private events, generative verbal learning, and 
assessment and intervention programs for children with developmental disabilities. 
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Verbal Behavior 


Mark L. Sundberg 


LEARNING OBJECTIVES 


¢ Differentiate between formal properties and functional properties of language. 


¢ Define verbal behavior. 
e Define a verbal operant. 


e Explain why Skinner suggested that the listener's role is less significant than typically assumed. ; 
e Using a given example, determine the classification of verbal operants and explain how these terms can be used in 


the analysis of complex verbal behavior. 


¢ Identify and discuss three or more functional units of verbal behavior that contribute to our understanding of 


multiple control. 
e Define an autoclitic relation. 


Identify and discuss how viewing language as a learned behavior involving a social interaction between speakers and 


listeners, with the verbal operants as the basic units, changes how clinicians and researchers approach and ameliorate 


human problems related to language. 


Jason was a 5-year-old boy diagnosed with autism. He 
could not talk, echo sounds, or imitate actions. He often 
screamed to get his needs and wants known. The imme- 
diate priorities were to teach Jason to communicate and 
to reduce his screaming behavior. Jason liked a variety of 
foods and toys and was taught to ask for them using sign 
language. He learned his first prompt-free sign (“apple”) 
within 10 minutes of intensive teaching, and his second 
sign (“Play-Doh”) about an hour later. In addition, his 
screaming behavior was reduced to a near-zero level 
during that first teaching session. After 4 months of inter- 
vention, Jason acquired 43 signs, learned to echo and 
imitate others, and began to speak. Within 2 years Jason 
could vocally identify over 300 items and actions, and 
his problem behaviors were minimal. The intervention 
program used with Jason was based on applied behavior 
analysis and Skinner’s (1957) analysis of verbal 
behavior. 


his chapter contains two major sections. The first 

describes the core elements of Skinner’s (1957) analysis 

of verbal behavior, and the second describes some appli- 
cations of that analysis to language assessment and intervention 
for persons with autism or other intellectual disabilities. 


™The author thanks Dave Palmer, Bob Ryan, Hank Schlinger, Carl Sundberg, 
and Cindy Sundberg for their comments on an earlier version of this chapter. 
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SKINNER’S (1957) ANALYSIS 
OF VERBAL BEHAVIOR 


B. F. Skinner spent over two decades developing his analysis of 
language. He began the project a few years after graduating from 
Harvard University in 1931 and finished it in 1957. His main 
premise was that language constitutes learned behavior acquired 
and maintained by environmental contingencies. Reflecting on 
his work on language, Skinner (1978) wrote “Verbal behav- 
ior... will, I believe, prove to be . . . my most important 
work” (p. 122). Skinner considered language an important 
topic because verbal behavior is central to many critical aspects 
of human behavior (e.g., language acquisition, relationships, 
intelligence, academics). 

It has been said that Verbal Behavior (1957) is Skinner’s 
most difficult book to understand. Michael (1984) noted that 
a reader must first understand the basic concepts and princi- 
ples of behavior analysis before taking on Verbal Behavior. 
However, a more problematic obstacle noted by Michael was the 
complexity of the subject matter being analyzed. Linguists have 
tangled with understanding human communication for centuries. 
Skinner’s work can contribute to their efforts in many ways, 
beginning with a behavioral distinction between linguistic form 
and linguistic function. 


Form and Function of Language 


The formal properties of language involve the topography of a 
verbal response, and the functional properties involve the causes 
of that response. For example, a child’s cry could be the form 
of verbal response, and why she cried would be the function. 


Skinner’s (1957) account of language considers both of these 
elements but differs from traditional treatments in that he provides 
a behavioral analysis of the functional properties. He identifies 
the distinction between form and function early in his book: 


Our first responsibility is simple description: what is the 
topography of this subdivision of human behavior? Once 
that question has been answered in at least a preliminary 
fashion we may advance to the stage called explana- 
tion: what conditions are relevant to the occurrences of 
the behavior—what are the variables of which it is a 
function? (p. 10). 


Form 


To account for both form and function, two separate but inter- 
locking classification systems are necessary. The formal proper- 
ties of language can be classified by a description of the verbal 
utterances emitted (e.g., speech, sign language, text, icons). For 
example, with speech it is common to measure the topography 
of what was said by (1) phonemes: the individual speech sounds 
that compose a word; (2) morphemes: the units with an individual 
piece of meaning; (3) lexicon: the total collection of words that 
make up a given language; (4) syntax: the organization of words, 
phrases, or clauses in sentences; and (5) grammar: the adher- 
ence to established conventions of a given language. The formal 
description of language also involves the classification of words 
by their grammatical forms or parts of speech, specifically, nouns, 
verbs, adjectives, adverbs, prepositions, determiners, conjunc- 
tions, and interjections (Barry, 1998; Brinton & Brinton, 2010). 
The traditional classification system is based on vocal response 
forms; thus certain adjustments are necessary for accommodating 
nonvocal response forms. For example, sign language involves 
cheremes rather than phonemes (Stokoe, Casterline, & Crone- 
berg, 1965), and selection-based icon systems (¢.g., iPad commu- 
nication devices) have no clear parallel to phonemes. 


Function 


The second type of classification system involves the different 
functions that a word may have. For example, the same word 
(e.g., book) can have different meanings (e.g., something to read 
versus making a hotel reservation). Skinner suggests that the 
meaning of a word is not to be found in the form of a response, 
a cognitive processing system, a physical referent, struc- 
tural regularity, or a person’s genetic endowment, but rather 
in the immediate and historical environmental contingencies 
that evoke and consequate speaker and listener behavior. The 
primary purpose of his book Verbal Behavior was to identify 
those environmental contingencies. 


Defining Verbal Behavior 


Identifying exactly what constitutes language has eluded linguists 
for centuries. Skinner (1957) argued that what is commonly 
referred to as communication and language is learned speaker 
and listener behavior that is acquired, maintained, and extended 
by the same types of environmental variables that control non- 
verbal behavior (e.g., reinforcement, stimulus control, motivating 
operations). Skinner preferred the term verbal behavior over the 
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term language because verbal behavior emphasizes the indi- 
vidual speakers and listeners rather than the practices of whole 
verbal communities (e.g., the English language). 

In defining verbal behavior, Skinner (1957) placed the 
focus on an analysis of the immediate and historical contingen- 
cies that affect speakers and listeners. Accordingly, he defined 
verbal behavior “as behavior reinforced through the media- 
tion of other persons” (p. 2), but these other persons “must be 
responding in ways which have been conditioned precisely in 
order to reinforce the behavior of the speaker” (p. 225). For 
example, if a parent asks a child, “Can you let the cat out?” the 
bekavior of the child (listener) is necessary to produce an open 
door; thus, the receipt of the reinforcer is mediated by the behav- 
ior of the child (the listener). But the child must have a training 
history to successfully mediate reinforcement in this way (e.g., 
the child understands his! parent’s words). Nonverbal behavior 
could produce the same reinforcer by the parent opening the 
door herself, but the reinforcement for this behavior is not medi- 
ated through another person. 

Skinner’s definition of verbal behavior was not meant to 
solve the linguist’s query regarding the beginnings of “true” 
language (e.g., Hayes, Barnes-Holmes, & Roche, 2001; Hockett, 
1960; Malott, 2003; Normand, 2009; Palmer, 2004). Michael 
(in Michael & Malott, 2003) noted that “Skinner’s definition of 
verbal behavior . . . is only intended to identify [that] the topic of 
his consideration in the book on verbal behavior . . . can be nar- 
rowed to what is traditionally recognized as the verbal field” (p. 
116). By treating language as behavior emitted by speakers and 
listeners, the focus of the analysis and its applications is on the 
contingencies responsible for that behavior. 


Speaker and Listener 


The behaviors of a speaker and a listener are controlled by dif- 
ferent but interlocking contingencies. These contingencies and 
the behaviors they establish and evoke are simple at first, but 
rapidly grow to enormous complexity. Skinner ( 1957) provides 
separate accounts of speaker and listener behavior and terms 
their interactions verbal episodes (p. 38). In a verbal episode, 
a speaker emits any type of verbal response in any form 
(speech, sign language, eye contact), and a listener (1) serves 
as an audience for a speaker, (2) provides reinforcement for 
a speaker, and (3) responds in specific ways to the speaker’s 
verbal behavior. The roles of speaker and listener alternate in 
an exchange, and usually involve covert speaker and listener 
behavior as well. These speaker and listener repertoires will be 
defined, exemplified, and elaborated on throughout this chapter. 
However, it is valuable to first make a distinction between 
verbal responses based on response form and those based on 
stimulus selection. 


Topography-based and Selection-based 
Verbal Behavior 


Verbal behavior is not restricted to spoken words. Any form 
of operant behavior can acquire a verbal function (e.g., sign 
language, fingerspelling, icon selection, Braille, facial expres- 
sions, intonation). Skinner’s broader view of linguistic response 
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forms expands the options for nonvocal persons in need of 
augmentative and alternative communication (AAC). In an 
effort to provide clarity to the issue of verbal response forms, 
Michael (1985) suggested that a distinction be made between 
topography-based verbal behavior (e.g., speech, sign language, 
text) and selection-based verbal behavior (e.g., tablet-based 
icon-selection programs, Picture Exchange Communication 
System [PECS], voice output devices). 

Michael (1985) states that in topography-based (TB) 
verbal behavior “[t]he wnit of verbal behavior can be described 
as an increased strength of a distinguishable topography given 
some specific controlling variable” (p. 1). This category of verbal 
behavior can include speech, sign language, writing, fingerspell- 
ing, and other response forms in which the listener is affected 
by a speaker’s specific response topography. For example, if a 
child wants to play with an iPad and signs iPad, that response 
form informs the listener of the relevant motivation, allowing for 
the delivery of specific reinforcement (the iPad). Had the child 
signed “cookie,” that topography would inform the listener that 
a different motivator and consequence are involved. 

In selection-based (SB) verbal behavior, Michael (1985) 
states that “the unit of verbal behavior can be described as an 
increased control of the pointing [or selecting] response by a 
particular stimulus as a result of a different stimulus . . . or estab- 
lishing operation” (p. 1). For example, the same “iPad,” request 
exemplified above could also occur by selecting the iPad icon 
from a comparison array. The selected icon also informs the 
listener of the relevant motivation and allows for the delivery of 
specific reinforcement. In SB verbal behavior, what is conveyed 
to the listener is the information on the stimulus selected, as 
opposed to TB verbal behavior, where the listener’s behavior is 
controlled by the topography of a speaker’s response. 

Michael’s distinction between TB and SB verbal behay- 
ior has provided valuable conceptual and practical guidance 
for understanding a number of important behavioral issues 
(for a review, see Shafer, 1993). For example, the distinc- 
tion can facilitate the selection and implementation of AAC 
systems for children with severe language delays (e.g., 
Charlop-Christy, Carpenter, Le, LeBlanc, & Kellet, 2002; 
Frost & Bondy, 1994; McGreevy, Fry, & Cornwall, 2012; 
Sundberg, 1993a; Tincani, 2004). In addition, research shows 
that TB systems may produce types of generative learning 
as observed in stimulus equivalence and relational framing 
more efficiently than SB systems (Jennings & Miguel, 2017; 
Lowenkron, 1998; Pérez-Gonzdélez, Salameh, & Garcia- 
Asenjo, 2018; Polson & Parsons, 2000; C. T. Sundberg & 
Sundberg, 1990; C. T. Sundberg, Sundberg, & Michael, 2018; 
Wraikat, C. T. Sundberg, & Michael, 1991). 


Behavioral Cusps 


When an infant learns to walk, it changes her life forever. Once 
standing and walking occur, new stimuli, motivators, and rein- 
forcers are available and they immediately begin to evoke and 
shape new behaviors. Rosales-Ruiz and Baer (1997) termed 
these significant human developmental changes behavioral 
cusps, and defined them as “any behavior change that brings the 
organism’s behavior into contact with new contingencies that 


have even more far-reaching consequences” (p. 533). A cusp 
represents the beginning of a new level of learning for a child. 
Rosales-Ruiz and Baer (1997) elaborate on their definition by 
noting, “What makes a behavior change a cusp is that it exposes 
the individual’s repertoire to new environments, especially new 
reinforcers and punishers, new contingencies, new responses, 
new stimulus controls and new communities of maintaining or 
destructive contingencies. When some or all of those events 
happen the individual’s repertoire expands . . . and perhaps that 
leads to some further cusps” (p. 534). 

There are many types of cusps; some are minor, but some 
are especially important for human development because their 
far-reaching learning effects can change the course of a child’s 
development. Some of a child’s most important early cusps 
involve motor behavior (e.g., a pincer grasp, crawling, walk- 
ing), social behavior (e.g., joint attention, adults as conditioned 
reinforcers), and language acquisition (e.g., imitating, echoing, 
listening, manding) (Bosch & Fuqua, 2001; Bosch & Hixson, 
2004; Greer & Ross; 2008; Hixson, 2004; Hixson, Reynolds, 
Bradley-Johnson, & Johnson, 2010; Rosales-Ruiz & Baer, 1997; 
Staats, 1996). 

It is important to distinguish between the acquisition of a 
cusp and the facilitative effects of the cusp itself. In acquisition, 
the elements of a cusp are gradually established, but the new 
consequences produced by the cusp only become available when 
the cusp is achieved. For example, approximations to crawling 
are gradually shaped, but only when crawling is successful does 
its far-reaching consequences begin to affect an infant’s behav- 
ior in a more powerful way (e.g., the automatic reinforcement 
obtained by crawling to a different room). Cusps can be valu- 
able to a child, especially when they are combined with other 
cusps and produce even more generative cusps (Greer & Ross, 
2008; Hixson et al., 2010; Michael, Palmer, & Sundberg, 2011; 
Rosales-Ruiz & Baer, 1997). For example, once a child can emit 
echoic, tact, listener, and self-listener behaviors, new mands and 
tacts can be acquired incidentally with no formal training or 
reinforcement history (Horne & Lowe, 1996; Pérez-Gonzilez, 
Pastor, & Carnerero, 2014). 


The Elementary Verbal Operants 
and Listener Discriminations 


A behavioral analysis of expressive and receptive language 
involves identification of the antecedents and consequences 
that control the different types of speaker and listener behavior. 
Skinner (1957) identified three separate sources of antecedent 
control for speaker behavior: (1) motivating variables, (2) non- 
verbal discriminative stimuli (SPs), and (3) verbal SPs. These 
three sources of antecedent control and their related history 
of consequences provide the framework for distinguishing 
between five different types of expressive language (duplic, 
mand, tact, intraverbal, codic). Michael (1982) termed these 
basic components of Skinner’s taxonomy the elementary 
verbal operants. Receptive language involves the behav- 
ior of the listener, and can be behaviorally characterized as 
mainly nonverbal listener discriminations controlled by verbal 
SP, (e.g., a teacher says, “touch the red balloon” and the child 
touches the red balloon, not the blue one). However, much of 
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Figure 18.1 The elementary verbal operants and listener discriminations. 


Antecedent Variables Behavior 


SPEAKER REPERTOIRES 
(Expressive Language) 


Listener-mediated Consequences 


Motivating operation (value of being 
pushed is high) 


Nonverbal S° (seeing snow) 


Verbal S° with formal similarity to the 
response (hearing “bear”) 


Verbal S° with no formal similarity, but with 
point-to-point correspondence (seeing “dog” 
written or hearing “dog”) 


Verbal S° without point-to-point correspon- 
dence (hearing “What time?”) 


Mand (saying “push”) 


Tact (saying “snow”) 


Duplic (echoic, motor imitation, 
copying text) (saying “ bear”) 


Codic (textual, taking dictation) 
(saying “dog” or spelling “dog”) 


Intraverbal (saying “10 AM”) 


Specific reinforcement (being pushed) 


Generalized S' (e.g., praise, approval) 


Generalized S' 


Generalized S' 


Generalized S' 


LISTENER DISCRIMINATION 
(Receptive Language) 


Antecedent Variables Behavior 


Consequence 


Verbal S? and a nonverbal S° ("Give mea 
pen” along with the pen) 


a listener’s behavior is also verbal (e.g., covert self-talk) and 
should be treated as verbal behavior (Schlinger, 2008a). The 
elementary verbal operants and listener discriminations will 
be briefly defined and exemplified below and in Figure 18.1, 
followed by a more in-depth treatment of each type of speaker 
and listener repertoire. 


Mand 


When a person asks for something to drink, for directions to a post 
office, or to be left alone, she is usually manding. Mands allow 
a speaker to get her needs and wants known to a listener. The 
mand relation is a type of verbal behavior in which the form of a 
response (words, signs, gestures, icon exchanges, etc.) is under the 
functional control of a motivating operation (MO) and a history of 
specific reinforcement (Figure 18.1). For example, a mand can be 
observed when a child wants to be pushed on a swing and emits 
the word “push” and an adult then pushes her on the swing. The 
reinforcement for a mand is specific to the MO; that is, being 
pushed by adults is what the child wants, not just claps and cheers. 


Tact 

When a child names her friends in the room while looking at 
them, she is usually tacting. Tacts involve a speaker’s ability to 
verbally identify aspects of the physical environment. The tact 
relation (Figure 18.1) is a type of verbal behavior in which the 
form of a response is under the functional control of a nonverbal 
discriminative stimulus (SP) and a history of conditioned rein- 
forcement (S") (e.g., seeing snow and saying “snow’’). 


Duplic 

There are three types of duplic relations: echoic, motor 
imitation, and copying text. They all have the same defining 
features but have unique topographical features. Repeating 


Nonverbal operant behavior (listener 
hands the speaker a pen) 


Generalized S' 


spoken words exemplifies the echoic, imitating sign language 
exemplifies motor imitation, and copying written words exem- 
plifies copying text. 


Duplic: Echoic. The echoic relation (Figure 18.1) is a type of 
duplic verbal behavior in which the form of a vocal response is 
under the functional control of an auditory verbal SP that has 
formal similarity between the stimulus and the response product, 
and has a history of generalized S' (Figure 18.1). Formal similar- 
ity occurs when the controlling verbal stimulus and the response 
product are in the same sense mode (e.g., both are auditory), and 
the stimulus and the response product resemble each other in the 
physical sense of resemblance (e.g., hear “bear,” say “bear’’). 


Duplic: Motor Imitation. Motor imitation (relating to sign lan- 
guage) (Figure 18.1) is a type of duplic verbal behavior in which 
the form of a motor response is under the functional control of 
4 visual verbal S that has formal similarity and a history of 
generalized S' (e.g., parent signs “car” and the child signs “‘car”). 


Duplic: Copying Text. Writing letters and words that match other 
letters and words constitutes copying text (fingerspelling and 
Braille can also be copied). Copying textual stimuli has the same 
defining features as echoic and motor imitation, except the response 
form is written, fingerspelled, or Braille punched. Copying text 
(Figure 18.1) is a type of verbal behavior in which the form of a 
transcriptive response is under the functional control of a textual 
verbal SP that has formal similarity and a history of generalized 
(e.g., writing down an address observed on a website). 


Codic 


There are two types of codic relations: textual and tak- 
ing dictation. Textual behavior for speech is emitting vocal 
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responses controlled by written verbal SPs, while taking dicta- 
tion is emitting transcriptive and spelling behaviors controlled 
by vocal verbal S?s. Similar codic relations can occur with sign 
language and Braille. 


Codic: Textual. The textual relation (Figure 18.1) is a type 
of verbal behavior in which the form of a response is under 
the functional control of a verbal S? and a history of gener- 
alized S', but has no formal similarity between the stimulus 
and the response product. However, there is a broader type of 
match between the verbal stimulus and the verbal response that 
Skinner terms point-to-point correspondence. Point-to-point 
correspondence occurs when the parts of a verbal stimulus 
correspond to the parts of a verbal response but do not physi- 
cally match each other (e.g., see “D-O-G,” then saying “dog”). 


Codic: Taking Dictation. When someone gives you a name and 
you write it down, you convert spoken words to written words 
(spelling). Although somewhat dated, but nonetheless accurate, 
Skinner’s term taking dictation (Figure 18.1) describes this 
type of verbal behavior, in which the form of a response is under 
the functional control of a verbal S? with point-to-point corre- 
spondence between the verbal stimulus and the verbal response 
product, and a history of generalized S', but has no formal sim- 
ilarity (e.g., hearing “dog,” then typing “D-O-G”). 


Intraverbal 


Answering questions, telling stories, and recalling memories are 
usually intraverbal behaviors. Our intraverbal repertoires allow 
us to verbally respond to verbal stimuli we encounter or produce 
ourselves. The intraverbal relation (Figure 18.1) is a type of 
verbal behavior in which the form of a response is under the 
functional control of a verbal S? that does not have point-to-point 
correspondence with the verbal stimulus (e.g., receiving the text 
“What time?” and texting back “104m”). There is also a relevant 
history of generalized S' involved. 


Listener Discriminations 


An important component of language involves a listener’s abil- 
ity to discriminate between verbal stimuli emitted by speakers, 
and to act upon those stimuli, even when the speaker and listener 
are the same person. Listener discriminations are commonly 
referred to as receptive language. Listener discriminations 
(Figure 18.1) can be defined as a verbal S? that evokes a cor- 
responding nonverbal response (or class of responses) due to a 
history of generalized S' (e.g., someone asks for a pen and you 
reach for a pen). 


Matching-to-Sample (MTS) 


MTS procedures teach a child to attend to and organize the var- 
ious stimuli that affect his sensory systems (e.g., sights, sounds, 
tastes). A generalized matching repertoire (e.g., placing novel pic- 
tures of various exemplars of a category together) can facilitate the 
establishment of equivalent and nonequivalent relations between 
various classes of stimulus events (Chapters 19 and 20). Match- 
ing skills are valuable in language instruction for severely lan- 
guage-delayed children and adults. For example, a child who can 


match pictures of dogs, and discriminate them from other animals, 
and then learns to call one of the pictures a “dog,” may be able 
to correctly tact novel pictures of a dog, without further training. 


Classifying the Verbal Operants 


An important component of making use of Skinner’s analysis 
of verbal behavior is the ability to classify a verbal response as 
a mand, tact, intraverbal, and so on. Classifying the verbal rela- 
tions allows us to more precisely identify, organize, and account 
for the specific variables that might control a given utterance. 
Classification can also assist in identifying fine distinctions 
between variables that may prove to be of interpretive or clini- 
cal value (Palmer, 2016). Figure 18.2 presents a flow chart that 
can be used to narrow down the sources of control for a sample 
of verbal behavior. Verbal classification can be accomplished 
by asking the following questions regarding possible controlling 
variables for a target response. 


1. Does an MO control the response form? If yes, then the 
response is at least part mand. 


2. Does an S? control the response form? If yes, then: 


3. Is the S? nonverbal? If yes, then the response is at least 
part tact. 


4. Is the SP verbal? If yes, then: 


5. Is there formal similarity between the verbal SP and the 
response? If yes, then the response is duplic (echoic, imi- 
tative, or copying text). If not, then: 


6. Is there point-to-point correspondence between the verbal 
SP and the response? If yes, then the response is codic 
(textual or taking dictation). If not, then the response is at 
least part intraverbal. 


THE VERBAL OPERANTS AND LISTENER 
BEHAVIOR IN MORE DETAIL 


Motivating Operations and the Mand 


Motivation is an important source of antecedent control of 
human behavior. Much of what we do each day is controlled 
by MOs (Chapter 16). Unlearned MOs, such as those related to 
food, water, warmth, and sleep, are straightforward and need to 
be accommodated each day. In contrast, learned MOs, such as 
those related to money, information, travel, relationships, and 
so on, are often more complicated in part due to their variability 
and uniqueness to each individual. 

Both types of MOs can evoke nonverbal and verbal 
behaviors. In addition, their related histories of specific reinforce- 
ment can alter the functions of other MOs and S?s (Schlinger & 
Blakely, 1987, 1994). For example, on a hot day when a person 1s 
shopping and has not had anything to drink for a while, the value 
of water might gradually increase to the point where nonverbal 
behavior is evoked (e.g., looking for a drinking fountain). The 
history of specific reinforcement related to the MO can also alter 
the functional properties of other stimuli, such as establishing 
a drinking fountain as an SP that immediately evokes approach 
behavior. If water is located and a sufficient amount is consumed, 
the value of water and its evocative strength weakens via satiation, 


Controlling Variables 


UMO/CMO 
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| ——»> Yes —————_—__> Mand 
NO 

Nonverbal Stimulus > Yes —+» Tact 
No 


| 


Point-to-Point 
Correspondence 


Verbal Stimulus t—» Yes ——» 


Yes 


— >» No ————_> IIntraverbal 


Formal Similarity —» Yes -——————> Duplic 


Echoic 
Imitation 


Copying text 


No — > Codic 


thus an abolishing operation (AO), and the functional properties 
of the fountain change to an S4 (one stops searching for drinking 
fountains). 

The same MO (water deprivation) can also evoke verbal 
behavior (e.g., asking for water), but a listener is necessary to 
mediate (deliver) the reinforcement, and to complete the verbal 
episode. In this case, a specific history of reinforcement from 
listeners alters the functional properties of MOs, as well as the 
functional properties of potential listeners, establishing some 
listeners as SPs for manding behavior (e.g., a waiter) and others 
as S4. When the mand is successful, it results in the same spe- 
cific reinforcement and AO achieved through nonverbal behavior 
that involves obtaining water, with the only difference being how 
the water was obtained. When MOs evoke a verbal response that 
produces specific reinforcement mediated by listeners, Skinner 
(1957) termed this verbal relation a mand (Figure 18.1). 


Specific Reinforcement and the Mand 


The defining consequences for the mand constitute what Skinner 
(1957) terms specific reinforcement (p. 56). Specific reinforce- 
ment involves reinforcement that is established by its relation 
to a specific MO (e.g., water deprivation establishes water as 
specific reinforcement). When sufficient specific reinforcement 
is provided, it abolishes the MO. For example, if there is an MO 
at strength for finding one’s car keys, the specific reinforce- 
ment is locating the keys. This specific reinforcement also alters 


Textual Figure 18.2 Verbal behavior classification 


Taking dictation chart. 


the function of any number of contextual stimuli (e.g., drawers 
become SPs for opening and searching behaviors). Finding the 
keys has multiple effects; it abolishes the MO and simultane- 
ously functions as reinforcement, strengthening the particular 
behavior that was evoked by the MO (e.g., searching, mand- 
ing). Specific reinforcement can be contrasted with generalized 
S' characteristic of the other verbal operants. Generalized S' 
acquires its strengthening effect by its relation to a variety of 
MOs and relevant reinforcement history, and, as a result, is not 
dependent on any single MO (Michael, 2004). 


Classifying Mand Relations 


Classifying the mand relation largely depends on talking pre- 
cisely about its two defining features: (1) MOs and (2) specific 
reinforcement. Mands can be classified by identifying the dif- 
ferent types of MOs that control manding behavior and by the 
different ways mands affect the behavior of the listener and 
produce specific reinforcement. Michael and colleagues (e.g., 
Laraway, Snycerski, Michael, & Poling, 2003; Michael, 1982, 
1993, 2004; Chapter 16) distinguish between two main types 
of MOs: unconditioned motivating operations (UMOs) and 
conditioned motivating operations (CMOs). UMOs involve 
types of motivation that are innate to humans, and do not 
require a learning history to affect behavior, while CMOs are 
the direct result of a learning history. In general, a distinc- 
tion could be made between (1) MOs related to deprivation 
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and specific reinforcement established by that deprivation, 
and (2) MOs related to the removal of aversive stimulation 
and the specific negative reinforcement related to that con- 
dition. There are different types of UMOs and CMOs that 
evoke mands, each of which has different behavioral effects 
(Chapter 16). 


Mands Evoked by Unconditioned Motivating 
Operations (UMOs) 


Michael and Miguel (Chapter 16) identified nine UMOs, six of 
which are related to deprivation (food, water, sleep, activity, 
oxygen, and sex) and three of which are related to aversive stim- 
ulation (the aversive conditions of becoming too hot, becom- 
ing too cold, and an increase in painful stimulation). Each of 
these MOs (e.g., sleep deprivation) can evoke learned nonver- 
bal behavior (e.g., going to bed) and/or verbal behavior (e.g., 
a mand), and the specific reinforcement related to the MO can 
alter the functional effects of other variables as well (e.g., a sofa 
functions as an SP evoking an approach response). 


Deprivation-based UMOs. Some of an infant’s first mands 
are evoked by unlearned deprivation (Bijou & Baer, 1965; 
Skinner, 1957). What begins as a human biological process 
soon becomes learned operant behavior. Food deprivation, for 
example, may initially elicit crying as a biological (respondent) 
event, but through the process of reinforcement (i.e., feeding 
the infant), crying becomes an operant (Bijou & Baer, 1965; 
Novak & Pelaez, 2004; Schlinger, 1995). Each of the different 
deprivation-based UMOs may initially evoke crying, but due to 
the effects of differential specific reinforcement, the response 
topographies begin to shift (e.g., a hunger cry sounds different 
from a tired cry). Parents are often able to discriminate between 
different forms of crying by about the 8th week of an infant’s 
life (Brazelton & Sparrow, 2006). Eye contact can also serve a 
similar verbal function for an infant in that MOs and specific 
reinforcement can establish differential looking behavior that 
acquires a mand function (Isaksen & Holth, 2009). 


Aversive Stimuli as UMOs. Three UMOs are related to 
unlearned aversive stimulation (too hot, too cold, and painful 
stimulation), and it is their removal that functions as specific 
negative reinforcement. Michael and Miguel (Chapter 16) 
explain that “painful stimulation establishes pain reduction as 
a reinforcer and evokes the (escape) behavior that has achieved 
such reduction” (p. 381). For example, if a child experiences a 
burn, the behavior evoked by this aversive UMO can be non- 
verbal (e.g., turning on a cold water faucet), or it can be verbal 
(e.g., “Mommy!”). Mands controlled by aversive stimulation 
also occur early in language development and, like mands 
related to deprivation, participate in many aspects of human 
development (e.g., language acquisition, bonding, intellectual 
development). Identifiable mands of any type are valuable to 
caregivers in that a recognized response form can help a care- 
giver identify a child’s private MOs and the desired specific 
reinforcement. Thus, early manding allows reliable access to a 
child’s inner world. 


Mands Evoked by Conditioned Motivating 
Operations (CMOs) 


CMOs are more varied than UMOs and can change frequently. 
Michael and Miguel (Chapter 16) define a CMO as “motivating 
variables that alter the reinforcing effectiveness of other stimuli, 
objects, or events, but only as a result of the organism’s learn- 
ing history” (p. 393). For example, a low battery notice on a 
phone increases the value of a power cord and evokes behavior 
that involves a history of obtaining that specific reinforce- 
ment. Michael and Miguel (Chapter 16) identify three types of 
CMOs: transitive (CMO-T), reflexive (CMO-R), and surrogate 
(CMO-S), each of which can evoke manding behavior. 


Parts of Speech. Another way to classify some CMO effects is 
by the parts of speech. For example, a CMO-T related to nouns 
often involves a stimulus change (e.g., encountering a parking 
meter) that alters the value of objects (e.g., coins). Verbs often 
involve movement, and a stimulus change (e.g., being put in a 
wagon) can increase the value of an event (e.g., being pulled). 
Similar types of CMO-Ts can be identified for many adjectives, 
adverbs, prepositions, etc., as well as for verbal information, 
help, and so on. For those in need of language intervention, there 
are many ways in which these types of CMO-Ts can be captured 
or created for instructional purposes.” 


Multiple Control 


Any instance of human behavior is usually a function of multi- 
ple variables operating simultaneously. The task of the behavior 
analyst is to identify the relevant sources of control responsible 
for a behavior of concern, and Skinner’s analysis of multiple 
control can be of help in this process. Skinner (1957) begins his 
treatment of multiple control by stating, “Two facts emerge 
from our survey of the basic functional relations in verbal behav- 
ior: (1) the strength of a single response may be, and usually is, 
a function of more than one variable and (2) a single variable 
usually affects more than one response” (p. 227). Michael et al. 
(2011) suggested that Skinner’s two effects be termed (1) con- 
vergent multiple control and (2) divergent multiple control. In 
convergent multiple control, a single response is controlled 
by more than one antecedent variable. For example, a mand is 
usually controlled by both an MO (e.g., for information) and 
an SP (e.g., an audience). In divergent multiple control, a sin- 
gle antecedent variable controls more than one response. For 
example, the verbal stimulus “Who was Darwin?” could evoke 
any number of different verbal responses from a listener turned 
speaker, such as “natural selection,” “HMS Beagle,” or “the 
Galapagos Islands.” 


Convergent Control Involving Multiple MOs. Multiple types 
of motivation can participate in any verbal episode. MOs can 
combine and enhance an evocative effect, or compete with each 
other for the control of behavior (e.g., Michael et al., 2011; 
Sundberg, 2013). For example, when MOs related to exercise 
and social interaction combine, they may evoke a mand like “Do 
you want to go to the gym?” MOs also commonly compete with 
each other. For example, an MO related to food deprivation and 


social embarrassment may compete to evoke either “Yes, I’d like 
another piece of cake” or “No, thank you.” This convergence of 
antecedent MO variables on behavior often produces emergent 
effects (both respondent and operant) quite different from the 


effects of each element of the compound arrangement operating 
independently. 


Convergent Control Involving MOs and S?s. MOs typically 
participate with SPs in evoking our day-to-day behavior. As with 
multiple MOs, combining MOs with S?s can produce an addi- 
tive or competitive effect on behavior. For example, early mand 
training for nonverbal children consists of combining echoic 
or imitative prompts with MOs and salient nonverbal stimuli. 
MOs and S"s can also compete with each other. For example, 
a parent may have strong instructional control over a child’s 
behavior, except when he is playing video games. SPs can affect 
MO control as well, as when a verbal S? (e. g., a question about 


sports) disrupts control by a current CMO-R (e.g., studying for 
an exam). 


Mand Control. An MO may be insufficient to control behav- 
ior on its own, but when combined with other antecedents, the 
MO successfully participates in evoking behavior. For exam- 
ple, an MO regarding helping people may not evoke behavior 
until other stimuli, such as a person falling, are combined with 
the incipient MO. The weakness of an MO related to helping 
people could be demonstrated by seeing someone fall and not 
offering help. Palmer (2016) suggests that there are many cir- 
cumstances in which the “antecedent is, by itself, insufficient 
to evoke the relevant response” (p. 100), but when combined 
with other variables it evokes behavior. In his discussion on 
intraverbal relations, Palmer suggests the term intraverbal 
control be used to identify this supplementary type of control 
as it relates to verbal Ss. A similar case could be made for 
mand control where an MO by itself is insufficient to evoke 
behavior, but, when combined with other variables, an evoca- 
tive effect occurs. 


Private Events 


Most types of MOs are private; that is, only one person can 
experience their effects. Skinner (1953) termed these types of 
covert variables private events and stressed that despite their 
inaccessibility, private events cannot be neglected as causal vari- 
ables in the interpretation of behavioral events (Skinner, 1945, 
1953, 1974). Skinner (1953) explains: “With respect to each 
individual . . . a small part of the universe is private. We need 
not suppose that events which take place within an organism's 
skin have special properties for that reason. A private event may 
be distinguished by its limited accessibility but not, so far as we 
know, by any special structure or nature” (p. 257). 

Many events occur within our bodies and affect how we 
behave. For example, verbal behavior can easily become covert. 
A speaker becomes her own listener and responds to her own 
verbal behavior much in the same way she responds to the verbal 
behavior of others (Horne & Lowe, 1996). We can and do talk 
to ourselves and it helps us solve problems, comprehend what 
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others say, edit what we might say, recall past events, entertain 
ourselves, and so on. Although these private verbal events will 
always be inaccessible to others, under some circumstances they 
can be indirectly measured (Palmer, 2011). 


Nine Types of Mands Distinguished 
by Listener Behavior 


Skinner suggests that some mands can be classified by the differ- 
ential mediating behavior of the listener and the reinforcement that 
strengthens that listener behavior. He offers nine types of mands 
(Figure 18.3): requests, commands, prayer, questions, advice, 
warning, permission, offer, and call. For example, in making the 
distinction between a request and a command, Skinner suggests 
that a request is emitted to a listener who is motivated to reinforce 
a speaker (e.g., a waiter). The specific reinforcement provided 
by the listener is the specific item requested, and reinforcement 
for the listener is some form of generalized S' (e.g., “Thanks!”). 
A command, in contrast, involves an unwilling listener and the 
verbal stimulus needs to be supplemented with aversive stimuli. 
The specific reinforcement provided by the listener is compli- 
ance with the speaker’s command. The listener is reinforced by 
the removal of the threat—thus a type of negative reinforcement. 
Skinner provides a similar treatment for the other types of mands 
he identified (Figure 18.3). 


Emergent Mand Relations 


Learning to mand is a powerful behavioral cusp in early child 
development (Bijou & Baer, 1965: Greer & Ross, 2008; 
Rosales-Ruiz & Baer, 1997; Sundberg & Michael, 2001). 
A child’s mand repertoire typically grows quickly (due to MO 
and specific reinforcement effects) and often does so with lit- 
tle formal instruction. Once a child has acquired basic echoic, 
mand, tact, and listener repertoires, the combination of these 
cusps can produce emergent mands in a number of different 
ways. For example, once a child acquires mand frames (e.g., 
“I want_X_”) and can tact objects and actions, new tacts can 
be inserted into the mand frame by direct training, incidental 
exposure, or relational framing (e.g., Hall & Sundberg, 1987; 
Horne & Lowe, 1996; Pérez-Gonzdlez, Pastor, & Carnerero, 
2014; Rosales & Rehfeldt, 2007). A more advanced and quite 
generative mand cusp involves mands for information (e.g., 
“What’s that?” “Where’s Mommy?”). A child’s ability to ask 
questions plays a major role in vocabulary growth by gen- 
erating new tacts, echoics, mands, intraverbals, and listener 
behaviors. 


Nonverbal Stimulus Control and the Tact 


The distinction between the mand and the tact is based on the 
distinction between MOs and nonverbal SPs as antecedents, 
and specific and nonspecific reinforcement as consequences. 
Nonverbal Ss, like MOs, can evoke both nonverbal and verbal 
behaviors. For example, we can respond to rain nonverbally 
by getting an umbrella, or we can respond to rain verbally by 
talking about the rain. When the nonverbal properties of the 
physical environment (¢.g., visual or tactile stimulation of rain) 
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Figure 18.3 Types of mands distinguished by listener behavior. 


Mand Type_ | Speaker’s MO 

Request MO for action on the part of a 
willing listener 

Command MO for action on the part of an | 
unwilling listener 

Prayer MO to generate an emotional 
disposition 

Question MO for verbal action 

Advice 


MO for the listener to enjoy 

consequences 

MO to inform the listener of a 
possible aversive stimulus 


Warning 


Specific Reinforcement for | Reinforcement for Listener 

the Speaker 

Listener complies 

Listener complies 

Emits praying behavior 

Verbal information provided 
reinforcement 


Listener takes advice 


Listener following warning 


Behavior 


Generalized conditioned 
reinforcement 


Remove aversive threat 


Improves emotional condition 


Generalized conditioned 


Accesses reinforcement and avoids 
punishment 


Avoiding aversive stimulus 


Experiences something new 


evoke a verbal response that is followed by generalized S', 
Skinner (1957) termed this verbal relation a tact (Figure 18.1). 

Michael (2004) defines nonverbal stimuli as environmental 
changes (e.g., offset to onset of a stimulus) that affect our 
receptor systems (e.g., vision, hearing, vestibular). When a 
particular nonverbal stimulus (e.g., rain) evokes a particular 
response, and that response is immediately followed by rein- 
forcement, stimulus control develops. Skinner (1957) identi- 
fied two general types of nonverbal stimulus control that work 
together as a form of convergent multiple control to evoke ver- 
bal behavior: (1) the audience that “controls a large group of 
responses” (p. 81) and (2) “nothing less than the whole of the 
physical environment—the world of things and events which a 
speaker is said to ‘talk about’” (p. 81). 


Classifying Tact Relations 


Classifying tacts can be accomplished by identifying the dif- 
ferent types of nonverbal Ss that control responses, and by 
the different ways these verbal responses affect the behavior of 
the listener. In addition, nonverbal S?s commonly interact with 
other nonverbal Ss, verbal SPs, and MOs. Several ways to clas- 
sify various nonverbal SPs and tacting will be briefly described. 


Different Sensory Receptors 


Tacts can be classified by the sensory receptor affected by a 
nonverbal stimulus. Visual stimuli have functional properties 
that are different from auditory stimuli, tactile stimuli, olfactory 
stimuli, and so on. For example, a child may emit the tact “ice 
cream truck” under the auditory nonverbal stimulus of hearing 
the truck’s music, but not be able to emit the same words under 
the visual nonverbal stimulus of seeing the truck itself. 


Offer MO to get an unwilling listener to Listener participates 
participate 

Permission MO to remove an aversive stimulus | Aversive stimulus removed 

Call MO for a listener's attention Listener emits attending 


behavior wt 


Behave without negative ramifications 


Reinforcement from additional 
speaker behavior 


Parts of Speech 


Tacts can be classified by the parts of speech (e.g., nouns, 
verbs, adverbs) because a unique type of nonverbal stimu- 
lus control is often involved for each part. For example, the 
nonverbal stimuli that evoke many nouns are typically static 
objects (e.g., dog, house), while the nonverbal stimuli that evoke 
verbs often involve movement (e.g., jumping, opening). Many 
adjectives involve a more complex arrangement of nonver- 
bal stimuli in which the same properties can occur in differ- 
ent objects (e.g., green, large, heavy), and these properties are 
the source of stimulus control, rather than the object itself. For 
many prepositions, the objects themselves are irrelevant to the 
discrimination; rather, their spatial relation to each other is the 
relevant source of nonverbal control (e.g., next to, on, over). 
The nonverbal stimulus control involved in other parts of speech 
(e.g., pronouns, adverbs) can be analyzed in a similar manner. 


Multiple Nonverbal Stimuli 


Most nonverbal circumstances involve multiple nonverbal stim- 
uli. At a basic level, a single stimulus (e.g., dog) comprises mul- 
tiple nonverbal stimuli (e.g., a tail, paws, fur). Most encounters 
with the environment involve multiple nonverbal SPs of many 
types (e.g., objects, actions, relations). For example, the tact 
“The dog is begging,” upon observing a dog, contains multi- 
ple nonverbal stimuli, each of which participates in evoking 
the response. The different parts of speech in the sentence are 
controlled by a different nonverbal source of stimulus control. 
The noun “dog” is controlled by the nonverbal object, the verb 
“begging” is controlled by the dog’s actions, and the verb “is” 
is a relational autoclitic informing the listener of the coordi- 
nated relation between the dog (noun) and begging (verb). 
When a sentence contains other parts of speech (e.g., adjectives, 


pronouns) additional nonverbal SPs participate in evoking a 
response. Thus, what may appear to be a single utterance with 


a single “meaning” is actually multiple responses controlled by 
multiple variables. 


Convergent Multiple Control Involving 
Nonverbal Ss, Verbal S°s, and MOs 


Nonverbal Ss commonly participate in verbal episodes with 
other types of antecedent variables. For example, nonverbal and 
verbal Ss are involved when a child is asked, “What shape is 
this paper?” Saying “rectangle” is multiply controlled by the 
actual (nonverbal) shape of the paper and by the (verbal) spoken 
word “shape.” If the question was “What color is this paper? a 
different response would be evoked. This combination of con- 
trolling variables makes the response part tact and part intra- 
verbal, and both sources of control are required to be correct. 
Nonverbal Ss can also share control with MOs. For example, 
an MO related to a cut finger and the nonverbal SP of a Band- 
Aid box evoke selection behavior. 


Tact Control 


Multiple control can also play a role when the nonverbal anteced- 
ent is, by itself, insufficient to evoke the relevant response 
(Palmer, 2016). For example, if a person cannot correctly 
remember the name of someone she is talking with, supplemen- 
tal verbal stimuli (e.g., discussion of a shared work experience) 
may help to evoke recall. Extending Palmer’s (2016) concept of 
intraverbal control to situations in which the nonverbal stimu- 
lus is insufficient to evoke the relevant response can be termed 
tact control, suggesting that the nonverbal stimulus is not the 
primary source of control, but rather a participant in a multiple 
configuration of antecedent variables. 


Private Nonverbal Stimulus Control 


Nonverbal stimuli can arise within the skin of a speaker and 
evoke verbal behavior in the form of a tact. For example, if one 
tightens her stomach muscles, this response produces a nonver- 
bal response product that can be tacted through the kinesthetic 
receptors. That is, one can feel her muscles move and tact this 
movement, although no one else can. Visual imagining can 
also provide nonverbal SPs that can be tacted (e.g., Horne & 
Lowe, 1996; Miguel, 2018; Skinner, 1974) and thus provide a 
sometimes valuable supplemental source of stimulus control for 
behavior for intraverbal behavior. For example, Kisamore, Carr, 
and Leblanc (2011) demonstrated how intraverbal behavior 
could be improved by teaching children to emit visual imagining 
behavior (e.g., imagining a farm when asked to name some farm 
animals). Visual imagining can also participate in bidirectional 
naming relations described in a later section (Horne & Lowe, 
1996; Miguel, 2018). 

A child first learns to tact her own overt behavior (e.g., “I'm 
dancing”) before learning to tact her own covert behavior (e.g., 
“I feel tired”). A verbal community establishes self-awareness 
in several ways (Skinner, 1953, 1974). For example, observable 
overt collateral behavior can inform the community of a private 
stimulus (e.g., a child limping), and these circumstances can 
be used to teach the child to tact the private event for herself. 
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Another way a child learns a tact controlled by private nonver- 
bal stimulation is when the private stimuli are accompanied by 
observable external stimuli (e.g., blood from a cut). The adults 
can tact the public stimuli (as well as collateral behavior) that are 
highly correlated with the private stimuli, and provide and teach 
an appropriate response form (e.g., “ouch”) when the private 
stimulus is present, thus allowing for echoic-to-tact transfer. 
Private events can become quite complicated when emotions, 
MOs, verbal S?s, nonverbal Ss, and other variables interact 
with each other and the current context and reinforcement 
contingencies. 


Tact Extension 


Once a tact relation has been established, the tact response can 
occur under novel stimulus conditions through the process of 
stimulus generalization. Skinner (1957) identifies four different 
levels of generalization based on the degree to which a novel 
stimulus shares the relevant or defining features of the original 
stimulus. The four types of tact extension are generic, meta- 
phorical, metonymical, and solecistic. In a generic tact exten- 
sion, the novel stimulus shares all of the relevant or defining 
features of the original stimulus. For example, if a child learns 
to tact “tree” in the presence of a maple tree, when shown an 
oak tree the child correctly tacts it as “tree.” When the child 
encounters a novel nonverbal stimulus that has some, but not all, 
of the defining features of tree (e.g., a bush) this antecedent may 
also evoke the response “tree,” and exemplifies a metaphorical 
tact extension. 

Another type of generalization to novel stimuli is a met- 
onymical tact extension, in which a novel stimulus does not 
share any of the defining features of the original stimulus, but 
some irrelevant feature acquires stimulus control. For example, 
a child may have a tendency to call a swing a “tree” if there 
was a swing on the tree, and it was the swing that acquired 
stimulus control over the response “tree” due to a history of rein- 
forcement. The fourth type of novel tacting, termed solecistic 
extension, may occur when the property that acquires control 
has no clear connection to the defining properties. For example, 
there appears to be no connection between tacting the nonver- 
bal stimulus shoe as a “tree,” although these types of solecistic 
extensions can occur. 


Tact Abstraction 


Tact abstraction involves learning to tact a property (e.g., 
color, size, shape) regardless of the object itself. For example, 
the color red can appear as part of a wide range of nonverbal 
stimuli (e.g., red cars, red clothing, red fruit). Several of the 
parts of speech involve abstractions, such as adverbs (e.g., 
fast, slow) and prepositions (e.g., near, above), where the 
properties of movement and location, respectively, must 
acquire control over behavior regardless of the context in 
which that property appears. 


Reinforcement Controlling the Tact 


In addition to the different types of antecedent variables that 
distinguish the mand from the tact, different types of reinforce- 
ment define each operant as well (Braam & Sundberg, 1991; 
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Saunders & Sailor, 1979; Stafford, Sundberg, & Braam, 1988). 
The mand relation involves specific reinforcement, while the 
tact relation involves generalized S'. Skinner (1957) identified 
five different types of generalized S' for tacting provided by 
listeners; three are delivered by a second person serving as a lis- 
tener: (1) educational reinforcement, (2) escape from or avoid- 
ance of aversive stimulation, (3) extending the listener’s contact 
with the environment; and two are self-delivered by the speaker 
acting as her own listener: (4) automatic reinforcement, and (5) 
a history of contiguous or correlated usage. 


Educational Reinforcement 


A child’s ability to correctly identify objects and events in her 
physical environment is an important aspect of language devel- 
opment, and parents and other caregivers take special effort to 
establish this behavior. Skinner (1957) suggests that the type 
of generalized S' involved can be identified as educational 
reinforcement, and he defined it as “reinforcement supplied 
primarily because it establishes and maintains a particular form 
of behavior in the speaker” (p. 84). For example, teaching a child 
to tact body parts involves antecedents, behavior, and conse- 
quences (a discrete trial), with differential reinforcement explic- 
itly provided for purposes of instruction. It could be considered 
contrived reinforcement contingencies; nonetheless, these con- 
tingencies are valuable for establishing a child’s early tacting 
vocabulary or for repairing a child’s impaired tact repertoire. 


Escape from or Avoidance of Aversive Stimulation 


The removal of something that is aversive to a person can func- 
tion as a type of negative reinforcement, thus strengthening 
behavior. If a threatening tone of voice accompanies a nonverbal 
stimulus (e.g., a harsh “What is it?” in a teaching trial), it might 
be that the removal of the aversive stimulus is a more power- 
ful consequence than social praise (e.g., Carbone, Morgenstern, 
Zecchin-Tirri, & Kolberg, 2008). 


Extending the Listener’s Contact 
with the Environment 


The main reinforcement for tacting is the action that listeners 
take as a function of the speaker’s tact. Michael (1991) explains 
this effect in the following manner: “As a result of the speaker’s 
tact, the listener is able to do something that he could not have 
done without the speaker’s tact, and for this reason provides 
generalized S (p. 35). For example, if a driver (as a listener) 
cannot see an oncoming car, a passenger’s (as a speaker) tact 
“There’s a car coming” extends the driver’s contact with the 
physical environment, and allows him to behave more effec- 
tively. This reinforcement is not contrived and produces effec- 
tive listener behavior, and the listener provides reinforcement 
for that reason (e.g., “Thanks”). 


Automatic Reinforcement 


Environmental contingencies can affect behavior without the 
immediate involvement of other people. For example, the 
reinforcement obtained from swimming may emanate from 
the behavior itself—that is, it feels good. Skinner (1957) used the 
term “automatic” to identify circumstances in which a behavior 


is evoked, shaped, maintained, or weakened by environmental 
variables occurring without direct manipulation by other peo- 
ple. Vaughan & Michael (1982) note that Skinner’s usage of 
automatic was simply to counteract “any tendency to restrict the 
concept of reinforcement to those occasions upon which it has 
been deliberately arranged by another person or group” (p. 218). 
Skinner (1957) also discussed automatic punishment (p. 375), 
automatic extinction (p. 164), automatic consequences (p. 442), 
automatic shaping (p. 58), automatic MO control (p. 220), and 
automatic stimulus control (p. 416). In short, all the basic prin- 
ciples of behavior can operate on a person automatically, and 
all automatic contingencies have the same defining properties 
as non-automatic contingences. Thus, antecedents and conse- 
quences that are not deliberately arranged can affect behavior, 
often in a manner more efficient than directly manipulated 
contingencies (Palmer, 1996; Skinner, 1968). 

Among the various types of automatic contingencies, 
automatic reinforcement has received the most attention from 
behavioral researchers. Automatic reinforcement can occur 
in two ways: (1) where the reinforcement emanates from the 
response product of the behavior (e.g., infant babbling, singing 
songs, mannerisms, emitting a clever phrase) and (2) where the 
reinforcement is provided by the behavior’s reinforcing effects 
emanating from the physical environment (e.g., turning on a 
light, finding a missing phone). Both types of automatic rein- 
forcement can produce an increase in the behavior they follow 
without involvement from another person. 

The automatic reinforcement concept has many appli- 
cations to language instruction and problem behaviors, along 
with a growing body of empirical research supporting its use 
(e.g., Chapter 27; Shillingsburg, Hollander, Yosick, Bowen, & 
Muskat, 2015; Stock, Schulze, & Mirenda, 2008; Sundberg, 
Michael, Partington, & Sundberg, 1996; van Harran, 2015; Voll- 
mer, 1994). In addition, Donahoe and Palmer (1994) and Palmer 
(1996, 1998) describe how automatic reinforcement participates 
in the development of more complex aspects of verbal behav- 
ior. For example, these authors suggest that a child’s grammar 
produces automatic-reinforcement when it sounds like grammar 
emitted by others in their verbal community, but is automati- 
cally punished when it sounds odd or unusual. Palmer (1996) 
described this effect as achieving parity with others. Several 
studies have demonstrated how automatic reinforcement partic- 
ipates in establishing and refining grammatical and syntactical 
skills (e.g., Ostvik, Eikeseth, & Klintwall, 2012; Wright, 2006). 


A History of Contiguous or Correlated Usage 


Each speaker has a unique and private history of reinforcement 
relevant to her own verbal behavior. This history is brought to 
bear on current environmental contingencies, and the two sets 
of variables interact. For example, if a friend is talking about 
a recent bike trip to Nova Scotia, and you as a listener have 
also biked around Nova Scotia, a number of covert verbal 
behaviors and nonverbal imagery may come to strength. The 
response products of these covert verbal behaviors prepare you 
for your upcoming speaker role. “Nova Scotia” as a verbal SP 
may covertly evoke “Halifax” or “Immigration Station” as a 
self-intraverbal. A covert visual image of flowers along the road 
may also function as a nonverbal S” that evokes “lupines” as a 


self-tact. These effects occur because of a history of reinforce- 
ment in which these different stimuli occurred together either 
contiguously or simply correlated (Palmer, 2016). Skinner 
terms this effect contiguous usage, noting, “In general there 
is an advantage if responses appropriate to a current situation 
are strong” (p. 86). These responses are self-strengthened by 
the automatic reinforcement they produce. In addition, their 
occurrences evoke speaker behavior that increases the probabil- 


ity that the listener will provide generalized S' (e.g., “I enjoyed 
the lupines too”). 


Generative Learning 


Typically developing children demonstrate an explosion of lan- 
guage skills between the ages of 2 and 3 (Brazelton & Sparrow, 
2006; Greer & Ross, 2008; Hart & Risley, 1995). During that 
period, a child begins to emit and comprehend new words daily, 
and vocabulary size can grow from 100 to 1000 words. A child 
may combine and expand on these new words in novel ways 
and in novel contexts, often without direct training or a history 
of reinforcement (e.g., a toddler who shocks an audience with a 
swearword). Young children seem to learn new words on their 
own. However, some children, especially those with autism, 
struggle with or fail to make this leap in language acquisition. 
Some emergent verbal behavior can be explained by 
stimulus and response generalization, but others involve 
a process of generalization that goes under the rubric of 
generative learning. Generative learning involves a behav- 
ioral effect where previously acquired skills enable or accel- 
erate the acquisition of other skills, without dependence on 
direct teaching or a history of reinforcement (Alessi, 1987; 
Horne & Lowe, 1996; Staats, 1996; Stewart, McElwee, & 
Ming, 2013). There are several productive lines of research 
on generative learning, each of which has contributed to our 
understanding of this important behavioral effect (e.g., Becker, 
1986; Engelmann & Carnine, 1982; Goldstein, 1983; Greer 
& Ross, 2008; Hayes et al., 2001; Hixson, 2004; Horne & 
Lowe, 1996; Johnson & Layng, 1994; Koegel & Koegel, 1988; 
Lowenkron, 1998; Malott, 2003; Rosales-Ruiz, & Baer, 1997; 
Sidman, 1994; Staats, 1996). The concept of behavioral cusps 
(Rosales-Ruiz & Baer, 1997) can help organize the topic of 
generative learning, especially as it applies to early language 
acquisition (Greer & Ross, 2008). In addition, the work of Staats 
(1996) on cumulative-hierarchical learning can help to provide 
a conceptual foundation for generative learning (Hixson et al., 
2010). The following section will describe several ways that 
generative learning can produce emergent tact relations. 


Emergent Tact Relations 


Stimulus Equivalence 

Sidman (1971) and Sidman & Cresson (1973) showed that 
combining matching-to-sample (MTS), echoic, and listener 
skills generated emergent MTS, tact, and textual relations. 
Sidman termed this effect stimulus equivalence (Chapter 19). 
These researchers showed that equivalence can be established 
by using conditional discrimination procedures to directly teach 
a single AB relation between a word (stimulus A) and an object 
(stimulus B), then teach a BC relation between the same object 
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(stimulus B) and a different stimulus (stimulus C), and then test 
for the emergence of the untrained relations (i-e., BA/CB symme- 
try tests and AC/CA transitivity tests). When emergence occurs 
with both symmetry and transitivity, equivalence has occurred 
(Sidman & Tailby, 1982). Pilgrim (Chapter 19) describes the 
emergent tact and textual responses in Sidman’s early work as 
follows: “The exciting result was that these boys not only named 
the pictures correctly, but also the written words; that is they read 
the words orally where they had not been able to do so prior to the 
teaching phases” (p. 456). Since Sidman’s initial work, equiva- 
lence procedures have proved an effective method for producing 
a host of generative learning effects (Chapter 19). 


Recombinative Generalization 


Goldstein (1983) defined recombinative generalization as 
“differential responding to novel combinations of stimulus 
components that have been included previously in other stimu- 
lus contexts” (p. 280). For example, if a child is taught to tact 
“oreen circle” and “red square” and then without direct training 
the child is able to correctly tact “green square” and “red circle,” 
then recombinative generalization has occurred. Matrix training 
is an example of a commonly used procedure that can produce 
recombinative generalization, and studies have demonstrated its 
effectiveness in generating emergent tact relations (e.g., Axe & 
Sainato, 2010; Frampton, Wymer, Hansen, & Shillingsburg 2016; 
Kohler & Malott, 2014; Pauwels, Ahearn, & Cohen, 2015). 


Common Bidirectional Naming (C-BiN) 


Horne & Lowe (1996) suggested that one of the most far- 
reaching achievements in early language acquisition occurs 
when the echoic, tact, listener, and self-listener skills combine 
to establish a new higher order verbal cusp they term common 
bidirectional naming. Once a child can easily echo new words, 
and has acquired the basic prerequisite repertoires for C-BiN, a 
new word acquired as listener can immediately generate speaker 
behavior (a tact) without further training; or if the word is first 
acquired as a tact, it can immediately generate a listener rela- 
tion without further training (consistent with symmetry and 
mutual entailment). A reinforcement history of naming experi- 
ences (multiple exemplar instruction) and self-echoic mediation 
establishes bidirectional relations between words and objects or 
events, and fuses the speaker and listener repertoires (Greer & 
Ross, 2008; Horne & Lowe, 1996; Miguel, 2016, 2018). The 
naming cusp is far-reaching, so much so that Greer and Longano 
(2010) assert the following: “Naming appears to be the source 
of the explosion in language development” (p. 73). In addition, 
C-BiN teaching procedures have shown effectiveness in produc- 
ing generative learning for children with disabilities who have 
failed to acquire this important form of learning (e.g., Fiorile & 
Greer, 2007; Lee, Miguel, Darcey, & Jennings, 2015). 


Joint Control 


Lowenkron (1984, 1988, 1989) demonstrated that emergent 
speaker and listener behavior could be generated by a type 
of multiple control he termed joint control. Joint control also 
involves the combination of verbal skills (e.g., echoic and 
tact repertoires) and occurs when two separately established 
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antecedents that evoke the same response topography arise 
simultaneously. For example, when searching for a specific ref- 
erence from a list of references, a self-echoic (e.g., “Guttman 
and Kalish”) is rehearsed (overtly or covertly) until a textual 
stimulus evokes the same response form (e.g., reading “Guttman 
and Kalish”). The physical similarity of the two response forms 
generates a nonverbal SP regarding that similarity and evokes an 
emergent autoclitic tact (e.g., “that’s it”). The response product 
of the autoclitic tact, along with the textual stimulus, sets up 
the conditional discrimination that evokes the emergent listener 
behavior (e.g., copying and pasting the reference). Once joint 
control is acquired as a generalized repertoire (a higher-order 
cusp), its generative effects can allow for, or accelerate, a host 
of complex speaker and listener behaviors (e.g., Causin, Albert, 
Carbone, & Sweeney-Kerwin, 2011; Lowenkron, 1998, 2006; 
C. T. Sundberg et al., 2018; Tu, 2006). 


Relational Frame Theory (RFT) 


Hayes et al. (2001) extended Sidman’s equivalence framework 
to nonequivalent relations such as comparison, opposition, 
and difference (Chapter 20). Generative learning opportunities 
that can produce emergent tacts are abundant when equivalent 
and nonequivalent relations interact in novel contexts. Once 
acquired as a generalized operant, relational responding func- 
tions as a higher-order cusp capable of generating a number 
of far-reaching derived effects (e.g., Dougher, Augustson, 
Markham, Greenway, & Wulfert, 1994; Hayes et al., 2001; 
Stewart et al., 2013). In RFT, framing is established through a 
history of multiple exemplar training and combined in various 
ways dependent on the current context. This history and the 
current contingencies provide the foundation for the establish- 
ment of derived relational responding. 

A defining feature of derived relational responding is 
the transformation of stimulus functions, which includes 
all types of transfer (e.g., mutual entailment, combinatory 
entailment, transformation of function) (Dymond & Rehfeldt, 
2000; Hayes et al., 2001). Unique to RFT is the treatment of 
the transformation of psychological function (e.g., Dougher 
et al., 1994; Dymond & Barnes, 1994). In this type of trans- 
formation, “[i]f one stimulus in a mutual or combinatorially 
entailed relation is given a direct psychological meaning, then 
the remaining stimuli may acquire this psychological func- 
tion. For example, if ‘lemons’ is in an equivalence class with 
actual lemons in a context that selects taste as the relevant 
function, talking of lemons can be associated with salivation 
or puckering” (Hayes, Gifford, & Ruckstahl, 1998, p. 289). 
For individuals with language delays, the transformation of 
function and related relational framing activities can provide 
teaching tools for establishing generative networks of stimulus 
relations that can be emitted, expanded, and merged under a 
variety of contexts (e.g., Dixon, 2016). To learn more about 
derived relational responding, see Chapter 20. 


Verbal Behavior Controlled by Verbal S°s 


Verbal S”s constitute the third type of antecedent control for 
language, joining MOs and nonverbal SPs. as sources of control 
for both speaker and listener behavior. We commonly respond to 


the words of others, and to our own words as a speaker or a lis- 
tener. We have conversations, surf the Internet, read, write, and 
so on. Skinner (1957) defines a verbal stimulus as “the product 
of earlier verbal behavior” (p. 65). That is, verbal responses pro- 
duce a response product. For example, vocal speech produces 
auditory stimuli, sign language produces visual stimuli, and 
Braille dots produce tactile stimuli. These response products can 
in turn have a discriminative or motivative function evoking the 
behavior of listeners, including the speaker as her own listener. 

Verbal SPs have the same causal status as nonverbal S?s 
in that both types of Ss acquire discriminative control over 
behavior through a history of differential reinforcement. Ver- 
bal SPs have unique effects and characteristics not found with 
nonverbal S?s (Sundberg, 2016a). For example, verbal SPs can 
be portable, private, and free from environmental support. We 
can talk about being in Italy without being in Italy, recite the 
numbers to open a combination lock, solve problems, or think 
through an important decision, all in the absence of the rele- 
vant physical stimuli. The verbal response products of this talk 
may function as automatic S', as well as SPs or MOs evoking 
other behaviors. Nonverbal SPs by contrast, are restricted to 
the immediate physical environment (with the exception of 
imagining), and are far less portable or available to a speaker 
across environments. 

Six types of elementary verbal operants are controlled 
by verbal Ss: echoic, motor imitation (relating to sign 
language), copying text, textual, taking dictation, and intraverbal 
(Figure 18.1). All of these verbal operants, except the intraverbal, 
can be classified as duplic or codic. 


The Duplic Relations: Echoic, Motor 
Imitation, and Copying Text 


Duplics are the least complex of the verbal operants but play a 
significant role in linguistic and academic development. This 
category of verbal operants involves verbally copying verbal 
stimuli. Verbal stimuli can occur in many forms (e.g., spoken 
words, written words, gestures, sign language, fingerspelling, 
Braille dots). These can be organized into three distinguishable 
types: echoic (auditory match), motor imitation as it relates to 
sign language (visual match), and copying text (e.g., copying 
configurations of written letters, fingerspelling, or Braille dots). 


Echoic. A child’s ability to echo the phonemes and words of 
others is essential for learning to identify objects and actions. 
In the presence of a duck a parent might say, “That’s a duck, 
can you say duck?” If the child can respond “duck,” then the 
parent says “Right!” Eventually, the child learns to name a 
duck without the echoic prompt. Echoic behavior also plays a 
critical role in more complex forms of verbal behavior, such 
as incidental verbal learning, derived relational responding, 
joint control, bidirectional naming, and problem solving (e.g., 
Hayes et al., 2001; Horne & Lowe, 1996; Lowenkron, 2006; 
Palmer, 2012; Schlinger, 2008a, 2008b). 


Motor Imitation. Some types of motor imitation constitute 
nonverbal behavior (e.g., following others to get in a line), 
while others can have the same functional properties as echoic 


relations. Motor imitation is a valuable tool for teaching sign 
language to persons who are deaf, or hearing but unable to 
produce speech. For example, a child may learn to imitate the 
sign for popcorn while popping corn, then later emit the sign 
appropriately without an imitative prompt. 


Copying Text. Textual verbal stimuli can be written, finger- 
spelled, or punched in Braille dots. Learning to copy these 
stimuli can facilitate the development of codic verbal behavior 
(e.g., learning to read, write, and spell). For example, a child 
often first learns to make letters by tracing and copying them, 


which can facilitate the process of independently writing, spell- 
ing, and reading. 


The Codic Relations: Textual and Taking Dictation 


Codic relations are more complicated than duplic relations 
because of the lack of formal similarity between the verbal S? 
and the verbal response. Michael (1982) notes that the term 
codic “is meant to suggest the kind of relation seen in a formal 
code, where one stimulus is said to stand for another stimulus 
that it does not resemble” (p. 2). In codic relations, additional 
speaker behaviors are required to learn to code and convert a 
textual stimulus into a vocal stimulus, and vice versa. 


Textual. Textual behavior is reading without an implication 
that the reader understands what is being read. Understanding 
what is read is typically identified as reading comprehension 
and involves a number of functionally separate verbal and 
nonverbal operants (e.g., intraverbal behavior, manding, tact- 
ing, listener discriminations). For example, saying “Chicago” 
upon seeing the written word “Chicago” is usually textual 
behavior. But understanding that Chicago is a city on Lake 
Michigan is not textual behavior (it is more likely intraverbal 
behavior). Textual behavior can occur with sign language as 
well. The Stokoe Notation System was designed as a form 
of written sign language (although not widely used). It has 
55 written cheremes that are analogous to the 42 vocal pho- 
nemes of English (Stokoe et al., 1965). An arrangement of 
written cheremes can be read with signs in a manner similar 
to speech because they have point-to-point correspondence 
to a specific sign. 


Taking Dictation. Taking dictation consists of writing and spell- 
ing words that are spoken. The multiple repertoires involved in 
learning this code include not only the manual production of 
letters (written, typed, fingerspelled, or punched) but also the 
spelling of words. As with textual, the code aspect of this reper- 
toire usually requires additional instruction. Taking dictation can 
occur with sign language as well. This relation involves seeing 
a sign and writing the corresponding cheremes. 


The Intraverbal 

The intraverbal relation is different from the other types of 
verbal behavior evoked by verbal SPs in that no point-to-point 
correspondence exists between the verbal stimulus and the 
verbal response (Figure 18.1). Young children typically emit 
intraverbal responses in the form of singing songs, telling 
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stories, describing activities, and so on. At a more advanced 
level, intraverbal behavior is a key component of many import- 
ant aspects of human behavior (e.g., academics, social inter- 
action, entertainment). As with the other verbal operants, the 
form of an intraverbal response can involve speech, signs, 
written words, and so on. And as with the mand and tact, to 
understand the intraverbal, one must understand its controlling 
variables. At least four different types of verbal discrimina- 
tions can be involved in intraverbal relations (Figure 18.4): 
(1) simple verbal, (2) compound verbal, (3) verbal conditional, 
and (4) verbal function-altering. 


Classifying Verbal Discriminations 


Intraverbal relations can be classified by identifying the 
type of verbal stimulus control that evokes the intraverbal 
response. In general, each of the verbal discriminations pre- 
sented below involve increasingly more complex types of 
verbal stimulus control. 


Simple Verbal Discrimination 


The first type of intraverbal behavior acquired by typically 
developing children usually involves simple verbal discrim- 
inations, where a single-component word or phrase evokes 
a nonmatching verbal response. For example, upon hearing 
“Ready, set... ,” a child says “go,” or hearing “Buzz...” a 
child says “‘lightyear.” These early intraverbal responses may 
be initially multiply controlled by MOs and other supplemen- 
tary variables (e.g., echoics, tacts), but soon intraverbal stimulus 
control is established. 


Compound Verbal Discrimination 


A compound verbal discrimination involves convergent mul- 
tiple control where two or more verbal SPs collectively evoke 
a response. However, the response evoked by the compound 
stimulus is different from the response evoked by each verbal 
SP when presented in isolation (Eikeseth & Smith, 2013). For 
example, hearing “yellow” might intraverbally evoke any num- 
ber of responses (e.g., “color,” “school bus”), and “fruit” might 
also evoke any number of responses (e.g., “apple,” “orange”), 
but when combined as “yellow fruit” they evoke a more specific 
response (e.g., “banana,” “lemon”). In a compound verbal dis- 
crimination, the individual words do not change the function 
of each other; they are parts of a larger stimulus configuration, 
similar to the way individual nonverbal stimuli (e.g., tail, paws) 
make up objects (e.g., a dog). In the next type of verbal stimulus 
control presented, the individual words in the antecedent do 
affect each other. 


Verbal Conditional Discrimination (VC?) 


Verbal stimulus control becomes more complicated when 
verbal stimuli interact with each other in the same antecedent 
configuration. For example, a conditional discrimination can 
arise between verbal stimuli in an utterance and more behavior is 
required to emit intraverbal responses (i.e., questions get harder). 
A verbal conditional discrimination occurs when one verbal 
stimulus alters the evocative effects of another verbal stimulus 
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Figure 18.4 Four types of verbal discriminations. 


Definition 


Type of Discrimination 


evoke behaviors, but when they 


Verbal Conditional 


same antecedent configuration. 


Verbal Function-altering A verbal stimulus that alters the 


within the same antecedent configuration, and that stimulus 
evokes the response (Axe, 2008; Catania, 2013; Devine, Carp, 
Hiett, & Petursdottir, 2016; Eikeseth & Smith, 2013; Kisamore, 
Karsten, & Mann, 2016; Michael et al., 2011; Sundberg, 2016a; 
Sundberg & C. A. Sundberg, 2011). For example, in the verbal 
antecedent “Who is not going to dinner?” the reinforcement his- 
tory related to the conditional word “not” alters the evocative 
effects of the phrase “going to dinner,” changing it to a differ- 
ent verbal SP that evokes a corresponding intraverbal response. 
A VC? effect can be demonstrated by showing a change in the 
evocative effect when the verbal stimulus is altered to ““Who is 
going to dinner?” 


Verbal Function-altering Effect 


Verbal stimuli can alter the functional effects of immediate 
or future S?s and MOs and, accordingly, change a listener’s 
behavior. The immediate effects involve straightforward operant 
and respondent conditioning, but the changes in future behavior 
are more complex. For example, being told “The bridge is out, 
turn left at the 7-Eleven and there will be another one in 5 miles” 
can alter the functional effects of stimuli encountered in the future 
and evoke verbal and nonverbal behavior at that time (e.g., tacting 
the 7-Eleven, turning left). Skinner (1957) described this type of 
effect as conditioning the behavior of the listener (p. 357) through 
instruction. Schlinger and Blakely (1994) and Schlinger (2008b) 
suggested the term verbal function-altering be used to describe 
the effect that occurs when verbal stimuli immediately condition 
the behavior of the listener and produce relatively permanent 
changes in the behavioral function of other stimuli and MOs 
occurring immediately or later in time The future behavior of 


Simple A single-component verbal stimulus 
that evokes a response. 
Compound A verbal stimulus that involves two 


or more S°s that each independently 
both occur in the same antecedent 
configuration, a different S° is generated. 


A verbal stimulus that alters the evocative 
effects of another verbal stimulus in the 


function of other stimuli or motivating 
operations (Schlinger & Blakely, 1994). 


Example 


Speaker: Saying “meow” after hearing “A kitty 
SaySeee 

Listener: When asked to “jump” a child emits 
jumping behavior. 

Speaker: Saying “blue” after hearing “Red, white, 
amd ee 

Listener: When asked to “clap fast,” and “clap 
slow,” “walk fast,” and “walk slow” the 
corresponding nonverbal behavior is emitted. 


Speaker: Saying “spoon” and “soap,” respectively, 
when asked, “What do you eat with?” and “What 
do you wash with?” 

Listener: Pointing to spoon and soap when asked 
the same questions presented above. 


Speaker: Singing at the right time after hearing 
“When | call your name, sing your part.” 

Listener: After hearing “When the doorbell rings, 
get your father,” the correct behavior is emitted at 
the right time. 


a listener can involve any speaker, listener, or respondent rela- 
tion. Schlinger and Blakely (1987, 1994) suggested that the use 
of Skinner’s (1969) term rule-governed behavior be reserved for 
these types of verbal function-altering effects. 

The conditioning of a speaker’s future intraverbal behavior 
would first involve the initial listener conditioning and then 
an encounter with the newly established verbal stimulus. For 
example, if a peer says to a teenage friend, “When your mother 
asks where did you go, tell her the library.” Upon encountering 
the mother’s question later, it evokes “the library” as instructed. 
Under other circumstances, hearing “Where were you?” would 
have different evocative effects (e.g., “track practice”). However, 
it is important to note that “this does all not happen in the naive 
speaker or listener; it is the end result of a long process of verbal 
conditioning” (Skinner, 1957, p. 360). 


Convergent Multiple Control Involving 
Verbal S°s, Nonverbal S°s, and MOs 


Verbal SPs not only combine with each other as a type of 
multiple control but also commonly participate with nonverbal 
Sos MOs, and respondent relations in the control of behavior. 
As with other examples of convergent multiple control, the 
effect of combining multiple variables can be additive or dec- 
remental. On the one hand, verbal S?s can provide a welcome 
addition to nonverbal stimuli, such as the information provided 
by a tour guide while viewing an historical site. On the other 
hand, verbal S?s can weaken, block, or distort control by other 
nonverbal stimuli, verbal stimuli, and MOs. For example, when 
someone talks during a movie scene the nonverbal action or 
dialog might be missed. 


Intraverbal Control 


A distinction can be made between verbal S?s that depend on 
multiple control with nonverbal S?s and MOs, and those that 
do not. Those that do not require multiple control, and have the 
relevant reinforcement history, can be classified as an intraver- 
bal operant (e.g., upon hearing “What’s 6 X 8” a child immedi- 
ately says “48”). However, some verbal stimuli only supplement 
other more critical antecedents, but nonetheless play a causal 
role in evoking behavior. Palmer (2016) recommends that “In 
cases in which the verbal antecedent is, by itself, insufficient 
to evoke the relevant response, we should speak of intraverbal 
control, usually as one of a number of concurrent controlling 
variables” (p. 99). For example, a common practice in tact train- 
ing for children with language delays is to hold up an object and 
say, “What is it?” The verbal stimulus “What is it?” by itself is 
insufficient to evoke the correct response, but, along with the 
presented object, it can be an important supplemental S? that 
helps to evoke a response from a child learning tacts. 


Private Verbal Stimulus Control 


Verbal stimuli, like MOs, are free from the support of the phys- 
ical environment; thus they can be portable and private. We can 
and do talk to ourselves, often to our benefit (e.g., self-editing, 
problem solving, thinking) (see Box 18.1). Our own covert verbal 
behavior provides an abundant supply of verbal stimuli (some- 
times too much). These verbal stimuli are no different in function 
than overt verbal stimuli, except only one person can experience 
the SP or MO effect. Covert verbal stimuli can combine with 
other variables, including overt variables. For example, while 
traveling through an airport and looking for the right terminal and 
gate, passengers may engage in a series of overt or covert verbal 
behavior that assists them in navigating various transitions. This 
verbal behavior can involve self-echoics (e.g., “Gate B31”), self- 
intraverbals (e.g., “B31 is in a different terminal”), self-textual 
(e.g., reading the gate signs), and joint control, all of which can 
participate as causal variables that ultimately evoke going to the 
right gate. Covert verbal SPs have received little experimental 
attention due to the privacy issue; nonetheless, they can play an 
important role in the interpretation of complex behavior (Moore 
2008; Palmer, 2011; Schlinger, 2008a; Skinner, 1974; C. T. 
Sundberg et al., 2018). 


Reinforcement Controlling the Duplic, 
Codic, and Intraverbal Relations 


Generalized S' provided by listeners (e.g., praise, approval) 
establishes verbal stimulus control over verbal behavior, in the 
same way it does for nonverbal stimulus control. Four of the five 
types of reinforcement identified for the tact relation apply to the 
duplic, codic, and intraverbal relations as well: (1) educational 
reinforcement, (2) termination of aversive stimuli, (3) automatic 
reinforcement, and (4) a history of contiguous or correlated 
usage. The form of reinforcement that does not apply was identi- 
fied as reinforcement from extending the listener’s contact with 
the physical environment, and that form is unique to the nonver- 
bal stimulus control. However, Skinner identifies an additional 
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type of generalized S' unique to verbal stimulus control; this 
type can be termed facilitative reinforcement. 


Facilitative Reinforcement. Skinner (1957) states that verbal 
behavior “may be reinforced because it helps in the acquisition 
of other types of verbal operants” (p. 67). Facilitative reinforce- 
ment functions as a type of automatic reinforcement, like that 
observed in emitting a chain of verbal behaviors in problem 
solving. The successful acquisition of the next step in the chain 
is facilitated by an earlier step. Such facilitative reinforcement 
plays a causal role in more complex behavioral relations such 
as joint control and BiN. For example, repeating instructions 
(e.g., turn left at the 7-Eleven) while driving can facilitate the 
emergence of joint control when the 7-Eleven is encountered 
(Lowenkron, 1998). 


Emergent Intraverbal Relations 


Research has demonstrated that intraverbal relations can 
emerge from previously acquired tact, intraverbal, and lis- 
tener repertoires, and even from direct observation (e.g., 
Allan, Vladescu, Kisamore, Reeve, & Sidener, 2014; Braam & 
Poling, 1983; Devine et al., 2016; Grannan & Rehfeldt, 2012; 
Pérez-Gonzalez & Garcia-Asenjo, 2016; Pérez-Gonzalez, 
Garcia-Asenjo, Williams, & Carnerero, 2007; Petursdottir, 
Carr, Lechago, & Almason, 2008; Smith et al., 2016). There 
are also ways that new intraverbal behavior can emerge from the 
combinations of existing verbal repertoires (DeSouza, Fisher, & 
Rodriguez, 2019; Horne & Lowe, 1996). 


Intraverbal Bidirectional Naming (I-BiN) 


Horne and Lowe (1996) suggested that emergent intraverbal 
relations could also be produced through intraverbal bidirec- 
tional naming (I-BiN) procedures (Miguel, 2016). In I-BiN a 
bidirectional relation is established between at least two ver- 
bal stimuli (e.g., “cat” and “animal”). I-BiN occurs when AB 
training on a unidirectional intraverbal relation automatically 
produces a reverse BA intraverbal relation, and vice versa. 
For example, after learning the unidirectional intraverbal rela- 
tion between “cat” as the verbal S? and “animal” as the verbal 
response, without further training a child can emit “cat” as the 
response when “animal” is the verbal SP. The components of 
I-BiN consist of a combination of echoic, self-echoic, listener, 
self-listener, and tact repertoires working together. Once estab- 
lished, I-BiN, like C-BiN, can function as a higher-order gen- 
eralized verbal operant that can produce powerful generative 
effects (e.g., Horne & Lowe, 1996; Greer & Ross, 2008; Jen- 
nings & Miguel, 2017; Miguel, 2016, 2018; Pérez-Gonzalez 
et al., 2018). 

Simple I-BiNs can be established by teaching a child a 
relation between two verbal stimuli (e.g., “shoes and socks”), 
and then teaching the child to emit overt or covert self-echoic 
repetitions of the trained intraverbal relation (e.g., “shoes and 
socks, shoes and socks”). When repeated, the basic compo- 
nents of the reverse intraverbal are contained contiguously 
in the phrase (i.e., the word “shoes” now follows the word 
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BOX 18.1 


| Shouldn't Have Opened My Big Mouth: Skinner's Analysis of Self-Editing 


Some people may say things that are offensive, socially inap- 
propriate, or untruthful. Others may make frequent verbal 
mistakes, tangle their words, or digress repeatedly. Skinner 
devoted three chapters to self-editing (Chapters 15 to 17) in his 
book Verbal Behavior (1957). He emphasized that a speaker is 
also a listener, including a listener of her own verbal behavior 
and may, to her benefit, self-edit that verbal behavior before 
or soon after it is emitted. Self-editing is extra work for a 
speaker and occurs due to a history of reinforcement or pun- 
ishment from listeners. However, Skinner notes that in many 
cases punishment may play a larger role in self-editing than 
reinforcement. 

Some types of language and social problems can be 
related to self-editing deficits. Skinner (1957) suggests two 
behavioral distinctions be made when considering self-editing 
problems. The first is between a speaker who cares about what 
she says and a speaker who does not care about what she says. 
Caring involves a motivating operation (MO) that is related 
to a history of reinforcement and punishment from listeners. 
The same speaker may care that her verbal behavior obtains 
a positive listener response from one audience, but not from 
another audience. Caring can also involve MOs related to neg- 
ative responses from listeners. A speaker may care that her 
words hurt others. 

A second distinction can be made between a speaker 
who is aware of what she says and its effect on listeners and 
a speaker who is unaware of what she says or its effects. This 
distinction is based on the degree to which a speaker can tact 
aspects of her own verbal behavior (e.g., that certain words 
might be offensive) and aspects of her listener’s behavior 
(e.g., noticing a nonattentive listener), and behave accordingly. 
Three examples of self-editing problems based on these dis- 
tinctions are presented, along with some comments regarding 
intervention. 


Offensive Verbal Behavior 


Some individuals are aware of the offensive, hurtful, or socially 
negative things they say to people, but they may not care. They 
may demonstrate weak MOs for positive reactions from listen- 
ers, strong MOs for negative reactions from listeners, or strong 
MOs for social reinforcement from peers for their offensive 
behavior. People who say negative things to other people may 
not seek treatment because they do not feel anything is wrong. 


“socks”), and this response is followed by direct or automatic 
reinforcement (Vaughan & Michael, 1982). When this reverse 
intraverbal emerges without training, this I-BiN effect is consis- 
tent with symmetry and mutual entailment (e.g., Horne & Lowe, 
1996; Miguel & Petursdottir, 2009; Pérez-Gonzalez et al., 2007; 
Petursdottir & Haflidadéttir, 2009). Once simple bidirectional 


It may be hard to change this behavior because of the difficulty 
in controlling the relevant contingencies, especially the MOs. 


Weak Verbal Repertoires 


Some individuals are aware of their difficulties in express- 
ing themselves. They might be insecure about what they say, 
tangle their words, or struggle with public speaking. Individ- 
uals experiencing these problems typically care about what 
they say, but they may have weak intraverbals, fail to tact 
listener behavior, or have interfering MOs and emotional 
responses. These individuals are more likely to seek treatment 
(e.g., Toastmasters, public speaking courses). Skinner (1957, 
pp. 405-417) describes several techniques for teaching a will- 
ing participant to edit her own verbal behavior (e.g., rehearsal 
with familiar audience, self-prompts, practice self-listening, 
practice tacting listener behavior, change the audience, change 
the motivation). 


High Rate of Verbal Behavior 


Some people may be unaware of the effects of their words on 
listeners, but they do care (a strong MO) that people respond 
positively. These individuals may engage in a high rate of 
talking, dominate a conversation, or provide excessive detail 
regarding events or their topic of interest. Typically, pun- 
ishment reduces this behavior, but there may not be enough 
punishment, or other listeners intermittently reinforce the 
behavior. These speakers may be automatically reinforced by 
their own verbal behavior, have weak listener repertoires, or 
have weak MOs for the listener’s point of view. Individuals 
experiencing these types of editing problems may not seek 
treatment because they are unaware that a problem exists, 
although intervention could be successful. 


Summary 


Skinner’s three chapters on self-editing provide a behavioral 
analysis of many types of self-editing issues and problems. 
We may open our big mouths, then wish we had not, because 
we failed to edit our verbal behavior before it was emitted. 
Self-editing requires extra work for a speaker and may not 
occur for that reason. Skinner (1957) offers many practical 
suggestions for clinicians and others interested in understand- 
ing or improving self-editing repertoires. 


relations are established between words, new words can be 
added to the class (e.g., “feet” “shirt’”) and ever more forms 
of emergent intraverbal behavior can occur (e.g., saying “feet” 
upon hearing “shoes go on your... ”). 

When a stimulus change affects one member of an I-BiN 
class, its effects can transfer to all members of the class. 


For example, ifa child has acquired the C-BiNs “dangerous” and 
“spider,” and then learns the I-BiN “spiders are dangerous,” this 
new verbal stimulus can produce a generative verbal function- 
altering effect (Schlinger & Blakely, 1994) for the whole 
stimulus class of spiders. For example, later when the child 
sees a spider, if she tacts the spider the response product of 
the tact (i.e., hearing the word “spider’’) evokes the intraverbal 
response “dangerous,” bringing with it a history of listener 
behavior. Furthermore, the nonverbal stimulus of the spider 
is present when reinforcement is delivered, and as a result the 
tact of the spider as “dangerous” may also emerge. The new 
tact and intraverbal relations involving the word “dangerous” 
are likely members of several different equivalent and non- 
equivalent classes, and their effects can transfer throughout 
those classes (Chapter 20). For example, the previously neutral 
spider may now function as a conditioned stimulus and elicit 
respondent behaviors or as a CMO-R that evokes avoidance 
behaviors or as a nonverbal S? that evokes the emergent tact 
“dangerous.” 

The verbal function-altering effect can also transfer to any 
stimulus that is in the same equivalence class as spiders (e.g., 
mosquitoes, ants, crickets) and similar respondent, avoidant, and 
tacting behavior can emerge. What makes this transformation of 
function important to generative language is that it demonstrates 
how an emergent tact relation can produce an emergent intra- 
verbal relation, further demonstrating the far-reaching effects 
of combining verbal cusps (Greer & Ross, 2008; Hayes et al., 
2001; Horne & Lowe, 1996; Rosales-Ruiz & Baer, 1997). 

There is a growing body of research on more complex 
I-BiNs and their generative effects (e.g., Carp & Petursdottir, 
2015; Greer & Ross, 2008; Jennings & Miguel, 2017; Ma, 
Miguel, & Jennings, 2016; Pérez-Gonzalez et al., 2018; Santos, 
Ma, & Miguel, 2015). For example, Jennings and Miguel (2017) 
examined the effects of intraverbal and tact training on the emer- 
gence of stimulus equivalence. These researchers first taught 17 
adult participants to tact birds (A), trees (B), and reptiles (C). 
Then the experimenters taught them unidirectional intraverbal 
relations between specific sets of stimuli (e.g., “The tree for 
Cardinal [A1] is Buckeye [B1] and the reptile for Buckeye [B1] 
is Black Racer [C1]”). Participants were then administered a 
visual-visual MTS test and an intraverbal test to assess perfor- 
mance for equivalence class formation. In addition, participants 
experienced four novel sets of stimuli to assess for generaliza- 
tion. The results for 4 of the 17 participants are presented in 
Figure 18.5. The closed circles, squares, and triangles represent 
three types of matching-to-sample tasks that constitute equiva- 
lence. Baseline data on equivalence show that these participants 
responded at chance level. However, following tact and intraver- 
bal training, all 4 participants demonstrated nearly 100% correct 
performance on all equivalence measures. In addition, all the 
participants demonstrated emergent intraverbal behavior that 
participated in establishing the equivalence relations. In total, 
13 of the 17 participants demonstrated the emergence of stimu- 
lus equivalence. These authors concluded that the participant's 
equivalence performance was established by the participant’s 
own C-BiN and I-BiN verbal behavior. 
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LISTENER BEHAVIOR 


Skinner’s Verbal Behavior (1957) was primarily about the 
behavior of the speaker, but Skinner argued that a complete 
account of language must include both speaker and listener 
behavior. In contrast, linguists tended to ignore the speaker and 
were mainly concerned with the listener (Skinner, 1978). Grad- 
ually however, Skinner’s book began to affect linguistic theory 
(Andresen, 1991). In a letter Skinner sent to me in 1989 (see 
Box 18.2) he muses, “It is quite amazing how linguists are com- 
ing around to the position of my book. They had to get around 
to the behavior of the speaker sooner or later, having spent 
centuries on how verbal behavior is understood by the listener.” 
Despite Skinner’s focus on the speaker, Schlinger (2008b) 
noted the following: “Skinner did not neglect the listener. In 
fact, he made frequent mention of the listener throughout the 
book—the word listener occurs 793 times compared to 893 
instances of the word speaker” (p. 312). In addition two chap- 
ters of Verbal Behavior are primarily devoted to analyses of 
listener behavior (Chapters 6 and 7). 


Different Roles of a Listener 


A listener is an active participant in verbal exchanges and 
serves many roles in this capacity. Skinner (1957) makes a dis- 
tinction between the nonverbal and verbal duties of a listener. 
The nonverbal duties include serving as an audience (an SP or 
MO) and providing reinforcement for speakers. Reinforcement 
for a speaker typically involves some kind of listener action 
directly controlled by the speaker’s words. Listener action 
can be nonverbal (e.g., a child gets his shoes when asked to 
do so) or it can be verbal (e.g., a child says “Can you help 
me?’”’). When a listener takes verbal action, she now becomes 
a speaker, and her behavior should be interpreted as such (e.g., 
a mand). Speakers and listeners reverse roles, often rapidly in 
verbal exchanges, and usually speakers and listeners are also 
self-listeners of their own verbal behavior. In any given verbal 
exchange, all of these repertoires interact with each other and 
with the current context and contingencies. The interaction 
between speakers and listeners (including self-listeners) con- 
stitutes what Skinner termed a verbal episode, and can be 
considered the basic speaker-listener unit of verbal behavior. 
The roles of the listener and the elements of a verbal episode 
will be described in more detail. 


Mediator of Reinforcement 


One role of a listener is to consequate a speaker’s behavior, 
which can occur in a number of ways. Eye contact, head nods, 
social praise, smiles, and so on can all function as generalized 
S'. But listeners do more than just directly reinforce a speaker 
for emitting verbal behavior; they reinforce the speaker’s behav- 
ior by acting on her verbal behavior. For example, if a speaker 
says, “Can you hand me the remote?” the listener’s nonverbal 
behavior of passing over the remote functions as specific rein- 
forcement for the speaker’s verbal behavior and abolishes the 
MO. A listener may also ignore or punish a speaker’s verbal 
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From “Training Intraverbal Bidirectional Naming to Establish Generalized Equivalence Performance” by A. M. Jennings and C. F. Miguel, 
The Journal of the Experimental Analysis of Behavior, 108, p. 279. Reproduced with permission of John Wiley & Sons, Inc. 


behavior, likely resulting in an extinction burst or decrease in 
responses. These types of consequences delivered by listeners 
shape and maintain speaking behavior, and help bring it under 
SP and MO control. 


Serving as an Audience 


The listener also serves as a nonverbal SP for a speaker’ s 
behavior. Skinner (1957) devotes an entire chapter to the role 
of the listener as an audience, and in doing so identifies three 
subdivisions of an audience. The first and largest subdivision 
is the language spoken (e.g., an English-speaking audience 
evokes English words). The second level involves special 
audiences who evoke differential responses. For example, one 
might use the term “reward” when talking with parents and 


laypersons but the term “reinforcement” when talking with 
behavior analysts. The third and most extensive role of the 
audience is to “select a subject matter” (Skinner, 1957, p: 175). 
We talk about different topics with different people. For exam- 
ple, a discussion involving sports may be reinforced by one 
member of a speaker’s verbal community, but punished by 
another, and each audience acquires multiple and generative 
discriminative functions. 


Understanding a Speaker 


The most complex aspect of a listener’s behavior involves 
what can be identified as understanding what a speaker says. 
Understanding on the part of a listener can be demonstrated by 
nonverbal or verbal behaviors. Nonverbal understanding can be 
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B. F. Skinner's Letter To Mark Sundberg 


BOX 18.2 


HARVARD UNIVERSITY 
DEPARTMENT OF PSYCHOLOGY 


WILLIAM JAMES HALL 
33 KIRKLAND STREET 


CAMBRIDGE, MASSACHUSETTS 02138 


August 7, 1989 


Dr. Mark L. Sundberg 


Dear Mark: 


Thanks for the copy of the new edition. I thought there were some excellent things in it. I like the 
review of Bruner. It is quite amazing how the linguists are coming around to the position of my book. 
They had to get around to the behavior of the speaker sooner or later, having spent centuries on how 


verbal behavior is understood by the listener. 


The book you cited must have been my Recent Issues in the Analysis of Behavior, not anything to do 


especially with verbal behavior. 


Things are going well here. I am transcribing and editing a draft of my paper at ABA for the Journal 


of Behaviorology. 


All the best, 


B.F Skinner 


B. F. Skinner’s Letter To Mark Sundberg HARVARD UNIVERSITY DEPARTMENT OF PSYCHOLOGY. Used with permission 


demonstrated by a verbal stimulus that evokes a corresponding 
nonverbal response. For example, in standard receptive lan- 
guage training, a child may be shown an array of buildings and 
asked, “Where is the post office?” The nonverbal selection of 
the picture of a post office would demonstrate a type of under- 
standing. Verbal understanding can be demonstrated by a verbal 
stimulus that evokes an appropriate verbal response from the 
listener (who is now a speaker). For example, if a tourist asks a 
passerby for the location of a post office, and the passerby says, 
“About three blocks down, on the left,” understanding what the 
tourist asked is demonstrated by the intraverbal behavior emitted 
by the passerby. 

The same four types of verbal discriminations previously 
described for intraverbal relations can be used as a framework 
to identify various types of nonverbal listener discriminations 
(Figure 18.4). In addition, there is a fifth type of verbal dis- 
crimination that is specific to listener behavior commonly 
termed an auditory conditional discrimination (e.g., Saunders 
& Green, 1999; Sidman et al., 1982). Each of these types of lis- 
tener discriminations demonstrates how multiple control in the 
form of a conditional discrimination between verbal and non- 
verbal stimuli is an important aspect of listener understanding. 


Speaker and Listener Verbal Episodes 


Figure 18.6 presents an example of a verbal episode in which 
a speaker in a restaurant asks a waiter, “May I have some 
bread?” The sources of antecedent control are multiple: (1) an 
MO related to bread (MO)), (2) the general context (the restau- 
rant) (nonverbal Shay (3) a receptive audience (the waiter) 
(nonverbal S>.,): if the waiter asks, “Do you need anything 
else?” these (4) verbal stimuli are relevant (verbal Sep) and if 
bread is visible, that (5) nonverbal stimulus (nonverbal SP) 
could also participate in the antecedent configuration. For the 
listener (the waiter), his behavior is controlled by the speaker's 
words, “May I have some bread?” (verbal SP,). The speaker’s 
words can have multiple divergent effects on listener behavior. 
They can evoke a verbal response (€.g., “Yes, of course”) and 
a nonverbal response (e.g., the waiter delivering bread), but 
only the delivery of bread functions as the ultimate specific 
reinforcement for the speaker, and changes the function of the 
MO to an abolishing operation (AO). Generalized S' in this 
example, although encouraging, would not satisfy the MO. 
There can be other variables involved in this episode as well 
(e.g., listener MOs, respondent relations, covert behaviors). 
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Figure 18.6 A verbal episode between a speaker and a listener. 


This interaction could also generate new MOs and Ss (e.g., 
butter, a glass of wine). 

Figure 18.6 also demonstrates how the same word can 
have different meanings for speakers and listeners, and why 
separate accounts are necessary. In this example, the same 
word “bread” and its nonverbal referent are involved for each 
party, but the word “bread” appears in different positions in 
the individual four-term contingencies affecting the speaker 
and listener. For the speaker, “bread” is a response, but for the 
listener, “bread” is a stimulus. In addition, the word has differ- 
ent functions for each participant. For the speaker, “bread” is 
a response in a mand relation, but for the listener “bread” is a 
verbal SP that evokes a nonverbal discrimination. Thus, what the 
word “bread” means for the speaker is not the same as what 
the word “bread” means for the listener, despite having the same 
formal properties and nonverbal linguistic referent. 


Emergent Listener Relations 


A listener is an active responder, and new listener relations can 
emerge through the process of stimulus and response general- 
ization. In addition, new relations can emerge through gener- 
ative processes produced by the combinations of previously 
acquired skills. Several studies have demonstrated that novel 
listener behaviors can emerge from direct training on speaker 
relations (e.g., Byrne, Rehfeldt, & Aguirre, 2014; Cuvo & Riva, 
1980; Ingvarsson, Cammilleri, & Macias, 2012: Lowe, Horne, 
Harris, & Randle, 2002; Petursdottir & Carr, 2011; Petursdottir 
& Haflidadéttir, 2009). For example, Ingvarsson et al. (2012) 
showed that direct intraverbal training was sufficient to generate 
emergent listener discriminations for children with autism. 


Common Bidirectional Naming (C-BiN) 


The merging of speaker and listener cusps is the foundation 
of BiN. Once a child has acquired the basic echoic, tact, lis- 
tener, and self-listener cusps, the prerequisite repertoires for 
C-BiN are established and a novel word acquired (or observed) 
as a tact can immediately generate a listener relation without 
further training (i.e., symmetry, mutual entailment). Several 
studies have demonstrated this type of bidirectional naming 
and its effects on emergent listener behavior (e.g., Gilic & 
Greer, 2011; Greer & Ross, 2008; Horne & Lowe, 1996; 
Miguel & Kobari-Wright, 2013). 


Conditioning the Behavior of the Listener 


Emergent listener behavior can also occur through the behay- 
ior of a speaker who provides instructions (rules) that condi- 
tion the immediate or future behavior of a listener (Schlinger, 
2008b; Skinner, 1957, 1969). A speaker’s words can alter the 
functional properties of other words, objects; and events for 
a listener, but not until those stimuli are encountered (Sch- 
linger & Blakely, 1994). For example, if a speaker says, “Don’t 
go to Bill’s Restaurant—it’s horrible,” any stimulus in that 
equivalence relation can acquire an emergent evocative effect 
if encountered. Bill’s Restaurant may now evoke avoidant 
behavior or the tact “horrible.” In addition, the restaurant, its 
owner, or patrons may now evoke behavior in equivalence 
relations with horrible things, which might include aggression 
or property destruction (e.g., vandalism). These transforma- 
tion of function effects demonstrate the generative effect of 


instructions, and show the potential impact that words can have 
on behavior. 


AUTOCLITIC VERBAL BEHAVIOR 


In the process of acquiring language, a child learns to listen 
to the words of others. Along the way, the child also learns to 
listen and react to her own words. She may self-edit, qualify, 
order, negate, or elaborate on her words. This is an important 
and complex secondary type of verbal behavior that is shaped 
and reinforced by listeners in a speaker’s verbal community. 
For example, a supervisor may pause briefly before saying to a 
staff member, “That was good.” What might be the behavioral 
effects of the pause? Does it have any special meaning to a 
listener? The pause can produce a verbal stimulus that accom- 
panies what the listener hears, and it can change the way a lis- 
tener reacts to what he hears. Speakers manipulate their own 
verbal behavior, often accidentally or unknowingly, because 
such behavior is reinforced by listeners. Skinner terms this 
secondary verbal behavior autoclitic behavior. 

The autoclitic relation involves “two systems of 
responses, one based upon the other. The upper level can 
only be understood in terms of its relation to the lower level” 
(Skinner, 1957, p. 313). The upper level may be described as 
a speaker’s verbal behavior about her own lower level ver- 
bal behavior. Michael (1991, 1992) suggested that these two 
levels of responses be identified as primary verbal behavior 
as the lower level and secondary autoclitic behavior as the 
upper level. A primary verbal response constitutes any of the 
verbal operants (e.g., mand, tact, intraverbal, textual), and a 
secondary autoclitic response is controlled by “some feature 
of the primary verbal operant or by its controlling variables” 
(Michael, 1992, p. viii). For example, a primary response such 
as “The ice is solid” may be accompanied by a secondary 
response, “I think,” that informs the listener of the tenuous 
nature of the primary tact. Autoclitic behavior is helpful to 
listeners who may prompt a speaker to identify the sources 
of control for what she says, as in “Did you test the ice?” The 
autoclitic response tells the listener more than just the primary 
response on its own, and listeners reinforce speakers for this 
extra information. 

Michael (1991) suggests that not all of a speaker’s sec- 
ondary verbal behavior has an autoclitic function. Some of a 
speaker’s response to her own verbal behavior consists simply 
of more primary verbal behavior. For example, covert verbal 
behavior regarding an upcoming meeting may evoke self-intra- 
verbals or self-mands regarding that meeting. Michael (1991) 
identifies this type of response to one’s own verbal behavior as 
simple secondary verbal behavior. There are several ways to 
distinguish between secondary autoclitic responses and simple 
secondary responses. For example, a secondary autoclitic 
response cannot affect behavior on its own; it must involve 
some aspect of a primary response. Simple secondary behavior, 
requires a strict temporal relation between the primary response 
and the secondary response, with the primary response always 
occurring first and generating the relevant verbal S?s that evoke 
the secondary verbal behavior. The autoclitic relation differs in 
that the primary response and secondary autoclitic response can 
occur in any order, or simultaneously (e.g., “I think that’s the 
right exit,” “That’s the right exit, I think,” or slowly emitting 
“right’”). Peterson and Ledoux (2014) suggested that secondary 
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autoclitic relations be further classified as autoclitic mands and 
autoclitic tacts. 


Autoclitic Mand 


The autoclitic mand is a type of secondary autoclitic behay- 
ior that has the same defining features as a primary mand 
(responses controlled by an MO and specific reinforcement), 
except the autoclitic mand is secondary verbal behavior that 
enjoins a listener to take some specific action regarding the 
primary response, and autoclitic behavior is reinforced by the 
listener’s action. For example, a speaker who says, “Believe 
me, they are wrong,” is blatantly manding to the listener with 
“believe me” to verbally accept as fact, without challenge, her 
statement “They are wrong.” The autoclitic mand can take any 
form, accompany any type of verbal interaction, and range 
from subtle to obvious. 


Autoclitic Tact 


An autoclitic tact is a type of secondary verbal behavior that is 
controlled by some nonverbal feature of the primary response 
or its controlling variables, and the autoclitic response informs 
the listener of that feature. This behavior is reinforced by lis- 
teners who provide generalized S'. For example, a speaker 
may say, “I’m sure it’s safe,” where the secondary autoclitic 
response “I’m sure ” is a tact of the strong source of con- 
trol for “it’s safe.” Skinner (1957) identified several types of 
autoclitic tacts. A descriptive autoclitic tact is controlled by 
the nonverbal properties of the antecedent events and their 
relation to the primary response. There are several types of 
descriptive autoclitics. For example, an autoclitic tact of 
strength could be observed in a speaker who might have a 
tendency to say, “That’s the right house” (a primary response), 
and covertly tacts the strength of the nonverbal stimulus con- 
trol (a secondary autoclitic response) and tags on, “I’m sure,” 
before “That’s the right house.” “I’m sure” is a descriptive 
autoclitic tact that informs the listener of the strength of the 
primary relation and as a result alters the listener response to 
“That’s the right house.” The listener reinforces the speaker’ s 
behavior because he is benefited by this additional information. 
Quantifying autoclitic tacts inform the listener of the 
range of application of a primary response. For example, 
a speaker may tag on the autoclitic tact “some” to the 
primary tact “Behavior problems are caused by attention.” 
The autoclitic “some” informs the listener that the whole 
statement that follows is restricted in its range. If “all” had 
been tagged on to the primary response, the listener would 
behave differently. Relational autoclitic tacts inform the lis- 
tener of agreement between aspects of the primary responses. 
For example, possessive relations indicated by an ’s can be 
tagged to a noun, as in the “supervisor's recommendation,” 
and function as a tact of the source of control for the 
recommendation, and as a tact of the agreement between the 
source and the recommendation. Skinner suggested that auto- 
clitic relations play a causal role in complex verbal activities 
involving grammar, syntax, composition, grouping, ordering, 
punctuation, predication, and so on (e.g., Palmer, 2016). 
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Intraverbal Autoclitic Frames 


An intraverbal autoclitic frame facilitates the relation between 
words in a given utterance by providing order, agreement, 
grouping, and composition of larger units of verbal behavior. 
Moore (2008) describes an autoclitic frame as “conventionally 
prescribed formats of sequences for emitting verbal behav- 
ior, as in subject-verb-object or actor-action-object sentences. 
These frames may become very generalized, such that they 
come to be applied in many other situations than the origi- 
nal” (p. 208). Skinner (1957) provides the following example 
of an autoclitic frame: 


If (a boy) has acquired a series of responses such as 

the boy’s gun, the boy’s shoe, and the boy’s hat, we may 
suppose that the partial frame the boy’s__ is available 
for recombination with other responses. The first time 

the boy acquires a bicycle, the speaker can compose a new 
unit the boy’s bicycle. ... The relational aspects of the 
situation strengthen a frame, and specific features of the 
situation strengthen the responses fitted into it” (p. 336). 


Palmer (2016) notes “many of the structural properties of 
verbal behavior that so excite the linguist arise, at least in part, 
from the prevalence of intraverbals and intraverbal control. In 
particular, autoclitic frames and grammatical tags are largely 
intraverbal” (p. 101). Speakers must acquire these intraverbal 


autoclitic frames and tags because they play an important role 
in the effective execution of verbal behavior. Very little research 
on autoclitic frames is available, but some has begun to emerge 
(e.g., Luke, Greer, Singer-Dudek, & Keohane, 2011; Martins, 
Hiibner, Gomes, Pinto Portugal, & Treu, 2015; Speckman, 
Greer, & Rivera- Valdes, 2012). 


APPLICATIONS OF SKINNER’S (1957) 
ANALYSIS OF VERBAL BEHAVIOR 


One of the most developed applications of Skinner’s work 
can be found in the behavioral treatment of autism or other 
types of intellectual disabilities.4 Linguistic problems are often 
the primary handicapping conditions of those diagnoses, and 
effective language assessment and intervention strategies are 
essential for treatment programs. Joe Spradlin was the first to 
publish an application of the concepts from Verbal Behavior 
to the communication needs of the intellectually disabled. He 
created the Parsons Language Sample (1963), which was an 
assessment tool based on the verbal operants. Spradlin was 
also instrumental in the early research and development of 
language intervention programs based on verbal behavior (e.g., 
Spradlin, 1966). Jack Michael was one of the first professors to 
offer a full course using Skinner’s (1957) book Verbal Behav- 
ior, and he spearheaded the further development of Skinner’s 
analysis of verbal behavior (see Box 18.3). 


BOX 18.3 


Jack Michael and Skinner's Analysis of Verbal Behavior 


Jack Michael began teaching behavior analysis in 1955 at the 
University of Kansas. He used Skinner’s (1953) Science and 
Human Behavior as the textbook for the course. In that book, 
Skinner mentioned his upcoming book Verbal Behavior in a 
footnote (p. 210). Jack contacted Skinner regarding the book, 
and Skinner sent him an early version of Verbal Behavior. Jack 
began to incorporate verbal behavior content into his courses. 
After the book became available, Jack developed a full course 
on verbal behavior and taught that course nearly every aca- 
demic year until his retirement in 2003. He maintained, like 
Skinner (1953), that an analysis of verbal behavior was essen- 
tial for a complete account of complex human behavior (e.g., 
Michael, 1984). 

One of Jack’s primary goals as a teacher was to impart 
to students the verbal repertoires necessary to precisely use 
the concepts and principles of behavior analysis to analyze 
behavior, including verbal behavior, in any context (Michael, 
1995, 2004). In the process of teaching others, he was con- 
stantly advancing his own understanding of Skinner’s writings 
while refining various concepts along the way (e.g., motivat- 
ing operations, positive and negative reinforcement, automatic 
reinforcement, topography-based and selection-based verbal 
behavior, duplic and codic relations). 

Applied research on verbal behavior was important 
to Jack, but it was slow in developing, despite Skinner’s 


(1957) prediction that “[t]he formulation is inherently 
practical and suggests immediate technological applica- 
tions at almost every step” (p. 12). In 1976, Jack started 
offering a graduate course at Western Michigan Univer- 
sity titled “Verbal Behavior Applications.” That course 
inspired many students, especially those needing theses 
or dissertations, to Conduct verbal behavior research. The 
Midwestern Association for Behavior Analysis (MABA) 
convention provided contingencies to present the research, 
and a verbal community arose that eagerly supported verbal 
behavior research. 

Jack and W. Scott Wood were instrumental in starting 
the Verbal Behavior Special Interest Group (VB SIG). They 
co-chaired the first meeting, which was held at the 1977 
MABA convention. Skinner attended and participated in the 
discussions, as did many other prominent behavior analysts. 
It was suggested that an outlet for verbal behavior research 
and other instructional material was needed. Under Jack’s 
direction, the VB SIG established a newsletter, the VB-NEWS, 
which ultimately became the journal The Analysis of Verbal 
Behavior. Thanks to Jack’s relentless dedication to understand- 
ing, teaching, advancing, and applying Skinner’s analysis of 
verbal behavior, we now have a robust body of conceptual 
and empirical research, and many highly successful clinical 
applications. 


APPLICATIONS TO LANGUAGE ASSESSMENT 
AND INTERVENTION 


Behavior analysis provides three major contributions to 
language assessment and intervention programs for individu- 
als with special needs: (1) applied behavior analysis technol- 
ogy, (2) a behavioral analysis of human development, and (3) a 
behavioral analysis of language. 


Applied Behavior Analysis (ABA) Technology 


Behavioral technology provides teachers, speech-language 
pathologists (SLPs), parents, and others with the necessary 
procedures and strategies to assess and teach language skills 
to those who need formal instruction (e.g., Leaf & McEachin, 
1999; Lovaas, 1987, 2003; Maurice, Green, & Luce, 1998: 
Romanczyk & McEachin, 2016). Effective use of behavioral 
procedures requires specific training and supervision. For exam- 
ple, staff must know when and how to use reinforcement, extinc- 
tion, prompting, fading, and so on. The content of the current 
book describes the essential features of ABA, as do a number of 
other behavior analysis textbooks (e.g., Malott & Shane, 2013; 
Martin & Pear, 2015; Miltenberger, 2016; Sulzer-Azaroff, 
Mayer, & Wallace, 2013). 


A Behavioral Analysis of Human Development 


The development of language and social skills of typically 
developing children provides a valuable guide and framework 
for working with children with language delays. Knowing what 
skills to teach and when those skills appear in typical language 
development is an important component of establishing and 
sequencing a language intervention program. Behavior analy- 
sis has a long history of child development research, beginning 
with the books by Bijou and Baer (1961, 1965, 1967). Their 
work provides a behavioral analysis of how children learn and 
develop skills throughout their early childhood. There is a ded- 
icated journal (Behavioral Development Bulletin) and numer- 
ous extensions and applications of a behavioral approach to 
human development (e.g., Greer & Koehane, 2005; Morris et 
al., 1982; Novak & Pelaez, 2004; Rosales-Ruiz & Baer, 1997; 
Schlinger, 1995). 


A Behavioral Analysis of Language 


Skinner’s (1957) analysis of verbal behavior provides a frame- 
work for language assessment and intervention. A verbal behav- 
ior assessment program involves identifying the strength of the 
elementary verbal operants, listener skills, self-listener skills, 
multiply controlled responding, and speaker-listener interactions 
in a developmentally sequenced fashion. Intervention is based 
on the assessment results and focuses on establishing missing 
or impaired speaker or listener skills through the use of ABA 
teaching procedures. Several publications are now available 
that provide detail on how to apply Skinner’s analysis of verbal 
behavior in working with individuals who have language delays 
(e.g., Barbera & Rasmussen 2007; Carr & Miguel 2012; Dixon, 
2014; Greer & Ross, 2008; LeBlanc & Dillon, 2009; McGreevy 
et al., 2012; Partington, 2006; Schramm, 2011; Sundberg, 2014, 
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2016b; Sundberg & Michael, 2001; Sundberg & Partington, 
1998; Weiss & Demiri, 2011). 


The Importance of Language Assessment 


Parents and pediatricians usually monitor a child’s acquisition 
of language, noting important milestones while watching for 
discrepancies. Language delays are associated with a variety 
of diagnoses (e.g., autism, communication disorders) and early 
identification is valuable. If a child cannot communicate with 
words, negative behavior (e.g., tantrums, aggression, social 
withdrawal) may serve the communication function. When a 
chiid’s language skills fail to develop, or are impaired in some 
way, professional assistance should be obtained, such as ser- 
vices from an SLP, behavior analyst, or psychologist. 

A language assessment for a person with language delays 
has three primary goals: (1) to identify the nature of the delay 
or problem, (2) to compare the person’s performance with 
expected norms, and (3) to provide guidance for an appropri- 
ate intervention program if necessary. Each type of speaker 
and listener behavior needs to be assessed independently and 
in various combinations (e.g., C-BiNs, equivalent and non- 
equivalent relations) to obtain baseline levels of performance 
and to identify the direction of an intervention program. The 
goal is to determine what a child can and cannot do (e.g., 
mand, echo, listen, tact, demonstrate generative learning). The 
value of assessing each of the speaker and listener domains, 
and the interaction between them, is that it is common to 
encounter language-delayed children who can emit a word as 
one type of verbal behavior (e.g., a tact) but not as another 
(e.g., a mand). 

A variety of standardized language assessments are avail- 
able, but none provide measures of all of the verbal operants 
and listener skills (Esch, LaLonde, & Esch, 2010). However, 
several criterion-referenced language assessments are available 
that do measure each of the verbal operants and listener reper- 
toires (e.g., Dixon, 2014; McGreevy et al., 2012; Partington, 
2006; Partington & Sundberg, 1998; Spradlin, 1963; Sundberg, 
1983, 2014; Sundberg & Partington, 1998; Sundberg et al., 
1979). These assessments are framed by the elementary verbal 
operants, listener behavior, various combinations of speaker 
and listener behavior, and perceptual behavior. In the following 
sections, early language assessment and intervention strategies 
will be described for each of the individual speaker and listener 
domains. 


Mand Assessment 


Assessing a child’s mand repertoire involves identifying the 
degree to which MOs control verbal responses. The mand clas- 
sification system previously described can guide the assessment 
process. Specifically, the distinctions between the types of MOs 
(i.e., UMO, CMO-T, CMO-R) and their evocative effects on ver- 
bal behavior can be assessed. For example, does the child emit 
a mand when deprivation UMOs are at strength (e.g., saying 
“milk” when she wants milk) or when aversive stimuli function 
as UMOs (e.g., when it is cold, saying “Let’s go in”). A similar 
process is necessary for learned MOs such as the CMO-T (e.g., 
manding for a missing toy) and the CMO-R (e.g., manding to fix 
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a broken toy). In addition, the delivery of specific reinforcement 
should abolish (AO) the CMO-R (e.g., the toy is fixed) and this 
effect should be assessed. 

Mands can also be assessed across MOs relevant to the 
different parts of speech (e.g., MOs related to action, location, 
possession) and in social interactions between speakers and 
listeners (e.g., manding to peers). Also, Skinner’s nine types 
of mands (Figure 18.3) can provide a further quantification 
of the different MOs that control mands for a specific person. 
Mands involving multiple variables should also be assessed, 
such as those involving multiple MOs (e.g., manding “I want 
scissors and Play-Doh’), MOs accompanied by nonverbal SPs 
(e.g., “I want that red one”), MOs accompanied by verbal SPs 
(e.g., manding for a specific video game after a salesperson’s 
pitch), and MOs accompanied by both verbal and nonverbal 
SPs (e.g., asking a waiter for bread). 


Barriers to Manding 


An important component of an intervention program is an anal- 
ysis of the language, learning, and social barriers demonstrated 
by a child during verbal interactions with others (Sundberg, 
2014). The mand is more susceptible to interference from bar- 
riers than are the other verbal operants because of the mand’s 
relation to MOs and specific reinforcement. These variables 
are powerful and can quickly evoke and shape negative behav- 
ior. For example, a child’s scream may immediately evoke eye 
contact and attention from an adult, and as a result screaming 
behavior increases and comes under MO and S? control. These 
naturally occurring contingencies may be more efficient than 
the carefully nurtured and programmed contingencies offered 
in a structured intervention program. There are several com- 
mon barriers involving MOs and specific reinforcement, such 
as demand weakening the MO, quick satiation, competing MOs 
and SPs, response forms inconsistent with the MO, and gener- 
alized CMO-Rs dominating behavior. 


Mand Intervention 


If a toddler fails to develop a mand repertoire, specific interven- 
tion may be necessary. Without words, signs, or icons as mands, 
the probability of negative behavior (e.g., tantrums) functioning 
as mands increases. The initial focus of an intervention program 
for nonverbal learners is usually on both establishing acceptable 
mand forms (preferably speech, but signs and icon selection 
can also be effective) and reducing existing forms of negative 
behaviors as mands. 


Response Form 


An important aspect of mand training for a nonverbal child is to 
establish a reliable response form that can produce the best pos- 
sible effects on a listener. In early mand training there are three 
possible choices: speech, sign language, and icon-selection sys- 
tems (in some special cases, written words can also be used as 
a fourth option). The primary focus should be on speech as a 
response form. However, if a child fails to acquire speech, sign 
language and icon-selection systems can work as mand forms 
(e.g., Frost & Bondy, 1994; Sundberg & Partington, 1998). Deter- 
mining which form is most appropriate for an individual child 


depends on a variety of factors that should be carefully considered 
(Bondy, 2012; Carbone, Sweeney-Kervin, Attanasio, & Kasper, 
2010; Sundberg, 1993a; C. T. Sundberg & Sundberg, 1990; 
Tincani, 2004). 


Using MOs for Instructional Purposes 


Teaching children with absent or impaired mand repertoires 
requires special skills on the part of caregivers. In addition to 
basic ABA knowledge, adults must be able to identify and con- 
trol the different MOs and related specific reinforcement as part 
of their teaching skills. If the relevant MO is not at strength, 
early mand training is impossible. MOs are often difficult to 
control because they can vary in strength across time, be fleet- 
ing, have an instant or gradual onset or offset, and combine 
or compete with other MOs and SPs. In addition, too high a 
response requirement placed on a child may quickly weaken an 
MO (Alling & Poling, 1995). 

Two general methods are used for identifying and con- 
trolling the value change of MOs for mand training: (1) captur- 
ing the MO value change as it occurs naturally and (2) creating 
the MO by generating some change that alters the value of a 
consequence (Michael, 1993; Sundberg, 1993b, 2013). Captur- 
ing motivation in the natural environment can be accomplished 
by observation of collateral behaviors that may be evoked by an 
MO (e.g., looking at a certain toy). Collateral behaviors provide 
insight as to the presence of a private MO, and thus an oppor- 
tunity to teach a new mand to a child (e.g., “ball’”’). Creating 
motivation requires an adult to make some direct change (e.g., 
presenting a desired iPad without turning it on) that increases 
the value of some other change (e.g., turning on the iPad). The 
onset of this MO value change can be used to establish a new 
mand relation (e.g., “iPad on”). 


Capturing and Creating UMOs 


The deprivation-based UMOs (e.g., food, drinks) are often an 
easier place to start for early or difficult learners because of 
the powerful evocative effects of these MOs (e.g., teaching 
“cracker” or “juicé™ as an early mand). UMOs can be captured 
for mand training purposes. For example, if a child reaches for 
a cracker, capture that MO and conduct a mand teaching trial. 
UMOs can also be created. For example, give the child some 
salty foods to increase the value of drinks as reinforcement. 
These conditions are also optimal for the implementation of 
pairing procedures to facilitate the development of additional 
behaviors, sp... and forms of S‘ (e.g., Esch, Carr, & Grow, 2009; 
Rader et al., 2014; Stock et al., 2008). 


Capturing and Creating CMO-Ts 


Capturing CMO-Ts in a child’s natural environment can be 
accomplished by observing the events that increase the value of 
some object or event. For example, after an adult blows bubbles 
with a wand, a child demonstrates excitement while watching 
the bubbles float and pop. After they pop, the value of the wand 
and specific adult behavior increases, and that is the time to con- 
duct a mand trial. Additional ways to capture MOs include the 
observation of established chains of behavior that are interrupted 
(e.g., missing toy piece), routines that are difficult (putting on 


a jacket), and preferences (e.g., a certain cup). The pioneering 
work of Hart and Risley (1975) on incidental teaching exempli- 
fies these teaching strategies. 

{ CMO.-Ts can be created by caregivers for language instruc- 
tion as well (e.g., Albert, Carbone, Murray, Hagerty, & Swee- 
ney-Kerwin, 2012; Hall & Sundberg, 1987; Landa, Hansen, & 
Shillingsburg, 2017; Oleson & Baker, 2014). For example, if a 
child likes to cut Play-Doh with scissors, then the presentation 
of Play-Doh may increase the value of scissors, demonstrated 
by the child’s collateral behavior of reaching for scissors. The 
adult could block access to the scissors to control the MO and 
teach the mand. Establishing these mands can be accomplished 
by the use of prompting, fading, differential reinforcement, and 
transfer procedures (Sundberg & Partington, 1998). 


Parts of Speech. The parts of speech provide a developmen- 
tal framework for more advanced mand training. For example, 
in addition to mands for edible and tangible reinforcement, a 
child needs to learn mands for actions (verbs), for properties of 
objects (adjectives), for relations between objects (prepositions), 
and so on. All of these mands are controlled by different MOs, 
and each particular MO should be at strength at the moment of 
teaching. 


Teaching Additional Types of Mands 


Further direction for mand training can come from Skinner’s list 
of nine types of mands (Figure 18.3). For example, a question 
is a mand controlled by an MO related to verbal information 
and specifically reinforced by the receipt of that information. 
Mands for information contribute to the burst in vocabulary 
growth commonly observed in toddlers between the ages of 
2 and 3. These mands involve an increase in the value of the 
names of things and events (e.g., “What’s that?”) or the loca- 
tion of things and events (e.g., “Where’s Buzz?”), while later 
questions may involve MOs for the function of things and events 
(e.g., “How do you turn it on?”) or the cause of events (e.g., 
“How did it break?”). A growing body of research has provided 
support for the role of MOs in question asking and has demon- 
strated a variety of effective teaching procedures (e.g., Endicott 
& Higbee, 2006; Lechago, Carr, Grow, Love, & Almason, 2010; 
Lechago & Low, 2015; Shillingsburg, Gayman, & Walton, 
2016; Sundberg, Loeb, Hale, & Eigenheer, 2002). 


Multiple Control 

MOs frequently share control with other MOs, verbal S?s, and 
nonverbal SPs as types of convergent multiple control. Multiple 
sources of control can be quite useful for establishing mands. 
A child’s early manding may be controlled by MOs along with 
echoic, intraverbal, or nonverbal prompts, and through the pro- 
cess of differential reinforcement for higher quality respond- 
ing, the response can be freed from these additional sources of 
control. 


Teaching Extended Mands 

Skinner (1957) identified several ways that mands general- 
ize across MOs and Ss, and how the same MO can control 
different response forms (Miguel, 2017). Recent research 
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has demonstrated methods to establish new mand forms for 
the same MO by using procedures from relational frame 
theory (e.g., Hayes et al., 2001; Murphy & Barnes-Holmes, 
2009; Rosales & Rehfeldt, 2007). For example, Murphy & 
Barnes-Holmes (2009) established multiple types of derived 
mands involving “more” and “less” for three participants with 
autism. The study was conducted in the context of a board 
game involving the quantity of tokens required to be success- 
ful (more or less). These authors showed that a new mand 
response form could be established with a supplemental condi- 
tional discrimination training procedure that allowed for novel 
stimuli to become equivalent with existing mand forms. That 
is, the participants learned, without formal training, a new way 
to mand for “more” or “less.” 


Tact Assessment 


Assessing a tact repertoire involves determining the degree to 
which nonverbal stimuli control verbal behavior. This is a sizable 
task given that the whole of the physical environment contains 
a massive number of nonverbal stimuli. The different parts of 
speech provide a frame for tact assessment because each part 
often demonstrates a different type of nonverbal stimulus control. 
Many nouns involve nonverbal SPs consisting of static objects, 
and many verbs involve nonverbal SPs consisting of movement, 
while many adjectives and adverbs involve nonverbal SPs gen- 
erated by the properties of objects and events, and relations 
between them. A similar analysis can be made for the other parts 
of speech (e.g., prepositions, pronouns). In addition, an assess- 
ment of C-BiN relations can identify the strength of this higher 
order verbal operant (e.g., Greer & Ross, 2008; Miguel, 2016). 


Sensory Receptors 


Tacts can also be assessed across the different sensory receptors 
affected by nonverbal stimuli (e.g., visual, auditory, tactile)— 
for example, testing the tact of a cat when the target is a visual 
stimulus (e.g., a cat) versus an auditory stimulus (e.g., hearing 
a cat meow). 


Multiple Control 


Tacting becomes more complex when multiple variables are 
involved. For example, a noun-verb tact such as “The water’s 
spilling out” involves at least two sources of stimulus control: 
One is a nonverbal SP related to the water, and the other is a non- 
verbal SP related to the movement of the water. Important infor- 
mation regarding a tact repertoire can be obtained by sampling 
the arrangement of various nonverbal stimuli corresponding to 
the different parts of speech (e.g., possessive-adjective-noun, 
subject-verb-noun). In addition, nonverbal SPs can combine 
with MOs and verbal S?s as types of convergent multiple con- 
trol. For example, a child may emit the name of a video game 
she sees due to both an MO and a nonverbal SP, making the 
response part mand and part tact. 

An important aspect of tact assessment involves identi- 
fying the degree to which nonverbal stimulus generalization 
occurs. The four types of tact extensions provide a framework 
for assessing a person’s skill in this area. In addition, the ability 
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to tact private events is a component part of a tact repertoire and 
can be measured by the occurrence of collateral behavior (e.g., 
limping) or public accompaniment (e.g., a bruise). 


Tact Intervention 


Learning to tact features of the physical environment is a cor- 
nerstone of early language acquisition. Once a tact repertoire is 
acquired, it can benefit a child in many ways. For example, tact- 
ing can facilitate listener behavior and the acquisition of other 
forms of verbal behavior such as mands, intraverbals, C-BiNs, 
equivalence classes, and relational frames (Horne & Lowe, 
1996; Jennings & Miguel, 2017; Petursdottir & Carr, 2011; 
Sundberg, 2015). If a tact repertoire does not begin to develop 
for a toddler, intervention may be necessary. Teaching a child 
to tact is more straightforward than teaching a child to mand, 
because the antecedents that control the tact are more observ- 
able, measurable, quantifiable, and controllable than MOs. In 
addition, tacts are consequated by easily manageable and por- 
table generalized S's, whereas the specific reinforcement of the 
mand requires having the right reinforcer at hand. 

The goal of a tact intervention program is to establish a 
fluent and generative tact repertoire that is functional for the child, 
and integrated with the other speaker and listener repertoires. A 
tact program for a child depends on the results of an assessment 
of her existing tact repertoire, along with the other verbal operants 
and related skills, as well as possible barriers to tact acquisition 
(Sundberg, 2014). Once these data are obtained, the tact classi- 
fication system, the different parts of speech, and ABA teaching 
procedures can provide a framework and guide for an interven- 
tion program. The response form for tacts can be spoken words, 
signs, written stimuli (including Braille), and, under some circum- 
stances, icon selection. Tacts should also be established across 
the different sensory receptors affected by nonverbal stimuli (e.g., 
tactile, gustatory). A number of language curricula are available 
that contain the core elements of a tact intervention program (e.g., 
Barbera & Rasmussen 2007; Leaf & McEachin, 1999; Lovaas, 
2003; Maurice et al., 1996; Sundberg & Partington, 1998). 


Multiple Control 


Toddlers begin to emit two-word utterances involving novel 
combinations of nouns and verbs, and other parts of speech, 
around 2 years of age (e.g., Brazelton, & Sparrow, 2006). For a 
child who fails to acquire noun-verb combinations or other types 
of multiple responses (e.g., adjective-noun), teaching the skill 
involves arranging multiple forms of nonverbal stimulus control 
(e.g., a ball rolling down a ramp) while prompting the child to 
tact the object and its movement (e.g., “ball rolling”), fading 
prompts, and reinforcing the highest quality responses. These 
types of multiple tacts, once acquired, should be generalized, 
extended, and framed with the child’s other verbal and nonver- 
bal activities. When a child’s verbal repertoire reaches the point 
where novel multiple antecedent stimuli evoke corresponding 
multiple responses, a new cusp has been achieved. 


True Language 


Also around 2 years of age is the developmental period at which 
“true” language has been said to occur (e.g., Hayes et al., 2001; 


Hockett, 1960; Horne & Lowe, 1996; Malott, 2003). Interestingly, 
that is also about the developmental point where C-BiN (Horne 
& Lowe, 1996), equivalence relations (Sidman, 1994), relational 
framing (Hayes et al., 2001), and joint control (Lowenkron, 
1998) begin to emerge. True language may be the point at which 
these powerful behavioral cusps begin to demonstrate their 
generative effects 


Tacting Private Events 


Teaching a child with language delays to tact stimuli arising 
within her body can be difficult. A child’s collateral behavior is 
the most common method for identifying the presence of a pri- 
vate event. Facial expressions, pauses, body movement, partic- 
ular words, and certain phrasing have the power to “give away” 
a private event such as an eagerness, unwillingness, or apathy. 
Collateral behavior, such as a child holding her stomach, can 
be used as a teaching opportunity to establish the tact “sick” 
because the relevant sources of private control are present, 
allowing for the use of transfer of stimulus procedures. Public 
stimuli can also accompany a private event (e.g., blood) and 
provide the observer with a teaching opportunity. 


Duplic and Codic Assessment 


Assessing the strength of verbal and nonverbal behavior con- 
trolled by verbal S”s can be organized by the technical distinctions 
between the duplic, codic, intraverbal, and listener relations. The 
duplic repertoires (echoic, motor imitation, copying text) are the 
simplest of the verbal operants to assess because a finite number 
of units compose each of these atomic repertoires (Alessi, 1987; 
Palmer, 2012). For example, measuring the strength of an early 
echoic repertoire can be accomplished with a one-page form (Esch, 
2014). Codic relations (textual and taking dictation) are also easy 
to assess but involve additional skilis such as letter discriminations 
and phonetics. The intervention section only contains duplic appli- 
cations. A presentation of the codic applications (reading, writing, 
and spelling) is beyond the scope of the current chapter. 


Duplic Intervention 


The ability to duplicate the actions and sounds of others plays 
a critical role in the acquisition of language, social behavior, 
and generative learning capabilities (e.g., Baer, Peterson, & 
Sherman, 1967; Greer & Ross, 2008; Horne & Lowe, 1996; 
Lovaas, 2003; Lowenkron, 1998; Palmer, 2012; Poulson, 2003). 
For example, imitating an adult playing peek-a-boo allows for 
specific back-and-forth interactions between the infant and 
adult. These interactions can generate direct and automatic rein- 
forcement that further establishes other aspects of an imitative 
repertoire. A generalized imitative repertoire is an important 
behavioral cusp because it allows a child to easily learn new 
behaviors without direct teaching or a reinforcement history 
(e.g., how to put on socks or connect train tracks). Motor imi- 
tation can also have a linguistic function (relating to sign lan- 
guage or gestural communication), and when it does, it can be 
classified as duplic behavior and serve a verbal function that is 
parallel to the echoic relation. If duplic behavior fails to develop 
for an infant-toddler, intervention may be necessary. 


Teaching Motor Imitation (Relating 
to Sign Language) 


Many nonvocal children with disabilities have been successful 
in acquiring sign language as a response form when motor 
imitation is easier to establish than echoic skills. The first goal 
of a formal program is to teach the child to copy simple motor 
behaviors emitted by others. Establishing this repertoire can 
be accomplished with standard ABA procedures. For children 
who need further assistance, another method is to place an 
imitative response within a mand frame, that is, combine the 
motor SP with an MO and specific reinforcement for train- 
ing purposes. For example, if a child demonstrates a strong 
MO for music, an imitative trial (e.g., tapping an arm as the 
sign “music’’), along with physical prompts, can be conducted 
while that MO for music is strong, allowing for a momentarily 
strong form of specific reinforcement to be utilized as a teach- 
ing tool (Sundberg & Partington, 1998). 


Teaching Echoic Behavior 


A child’s ability to echo words on command is one of the most 
important measures of her potential for language acquisition 
(e.g., Lovaas, 1977). The echoic is valuable because it can be 
used to establish mand, tact, intraverbal, and listener skills. 
However, a person’s inability to echo words presents a major 
barrier to future language development, and thus is often a 
key component of early intervention programs (e.g., Leaf & 
McEachin, 1999; Lovaas, 2003; Ross & Greer, 2003; Sund- 
berg & Partington, 1998). An echoic repertoire can be estab- 
lished with the use of ABA teaching procedures. However, 
for those who struggle with learning to echo others, the mand 
frame procedure described for imitation training can also be 
used with echoic behavior. 


Automatic Reinforcement 


Echoic behavior can also be established by pairing a word 
with existing forms of reinforcement. For example, if just prior 
to pushing a child on a swing, the word “swing” is emitted 
by an adult, the word “swing” may eventually become an Se 
Research has shown that this pairing procedure can increase 
the rate of a child’s vocal play and result in the emission of 
targeted sounds under echoic control (e.g., Esch et al., 2009; 
Rader et al., 2014; Stock et al., 2008; Sundberg et al., 1996). 
Children who are having difficulty acquiring an echoic rep- 
ertoire may benefit from a combination of these procedures, 
as well as the use of sign language or a selection-based icon 


system. 


Listener Assessment 


A listener assessment can be framed by the parts of speech and 
the five types of verbal discriminations previously described 
for listeners. In addition, an assessment of self-listener skills is 
essential (Esch, Esch, McCart, & Petursdottir, 2010; Greer & 
Ross, 2008). Measures should also be taken to assess a child’s 
ability to serve as an audience, mediate reinforcement, and 
demonstrate emotional listener responding. 
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Simple Verbal Discriminations 


The assessment of the strength of simple verbal discriminations 
for a listener involves determining if verbal stimuli reliably 
evoke a corresponding nonverbal response. For example, when 
asked to “walk” the child walks, and when asked to “run” the 
child runs. 


Auditory Conditional Discriminations 


An auditory conditional discrimination is more complex because 
it involves multiple stimuli and multiple responses. The stimuli 
include both a verbal SP (e.g., “Touch shoe”) and a nonverbal 
SP (e.g., a picture of a shoe) in an array of other stimuli. The 
responses include both scanning the array of nonverbal stimuli 
and emitting a nonverbal section response (e.g., pointing to the 
shoe). The assessment of this repertoire would involve sampling 
various combinations of the parts of speech in increasing levels 
of verbal and nonverbal complexity (e.g., Dunn & Dunn, 2007; 
Sundberg, 2014). 


Compound Verbal Discriminations 


The assessment of listener behavior controlled by compound 
verbal stimuli involves determining if multiple words collec- 
tively generate a new SP in a listener discrimination task. For 
example, if the target is to teach a child to discriminate between 
red and green vegetables and red and green fruits presented in 
an array, first test each element individually (e.g., “Can you find 
some red things?” “Can you find some fruits?”), then present 
combinations of the target verbal stimuli (e.g., “Can you find a 
green vegetable?) to determine if the compounding of the stim- 
uli evokes a more specific selection response. 


Verbal Conditional Discriminations 


In a VC? involving listener behavior, two separate conditional 
discriminations are required. The first occurs in the antecedent 
event where one verbal stimulus alters the evocative effects of 
another verbal stimulus in the same antecedent configuration. For 
example, in a listener discrimination involving the verbal stimulus 
“Which one can’t you wear?” and an array of nonverbal stimuli 
(e.g., shirt, ball, computer), the reinforcement history related to 
the word “can’t” alters the discriminative function of the word 
“wear,” allowing it to function as a new SP that participates in 
a second conditional discrimination involving the comparison 
array. In that second conditional discrimination, the shirt is 
established as an S4 and the ball and computer are established as 
SPs. Contrasting “can’t” with “can” embedded in the phrase will 
determine if a VC” is occurring. The assessment of a listener’s 
VCP repertoire can be quite complex, but would involve sampling 
various combinations of the parts of speech in increasing levels of 
verbal and nonverbal complexity (e.g., Devine et al., 2016; Smith 
et al., 2016; Sundberg & C. A. Sundberg, 2011). 


Verbal Function-altering Effect 


For a listener, the function-altering future effect involves 
a change in the evocative effect of a nonverbal stimulus 
encountered at some point in the future (Schlinger & Blakely, 
1994). For example, when a child is given a certain instruction 
by a parent involving a future nonverbal event, such as “When 
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the commercial comes on, go brush your teeth,” the effects of 
the commercial as a nonverbal SP can only be determined later 
when a commercial comes on. The assessment of this reper- 
toire would involve presenting a variety of these types of verbal 
instructions and nonverbal contexts. 


Listener Intervention 


If an infant or toddler fails to demonstrate early listener skills, 
there are a variety of intervention programs and materials that 
have produced successful results (e.g., Leaf & McEachin, 1999; 
Lovaas, 2003; Maurice et al., 1996). Early listener discrimina- 
tion skills should be established simultaneously with tacting 
skills (e.g., Petursdottir & Carr, 2011). For example, while 
teaching a child to tact a shoe, also ask the child to select a 
shoe from an array. This mixed verbal behavior teaching format 
(Sundberg & Partington, 1998) consists of interspersing speaker 
and listener teaching trials and can help establish generative 
C-BiN repertoires (Greer & Ross, 2008; Horne & Lowe, 1996). 

Learning to be a listener involves the capability to dis- 
criminate between words (or signs, text, etc.), especially when 
they appear in novel combinations, in different nonverbal envi- 
ronments, and under different MOs. The five types of verbal 
discriminations can provide some organization to the process 
of teaching a child with autism or other intellectual disabilities 
to comprehend words and sentences as a listener. 


Simple Verbal Discriminations 


Teaching simple verbal discriminations to a listener involves 
establishing different nonverbal behaviors under the control of dif- 
ferent verbal stimuli. For example, teaching a child to clap when 
someone says “clap” and to jump when someone says “jump” 
involves a discrimination between words. Establishing these skills 
can be accomplished with ABA teaching methodology. 


Auditory Conditional Discriminations 


A common listener training procedure (also termed receptive 
labeling) involves asking a child to identify objects, actions, 
properties, locations, and so forth, by emitting some form of 
selection behavior (e.g., upon hearing “Where’s the nurse?” a 
child selects a picture of a nurse from a comparison array). A 
critical component of developing auditory conditional discrim- 
inations is the configuration of the array containing the target 
item or activity. Initially, the array should be clear and simple 
and contain distinct items (e.g., a banana and a cat). Once a 
child acquires these beginning discriminations, training should 
systematically progress to more complex verbal stimuli and 
arrays. As with the tact, the different parts of speech can serve 
as a classification and curriculum guide for listener conditional 
discrimination training, and a number of teaching materials and 
programs are available for developing this repertoire (e.g., Leaf 
& McEachin, 1999; Lovaas, 2003). 


Compound Verbal Discriminations 


Establishing compound verbal discriminations for a listener 
involves the arrangement of different parts of speech into single 
phrases and the manipulation of corresponding nonverbal arrays 
(e.g., Devine et al., 2016; Eikeseth & Smith, 2013). Teaching 


these types of verbal and nonverbal listener discriminations 
involves presenting the combination of individual words that are 
already in a child’s listener repertoire. For example, a child may 
be able to respond as a listener to “wiggle” by wiggling, and 
“fingers” by showing an adult his fingers, but when asked “Can 
you wiggle your fingers?” the child cannot respond correctly. To 
teach this skill, imitative prompts can be used and faded, with 
the goal of establishing “wiggle fingers” as a new compound 
verbal SP that evokes finger wiggling. Eventually new nouns 
and verbs can be added to the instruction (e.g., wiggle toes, 
point your finger). Matrix training provides an efficacious and 
generative format for establishing these types of discriminations. 


Verbal Conditional Discriminations 


Teaching a child to emit vcs as a listener can be 
accomplished with a procedure identified as listener responding 
by function, feature, and class (LRFFC) (DeSouza et al., 2019; 
Smith et al., 2016; Sundberg, 2014; Sundberg & Partington, 
1998). In LRFFC training, a child is asked to identify nonverbal 
stimuli when given verbal S”s based on the things that one can 
do with an object (e.g., ride it, wear it), its features or attributes 
(e.g., wheels, sleeves), or its class or category (e.g., vehicles, 
clothing). Elements of these different verbal stimuli can be 
combined and contrasted to establish the necessary VCPs. For 
example, given a scene of a farm, an adult asks a child a series 
of LRFFC questions such as “Which animal eats hay?,” “Who 
grows the hay?,” “Which animal lays eggs?,” and “Who cooks 
eggs?” The different parts of speech can be used as a framework 
for LRFFC training procedures. A major advantage of this type 
of listener training is that it establishes VCs that can also be 
used to teach intraverbal behavior. 


Verbal Function-altering Effect 


Teaching a child to respond to verbal stimuli that change the 
effects of future nonverbal stimuli can be accomplished by grad- 
ually increasing the delay between the initial verbal stimulus and 
contact with the targeted nonverbal event. For example, if the 
target relation involves telling a child, “When your father gets 
here show him your award,” training could begin with a brief 
delay between the instruction and the arrival of the father, and 
then gradually increase the delay. Once acquired, new instruc- 
tions along with new nonverbal contexts should be introduced 
to develop this advanced form of listener behavior. 


Intraverbal Assessment 


Intraverbal relations are perhaps the most difficult of the elemen- 
tary verbal operants to assess and quantify. Although early intra- 
verbal behavior emitted by children is usually straightforward, 
intraverbal relations quickly become more complex as multiple 
verbal stimuli and multiple responses participate in verbal episodes 
with listeners. The four types of verbal discriminations, along with 
the parts of speech, and multiple control involving nonverbal S?s 
and MOs, can provide a framework for intraverbal assessment 
(Devine et al., 2016; Sundberg, 2014, 2016a; Sundberg & C. A. 
Sundberg, 2011). In addition, an assessment of I-BiN repertoires 
can identify the strength and effectiveness of this higher order ver- 
bal operant (Greer & Ross, 2008; Miguel, 2016). 


Simple Verbal Discriminations 


Many examples of simple intraverbal discriminations can be 
sampled during an assessment (e.g., songs, fill-in-the-blank 
games, common associations). Performance on these types of 
verbal tasks can be compared to the performances of typically 
developing children for a developmental perspective.” 


Compound Verbal Discriminations 


The assessment of a speaker’s intraverbal behavior controlled 
by a compound verbal stimulus is similar to the process for the 
listener, except without the presence of nonverbal stimuli. For 
example, when the question “Can you tell me a red fruit?” is pre- 
sented without any pictures and a child says “apple,” and when 
asked, “Can you tell me a red vegetable?” she says “pepper,” 
this constitutes a measure of intraverbal behavior controlled by 
a compound verbal S?. 


Verbal Conditional Discriminations 


The assessment process for intraverbal behavior controlled 
by verbal conditional discriminations is similar to the process 
for the listener, except the strength of responding is tested in 
the absence of nonverbal stimuli. Assessing this repertoire 
involves the use of contrasting verbal stimuli inserted into 
intraverbal autoclitic frames. For example, ask a child ques- 
tions that require conditional intraverbal responding, such as 
“What can you wear on your feet” along with “What can you 
wear on your head,” “What grows on your head,” “What grows 
in a garden?” and so on. 


Verbal Function-altering Effect 


Verbal S?s can also transform the functional effects of verbal 
stimuli encountered in the future. The assessment of a child’s 
ability to demonstrate these effects requires the targeted intra- 
verbal to be observed later in time. For example, give the child 
a verbal instruction such as “When your uncle asks you about 
your award, tell him you studied hard for it.”” Then, when the 
uncle comes in and asks the targeted question, score the child’s 
response. The assessment of this repertoire would involve pre- 
senting a variety of these types of verbal instructions and future 
measures of their effects. 


Intraverbal Intervention 


Prior to acquiring an intraverbal repertoire, a toddler has already 
acquired a sizable collection of mands, tacts, and listener skills. 
An intraverbal repertoire is typically the last of the initial verbal 
operants acquired (codic skills are acquired later). Some of the 
first intraverbal relations are simple phrases, songs, fun fill-ins, 
and common associations. A child’s intraverbal repertoire usu- 
ally expands rapidly between the ages of 2 and 3. This growth 
corresponds to increases in the complexity of the tact and lis- 
tener discrimination repertoires, thus providing a child with 
content to intraverbally talk about. The four types of verbal 
discrimination (Figure 18.4) and the parts of speech provide 
organization for early intraverbal training. In addition, the devel- 
opment of intraverbal behaviors by typically developing chil- 
dren can offer a frame of reference for an intraverbal program 


Chapter 18 * Verbal Behavior 485 


(Devine et al., 2016; Kisamore et al., 2016; Sundberg, 2014; 
Sundberg & C. A. Sundberg, 2011). 


Simple Verbal Discriminations 


Teaching beginning intraverbal behavior to a child involves 
establishing verbal behavior under the control of simple verbal 
stimuli. For example, an adult who is playing peek-a-boo may 
say “peek a...” and then implement a brief delay before saying 
“boo” to evoke the intraverbal response “boo” from the child. 
Fill-in-the-blank phrases (“You ride a... ”) are usually easier 
than WH questions (“What can you ride?”’) because fill-ins often 
just involve a simple verbal discrimination, whereas WH ques- 
tions ultimately involve verbal conditional discriminations. The 
primary goal at this early intraverbal level is to break the tar- 
geted verbal responses free from echoic control, while teaching 
the child to verbally discriminate among the many verbal stimuli 
she may encounter. In addition, the fill-in to WH question pro- 
cedure helps to establish early I-BiN repertoires (Greer & Ross, 
2008; Horne & Lowe, 1996). 


Compound Verbal Discriminations 


Teaching intraverbal behavior controlled by compound ver- 
bal stimuli is similar to the procedure previously described 
for a listener discrimination, except a verbal response is 
required. For example, the words “brother” and “name” may 
independently evoke certain behaviors, but when combined 
to “brother’s name” a new stimulus occurs and by prompting, 
fading, and reinforcing a specific response under these circum- 
stances (e.g., “Cory”) compound verbal stimulus control can 
be established. 


Verbal Conditional Discriminations 


When considering an intervention program targeting the estab- 
lishment of VCs, it is important to ensure that the child demon- 
strates the prerequisite speaker and listener skills (DeSouza 
et al., 2019; Sundberg & C. A. Sundberg, 2011). For example, 
prior to being able to intraverbally talk about hot and cold, it 
is essential that the tact and listener repertoires (e.g., LRFFC) 
regarding hot and cold are established. If a child can demon- 
strate these prerequisite skills, and this repertoire has been tar- 
geted as a priority, one can turn to a sizable body of research 
and clinical applications on intraverbal responses controlled by 
multiple verbal stimuli (for reviews see Aguirre, Valentino, & 
LeBlanc, 2016; Petursdottir, 2018). 

For example, Kisamore et al. (2016) demonstrated the 
effects of three different procedures for establishing multiply con- 
trolled intraverbals with seven participants with autism. The first 
procedure involved a standard prompt delay and error correction, 
but this procedure was effective for only three of the participants. 
Two of the participants benefited from the requirement to 
echo the question first, termed differential observing response 
(DOR), and two participants required other modifications of the 
procedures. Figure 18.7 shows the results for the two participants 
who failed with prompt delay and error correction, but benefited 
from the echoic DOR procedure. The data for both participants 
showed that intraverbal responding during the prompt delay 
and error correction was variable and did not meet criterion. 
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Figure 18.7 Results for the two participants who failed with prompt delay and error correction (PD&EC), but 
benefited from the differential observing response (DOR) procedure. The open circles show acquired intraverbal 
responses across conditions, and the bars in the right panel show the number of errors across conditions. 


From Teaching Multiply Controlled Intraverbals to Children And Adolescents with Autism Spectrum Disorders” by A. N. Kisamore, 
A. M. Karsten, & C. C. Mann, © 2016. Reproduce with permission of John Wiley & Sons Inc., 


However, with introduction of the DOR procedure, responding 
increased to mastery performance. 


Verbal Function-altering Effect 


The goal in establishing delayed intraverbal behavior is to 
teach a child how to respond to specific verbal Ss that might 
occur later in time. An instructional program would involve 
known intraverbal relations and altering the temporal distance 
between the first and second presentation of the target verbal 
SP. For example, if the goal is a future intraverbal interaction in 


a restaurant, initial conditioning might involve telling a child, 
“When the waiter asks you, ‘What do you want to order?’ you 
tell him chicken nuggets”, then engage in role playing involving 
a second adult and gradually increase the time between each 
adult’s verbal SP presentation. 


Autoclitic Assessment 


The following prerequisite skills are necessary to benefit 
from autoclitic instruction: (1) A speaker’s existing verbal 
skills are sufficient to support secondary verbal behavior, 


(2) a speaker demonstrates self-listening skills, and (3) a 
speaker can tact aspects of her own verbal and nonverbal 
behavior. Developmental milestones can be helpful for iden- 


tifying when these types of repertoires emerge (generally 
around 24 to 36 months of age). 


Autoclitic Mands 


Secondary autoclitic mands begin to emerge in a speaker’s mand 
repertoire after primary mands have been acquired. For exam- 
ple, does a child put extra stress on certain words, add tags of 
immediacy, or pause before saying something? If these second- 
ary verbal behaviors are controlled by some MO regarding how 
a listener should respond to a primary response, along with the 


relevant history of specific reinforcement, they can be classified 
as autoclitic mands. 


Autoclitic Tacts 


The assessment of autoclitic tacts can reveal if a speaker’s 
behavior is controlled by her own verbal behavior and is modi- 
fied due to its special effects on listeners (e.g., emitting an auto- 
clitic of strength, such as “I KNOW it is mine,” or an autoclitic 
of weakness, such as “I THINK it is mine’’). These types of auto- 
clitic tacts begin to demonstrate capabilities with verbal com- 
plexities such as self-editing, covert verbal behavior, and verbal 
self-awareness (Greer & Ross, 2008; Skinner, 1957, 1974). A 
child’s ability to emit tacts controlled by these various types of 
nonverbal S?s can be compared to that of typically developing 
children for a developmental perspective. 


Autoclitic Intervention 


Establishing an autoclitic repertoire can be difficult due to the 
private nature of the relevant controlling variables. In addition, 
all autoclitic relations involve multiple controlling variables, 
multiple responses, and a speaker serving as her own listener. 
Teaching autoclitic mands involves capturing or creating the 
relevant MOs and using ABA teaching procedures to establish 
a secondary mand relation. For example, in social interaction 
with peers a louder voice or a certain phrase may function as 
an autoclitic mand to prompt a listener to attend better to what 
the speaker says. Establishing these types of autoclitics can be 
accomplished with basic ABA procedures, including modeling 
and role playing. 

Teaching an autoclitic tact is complicated by the fact that 
the target nonverbal sP must originate from some aspect of the 
child’s own words, and it is that SP that should evoke the child’s 
autoclitic response. The task for the adult is to bring the child’s 
verbal behavior under the control of some specific aspect of the 
child’s own ongoing verbal behavior. This is not an easy task. 
For example, a child must first be emitting primary responses in 
order to generate stimuli to secondarily tact, but a child’s own 
words may be transient, private, multiple, or completely missed 
by the child. They also may have to compete with other words, 
ongoing listener behavior, current MOs, and any number of 
variables that may emerge in verbal exchanges. Two procedures 
for establishing autoclitic tacts will be described. As with the 
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autoclitic mand, capturing or creating the relevant controlling 
variables can provide the necessary elements to establish auto- 
clitic tacts. 


Capturing Autoclitic Tact Teaching Opportunities 
in the Natural Environment 


Once a repertoire of primary verbal operants is established, 
autoclitic tacts can emerge in typically developing children 
without formal instruction. This is because listeners, often 
unintentionally, provide differential reinforcement for auto- 
clitic behavior. Listeners also prompt speakers for autoclitic 
expansions of their primary response (e.g., “Where did you 
hear that?”). Certain settings and activities may be more 
conducive to evoking primary verbal behavior, such as play 
activities, arts and crafts, or outdoor events and games. Often, 
autoclitic behavior is already occurring in the context of ongo- 
ing verbal behavior, and if an adult can identify its presence, 
the adult can provide reinforcement for that behavior. For 
example, upon observing a wave in a water pool, a child emits 
the response, “That was SO big”—a response that appears to 
contain a descriptive autoclitic. An adult can reinforce the 
additional autoclitic behavior by drawing attention to the “SO” 
part (e.g., “Wow that was the biggest one!”). 


Formal Arrangement of Autoclitic 
Tact Contingencies 


If a child demonstrates the prerequisite skills identified above, 
she may be prepared to learn to emit autoclitic behavior through 
direct teaching procedures (e.g., multiple exemplar instruction). 
For example, if the targeted autoclitic relation involves a child 
informing the listener of the sense mode that is affecting her 
primary tact, she must first learn to discriminate between see- 
ing things, hearing things, smelling things, and so on. A formal 
teaching procedure could begin with a primary response evoked 
with a standard tact trial to set up the necessary conditions for 
tacting the nonverbal source of control for that response (¢.g., 
visual or auditory). When the stimulus is visual, the echoic 
prompt “I see the ball” can be used to prompt a response 
(at this point “I see” is not autoclitic; it is a primary echoic 
operant). When the stimulus is auditory (e.g., the sound of a ball 
bouncing), the echoic prompt “I hear the ball” can be provided. 
Both types of echoic prompts need to be faded, trials presented 
randomly, and stimulus control transferred to the type of sensory 
antecedent that controls the primary tact “ball.” It is important 
to transfer what is acquired through direct instruction to nat- 
ural environment contingencies where sensory information is 
valuable to the listener (e.g., hide-and-seek games, emergency 
situations). Similar procedures could be conducted for the many 
other types of descriptive autoclitics, as well as for quantifying 
and relational autoclitics. 

Autoclitics are an important aspect of a verbal repertoire, 
but the private nature of the controlling contingencies make them 
difficult to study. Fortunately, a growing body of autoclitic research 
is available (e.g., Hiibner, Austin, & Miguel, 2008; Lowenkron, 
1998, 2006; Luke et al., 2011; Palmer, 2011; Sidener & Michael, 
2006; Speckman et al., 2012; C. T. Sundberg et al., 2018). 
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Figure 18.8 The results for Andrea show her absent or weak mands during baseline; the open 
squares represent the specific target words (e.g., “colored pencil”), and the closed diamonds 
represent indirect vocal words as mands (e.g., “I don’t know what to do,” “OK”). The first 
condition was ineffective in evoking the target mand, but the second condition involving 
various types of verbal prompts such as a contingency specifying stimulus (CSS), produced 
successful manding and reduced inappropriate mands. 


Republished with permission of Association for Behavior Analysis International, from Teaching Mands to Older 
Adults with Dementia, C. R. Oleson and J. C. Baker, 2014, The Analysis of Verbal Behavior, 30, p. 122. Permission 
conveyed through Copyright Clearance Center, Inc. 


THE LOSS OF VERBAL BEHAVIOR 


Skinner’s analysis of VB can serve as a guide for language 
assessment and intervention for an individual experiencing a 
verbal loss, much in the same way it does for initial language 
acquisition. Verbal repertoires can weaken significantly, as 
observed in cases of dementia, Alzheimer’s disease, aphasia, 
and traumatic brain injury (Baker, LeBlanc, & Raetz, 2008; 
Gross, Fuqua, & Merritt, 2013; Palmer, 1991; Sundberg, 
2016b; Sundberg, San Juan, Dawdy, & Arguelles, 1990; Trahan, 
Donaldson, McNabney, & Kahng, 2014). An adult with dementia 
may, for example, experience memory loss, engage in off-topic 
verbalizations, or neglect important aspects of daily living (e.g., 
Bourgeois, 1993; LeBlanc, Raetz, & Feliciano, 2011). 

In his discussion of aphasia and the loss of verbal behavior, 
Skinner (1957) points out the following: “The order of damage 
seems to follow the order of ‘difficulty’ deducible from the avail- 
ability of a minimal repertoire. Textual and echoic behavior often 
survive (unless relevant sensory defects are involved) while intra- 
verbals and tacts appear to be most vulnerable” (p. 219). Identify- 
ing the strength of each verbal operant and the functional sources 
of control for an adult’s impaired verbal behavior can provide 
direction for an intervention program. For example, data regard- 
ing the nature of a person’s loss can be obtained by assessing 
intraverbal repertoires regarding recent and past events, mands 


for information, tacting complex stimuli, and the occurrences of 


odd or irrelevant VB (e.g., Gross et al., 2013; Sundberg, 2014). 

The body of research on applying Skinner’s analy- 
sis of verbal behavior to problems related to the loss of lan- 
guage is expanding (e.g., Dixon, Baker, & Sadowski, 2011; 
LeBlanc et al., 2011; Oleson & Baker, 2014). For example, 
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SUMMARY 
Skinner’s (1957) Analysis of Verbal Behavior 


1. Skinner argues that language constitutes learned 
speaker and listener behavior that is acquired, main- 
tained, and extended by environmental contingencies of 
reinforcement. 


2. A given utterance is comprised of formal properties (the 
words spoken) and functional properties (why those words 
were spoken). 


3. The formal properties of language involve the topography 
and structure of verbal responses usually classified in 
terms of the parts of speech, clauses, and sentences. 


4. The functional properties of language involve the causes 
of a response often referred to as semantics, or the mean- 


ing of words. 


5. From a behavioral perspective, the meaning of a word can 
be found in the immediate and historical antecedents and 
consequence that evoke that word. 
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Oleson & Baker (2014) examined manding problems experi- 
enced by older adults with dementia. These authors suggested 
that “[t]raining mands could not only lead to an improved qual- 
ity of life by providing individuals with dementia the ability 
to access desired items and activities . . . but may also reduce 
aggression and other challenging behavior” (p. 115). In Ole- 
son and Baker’s (2014) study, two participants with dementia 
experienced a mand training condition consisting of contriving 
an MO (e.g., providing paper and no colored pencils), verbal 
prompts, fading, specific reinforcement, and error correction. 
The results revealed that the procedure was effective with one 
participant, but results were inconsistent with the second par- 
ticipant. Figure 18.8 shows the results for the successful partic- 
ipant, Andrea, who acquired two mands and partially acquired 
a third. In the mand baseline condition, Andrea emitted indi- 
rect vocal words as mands (e.g., “I don’t know what to do,” 
“OK”), as indicated by the closed triangles, but rarely emitted 
the specific target words (e.g., “colored pencil”), as indicated 
by the open squares. The first condition consisted of contriv- 
ing the EO alone, which was ineffective in evoking the target 
mand. However, the second condition involving the contrived 
MO plus an indirect supplemental verbal prompt resulted in 
successful manding and reduced inappropriate mands. These 
data are similar to the results reported for Jason in the opening 
vignette, where Jason’s inappropriate mand was replaced by an 
acceptable response form through a basic mand training proce- 
dure. Many other teaching procedures have proven effective for 
children with autism or other intellectual disabilities that can be 
modified and used for individuals experiencing a loss of verbal 
behavior. 


6. By treating language as learned behavior emitted by speak- 
ers and listeners, the focus of the analysis and its applica- 
tions is on the contingencies responsible for that behavior. 


7. Skinner (1957) defined verbal behavior “as behavior rein- 
forced through the mediation of other persons” (p. 2), but 
those other persons “must be responding in ways which 
have been conditioned precisely in order to reinforce the 
behavior of the speaker” (p. 225). 


8. Language involves an interaction between a speaker and 
a listener, including when a speaker is her own listener. 
Skinner terms the basic unit of analysis for this interaction 
a verbal episode. 


9. Ina verbal episode, a speaker emits any type of verbal 
behavior (e.g., mand, tact, intraverbal), in any form (e.g., 
speech, sign language, eye contact). 


10. In a verbal episode, a listener (1) serves as an audience for 
a speaker, (2) provides reinforcement for a speaker, and 
(3) responds in specific ways to the speaker's behavior. 
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The roles of speaker and listener alternate in verbal epi- 
sodes, and usually involve covert speaker and listener 
behavior as well. 


Verbal behavior is not restricted to spoken words. Any 
form of operant behavior can acquire a verbal function 
(e.g., sign language, fingerspelling, icon selection, Braille, 
facial expressions, intonation). 


In topography-based verbal behavior, each verbal relation 
involves a distinguishable response form given some spe- 
cific controlling variable. 


In selection-based verbal behavior, the response form is 
the same for each verbal relation (e.g., pointing, selecting); 
what is conveyed to the listener is the stimulus indicated. 


A behavioral cusp is a form of generative learning in 
which the acquisition of one skill or specific set of skills 
enables the acquisition of other skills without direct teach- 
ing or a history of reinforcement. A cusp can allow access 
to new MOs, stimuli, reinforcers, and punishers, which in 
turn generate new behavior. 


The basic components of a speaker’s verbal repertoire are 
called the elementary verbal operants (e.g., mand, tact, 
intraverbal). 


The mand is a verbal operant in which the form of the 
response is under the functional control of motivating 
operations (MOs) and a history of specific reinforcement. 


The tact is a verbal operant in which the form of the 
response is under the functional control of a nonverbal S? 
and a history of generalized S'. 


There are three types of dupic relations: echoic, motor imi- 
tation, and copying text, and two types of codic relations: 
textual and taking dictation. 


The echoic (duplic) is a verbal operant in which the form 
of the response is under the functional control of a verbal 
SP that has formal similarity with a verbal response and a 
history of generalized S". 


Formal similarity occurs when the controlling antecedent 
stimulus and the response product (a) share the same sense 
mode (i.e., both stimulus and response are visual, auditory, 
or tactile) and (b) physically resemble each other. 


The intraverbal is a verbal operant in which the form of 
the response is under the functional control of a verbal S? 
that does not have point-to-point correspondence to the 
verbal response, but does have a history of generalized S'. 


Point-to-point correspondence between the stimulus and 
the response or response product occurs when the begin- 
ning, middle, and end of a verbal stimulus matches the 
beginning, middle, and end of a verbal response. 


The textual relation (codic) is a verbal operant in which 
the form of the response is under the functional control of 
a written verbal S? that does not have formal similarity, 
but does have point-to-point correspondence and a history 
of generalized S". 
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Taking dictation (codic) is a verbal operant in which the 
form of the response is under the functional control of a 
verbal SP that controls a written, typed, Braille punched, 
or Fingerspelled response. Like the textual relation, there 
is no formal similarity, but there is point-to-point corre- 
spondence and a history of generalized S". 


Listener discriminations (commonly referred to as recep- 
tive language) can be defined as a verbal SP that evokes a 
corresponding nonverbal response (or class of responses) 
due to a history of generalized S'. 


The generalized ability to match nonverbal sample stimuli 
to nonverbal comparison stimuli can facilitate the estab- 
lishment of equivalent and nonequivalent relations. 


Classifying verbal relations allows us to more precisely 
identify, organize, and account for the specific variables 
that might be responsible for a given utterance. 


The Verbal Operants and Listener Behavior in 
more Detail 
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Mands can be classified by identifying the different types 
of MOs that control manding behavior (e.g., UMOs, 
CMOs), and by the different ways mands affect the behav- 
ior of the listener and produce specific reinforcement. 


The defining consequences for the mand constitute what 
Skinner identifies.as specific reinforcement. Specific rein- 
forcement involves reinforcement that is established by its 
relation to a specific MO (e.g., water deprivation estab- 
lishes water as specific reinforcement). 


Several types of MOs are related to aversive stimulation 
(e.g., too hot, loud noises, demands), and their removal 
functions as specific negative reinforcement. 


When sufficient specific positive or negative reinforcement 
has been provided, it abolishes the MO (e.g., satiation). 


. Any instance of human behavior is usually a function of 


multiple variables operating simultaneously. There are two 
types of multiple control: convergent multiple control and 
divergent multiple control. 


In convergent multiple control, a single response is con- 
trolled by more than one antecedent variable. 


In divergent multiple control, a single antecedent variable 
controls more than one response. 


It is common to have multiple MOs and SPs operating 
simultaneously within a single verbal episode. 


The involvement of multiple antecedent variables can 
enhance or weaken an evocative effect. 


A private event is only felt by one person. However, pri- 
vate events can have immediate behavioral effects (e.g., 
MOs, covert verbal SPs), and should be taken into account 
for a complete interpretation of behavior. 


Public accompaniment occurs when a publicly observable 
stimulus reliably occurs with a private stimulus. 
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Collateral responses are publicly observable behaviors that 
reliably occur with a private stimulus. 


Once a child has acquired basic echoic, mand, tact, and lis- 
tener repertoires, the combination of these cusps can pro- 
duce emergent mands and generative learning in a number 


of different ways (e.g., tact to mand transfer, incidental 
mand acquisition). 


Classifying tacts can be accomplished by identifying the 
different types of nonverbal S?s that control responses, 


and by the different ways these verbal responses affect the 
behavior of the listener. 


Nonverbal S?s commonly participate in verbal episodes 
with other types of antecedent variables (e.g., CMO-Ts, 
verbal SPs). 


Once a tact relation has been established, the tact response 
can extend to novel stimulus conditions through the pro- 
cess of stimulus generalization. 


In generic tact extension, the novel stimulus shares all of 
the relevant or defining features associated with the origi- 
nal stimulus. 


In metaphorical tact extension, the novel stimulus shares 
some, but not all, of the relevant features of the original 
stimulus. 


In metonymical tact extension, the novel stimulus shares 
none of the relevant features of the original stimulus 
configuration, but some irrelevant but related feature has 
acquired stimulus control. 


In solecistic tact extension, a stimulus property has no 
clear connection to the original stimulus, but the novel 
stimulus has acquired control through some type of rein- 
forcement history. 


Listeners provide five different types of generalized S' for 
tacting: (1) educational reinforcement, (2) escape from or 
avoidance of aversive stimulation, (3) extending the listener’s 
contact with the environment, (4) automatic reinforcement, 
and (5) a history of contiguous or correlated usage. 


Skinner (1957) used the term “automatic” to identify 
circumstances in which behavior is evoked, shaped, 
maintained, strengthened, or weakened by environmental 
variables occurring without direct manipulation by other 
people. 


Automatic reinforcement can occur when the reinforce- 
ment is provided by the response product emanating from 
the behavior. 


Automatic reinforcement can occur when the reinforce- 
ment is provided by the behavior’s reinforcing effects 
emanating from the physical environment. 


Generative verbal learning is a behavioral effect where 
previously acquired speaker and listener skills enable or 
accelerate the acquisition of other speaker and listener 
skills, without dependence on direct teaching or a history 
of reinforcement. 
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Once a child has acquired basic echoic, tact, listener, and 

matching-to-sample repertoires, the combination of these 
cusps can produce emergent tacts in a number of different 
ways (e.g., stimulus equivalence, recombinative general- 

ization, common bidirectional naming, joint control, and 

relational framing). 


Stimulus equivalence occurs when, for example, the direct 
teaching of an AB relation between a word (stimulus A) 
and an object (stimulus B), and the direct teaching of a 
BC relation between the same object (stimulus B) and 

a different stimulus (stimulus C), can generate several 
untrained speaker and listener relations (BA/CB sym- 
metry and AC/CA transitivity). When emergence occurs 
with both symmetry and transitivity, equivalence has 
occurred (Chapter 19). 


Recombinative generalization is a type of multiple control 
involving learning to respond to novel combinations of 
antecedent variables as both a speaker and a listener. 


Bidirectional naming (BiN) is a higher order verbal cusp 
consisting of the fusing of the speaker and listener reper- 
toires in bidirectional relations. In common BiN, a new 
word acquired as listener can generate a tact relation with- 
out further training, and a new word acquired as a tact can 
generate a listener relation without further training. 


Joint control involves the combination of verbal skills 
(e.g., echoic and tact) and occurs when two separately 
established antecedents that evoke the same response 
topography arise simultaneously and produce an emergent 
SP that evokes behavior. 


Relational framing involves an extension of Sidman’s 
equivalence theory to nonequivalent stimulus relations 
such as comparison, opposition, and difference 
(Chapter 20). 


Verbal S?s have the same causal status as nonverbal SPs in 
that both types of SPs acquire discriminative control over 
behavior through a history of differential reinforcement. 


Six types of elementary verbal operants are controlled by 
verbal S?s: echoic, motor imitation (relating to sign lan- 
guage), copying text, textual, taking dictation, and intra- 
verbal. All of these relations are duplic or codic, except for 
the intraverbal. 


Intraverbal relations can be classified by identifying the 
type of verbal stimulus control that evokes the intraverbal 
response. 


The intraverbal response can be controlled by four dif- 
ferent types of verbal stimulus control: (1) simple verbal 
discriminations, (2) compound verbal discriminations, 
(3) verbal conditional discriminations, and (4) verbal 
functional-altering discriminations. 


A simple verbal discrimination involves a single-compo- 
nent word or phrase that evokes an intraverbal response. 


A compound verbal discrimination involves two or 
more verbal S?s (convergent multiple control) that each 
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independently evoke behavior, but when they both occur in 
the same antecedent configuration, a different SP is gener- 
ated, and a more specific intraverbal response is evoked. 


A verbal conditional discrimination is a type of convergent 
multiple control involving a verbal stimulus that alters the 
evocative effects of another verbal stimulus in the same 
antecedent configuration, and evokes a corresponding 
intraverbal response. 


The verbal function-altering effect occurs when verbal 
stimuli condition the behavior of a listener by changing 
the functions of other stimuli and MOs that the listener 
encounters immediately or later in time (e.g., instructions). 


Generalized S' provided by listeners (e.g., praise, approval) 
establishes verbal stimulus control over verbal behavior, in 
the same way it does for nonverbal stimulus control. 


Research has demonstrated that intraverbal relations can 
emerge from previously acquired tact, intraverbal, and 
listener repertoires, and even from direct observation. 
Intraverbal behavior can also emerge through the process 
of bidirectional naming. 


Intraverbal bidirectional naming (I-BiN) involves a bidi- 
rectional relation that is established between at least two 
verbal stimuli (e.g., “cat” & “animal’’). I-BiN occurs when 
AB training on a unidirectional intraverbal relation auto- 
matically produces a reverse BA intraverbal relation, and 
vice versa. 


Listener Behavior 
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A listener is an active participant in verbal exchanges and 
serves many roles in this capacity, such as acting as an 
audience, providing reinforcement for speakers, and taking 
action upon a speaker’s words. 


The same four types of verbal discriminations described 
for intraverbal relations can be used as a framework to 
identify various types of nonverbal listener discrimina- 
tions. In addition, there is a fifth type of verbal discrimina- 
tion that is specific to listener behavior commonly termed 
an auditory conditional discrimination. 


In an auditory conditional discrimination, a verbal stimu- 
lus alters the evocative effect of a nonverbal stimulus due 
to a history of reinforcement and evokes a corresponding 
nonverbal response. 


A listener often engages in overt or covert verbal behavior 
in the process of listening to a speaker; when this occurs 
the listener is now a speaker, and his behavior should be 
classified as such. 


A speaker can also be her own listener. We can, and often 
do, talk to ourselves. 


Autoclitic Verbal Behavor 
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A speaker often adds autoclitic responses to her on-go- 
ing primary verbal behavior because autoclitic responses 
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provide a listener with more information about a speaker’s 
primary response, and a listener reinforces a speaker for 
this extra information. 


A primary verbal response constitutes any of the verbal 
operants (e.g., mand, tact, intraverbal). 


A secondary autoclitic response is controlled by some 
aspect of the primary verbal operant or by its controlling 
variables. There are three types of secondary autoclitic 
behavior: autoclitic mand, autoclitic tact, and autoclitic 
intraverbal frames. 


The autoclitic mand enjoins a listener to take some specific 
action regarding the primary response, and the autoclitic 
behavior is reinforced by that action. 


The autoclitic tact is controlled by some nonverbal feature 
of the primary response or its controlling variables, and the 
autoclitic response informs the listener of that feature. 

This behavior is reinforced by listeners who provide gen- 
eralized S'. 


Intraverbal autoclitic frames provide order, agreement, 
grouping, and composition of larger units of verbal behav- 
ior. An autoclitic frame facilitates the connections between 
words in a given utterance and is reinforced by listeners 
who provide generalized S'. 


Applications of Skinner’s (1957) Analysis of Verbal 
Behavior 
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Behavior analysis provides three major contributions to 
language assessment and intervention programs for indi- 
viduals with special needs: (1) applied behavior analysis 
technology, (2) a behavioral analysis of human develop- 
ment, and (3) a behavioral analysis of language. 


Applied behavior analysis can provide professionals and 
parents with the necessary intervention procedures and 
scientific methods to teach language skills. 


The development of language and social skills by typically 
developing children can provide a valuable guide for a lan- 
guage intervention curriculum. 


Skinner’s analysis of verbal behavior can serve as a frame- 
work for specific language assessment and intervention 
programs. 


A language assessment for a person with language delays 
has three primary goals: (1) to identify the nature of the 
delay or problem, (2) to compare the person’s performance 
with expected norms, and (3) to provide guidance for an 
appropriate intervention program if necessary. 


Language assessment for a person with language delays 
involves measuring the strength of the elementary verbal 
operants, listener skills, self-listener skills, multiply con- 
trolled responding, speaker-listener interactions. 


Assessing a child’s mand repertoire involves identify- 
ing the degree to which the various MOs control verbal 
responses. 
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There are two general methods are used for identifying 
and controlling the value change of MOs for mand train- 
ing: (1) capturing the MO value change as it occurs natu- 
rally and (2) creating the MO by generating some change 
that alters the value of a consequence. 


Assessing a tact repertoire involves determining the degree 
to which nonverbal stimuli control verbal behavior. 


The goal of a tact intervention program is to establish a 
fluent and generative tact repertoire that is functional for 


the child, and integrated with the other speaker and listener 
repertoires. 


A listener assessment can be framed by the parts of speech 
and the five types of verbal discriminations previously 
described. In addition, an assessment of a child’s self- 
listener skills is important. 


Early listener discrimination skills should be established 
simultaneously with early tacting skills. 


Learning to be a listener involves the capability to discrim- 
inate between words (or signs, text, etc.), especially when 
they appear in novel combinations, in different nonverbal 
environments, and under different MOs. 


The four types of verbal discriminations, along with the 
parts of speech, and multiple control involving nonverbal 


KEY TERMS 


audience generic (tact) extension 
autoclitic impure tact 

automatic punishment intraverbal 

automatic reinforcement listener 

convergent multiple control mand 


copying a text 
divergent multiple control 
echoic 


formal similarity 
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multiple control 
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D : : 
S’s and MOs, can provide a framework for intraverbal 
assessment and intervention. 


. The development of intraverbal behaviors by typically 


developing children can offer a frame of reference for an 
intraverbal intervention program. 


There are several prerequisite skills are necessary to ben- 
efit from a formal autoclitic intervention program: (1) A 
speaker’s existing verbal skills are sufficient to support 
secondary verbal behavior, (2) a speaker demonstrates 
self-listening skills, and (3) a speaker can tact aspects of 
her own verbal and nonverbal behavior. 


Generative verbal learning is a major goal for a language 
intervention program. Procedures from stimulus equiva- 

lence, relational framing, bidirectional naming, and joint 
control can all help to facilitate the emergence of genera- 
tive repertoires for a person with language delays. 


The Loss of Verbal Behavior 
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metaphorical (tact) extension 


metonymical (tact) extension 


Verbal repertoires can also deteriorate, as observed in 
cases of dementia, Alzheimer’s disease, aphasia, and trau- 
matic brain injury. Skinner’s analysis of VB can serve as a 
guide for language assessment and intervention for these 
individuals much in the same way it does for those learn- 
ing to communicate for the first time. 


private events 


solistic (tact) extension 
speaker 

tact 

textual 

taking dictation 

verbal behavior 


verbal operant 


point-to-point correspondence 


MULTIPLE-CHOICE QUESTIONS 


The formal properties of language involve the: 
a. Cause of the verbal response 

b. Topography of the verbal response 

c. Unseen forces of verbal language 

d. Language properties are not formal 


Hint: (See “Form and Function of Language’’) 


2. 


The functional properties of language involve: 


a. Causes of the verbal response 

b. Topography of the verbal response 

c. Language properties do not have functional components. 
d. Unobservable psychic antecedents. 


Hint: (See “Form and Function of Language”) 
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3. A mand is a type of verbal operant in which a speaker: 
a. Differentially responds to the verbal behavior of others 


b. Asks for (or states, demands, implies, etc.) what he or 


she needs or wants 


. Repeats the verbal behavior of another speaker 


d. Names things and actions that the speaker has direct 
contact with through any of the sense modes 


Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations’”’) 

. An intraverbal response is a type of verbal operant in 
which a speaker: 
a. Differentially responds to the verbal behavior of others 


b. Asks for (or states, demands, implies, etc.) what he 
needs or wants 


c. Repeats the verbal behavior of another speaker 


d. Names things and actions that the speaker has direct 
contact with through any of the sense modes 


Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations”’) 

. A tact is a type of verbal operant in which a speaker: 
a. Differentially responds to the verbal behavior of others 


b. Asks for (or states, demands, implies, etc.) what he 
needs or wants 


ESSAY-TYPE QUESTIONS 
1. Explain some of the factors in the slow appreciation of 
Verbal Behavior. 
Hint: (See “Form and Function of Language’) 

. Describe the term mand. Be sure to include in your answer 


the definition of the term and its importance for the early 
development of language. 


Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations’’) 


ae eens 


NOTES 


1. For ease of reading throughout the chapter, the male pronoun will be used 

for the listener and the female pronoun will be used for the speaker. 

. However, when using the term verbal behavior to refer to Skinner’s analy- 
sis of language, the “verbal” part involves the behavior of the speaker, but 
where is the behavior of the listener in that term? Skinner did not neglect 
the listener in the analysis, but one must add the phrase “and the behavior 
of the listener” when describing Skinner’s analysis of verbal behavior. An 
alternative is to use the phrasing “speakers and listeners” when describing 
the complete verbal field. 

. For more detail, see Carbone (2013); Endicott and Higbee (2005); Hall and 
Sundberg (1987); Landa, Hansen, and Shillingsburg (2017); Lechago, Carr, 
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c. Repeats the verbal behavior of another speaker 


d. Names things and actions that the speaker has direct 
contact with through any of the sense modes 


Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations’’) 

6. A textual operant has 
a. Point-to-point correspondence but no formal similarity 
b. Point-to-point correspondence and formal similarity 
c. No point-to-point correspondence and no formal 

similarity 

d. None of these 
Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations”’) 

7. Taking dictation consists of: 


a. A discriminative stimulus evoked by a verbal response 
that does not have point-to-point correspondence. 


b. Reading without any implications so that the reader 
understands what is being read 


c. Writing and spelling words that are spoken 
d. None of these 


Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations’’) 


3. Describe the term tact. 
Hint: (See “The Elementary Verbal Operants and Listener 
Discriminations’’) 

4. Compare and contrast convergent multiple control and 
divergent multiple control. 
Hint: (See “Multiple Control”) 
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Grow, Love, and Almason (2010); Oleson and Baker (2014); and Sundberg, 
Loeb, Hale, and Eigenheer (2002). 

4. For reviews and research topics, see Aguirre, Valentino, and LeBlanc 
(2016); Dymond, O’Hora, Whelan, and O’Donovan (2006); Gamba, 
Goyos, and Petursdottir (2015); Petursdottir (2018); Petursdottir and 
Devine (2017); Sautter and LeBlanc (2006); Sundberg (2013). 

5. See Sundberg and C. A. Sundberg (2011) fora developmentally sequenced 
80-item intraverbal subtest for the Verbal Behavior Milestones Assessment 
and Placement Program (VB-MAPP) (Sundberg, 2014). 


PART 


Developing New Behavior 


The five chapters in Part 8 describe methods for developing new behavioral repertoires. 

Chapter 19, by Carol Pilgrim, outlines the research foundations and core concepts for equivalence- 
based instruction (EBI) and discusses several key outcomes of EBI, namely, class formation, delayed 
emergence, class expansion and class merger, transfer of function, and contextual control. The 
chapter ends with a treatment of how to apply EBI in instructional and generalized settings, as well as 
applications viewed through the perspectives of Naming Theory and Relational Frame Theory. 

Chapter 20, by Tom Critchfield and Ruth Anne Rehfeldt, describes how learning can be 
engineered with nonequivalent relations. After defining nonequivalence classes and why they matter, 
these authors present the basic vocabulary for explaining nonequivalence relations, describe types 
and examples of nonequivalence relations, review key theoretical concepts, address the role of 
nonequivalence relations in higher-order psychological constructs, and explain how nonequivalence 
relations provide a basis for enhancing well-being in ways that are quite different from typical 
approaches in applied behavior analysis. 

Chapter 21 details how new behaviors are acquired through modeling, imitation, and 
observational learning. After defining and providing examples of these terms, and outlining a training 
protocol for learners who do not display imitative behavior, the chapter describes how practitioners 
can use modeling as a behavior change strategy and teach children to be skilled observational 
learners. 

Chapter 22 describes how to shape new behavior by reinforcing successive approximations to a 
terminal behavior. The chapter also includes procedures for increasing the efficiency of shaping and 
guidelines for using shaping in applied settings. 

Chapter 23 explains how discrete responses can be linked to create behavior chains of more 
complex behaviors. Procedures for task analysis—breaking down complex tasks into sequenced steps 
for instruction—are described. The chapter addresses the varied methods of chaining and factors 
affecting the performance of behavior chains. 
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CHAPTER 19 


Equivalence-based Instruction 


Carol Pilgrim 


LEARNING OBJECTIVES 


e Understand the research foundations and core concepts of equivalence-based instructions. 


e Learn the designing of equivalence-based instructions procedures. 
¢ Discuss the application and generality of equivalence-based instructions with varied participant populations. 


¢ Discuss the applications stemming from alternative theoretical approaches to relational responding. 


It would be difficult to overestimate the role played by 
symbols in our everyday lives. Consider my commute to 
work this morning: As I left the house, I discovered a small 
vase on my vanity, filled by my better half with fresh aza- 
leas from our garden. With considerable goodwill toward 
the world, then, I pushed the button marked “Start” on my 
car’s dash, moved the shifter to “R,” and punched “5” on 
the console for our local NPR station. An angry debate was 
in progress on the meaning of “building the wall” for U.S. 
Democrats and Republicans, for American immigrants (past, 
present, and future), and, indeed, for observers from around 
the world. With a quick shift to my shuffled music play list, 
John Lennon’s “Give Peace a Chance” created a powerful 
contrast. Shortly thereafter, a swing by the drive-through 

of my local bank branch allowed me to deposit a check and 
move funds from one account to another, with nary a coin 
exchanged. Next, while crossing campus, I negotiated traffic 
lights, traffic signs, and hand signals from Officer O’ Malley. 
Graduating seniors in regalia, having just received their caps 
and gowns, were posing in front of every Seahawk sculpture 
on the university grounds, hamming it up for selfies with 
friends. As I parked and approached my office, the chime of 
the clock tower quickened my pace to a meeting with one of 
my graduate students, to review her thesis progress. 


ones that flow so automatically they’d be unlikely to 

receive a second thought, all seemingly unremarkable. 
On closer inspection, however, an observant reader might note 
that every detail of this anecdote involves a symbol. Ranging 
from the mundane to the sublime (e.g., from braking to love), 
these symbols were responsible for my very ability to have an 
ordinary morning. Indeed, much of the richness of our lives, 


‘k short vignette relates a sequence of ordinary events— 
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both the precision (e.g., account balances) and the breadth (e.g., 
abstractions like peace) of our experience, is made possible by 
the symbolic function of the events that surround us. Imagine 
the limitations that would necessarily characterize the world of 
individuals for whom symbolic function is not so easily acquired 
and, conversely, the benefits that accrue when multiple symbolic 
functions are generated rapidly. What sorts of steps might be 
taken to establish symbols when they are lacking, or to maxi- 
mize their number efficiently? How can a single symbol come 
to hold a range of different meanings across individuals (e.g., 
from security to oppression)? And while we may feel that “we 
know one when we see one,” what, exactly, do we mean when 
we speak of a symbol anyway? The few examples given here 
should help to illustrate that if we are ever to achieve a healthy 
and happy world, or even a minimally functional one, it will be 
important to find answers to such questions. In point of fact, 
these are the questions that lie at the heart of the study of stim- 
ulus equivalence. 

Behavior analytic research on stimulus equivalence 
explores how a person’s experience with the environment estab- 
lishes or changes the function of symbols—that is, their abil- 
ity to impact our behavior, or their meaning. In doing so, this 
research helps us understand a great deal about human behav- 
ior and about how instruction can be designed and delivered to 
maximize learning outcomes. Indeed, one of the most signifi- 
cant characteristics of equivalence-based instruction (or EBT) 
is its generativity. When designed properly, a small amount of 
explicit instruction can yield an extensive and predictable net- 
work of new relations that were not directly targeted by the 
teaching, which is certainly a win-win scenario for student and 
teacher alike. The tremendous “bang for the buck” nature of EBI 
makes it especially relevant to today’s pressing need for effi- 
cient and effective educational strategies. This chapter further 
develops the basic concept of stimulus equivalence by illustrat- 
ing a wide range of equivalence outcomes in applied research 


and by outlining key features of the equivalence procedures that 


practitioners or teachers can incorporate into their own instruc- 
tion efforts. 


RESEARCH FOUNDATIONS AND 
CORE CONCEPTS 


The behavior analytic approach to symbolic function known as 
stimulus equivalence made its debut in the early 1970s (Sid- 
man, 1971; Sidman & Cresson, 1973) and received its first full 
operational treatment in 1982 (Sidman & Tailby, 1982; Sid- 
man et al., 1982). That’s not so very long ago in science years. 
Since that time, probably no other single topic has occupied 
more attention across the domains of basic experimental, con- 
ceptual, and applied behavior analytic science. Let’s begin with 
a brief review of the equivalence phenomenon, before turning 
our attention to the reasons for its emphasis within the field. 


Pioneering Research on Symbolic 
Stimulus Relations 


What Was Taught 


Sidman’s initial studies on reading acquisition (Sidman, 1971; 
Sidman & Cresson, 1973) provide a perfect context for an 
introduction to the basic terms and concepts of an equivalence 
approach. Because these terms and concepts will play a criti- 
cal role in the descriptions to follow throughout this chapter, 
we will spend some time here developing each of them with 
respect to Sidman’s landmark studies. The participants in these 
experiments were three institutionalized young men (ages 17 
to 19) with severe intellectual disabilities. Despite extensive 
teaching histories, none of the participants had developed any 
reading skills whatsoever. Sidman targeted this critical func- 
tional repertoire with a teaching arrangement known as the 
matching-to-sample procedure, an arrangement that has 
much in common with everyday classroom teaching approaches. 
Matching-to-sample is a discrete-trial procedure, in Sidman’s 
case, each teaching trial began with the repeated presentation 
of 1 of 20 different spoken words, referred to as sample stimuli, 
and the illumination of a round key in the center of the teach- 
ing apparatus. The participant was required to touch the illumi- 
nated key, to indicate that the spoken word had been noticed, 
a response referred to as an observing response. The observing 
response resulted in presentation of eight simple line drawings, 
known as comparison stimuli, arranged in a circular configura- 
tion around the sample key. One of the comparison drawings 
corresponded to the spoken word presented as the sample stimu- 
lus on that trial: for example, given the spoken word “car” as the 
sample, the comparisons might include drawings of a car, a sun, 
acup, a cat, a box, a cow, a cap, and a dog. (See Figure 19.1 for 
an illustration of two trial presentations.) The participant’s job 
at this point was to select one of the comparison stimuli, again 
by touching it. Selecting the drawing that corresponded to the 
spoken word (e.g., selecting the picture of a car after hearing 
“car”) produced pennies and a chime; that is, the correspond- 
ing comparison was discriminative for reinforcement. Selecting 
any of the nonmatching comparison stimuli had no programmed 
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Figure 19.1 An illustration of two trials of the AB matching- 
to-sample procedure, with the spoken sample stimulus 
indicated at the top of each diagram. 


From “Equivalence Relations and Behavior: An Introductory Tutorial, ” 
by M. Sidman, 2009, The Analysis of Verbal Behavior, 25, p. 8. Used by 
permission. 


consequences. In either case, a brief interval followed, after 
which the next trial began. 

Across trials, each of the 20 words was presented as a 
sample stimulus. Thus, the comparison stimulus designated as 
correct, or discriminative, on any given trial varied as a func- 
tion of the specific sample stimulus presented on that trial. After 
hearing “car,” selecting the picture of a car produced reinfore- 
ers; after hearing “cup,” selecting the picture of a cup produced 
reinforcers; and so on. The participant could not earn reinforcers 
consistently by selecting the correct comparison from the pre- 
vious trial; correct selections had to “match the sample.” Said 
a bit more technically, stimulus control by the sample stimulus 
determined the discriminative comparison stimulus. The match- 
ing-to-sample procedure thus arranges a four-term contingency, 
the positive elements of which include two antecedent stimuli 
(the sample stimulus and the correct comparison stimulus), the 
selection response, and the reinforcer. The four-term contin- 
gency brings the more familiar three-term contingency under 
another level of antecedent stimulus control in that the sample 
stimulus determines the appropriate three-term, or discrimina- 
tive, unit. The performance produced by this four-term contin- 
gency (i.e., selecting the appropriate comparison on each trial) 
is known as conditional discrimination because discrimination 
between the comparison stimuli is conditional upon, or depends 
upon, the sample stimulus present on each trial. It follows that 
the sample stimuli are also known as conditional stimuli in this 
arrangement. In Sidman’s first training phase, then, the discrim- 
inative function of each picture was conditional on the word 
spoken. Also with respect to terminology, Sidman’s teaching 
arrangement could be described as an arbitrary or sometimes 
even a symbolic matching-to-sample procedure, because the 
correct comparison stimulus on a given trial did not depend 
on any physical relation to the sample (as opposed to identity 
matching-to-sample, where the sample and correct comparison 
are physically identical, or oddity matching-to-sample, in which 
the correct comparison is the only one physically different from 
the sample). 

It is common in the literature on equivalence to find alpha- 
numeric labels used to specify individual training stimuli. This 
practice is useful in providing a standard descriptive system that 
readily accommodates the many variations in stimulus types and 
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training procedures that are represented in the literature. It can 
take a little getting used to, but gaining some familiarity with 
this system will facilitate descriptions of the studies presented 
throughout the remainder of this chapter and make them eas- 
ier to follow as a reader. Each alphanumeric label designates a 
physically unique stimulus, where the letter denotes a stimulus 
set (e.g., a set of sample stimuli, or a set of comparisons) and 
the number denotes a potential equivalence class. For example, 
the spoken-word sample stimuli in Sidman’s reading studies 
could be designated as Al, A2, A3, .. . A20; and across training 
trials, each sample would be presented along with an array of 
eight drawings from the B stimulus set, designated as B1, B2, 
B3 ... B20. As reviewed previously, during training, selecting 
comparison B1 in the presence of sample Al, selecting B2 in 
the presence of A2, and so on, would produce a consequence 
determined to serve as a reinforcer, while selecting any of the 
alternative comparisons would result in the intertrial interval 
only. Once selection of the appropriate B comparison stimulus is 
demonstrated reliably in the presence of each A sample, the per- 
formance is described as an AB conditional discrimination. In 
Sidman’s studies, correctly matching pictures to spoken words 
constituted the AB conditional discrimination. 

Sidman’s second teaching phase began for each partici- 
pant once all AB performances were strong. (In fact, his par- 
ticipants were able to match a number of the pictures to words 
even before training began.) Teaching Phase 2 was conducted 
exactly as Phase | had been, but with one important differ- 
ence. Instead of the pictures, 20 written words now served 
as comparison stimuli, with 8 of the words presented on any 
given trial. Spoken words served again as sample stimuli. (See 
Figure 19.2.) Thus, Phase 2 was designed to establish an AC 
conditional discrimination. 

This training phase took some time to complete, and for 
two participants (from Sidman & Cresson, 1973) the words were 
taught in increasing set sizes; first, 9 words were taught to a mas- 
tery criterion, then 5 more words were added for a total of 14, 
and then all 20 words were presented together. When all AB and 
AC performances were strong, the important questions could 
then be addressed. Had these young men acquired skills that 
might reasonably be described as true reading comprehension? 
Did they really understand the words as symbols, or had they 
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Figure 19.2 An illustration of two trials of the AC matching- 
to-sample procedure, with the spoken sample stimulus 
indicated at the top of each diagram. 


Adapted from “Equivalence Relations and Behavior: An Introductory 
Tutorial,” by M. Sidman, 2009, The Analysis of Verbal Behavior, PABY (Oe Te 


simply learned the rote pairings that had been directly targeted 
by the training procedures? 


What Was Learned 


To distinguish between the two possibilities, Sidman designed 
an ingenious series of test or probe trials, still presented in 
the matching-to-sample format, but in the absence of any pro- 
grammed consequences. In other words, each probe trial was 
presented under extinction conditions, which allowed inter- 
pretations in terms of symbolic, or true matching, function. (In 
fact, these same tests had been presented at the very start of 
the studies, prior to any teaching, to determine baseline levels 
of performance; with the exception of some correct selections 
on AB trials, as noted above, and some correct picture-naming 
responses, baseline accuracy levels were uniformly low.) 
The probe trials presented new combinations of sample and 
comparison stimuli that were never encountered during train- 
ing. Thus, any consistent patterns of responding observed on 
the probes (in the absence of reinforcement) could be described 
as emergent stimulus relations. The term derived stimulus 
relations is sometimes used as a synonym for these emergent 
performances. ! 

There are four critical types of probe trials in equiva- 
lence studies. To illustrate Sidman’s training and testing design, 
Figure 19.3 presents a schematic of the trial types involved, sim- 
plified for the sake of space to show only 3 of the 20 stimuli from 
each training set. Figure 19.4 presents exactly the same infor- 
mation using the alphanumeric stimulus-designation system, to 
illustrate the relation between the two descriptive approaches. 
In both formats, arrows point from the sample stimuli, which 
would be presented individually, to the comparison stimu- 
lus array. Solid arrows indicate stimulus relations for which 
explicit reinforcement contingencies had been arranged during 
training phases (i.e., the AB and AC conditional discriminations 
described above). Broken arrows indicate the untrained stimulus 
relations that could be tested by probe trials. In his 1982 papers 
(Sidman & Tailby, 1982; Sidman et al., 1982), Sidman provided 
descriptors for each probe-trial type, taken from the mathemat- 
ical set-theory definition of equivalence. He reasoned that these 
behavioral tests captured operationally the abstract mathemat- 
ical properties said to hold when a set of events are defined as 
equivalent to one another. 

To explain further, after teaching AB and AC conditional 
discriminations, stimulus reflexivity is tested by probes for 
generalized identity matching between each of the experimental 
stimuli. For example, with stimulus B2 as sample, and stimuli 
B1, B2, and B3 as comparisons, selecting stimulus B2 would 
be indicative of reflexivity. Consistent reflexive selections with 
all of the experimental stimuli (e.g., the pictures and the written 
words, in Sidman’s studies), in the absence of reinforcement 
for such selections, indicate that each stimulus is functioning 
as equal to itself. The mathematical statement of reflexivity 
holds that A = A; thus, generalized identity matching pro- 
vides an operational definition of the mathematical property. 
The mathematical statement of symmetry holds that if A = B, 
then B = A orif A = C, then C = A. This property is typi- 
cally operationalized by probe trials that reverse the roles of the 
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Figure 19.3 Schematic representation of the 
trained and tested conditional discriminations. 
Solid arrows indicate trained relations, and dashed 


Equivalence arrows indicate tested relations. The arrows point 
hs from sample stimuli to comparison arrays. 
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stimuli presented as sample and comparisons during training. 
For example, after learning to select the appropriate B compari- 
son given an A sample (i.e., selecting B1 given Al, B2 given A2, 
and B3 given A3), a symmetry probe would present one of the 
B stimuli as a sample, with Al, A2, and A3 as the comparison 
choices. Similarly, after establishing selections of the appro- 
priate C comparison stimulus given an A sample, probe trials 
would present each of the C stimuli as samples with Al, A2, and 
A3 as the comparison array. Consistent patterns of reversible 
stimulus selections (e.g., choosing Al given B1, or choosing 
A2 given C2), again in the absence of reinforcement for the 
selections, across all relevant probe trials would demonstrate 
the property of symmetry. The young men in Sidman’s early 
studies received a modified version of the typical symmetry 
probes because the A stimuli in their training consisted of spo- 
ken words, and it is difficult to present multiple spoken words 
simultaneously as choices for selection. Instead of the standard 
symmetry test, then, each of the B and C stimuli was presented 
individually, and the participant was asked to name it (a tact 
or textual response, respectively), arguably a more demanding 
task, as the appropriate response had to be produced rather than 
selected from an array. Nevertheless, the exciting result was that 
these boys not only named the pictures correctly but also read 
the written words; that is, they read the words orally where they 
had been unable to do so prior to the teaching phases, despite 
the fact that oral reading had not been directly targeted during 
training, and despite the fact that no reinforcers were available 
on the probe trials. This was a striking accomplishment, but did 
the boys understand the meaning of the words? 

A third probe-trial type evaluated this possibility. The 
mathematical statement of transitivity holds that if A = B, 
and if B = C, then A = C, and when both symmetry and 
transitivity hold, C = A. This latter arrangement is often 
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trained and tested conditional discriminations, with 
individual stimuli indicated by their alphanumeric 
designations. 


described as a combined test for equivalence, or even more 
simply, an equivalence test, because both properties are eval- 
uated simultaneously. The participants in the present studies 
had received AB and AC training; thus, their combined tests 
for equivalence involved either B stimuli presented as sam- 
ples with C stimuli as comparisons, or the converse—C stim- 
uli as samples with B stimuli as comparisons (i.e., if A = B, 
andif A = C, then B = CandC = B). On some probe trials, 
a picture (from stimulus set B) was presented first and the 
task was to select the matching written word (from stimulus 
set C). On other probe trials, a written word was presented 
first, and the task was to select the matching picture. All 20 
of the possible picture-word combinations were evaluated in 
this way. Consistent and appropriate matching of pictures to 
words (e.g., Selecting a picture of a sun when the sample was 
the written word “SUN,” or selecting the written word “CAR” 
when the sample was a picture of a car), again in the absence 
of reinforcement for the selections, across all relevant probe 
trials would demonstrate the combined properties of symme- 
try and transitivity. More importantly for these young men, 
such consistent patterns would also represent true reading 
comprehension, an indication that the meaning of the words 
was understood. Sidman (2007) describes observing the 
first probe trials conducted with these young men, he and 
his colleagues watched from behind a one-way mirror, all 
with baited breath. As one correct response after another 
was demonstrated, the scientists literally danced, and at the 
end of the session, the research assistant jumped up, hugged 
the participant, and shouted, “Kent, you can read!” (Sidman, 
2007, p. 315). This excitement on the part of the research- 
ers helps to communicate the power and the potential of the 
outcome that had just been observed. Indeed, all three of the 
young men involved went on to show the emergent patterns 
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indicative of oral reading and reading comprehension for the 
first time in their lives, even though these particular perfor- 
mances had not been directly taught. 

To complete our lesson on basic equivalence terminol- 
ogy, when successful performances like those just described 
are obtained on probe trials of all three types—teflexivity, sym- 
metry, and transitivity (or combined tests for equivalence)—in 
the absence of reinforcement, the defining criteria for equiva- 
lence-class formation are satisfied. In the studies at issue here, 20 
different stimulus classes, each including a spoken word, a picture, 
and a printed word (i.e., AIB1C1; A2B2C2; . . . A20B20C20) 
could be characterized as equivalence classes in that the mem- 
bers of each class were perfectly interchangeable within a given 
context. Said another way, each stimulus from a given class 
functioned identically, whether as a sample or as a comparison 
stimulus, in relation to any other member of the same class. It is 
exactly this emergent interchangeability that allows us to interpret 
the stimuli within each class as symbolic of one another (e.g., a 
symbolic relation was demonstrated between the spoken word, 
“car,” the written word, [CAR], and the picture of the car). What’s 
more, this interchangeability was the result of a structured teach- 
ing approach involving only a small subset of the total number of 
relations generated. 


Why It Mattered 


The reading accomplishments of these three young men thus 
served to inspire the study of equivalence on multiple fronts. 
First, new forms of stimulus control were created that had never 
been targeted directly by the teaching procedures. Consider the 
productivity of the approach taken in these cases: 20 AB (spo- 
ken word-picture) and 20 AC (spoken word—written word) rela- 
tions either were already demonstrated by these participants or 
were explicitly established by the reinforcement contingencies 
of the matching-to-sample training procedure. The resulting per- 
formances included those 40 relations, as well as the potential 
for 60 new reflexivity relations, 40 new symmetry relations, 
and 40 new combined equivalence relations, for a grand total of 
180 discriminated operant units! This production of reliable and 
predictable patterns of novel stimulus-control relations, partic- 
ularly when the novel control is not based on physical similar- 
ities across the stimuli (as in primary stimulus generalization), 
is a striking behavioral phenomenon, demanding of research 
in its own right, just to understand the processes that might be 
responsible. Prior to Sidman’s work (Sidman, 1971; Sidman 
& Cresson, 1973; Sidman & Tailby, 1982), there was nothing 
within an extensive behavior analysis literature to suggest such 
generative capacity after basic four-term contingency training. 
A tremendous research effort has followed. 

A second reason for immediate excitement about Sidman’s 
early findings followed unequivocally from the first, and is per- 
haps most relevant to the content of this textbook. Given the 
focus on reading in Sidman’s studies, the potential for utilizing 
an equivalence approach to promote teaching efficiency was 
immediately apparent. Sidman’s early successes had captured 
that holy grail of applied outcomes—establishing extensive, 
varied, and flexible repertoires with only a modicum of direct 
training. The way had been paved for creative extensions of 
the equivalence approach to behavioral targets across a broad 


spectrum of academic and functional life skills, a number of 
which will be reviewed in the pages that follow. 

A third striking feature of the equivalence outcome just 
described is the degree to which it captured the essence of what 
is meant when we speak of a “symbol.” The three defining 
probe-irial types revealed that an arbitrary sound (i.e., a spoken 
word), an abstract set of black-and-white markings (i.e., a writ- 
ten word), and a related picture were each treated identically and 
interchangeably, just as would be expected of a symbol and its 
referent. This sort of substitutability has often been described in 
terms of a symbol “standing for,’ or “meaning the same as,” 
its referent; however, a clear specification of the behaviors 
necessary for identifying a true symbol had long been lacking 
(e.g., Sidman, 1994; Wilkinson & Mcllvane, 2001; Box 19.1 
provides an example of the confusions that often result). Sid- 
man’s equivalence criteria had thus established a compelling 
operational definition of symbolic function. In so doing, these 
criteria also provided a basis for empirical behavioral analysis of 
other complex performances critical to much of human behav- 
ior (e.g., categorization, semantic correspondence, syntax), as 
well as a possible approach to the development of (or change 
to) these sorts of performances as clinical targets. One case 
in point involves the study of behavioral novelty or creativity. 
Indeed, creativity has sometimes been erroneously characterized 
as unamenable to behavioral accounts, given that a history of 
reinforcement is lacking, by definition, when a behavior occurs 
for the first time. With equivalence procedures, however, novel 
behavior is illustrated reliably and in predictable patterns by 
the emergent performances on probe trials. Other examples 
of extensions to complex human functioning include equiva- 
lence-based models of instruction following, social stereotypes, 
false memories, and clinical issues such as the spread and treat- 
ment of generalized anxiety or phobias, just to mention a few.” 
An important shared characteristic of these complex patterns of 
responding is the involvement of emergent symbolic function. 


Continued Development of Equivalence 
Research: Key Outcomes Obtained with 
Equivalence Approaches 


Recent years have seen a repeated emphasis on the importance of 
translational research, both for science in general (e.g., Woolf, 
2008; Zerhouni, 2005) and for behavior analysis in particular 
(e.g., Critchfield, 2011; Mace & Critchfield, 2010). A traditional 
conceptualization of translational research has focused on mov- 
ing the findings from our basic laboratory science into the hands 
of practitioners, where our increased understanding can be put 
to important functional ends. In fact, this pragmatic strategy has 
been a defining characteristic of behavior analysis since its early 
formulations. “What we call the scientific conception of a thing 
is not passive contemplation. When we have discovered the laws 
which govern a part of the world about us, we are then ready 
to deal effectively with that part of the world” (Skinner, 1953, 
p 13). At the same time, behavior analysts have long realized 
that when those behavioral principles emerging from our labs 
are applied to significant social problems (Bear, Wolf, & Risley, 
1968, 1987), additional critical questions about the principles and 
their operation inevitably arise. This often necessitates a return 
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BOX 19.1 


But | Know My Dog Understands Me 


We often hear heartwarming stories in the media about clever 
pets—especially dogs—who understand their owner’s words so 
well that they perform life-saving rescues (remember Timmy 
telling Lassie to go get help?), even calling 911 and barking into 
the phone. Indeed, well-trained dogs help owners with mental 
health issues cope with emotional overload; they follow com- 
mands from owners with physical disabilities to retrieve dropped 
items, open and close doors, and provide support for standing 
and walking; and they fetch medications, phones, and other 
items for the elderly in emergency situations. There are reports 
of dogs who recognize more than 1000 human words, and one 
recent functional magnetic resonance imaging (fMRI) study 
(Andics et al., 2016) found different types of brain activity when 
dogs were presented with meaningful words in either a praising 
or a neutral tone compared to meaningless words presented as 
praise. Further, many dog owners are convinced that their pets 
understand not only their words but their emotions as well; we all 
know pets who respond very differently in the face of an owner’s 
laughter versus tears. Surely these reports, and scads of others 
like them, document that dogs understand what we say, right? 
This question gets right to the heart of what it means to 
speak of “understanding a word.” To pose the issue a bit dif- 
ferently, is there any reason to believe that a dog’s response to 
“Sit,” for example, differs from a child’s, or from our own, for 
that matter? In all three cases, the appropriate bodily position 
may be executed perfectly and immediately upon hearing the 
verbal command, so observation of that particular interaction 
does little to reveal a distinction between the three instances. 
Nevertheless, should we feel comfortable concluding that 
the dog in question actually understands the meaning of the 
word “Sit” in the same sense that the child does? Is “Sit” a 
true symbol for a certain action or posture, or is the dog’s 
“understanding” limited to performing a single response that 
has been explicitly trained in the presence of a specific dis- 
criminative stimulus? These are essentially the same ques- 
tions Sidman was grappling with in his early equivalence 
studies (Sidman, 1971; Sidman & Cresson, 1973; described 
above), when he sought to find out whether the match-to- 
sample teaching procedure had produced true reading com- 
prehension for the boys who participated. When the boys 
came to choose the appropriate written word upon hearing it 
spoken after repeated reinforcement for doing exactly that, 
did they understand the meaning of the words, or were they 
simply performing a trained rote response? As with the dog 
example, accurate selection of the directly trained option, 
in and of itself, did not reveal an answer. Further indicators, 
beyond the explicitly reinforced behavior patterns, are needed 
to make judgments about understanding, comprehension, 


to the lab for further investigation, followed by a new round of 
refined applications, and so the cycle continues. In this way, trans- 
lational research represents a reciprocal and dynamic process in 
which the laboratory and the field inform and inspire each other, 


and symbolic function. To return to the “Sit” example, after 
learning to take a seat when asked to, a child might then also 
request that her mother “sit” in an appropriate circumstance, 
accurately label or describe other people as sitting, match 
“Sit” with pictures of novel individuals doing so, and point 
out suitable objects on which one might sit, all in the absence 
of further teaching. These novel or emergent performances 
provide the types of evidence necessary to reflect understand- 
ing of the word. In contrast, responses to “Sit” for even the 
cleverest pets would likely be limited to executing the one 
physical position they had been taught to perform. Dogs may 
indeed be taught to make specific and even useful responses 
when presented with many different spoken words, which is 
impressive, to be sure. It falls short, however, of demonstrat- 
ing true symbolic understanding. 

Of course, operationalizing exactly what is needed to 
convince us of understanding and comprehension provided 
the impetus for Sidman’s development of equivalence, or 
symbolic matching-to-sample, as characterized by the prop- 
erties of reflexivity, symmetry, and transitivity (Sidman & 
Tailby, 1982). Indeed, ever since those defining features 
were first introduced, the possibility that equivalence pat- 
terns might be demonstrated by (nonhuman) animals has 
been a topic of considerable research activity. Early work 
on the topic found little support for responding consis- 
tent with the properties of equivalence, particularly sym- 
metry, with species ranging from pigeons (e.g., Hogan & 
Zentall, 1977), to Cebus monkeys (D’ Amato, Salmon, Loukas, 
& Tomie, 1985), to rhesus monkeys and baboons, (Sidman 
et al., 1982), and even to chimpanzees (e.g., Dugdale & Lowe, 
1990). More recent work has focused on the very real possibil- 
ity that the experimental procedures designed for humans may 
not be optimal for assessing equivalence-class formation in 
animals—in other words, we may have been asking questions 
of our animal subjects in the wrong ways. Methodological 
arrangements better suited to the strengths and learning styles 
of a given species (e.g., using olfactory stimuli with rats; Pena, 
Pitts, & Galizio, 2006) have yielded some promising leads. 
Interestingly, some of the most compelling evidence to date for 
equivalence-class formation in animals has come from two sea 
lions, Rocky and Rio (Kastak, Schusterman, & Kastak, 2001). 

The experimental histories of these subjects took place 
over many years, and included extensive training on a number 
of related tasks, so the complete set of factors responsible for 
their strong test performances cannot be specified. You can rest 
assured, however, that the search for equivalence in animals is 
alive and well in research labs around the world (e.g., Galizio 
& Bruce, 2017). 


leading to increased effectiveness in both domains, and providing 
for the strength of behavior analysis as a discipline. For present 
purposes, there is perhaps no better example of a translational 
research agenda than the one that has characterized stimulus 
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equivalence. The equivalence-research story is especially nota- 
ble in this context in that the core phenomenon was brought 
to light initially in an application, reading instruction, which 
then demanded laboratory follow-up, rather than the other way 
around—a valuable exemplar for applied behavior analysts and 
basic researchers alike. Since Sidman’s landmark studies, many 
hundreds of basic laboratory analyses have investigated the con- 
ditions necessary and sufficient for equivalence-class formation, 
while equivalence applications have benefited learners ranging 
from young typically and atypically developing children, to col- 
lege students, to adults with brain damage and dementia. As this 
body of work developed and progressed, several key outcomes 
were identified that added fuel to excitement about the scope and 
power of equivalence approaches for applied behavior analysis. 
These outcomes include class formation, expansion and merger, 
transfer of function, and contextual control. 


Class Formation 


The outcome of primary importance in any study of equiva- 
lence is demonstration of the emergent performances that 
indicate stimulus interchangeability and thus the formation of 
equivalence classes. The defining criteria for interpretations 
of equivalence have traditionally included strong and stable 
class-consistent responding on reflexivity, symmetry, and tran- 
sitivity/equivalence probe trials in the absence of reinforcement 
(e.g., Sidman & Tailby, 1982; see the dashed lines in Figures 
19.3 and 19.4 for illustration of these critical tests). The most 
common measure of class formation has been the percentage 
of probe trials on which a comparison selection is consistent 
with equivalence, for each defining property. For example, after 
training AB and AC conditional discriminations, stimulus Cl 
would be presented as the sample on some probe trials; selecting 
Cl, Al, or B1 would be consistent with equivalence-class for- 
mation on probes for reflexivity, symmetry, or combined tests 
for equivalence, respectively. (Again, see Figure 19.4.) 

To illustrate, Figure 19.5 provides data from one of 
Sidman’s original participants (Sidman & Cresson, 1973). The 
three panels present outcomes on the test types of particular inter- 
est in that study: word-name relations (i.e., textual responses, or 
CA oral naming) in the top panel, with picture—printed word (BC) 
and printed word—picture (CB) relations (i.e., reading compre- 
hension) in the middle and bottom panels, respectively. The trial 
types were intermixed in the actual test administration but are 
separated in this figure so that each performance can be followed 
individually. Each bar represents the percentage of responses on 
a given probe-trial type (CA, BC, or CB) that were consistent 
with the potential equivalence classes, for each of a series of test 
administrations. (As a reminder, 20 three-member classes were 
made possible by the training, where each class included a spoken 
word, a picture, and a written word.) 

In this study, four pretest administrations were conducted 
over the course of approximately | year. This type of pretest- 
ing is often required in studies of equivalence-based instruction 
when the stimuli targeted for training are common ones found 
in everyday environments. Pretest results play an important 
role in judging whether the performances of interest may have 
been due to learning that occurred outside the programmed 
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Figure 19.5 Percentage of trials on which class-consistent 
comparisons were selected for each of three probe types, 
CA (top panel), BC (middle panel), and CB (bottom panel). 
Each bar or pair of bars represents data from a single test 
administration. Open bars show performances on trials with 
words that had not been involved yet in training; filled bars 
show performances on trials with words that had been part 
of training. 


Adapted from “Reading and Crossmodal Transfer of Stimulus Equiv- 
alences in Severe Retardation,” by M. Sidman and O. Cresson, 1973, 
American Journal of Mental Deficiency, 77, p. 515-523. 


instruction under evaluation. In the present case, the percentage 
of class-consistent responses was consistently low for each of 
the three probe types across the four pretests, never exceeding 
chance levels of accuracy, despite repeated practice with each 
test and despite the fact that the 20 spoken words were being 
matched with their appropriate pictures (that is, the AB relations 
had already been learned). Thus, pretest results indicated that the 
performances of interest were not improving as a result of ongo- 
ing extra-experimental experiences for this participant. The AC 
spoken word—written word relations were taught in three sepa- 
rate instructional phases involving 9, then 14, and then all 20 of 
the words. A posttest with all 20 words followed each teaching 
phase. Results from the first two posttests are presented with two 
bars. The filled bars indicate scores on trials with written words 
that had been included in the preceding teaching phase, and the 
open bars indicate scores on trials with written words that had 


not yet been taught. By the third posttest, all 20 of the written 
words had been included in training. Posttest results were similar 
across the probe-trial types. The percentage of class-consistent 
responses increased well over pretest levels for each posttest, 
but only when the trials included a word from the training set. 
Responding remained at approximately chance levels for those 
trials involving an untrained word. This important finding indi- 
cates that improvements were specific to the instructional proce- 
dure, as opposed to some other extraneous factor (e.g., increased 
motivation or task familiarity, classroom experiences outside the 
experiment). By the final posttest, class-consistent responding 
was very strong, at 80% or better, for all probe types, indicating 
the formation of 20 three-member equivalence classes. 

This example illustrates well the type of data and the 
experimental reasoning that characterize conclusions about 
equivalence-class formation. While the specific criteria required 
to make judgments about class formation vary slightly across 
studies, depending on procedural details, they are always quite 
high (e.g., 90% class consistency or greater across probe types), 
and often include further specification that the number of incon- 
sistent responses made on any given probe type is low (e.g., no 
more than a single error per probe type). 

With many participants, equivalence outcomes are 
observed immediately, on the very first probe tests. However, 
another relatively common finding, known as delayed emer- 
gence, is that probe responses become increasingly consistent 
with class formation as testing continues, sometimes beginning 
at chance levels and improving to perfect or near-perfect class 
consistency across test administrations, despite the absence of 
consequences on these trials (e.g., Sidman, Kirk, & Willson- 
Morris, 1985; Spradlin, Cotter, & Baxley, 1973). In fact, the 
data from Figure 19.5 illustrate this pattern. Even for the words 
that had been included in training, the percentage of class- 
consistent probe responses increased across the successive 
posttests. Delayed emergence has important implications for 
conclusions about equivalence “failures” when probe exposure 
has been limited, and its frequent occurrence suggests that 
tests should be repeated when initial performances are weak, 
especially if the data suggest an improving trend. 


Class Expansion and Class Merger 


If equivalence-based instruction is to provide a basis for gen- 
erating truly functional repertoires, class size becomes an issue 
of interest. In the studies described thus far, three-member 
equivalence classes (e.g., a spoken word, picture, and written 
word; Sidman, 1971; Sidman & Cresson, 1973) were pro- 
duced, this being the minimum number of members necessary 
to demonstrate all equivalence properties. While clearly valu- 
able, and often a huge improvement from beginning skill lev- 
els, three-member classes might not take a child (or adult) very 
far toward fluency in verbal behavior, numeracy, coin usage, 
categorization, and so on. Fortunately, equivalence outcomes 
known as class expansion and class merger have been studied 
and documented extensively (e.g., Saunders, Saunders, Kirby, 
& Spradlin, 1988; Saunders, Wachter, & Spradlin, 1988), pro- 
viding strong evidence that classes can be increased to include a 
potentially unlimited number of members. Importantly, as class 
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size increases, the number of generative or emergent perfor- 
mances also grows exponentially. 

In class expansion, a new member is added to demon- 
strated equivalence classes, typically by teaching a new 
conditional discrimination. One member from each of the 
original classes serves as either sample or comparison in the 
new training, and a novel stimulus set serves in the alter- 
native role. To illustrate, consider again the equivalence 
classes demonstrating Sidman’s reading work, where each 
class included a spoken word, a picture, and a written word 
(A1B1C1, A2B2C2, etc.). Suppose that for the young men 
involved, learning the corresponding signs from American 
Sign Language would also have been useful for their liv- 
ing circumstances. EBI could be used to expand their three- 
member classes to four members by teaching the young men 
to match the signs to spoken words. In other words, the signs 
could be presented as comparison stimuli (e.g., D1, D2, D3) 
with the spoken words serving as samples (e.g., Al, A2, A3), 
and an AD conditional discrimination could be taught by rein- 
forcing the appropriate comparison selections. Figure 19.6 
illustrates the new training with a solid gray arrow. Class 
expansion would be evidenced by equivalence-consistent 
responses (e.g., selecting the appropriate sign given a picture 
or written word; selecting the appropriate written word or 
picture given the sign) on probe trials DA, BD, CD, DB, and 
CB. Figure 19.6 illustrates all of the newly possible emergent 
relations with dashed gray arrows. Note that after teaching 
one additional conditional discrimination (AD), a total of six 
new relations are created for each equivalence class (i.e., AD, 
DA, BD, DB, CD, and DC), dramatically increasing the 
impact of a small amount of teaching. Instructing additional 
conditional discriminations, one by one, can produce a grad- 
ual increase in class size from three, to four, to five members 
or more, even with young children (e.g., Lazar, Davis-Lang, 
& Sanchez, 1984). 


D Stimuli 
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Figure 19.6 Schematic representation of trained and tested 
conditional discriminations. The solid gray arrow indicates 
the new conditional discrimination training, and the 
dashed gray arrows represent the new emergent relations 
made possible by the training. 
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In class merger, independent equivalence classes can be 
combined to produce a larger class by teaching a new but interre- 
lated conditional discrimination. Consider the teaching arrange- 
ment illustrated in Figure 19.7, which might be appropriate for 
young children or individuals with intellectual disabilities who 
lack coin skills. Here, teaching AB and AC conditional discrimi- 
nations could result in two three-member equivalence classes, one 
including the spoken word, “Dime,” the printed stimulus, 10¢, and 
an actual dime (i.e., A1B1C1), and the other including the spoken 
word, “Quarter,” 25¢, and an actual quarter (1.e., A2B2C2).? As 
usual, we would look for evidence of these classes having formed 
by presenting the test trials represented by the dashed lines. Next, 
two additional conditional discriminations, DE and DF, are taught. 
Again, performances on the tests represented by the dashed lines 
would evaluate the formation of two new equivalence classes, one 
including the spoken “10 cents,” two nickels, and five pennies plus 
a nickel (i.e., DIE1F1), and the other including “25 cents,” two 
dimes plus a nickel, and five nickels (i.e., DIE2F2). At this point 
a total of four independent three-member classes have been made 
possible by the training. 

Once class-consistent probe performances confirm the 
formation of these classes, Figure 19.8 illustrates an additional 
teaching step that could result in class merger. The bold solid 
arrow shows that one more conditional discrimination—CE— 
is taught. Given 10¢ or 25¢ as a sample stimulus, reinforcers 
would follow selection of two nickels or two dimes plus one 
nickel, respectively. 

Class merger describes an outcome in which stimuli from the 
originally independent classes AIB1C1 and D1E1F1 are matched to 
each other, such that a single six-member class (AIB1C1D1EIF1) 
emerges. In our example, the spoken words, “Dime” and “10 cents,” 
the written stimulus, 10¢, and each appropriate coin arrangement, 
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whether a dime, two nickels, or five pennies plus a nickel, could all 
prove to be completely interchangeable. Similarly, the six stimuli 
relevant to the “quarter” class could merge and become interchange- 
able for a second six-member class (A2B2C2D2E2F2). In short, the 
four original equivalence classes, composed of three members each, 
could give rise to two six-member classes. Figure 19.9 represents 
with dashed gray arrows all of the emergent relations made possi- 
ble by teaching the CE conditional discrimination, and those that 
would have to be tested to reach conclusions about class merger. 
Positive, class-consistent responding across the various probe types 
would provide not only a basis for important functional skills in 
coin usage but also exactly the evidence necessary for inferences 
about understanding or comprehension of the symbolic relations 
between these stimuli. The network of new performances illustrated 
also paints a striking picture of the generative potential of equiva- 
lence-based instruction, especially as class size increases. Indeed, 
the balance of solid arrows (directly taught relations) to dashed 
(emergent relations) for the figure as a whole captures nicely the 
“bang for your teaching buck” impact of equivalence-based teaching 
procedures. 

Class merger outcomes like those just described have been 
demonstrated repeatedly in the equivalence literature, and with 
many populations, including typically developing children and 
adults (e.g., Sidman et al., 1985; Johnson, Meleshkevich, & 
Dube, 2014), as well as participants with intellectual disabili- 
ties (e.g., Lane & Critchfield, 1998; Saunders, Saunders, Kirby, 
& Spradlin, 1988). To date, few limits have been identified for 
the size of equivalence classes that can be generated through 
expansion or merger, although this would likely depend on the 
particular participant or participant population. Indeed, it has 
even been suggested that measures of the number of members 
that can be added to an equivalence class, either sequentially (as 
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Figure 19.7 Schematic representation of 
trained (solid arrows) and tested (dashed 
arrows) conditional discriminations for 
two training phases, one for AB and AC 
conditional discriminations and one for DE 
and DF conditional discriminations. 
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Figure 19.8 Schematic representation of 
trained (solid arrows) and tested (dashed 
arrows) conditional discriminations for the 
first two training phases, as well a third 
training phase. The bold solid arrow indicates 
the added CE conditional discrimination 
training. 
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tation of trained (solid arrows) and tested (dashed arrows) conditional 
hases. The bold solid arrow indicates the added CE conditional 


discrimination training that made class merger possible. The dashed gray arrows indicate the many 
relations that could result from class merger. 


in class expansion) or simultaneously (as in class merger), might 


provide a more valid assessment of cognitive development or of 


recovery from brain injury than the standardized tests currently 
in use (e.g., Sidman, 1986a; 1994). 


Transfer of Function 


A third major finding from research on stimulus equivalence, 
described as transfer of function, adds even further power to the 
generative potential of EBI. When a group of stimuli are said to be 
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equivalent, the implication is that each of the stimuli in the group 
will function in the same manner: that each is interchangeable with 
any other. As we have reviewed, this interchangeability of function 
is assessed by tests of reflexivity, symmetry, and transitivity/equiv- 
alence in which each member of the class proves to serve equally 
well in relation to any other member of the class, no matter which 
stimulus combinations are presented as sample and comparison. 
But what if a new function—that is, a stimulus-control effect other 
than that of sample or comparison in a match-to-sample task—was 
taught for one member of an equivalence class? If the stimuli that 
compose a given class are truly equivalent, it would follow that 
the new function should be shared by all members of the class—in 
other words, that the function should transfer to all class members. 

Let’s continue with the coin-equivalence example from 
Figure 19.9 and assume that our students have demonstrated the 
two possible six-member classes. As a next teaching step, we’ ll 
establish a new function for the dime and the quarter (B1 and B2). 
First, we’ll teach that the dime can be traded for two pieces of candy 
and that the quarter can be traded for five pieces of candy. When 
this lesson is mastered, we'll provide a choice between the dime 
and the quarter: “Which one would you like to have?” Selection of 
the quarter would confirm that our teaching procedure established a 
new function for each coin—that the quarter is better than or worth 
more than the dime. Subsequent tests would then present the same 


choice with each pair of stimuli (Al and A2, Cl and C2, D1 and 
D2, etc.). Consistent selections of A2, C2, D2, E2, and F2 (instead 
of Al, Cl, D1, El, or Fl) would demonstrate the transfer-of- 
function effect in that each member of the quarter class is preferred 
to, or better than, a member of the dime class, despite the absence 
of direct “value” training for any of these class members. 

The transfer-of-function effect has been demonstrated in a 
great many ways, shapes, and forms, and for many different stim- 
ulus functions (e.g., Green, Sigurdardottir, & Saunders, 1991; 
Kohlenberg, Hayes, & Hayes, 1991; Smeets & Barnes-Holmes, 
2003), highlighting the possibilities for emergent equivalence-based 
performances that go far beyond the matching-to-sample context. 
Indeed, transfer of function is arguably among the most import- 
ant advantages of classes or categories; once a class has formed, 
anything new learned about one member of the class can hold for 
all members of the class, without additional training. However, as 
illustrated in Box 19.2, transfer of function can be implicated in 
the spread of both adaptive and problematic response patterns. The 
transfer effect is accounted for from the perspective of equivalence 
in terms of class merger (e.g., between the established equivalence 
class members and the class of stimuli that serves a particular func- 
tion; e.g., Sidman, 1994, 2000). Equivalence-based applications 
designed to take advantage of transfer-of-function effects are still 
somewhat limited, but the potential is clear. 


BOX 19.2 


Are Equivalence Classes Ever a Problem? 


A chapter on designing and implementing EBI will necessarily 
focus on productive uses of this teaching approach. A completely 
fair question, however, is whether equivalence-class formation 
might ever have a negative impact on an individual’s function- 
ing. As you might guess, in a world in which bad things exist, 
equivalence classes that include bad things as members are also 
a possibility. And as we’ve just reviewed, if one member of an 
equivalence class has a particular function (e.g., if one member 
is “bad”), the other members of the class are likely to share that 
same function—the transfer-of-function effect. As a result, phys- 
ically dissimilar stimuli might come to be treated as “bad” even if 
they have never had any direct pairing or correlation with aversive 
events. Of course, establishing classes of bad things can often be 
useful, even life-saving. Any parent would be delighted if her child 
immediately avoided items marked with the signs shown in Figure 
19.10, after learning to steer clear of POISON, for example. 

In contrast, consider a case in which a college student 
experiences some difficulty while taking a test in a biology class. 
For question after question, she realizes that her study the night 
before was far too superficial, and she cannot compose a single 
coherent answer. As her heart begins to race and her breathing 


ALIN 


becomes more rapid, her thinking is increasingly distracted. She 
rushes from the room with most of the test booklet still incom- 
plete, and thus escapes the unbearable situation. We could all 
agree that this scenario describes a highly aversive experience. 
Unfortunately for the student involved, the aversiveness doesn’t 
stop there. She begins to notice the same increased respiration and 
heart rate with respect to other classes, and not only when taking 
a test. Preparing for a test, or just the announcement of a quiz, 
exam, or final, can initiate the cycle of intense emotions. Study 
patterns are disrupted and absences increase, both for exams and 
for regular class days (pop-quiz possibilities . . . ). Course syllabi 
with test dates and final exam schedules even become difficult to 
look at, and registering for a new semester looks bleak. 

Patterns like this one—often described in terms of test 
anxiety—are not uncommon among college students. How 
might equivalence classes play a role? One possible interpre- 
tation (e.g., Augustson, Dougher, & Markam, 2000; Dougher, 
Augustson, Markham, Wulfert, & Greenway, 1994; Friman, 
Hayes, & Wilson, 1998) goes as follows: For college students, 
it is not hard to imagine an equivalence class that includes actual 
tests; the spoken and written words “Test,” “Exam,” “Quiz,” and 


Figure 19.10 


“Final”; materials describing schedules of such events; and even 
larger academic units comprising these events (like courses or 
semesters). The various members of this suggested class share 
few physical features but may be interchangeable within a given 
context. To the extent that they are equivalent, then, a frightening 
experience with one member of the class could result in strong 
emotional reactions to other members of the class; that is, the 
function of one member could transfer to them all. 

The feasibility of this conceptual account of anxiety 
has received some strong support from laboratory models. To 
illustrate (Dougher et al., 1994), a group of college students 
demonstrated the formation of two four-member equivalence 
classes (AIB1C1D1 and A2B2C2D2) following match-to- 
sample training with abstract stimuli. Trials were then pre- 
sented with either stimulus B1 or B2 on the computer screen. 
When B1 appeared, a brief electric shock was also presented; 
when B2 appeared, no shock occurred. In the next phase of 
the study, the other class members were presented, one at a 
time. Stimuli Cl and D1 elicited skin conductance responses 
(a common laboratory measure of emotional reaction) similar 
to those produced by B1, while C2 and D2 did not (Dougher 
et al., 1994), demonstrating the transfer of an elicitation 
function from B1 to Cl and D1. In a second experiment, the 
eliciting function of B1 was extinguished by presenting that 


Contextual Control 


Anessential characteristic of many everyday classes or categories 
is that class composition can shift, sometimes dramatically, depend- 
ing on the context. For example, the events that might be included 
in a class of “good plays” would be altered considerably if conver- 
sation changed from sports to theater. Similarly, a list of musicians 
would be sorted quite differently, depending on whether one was 
asked about musical genre, time period, or nationality. In other 
words, the very same stimuli can be members of different classes, 
depending on the context. And, of course, we often note that the 
meaning or function of a word or phrase depends on the context in 
which it occurs: We respond differently to the word “right” when 
we follow directions to a destination and when we receive feed- 
back on an answer to an exam question. If the study of equivalence 
classes is to be useful in our understanding of naturally occurring 
classes and categories, including language, this same sort of flex- 
ibility in class membership must be captured—and it has been, 
in investigations of contextual control over equivalence classes 
(e.g., Bush, Sidman, & de Rose, 1989; Lynch & Green, 1991; 
Wulfert, Greenway, & Dougher, 1994). Box 19.3 describes the role 
of contextual control in some common (pun intended) forms of 
humor. 

Contextual control in these examples is the result of a more 
complex teaching arrangement known as a five-term contin- 
gency (e.g., Sidman, 1986b, 2000), where the match-to-sample 
performances described previously are brought under the control 
of an additional antecedent stimulus. Following from our musi- 
cians example, with a context of “Nationality” and the Rolling 
Stones as a sample (A1), Fairport Convention (B1) would be the 
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stimulus multiple times in the absence of shock. Subsequent 
tests with each stimulus presented individually showed that the 
eliciting function of Cl and D1 had also extinguished. These 
interesting results were interpreted as a possible model of pho- 
bic response patterns, in which relatively innocuous settings or 
events can come to occasion severe, even debilitating, anxiety 
and fear, despite the sufferer reporting no bad experiences with 
the setting or event at issue. The study confirms that a nega- 
tive experience need not have taken place in the presence of a 
current anxiety trigger, depending on the stimuli with which 
it shares equivalence-class membership. That extinction func- 
tions could also transfer across class members is potentially 
relevant to the effectiveness of certain clinical treatments for 
phobias (Dougher et al., 1994), such as systematic desensiti- 
zation (e.g., Koegel, Openden, & Koegel, 2004; Wolpe, 1961). 
In sum, that same generativity of equivalence classes that 
provides for great effectiveness in creating productive reper- 
toires has the potential to result in an equally broad impact of 
unwanted learning experiences. A similar analysis has been 
applied to social stereotypes—both positive and negative— 
where learning about a characteristic of one member of an 
established group is then extended to other individuals on 
the basis of nothing more than their group membership (e.g., 
Kohlenberg et al., 1991; Moxon, Keenan, & Hine, 1993). 


appropriate choice (both bands are British); with Bob Dylan as a 
sample (A2), The Doors (B2) would be the best match (as Amer- 
ican musicians). Conversely, if the conversation topic (1.e., the 
context) turned to music genre, The Doors (B2) would be matched 
to the Rolling Stones (A1; both rock), while Fairport Convention 
(B1) would be matched to Bob Dylan (A2; as folkies). 

Said a bit more technically, selecting a given comparison 
stimulus (e.g., B1) is reinforced in the presence of a particu- 
lar sample (e.g., Al) in one condition (i.e., Context 1), while 
selecting a different comparison stimulus (e.g., B2) in the pres- 
ence of the same sample (e.g., Al) is reinforced in a different 
condition (i.e., Context 2). Similarly, selecting B2 is reinforced 
given sample A2 in Context 1, while selecting B1 is reinforced 
in Context 2. The correct match thus depends not only on the 
sample stimulus presented but also on the context. The ques- 
tion of interest for present purposes is whether training condi- 
tional discriminations under contextual control will give rise 
to equivalence classes that are also under contextual control. 

To date, work with contextual control over equivalence 
classes has primarily involved typical adult participants, so extend- 
ing this research to other participant populations will be important. 
However, there has been some use of contextual control procedures 
in equivalence-based instructional packages designed for college 
coursework. For example, Fienup and Critchfield (2010; see also 
Fienup, Critchfield, & Covey, 2009) taught concepts of inferential 
statistics and hypothesis testing to college students using a com- 
puterized equivalence approach, with tests on the lesson content 
presented before and after training. Figure 19.11 illustrates the 
teaching and testing arrangement, with trained relations shown 
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BOX 19.3 


Why Are Puns Funny? 


A pun is defined as “the usually humorous use of a word in such 
a way as to suggest two or more of its meanings or the meaning 
of another word similar in sound” (Merriam-Webster). Some- 
times puns make us laugh, other times they make us groan; we 
may even apologize for a pun (e.g., as in “no pun intended”), 
as if punning is in bad taste. It has been argued variously that 
“Puns are the highest form of literature” (attributed to Alfred 
Hitchcock) and that “Puns are the last refuge of the witless” 
(attributed to Samuel Johnson), but authors from Shakespeare 
to Lewis Carroll to James Joyce to Oscar Wilde have all been 
recognized for their puns. Why would this form of word play 
evoke such different reactions, and what is it really that tickles 
us, or not, when puns are used? 

Consider these examples: 


¢ I wondered why the baseball was getting bigger. Then 
it hit me. 

¢ I saw an ad for burial plots, and thought to myself, this 
is the last thing I need. 

¢ “Denial ain’t just a river in Egypt.” (Mark Twain) 

¢ A Freudian slip is when you say one thing but mean 
your mother. 

¢ You matter. Unless you multiply yourself by the speed 
of light. Then you energy. 

* “I went to a place to eat. It said “breakfast at any time.’ 
So I ordered French toast during the Renaissance.” 
(Stephen Wright) 

¢ And my personal favorite—“Time flies like an arrow. 
Fruit flies like a banana.” (Groucho Marx) 


Are we groaning yet? In everyday terms, we could 
describe each of the instances above as including at least one 
word with multiple meanings, either for the same word (e.g., 


“matter” can refer to physical substance or to importance, 
“flies” can refer to the insect or to speed) or for one that sounds 
the same (e.g., “denial” and “the Nile,” or “your mother” and 
“another”). In more technical terms, we could say that, in each 
case, the word or words on which the joke depends are members 
of more than one equivalence class. For example, the spoken 
and written word “flies” is a member of one equivalence class 
that includes nouns such as the Latin name Diptera; a wide 
range of winged insects with large compound eyes like house 
flies, horse flies, and tsetse flies; and descriptors like “‘pollina- 
tors” or “pests.” It is also a member of an equivalence class that 
includes verbs that specify moving through the air, moving at a 
high speed, moving freely, or simply passing quickly. Typicaily, 
when a given word has more than one meaning (as most do), 
the context in which the word occurs determines its function, 
and we have seen how contextual control can be responsible 
for determining the class membership of a particular stimulus 
(i.e., in this case, whether “flies” is responded to as a member 
of its noun class or its verb class). 

The humor of puns, then, can come about in at least two 
ways. First, either an ambiguous context is provided such that 
more than one class membership is possible, or the context is 
explicitly appropriate to more than one equivalence class; the first 
two examples above illustrate this type of pun. Alternatively, the 
pun can be deliberately set up to evoke conflicting classes, as in the 
last three examples (e.g., Groucho first sets the stage for the verb 
class of “flies,” and then switches to the noun class without much 
additional warning or context). Where skillful writers or orators 
typically use context to make the impact of their words more pre- 
cise, puns from this latter category manipulate the reader/listener’s 
response in opposing directions, thereby creating a smile, but per- 
haps also the cha-grin of those who are not fans of puns. (Sorry.) 
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by dark arrows and tested relations shown by gray arrows. Les- 
son | established two three-member equivalence classes involving 
basic concepts related to inferential statistics (AIB1C1 included 
“low p value,” “statistically significant,” and “p < .05”; and 
A2B2C2 included “high p value,” “not statistically significant,” 
and “p > .05”). Lesson 2 did the same with respect to reaching 
statistical conclusions based on a scientific hypothesis and the direc- 
tion of an effect in the data (i.e., DIEIF1 and D2E2F2, where D 
stimuli described a hypothesis and a result, E stimuli were “consis- 
tent...” or “not consistent with hypothesis,” and F stimuli were 
“reject...” or “fail to reject the Null Hypothesis”). Finally, the 
top panel of Figure 19.12 illustrates Lesson 3, in which contextual 
control by the A stimuli (“low p value” or “high p value”) over DE 
and DF matching was trained. Note that the correct E or F choice 
depended on both the D sample and the A contextual stimulus. In 
other words, selecting “consistent with Scientific Hypothesis” or 
“reject the Null Hypothesis” was reinforced only in the presence 
of “effect matches hypothesis prediction” and “low p value”. For 
any other combination of sample and contextual stimulus, selection 
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of “not consistent with Scientific Hypothesis” or “fail to reject the 
Null Hypothesis” was reinforced. The second and third panels of 
Figure 19.12 illustrate the test trials used to evaluate the transfer of 
contextual control to the B and C stimuli. 

All 10 students demonstrated the ABC and DEF equiva- 
lence classes made possible by Lessons | and 2, as well as the 
relations trained under contextual control in Lesson 3, improving 
from moderate pretest scores at best to perfect or near-perfect 
scores in the posttest. Nine of the 10 students also demonstrated 
high percentages of emergent contextual control by the B and C 
stimuli. Figure 19.13 presents scores for each student on the two 
contextual transfer tests (illustrated in Figure 19.11) for all trials 
(left panels) and for those trials that required control by Lesson 3 
(right panels). In this demonstration, then, many sophisticated 
performances emerged (4.7 times more than the number of rela- 
tions directly taught) after approximately 45 minutes of training 
time. What’s more, these emergent performances reflected an 
understanding of the interrelations that define basic statistical 
reasoning, even in the absence of instructor explanations—an 
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with emergent contextual control for each student on the two contextual transfer tests 
(illustrated in Figure 19.11) for all trials (left panels) and for those trials that required 


control by Lesson 3 (right panels). 
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impressive illustration of the generativity and nuance made pos- 
sible with contextual control. 


DESIGNING EQUIVALENCE-BASED 
INSTRUCTION 


Equivalence outcomes are impressive, but they do not happen 
by magic. Research has shown a number of training and testing 
variables to be important for the successful generation of equiva- 
lence classes, whether in the laboratory or in the field. To illustrate 
these variables, this section of the chapter describes a series of 
decisions that need to be made in planning effective procedures 
for EBI, presented in the order in which they would be imple- 
mented. In point of fact, the equivalence outcome is quite robust, 
having been demonstrated with a wide variety of procedural vari- 
ations, so there is no single route to strong EBI design. A number 
of problems can be avoided with careful design, however, and 
the discussion that follows will highlight some of these potential 
complications. Table 19.1 outlines the major design decisions that 
guide the planning for EBI training and testing. 


Training Baseline Conditional Discriminations 


The most common approach to generating equivalence classes 
involves teaching a minimum of two interrelated conditional 
discriminations with the matching-to-sample procedure. 
Arranging these four-term contingencies requires a number of 


TABLE 19.1 
Decisions 


Equivalence-based Instruction Design 


|. Training Considerations 
A. What stimuli will be used and how many? 
B. How will the stimuli be presented? 


C. Will a sample- or comparison-observing response be 
required? 


. What instructions will be given? 

. How will balanced trial arrangements be ensured? 
. Are special training steps needed? 

. What training structure will be used? 


a G) Serer ae 


. What mastery (accuracy) criteria will be used for training? 
|. How will consequences be scheduled during training? 
ll. Testing Considerations 
A. How will the probe trials be presented? 
B. How will consequences be scheduled? 
C. How will the probe-trial types be ordered? 
D. What will be the criteria for equivalence-class formation? 


methodological choices, including the selection and presentation 
of stimuli, observing responses, instructions, trial-type arrange- 
ments and sequences, instructional programming, training struc- 
ture, mastery (accuracy) criteria, and consequence delivery. 


Selecting and Presenting Stimuli 


A wide range of stimulus types has been used effectively in 
equivalence research. In laboratory studies where the empha- 
S1S 1S ON experimental control, unfamiliar line drawings (e.g., 
Dougher et al., 1994), abstract figures or objects (e.g., Pil- 
grim & Galizio, 1990), letters from a foreign alphabet (e.¢., 
Sidman & Tailby, 1982), or meaningless nonsense sylla- 
bles are common (e.g., Fields, Hobbie, Adams, & Reeve, 
1999), to rule out the possibility that performances demon- 
strated on probe trials are due to experience with the stim- 
uli outside the lab. Although auditory and visual stimuli are 
probably the most convenient options, many different stim- 
ulus modalities have been represented as equivalence-class 
members, including tactile stimuli (hidden objects con- 
tacted only by touch; e.g., Bush, 1993), odors (e.g., Annett 
& Leslie, 1995), tastes (e.g., Hayes, Tilley, & Hayes, 1988), 
and drug effects (DeGrandpre, Bickel, & Higgins, 1992). 
In applications of EBI, the choice of stimuli will necessarily 
depend on the nature of the skills being targeted. 

Again, the options are virtually limitless. Early language 
training programs might include objects or pictures, spoken 
words, and written words, as illustrated in Sidman’s early work 
(Sidman, 1971; Sidman & Cresson, 1973), in either native or 
non-native languages (e.g., Joyce, Joyce, & Wellington, 1993), 
while programs designed to establish basic math skills might 
include quantities of objects, numerals, and spoken number 
names, or fractions, decimals, and pictorial representations of 
fractions (e.g., Lynch & Cuvo, 1995). In other illustrations, 
Braille letters, printed letters, and spoken letter names became 
equivalence-class members for children with degenerative visual 
disorders (e.g., Toussaint & Tiger, 2010); the photographs, office 
name-plates, and spoken names of therapists became equiva- 
lent for adults with brain damage (Cowley, Green, & Braunling- 
McMorrow, 1992); and for college students, the name of brain 
lobes, specific lobe functions, and brain diagrams became inter- 
changeable class members (Fienup, Covey, & Critchfield, 201 0). 
Applied research with stimulus sets such as these typically 
incorporate control procedures to rule out extra-experimental 
learning with the stimuli as responsible for probe performances. 
Strategies include pretests with each possible stimulus relation, 
testing all possible stimulus relations before and after teach- 
ing each of a series of subsets of the stimuli, and simultaneous 
multiple-baseline designs. 

Another important decision in conducting equivalence 
work involves the number of stimuli to use. Once again, the 
best answer here will depend on the participant population and 
the instructional goals. A number of variations are represented 
in the literature, but in many cases a reasonable starting point 
for teaching a first conditional discrimination is to present three 
different sample stimuli (i.e., Al, A2, and A3) across trials with 
three different comparison stimuli (i.e., B1, B2, and B3) on each 
trial. At first glance, beginning with three stimuli for each set 
may seem counterintuitive—why not start more simply, with just 
one or two samples and comparisons? The issue is that doing so 
can allow for the inadvertent reinforcement of stimulus-control 
patterns that conflict with those being targeted by the training. 
Consider a case in which sample Al and comparisons BI and B2 
are presented on every trial for a block of 20 trials, and selection 
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of BI produces a reinforcer on each trial. A participant might 
learn quickly to select B1, but doing so would require only a 
simple discrimination between the two comparisons. Sample A 1 
has no necessary role in determining the appropriate comparison 
selection under these conditions, as BI is always correct. Not 
only is establishing control by A1 over selection of B1 unlikely 
here, but control by sample stimuli more generally (e.g., as in 
control over the participant’s attending or observing) could be 
decreased as well, leaving conditional discrimination more dif- 
ficult to achieve in subsequent teaching steps. 

Next, consider a case in which two sample stimuli (e.g., 
A1 and A2) alternate irregularly across trials in a block with two 
comparison stimuli (e.g., Bl and B2) presented on every trial. 
Now control by the sample is necessary to produce a reinforcer 
on each trial (that is, selecting B1 is reinforced only in the pres- 
ence of sample Al, and selecting B2 is reinforced only in the 
presence of sample A2). In this case, however, it is possible for 
correct selections to be made, and thus reinforced, in either of 
two ways. With Al as the sample, for example, a child could 
learn to either select B1 (known as Type S or select responding) 
or to reject B2 (known as Type R or Reject responding) and 
simply choose whatever else is available. In the first case, the 
reinforced relation is between sample Al and comparison B1; 
in the second case, the reinforced relation is between sample A 
and comparison B2—two different forms of stimulus control 
(also described as different stimulus-control topographies; 
Dube & Mcllvane, 1996; McIlvane & Dube, 2003). The first 
form of stimulus control would be consistent with the teach- 
er’s goal; that is, stimulus-control topography coherence would 
hold between the teacher’s target and the child’s behavior. If 
only the second form of stimulus control was established, a 
lack of coherence would be present, and any resultant equiva- 
lence classes would necessarily differ from those targeted by 
the teacher as well (e.g., Al and B2 could be members of the 
same class; see, for example, Carrigan & Sidman, 1992). Using 
more than two comparison stimuli decreases the likelihood of 
exclusive Type R control, because the number of reject relations 
required to produce reinforcement on any given trial increases 
with each comparison added to the array. For example, given 
A1 as sample, both comparisons B2 and B3 would need to be 
rejected from a three-choice array, while only a single Type S 
relation (e.g., A1B1) is required to produce reinforcers (e.g., 
Sidman, 1987). In certain circumstances, limiting comparisons 
to only two or presenting only a single sample stimulus fora 
series of trials may be appropriate, but the teacher will then 
need to be watchful for evidence of conflicting stimulus con- 
trol, and additional training steps may be necessary to ensure a 
shift in stimulus control. 

Given these issues, it follows that presenting at least 
three comparison stimuli per trial is generally recommended. 
As a complete demonstration of all three properties of equiv- 
alence (i.e., reflexivity, symmetry, and transitivity) requires a 
minimum of two interrelated conditional discriminations (e.g., 
AB and AC) with at least three stimulus sets (e.g., A, B, and C 
stimuli), a total of nine different stimuli would be necessary as 
a starting point. From there, the number of stimuli per set can 
be increased as appropriate to the task at hand (e.g., remember 
that Sidman’s first studies used stimulus sets of 20 each, for a 
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total of 60 stimuli), and as we have seen, the number of sets can 
be increased as well. 

Equivalence-based instruction is also flexible with respect 
to the mode of stimulus presentation. Modern technology makes 
computerized matching-to-sample procedures relatively straight- 
forward to design and implement; both the basic laboratory and 
applied literatures are replete with examples (e.g., see Green 
& Saunders, 1998; Saunders & Williams, 1998). The sample 
stimulus is typically presented either at the top of the computer 
screen or in a center position, with the comparisons presented 
either in a horizontal row below the sample or in the corners of 
the screen. The participant’s responses (typically mouse clicks, 
key presses, or touches) are recorded automatically, with audi- 
tory and/or visual consequences programmed to follow imme- 
diately. Computer presentations have the advantage of ensuring 
procedural integrity, as all teaching and testing parameters are 
pre-programmed and consistently applied. Indeed, the promise 
of developing computerized equivalence-based teaching pro- 
grams for specific targets in educational settings (e.g., reading, 
spelling, coin equivalences) has been long noted (e.g., Sidman, 
1994), freeing teacher time from basic lessons to more individ- 
ualized instruction. To say that current application platforms and 
Internet technologies have significantly enhanced this potential 
would be an understatement (see, for example, Barron, Leslie, 
& Smyth, 2018). 

Sophisticated computer programming is hardly necessary 
for equivalence approaches, however, and many elegant exam- 
ples of “table-top procedures” have been utilized effectively. 
For example, stimuli have been presented manually in a Wis- 
consin General Testing Apparatus (Harlow, 1949), where the 
experimenter and participant sit on opposite sides of a wooden 
partition. Raising a door at the base of the partition allows a tray 
holding stimulus objects to be presented to the participant. For 
example, the sample stimulus might be positioned at the top of 
the tray, with three comparison stimuli arranged in a row at the 
bottom of the tray. A concave well below each stimulus object 
allows a reinforcer (e.g., a small edible, a penny, or a token) to 
be hidden, so that removing the appropriate comparison pro- 
vides access to the reinforcer (e.g., Pilgrim, Chambers, & Gal- 
1zi0, 1995). Similar strategies have included a large teddy bear 
or cloth screen used in place of a wooden partition to block any 
potential experimenter cuing, with stimulus cards arranged in a 
systematic format in front of the participant (e.g., Horne, Lowe, 
& Randle, 2004). In other arrangements, sample and comparison 
stimuli have been presented as pictures on a page of a three-ring 
binder or notebook (e.g., with a sample picture at the top of the 
page and comparison stimuli in a row at the bottom), where each 
page presents a new trial, or even more simply, by arranging 
individual objects or cards with stimulus pictures in a set con- 
figuration on a table in front of the participant. This flexibility in 
stimulus-presentation modality is important in allowing the EBI 
approach to be adapted to the particular needs of a given setting 
or participant. At the same time, table-top procedures carry the 
extra burden of requiring appropriate controls to prevent exper- 
imenter or teacher cuing (which could lead to false conclusions 
about emergent performance), and to ensure procedural integrity 
(the absence of which can dramatically decrease the effective- 
ness of teaching and testing methods). Tactics such as having the 


teacher always sit or stand behind the participant, and incorpo- 
rating interobserver reliability measures, are common with these 
arrangements, for example. 


Observing Responses 


Control by the sample stimulus is critical to the acquisition of 
conditional discriminations in a matching-to-sample procedure. 
For this reason, on each matching trial, standard practice is to 
require the participant to make an observing response to the sam- 
ple stimulus (e.g., a mouse click over the sample stimulus or a 
touch to the sample) once it is presented. The sample-observing 
response then results in presentation of the comparison stim- 
uli and the opportunity to access reinforcers. The observing- 
response requirement is designed to increase the likelihood that 
the participant will actually notice (i.e., observe) the sample. In 
cases where sample control proves difficult to establish, requir- 
ing a unique response for each sample stimulus has often been 
effective (e.g., see Saunders & Williams, 1998). For example, 
a participant might be required to give a different but specific 
name for samples Al, A2, and A3 (e.g., Saunders & Spradlin, 
1993; Pilgrim, Jackson, & Galizio, 2000); alternatively, a differ- 
ent gesture (e.g., a wave, a thumbs up, and a clap) or a different 
schedule performance (e.g., a fixed ratio [FR] and a differential 
reinforcement of low rates [DRL] schedule in a two-compar- 
ison procedure; Sidman et al., 1982) could be required in the 
presence of each sample. The advantage of a differential sample 
requirement is that it requires successive discrimination between 
the sample stimuli that are presented on different trials. 

Similarly, requiring an observing response to each com- 
parison stimulus prior to a final comparison selection can 
sometimes prove beneficial, especially if concerns arise about a 
participant making choices impulsively, without viewing each of 
the alternatives. For example, touching or naming each compar- 
ison in the array prior to the selection response should increase 
contact with each of the choices, and thereby increase the like- 
lihood of control by the correct comparison. 


Instructions 


Task instructions in conditional discrimination training have 
varied from simple (e.g., “Touch one”; Lynch & Green, 1991) 
to more detailed (e.g., “Pick the one that matches” or “When you 
see this [A1], choose this [B1]”; Pilgrim, Jackson, & Galizio, 
2000; Saunders, Saunders, Kirby, & Spradlin, 1988), and the 
best instruction set will again necessarily depend on instruc- 
tional goals and the participants involved. For basic laboratory 
research in which the experimental question focuses on the role 
of the reinforcement contingency in generating equivalence, 
adding instructional control to the mix will complicate the anal- 
ysis. Conversely, when producing an applied outcome is the sole 
focus and the isolation of controlling variables is a less important 
issue, a combination of instructions or other prompts and rein- 
forcement contingencies can enhance acquisition of the condi- 
tional relations necessary to give rise to emergent performance. 
With some populations, such as young children or individuals 
with intellectual disabilities, reinforcement contingencies alone, 
in the absence of other programming, can be insufficient for 
establishing arbitrary conditional discriminations (e.g., Pilgrim, 
Jackson, & Galizio, 2000; Augustson & Dougher, 1992). As 


long as the teacher ensures that the trained conditional discrim- 
inations are maintained once instructional prompts are removed, 
the groundwork is laid for equivalence outcomes. 


Arranging Trial Types and Sequences 


Careful attention to the balanced composition of trial types, as 
well as the order in which those trial types are presented within 
a block of trials, is essential to preventing a number of forms of 
competing stimulus control from interfering with conditional 
discrimination acquisition. Several standard practices have 
been developed specifically to guard against such issues. Let’s 
say that we want to teach an AB conditional discrimination, 
such that comparison stimuli B1, B2, and B3 are selected in 
the presence of sample stimuli Al, A2, and A3, respectively. 
Three different trial types would be needed, one with each of the 
sample stimuli; the same three comparisons would be presented 
as choices with each sample. These three trial types should be 
presented the same number of times within a training trial block, 
to ensure an equal opportunity for reinforcement with each com- 
parison. Imbalance here could create a bias toward selecting one 
(or more) comparison(s) over the others. Further, the position 
of the comparison stimuli should be counterbalanced across tri- 
als, such that each stimulus appears in each possible position 
on an equal number of trials. Imbalance here could result in a 
conditional relation between the sample stimulus and a particu- 
lar position, rather than a particular comparison. Similarly, the 
position of the correct or S+ comparison stimulus should be 
counterbalanced across trials, such that the reinforced choice 
appears in each position on an equal number of trials. Failure to 
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do so could again result in a preference for one stimulus location 
over the others. Finally, we would typically set a limit on the 
number of consecutive trials allowed within a block (e.g., no 
more than two or three trials in a row) (1) with the same sample 
stimulus, so that control by the sample is ensured, and (2) with 
the S+ comparison in the same position, so that responding to a 
particular position will not be inadvertently strengthened. With 
these constraints met, the order of trials within a block should 
otherwise vary unsystematically. To illustrate, the top half of 
Table 19.2 presents nine trials balanced with respect to the num- 
ber of each trial type and the position of each comparison stim- 
ulus, including the reinforced or S+ choice. The bottom panel 
of Table 19.2 shows the same trials in a semi-randomized order 
that could prevent inadvertent reinforcement of sequences of 
stimulus or position selections. Consecutive trial blocks and/or 
teaching sessions should also use a newly randomized sequence 
of trials (maintaining the same constraints) so that a simple 
chain of responses cannot be established. 

In sum, planning these details of trial arrangements prior 
to their presentation will help to ensure stimulus-control topog- 
raphy coherence—in this case, conditional control by the sam- 
ple stimuli over specific comparison selections. Optimally, the 
number of trials presented in a trial block should be sufficient to 
allow for balance along each of the dimensions outlined above, 
while the number of trials or trial blocks to be completed within 
a single session will necessarily depend on the participant pop- 
ulation. For example, young children may lose interest in the 
task after a relatively limited number of trials, so blocks ranging 
from 9 to 27 trials are common, while college students will often 


TABLE 19.2 Arranging Balanced Trial-block Composition—Training 


nnn 


Sample Stimulus 
Left Position 


Al B1 
Al B3 
Al B2 
A2 B1 
A2 B3 
A2 B2 
A3 B1 
A3 B3 
A3 B2 
A3 B3 
A2 B1 
Al B1 
A3 Bi 
A2 B2 
A3 B2 
Al B2 
Al B3 
A2 B3 


Comparison Stimuli 


Center Position Right Position 


B2 B3 
B1 B2 
B3 B1 
B2 B3 
B1 B2 
B3 BI 
B2 B3 
B1 B2 
B3 B1 
BI B2 
B2 B3 
B2 B3 
B2 B3 
B3 B1 
B3 BI 
B3 B1 
B1 B2 
BI B2 


Note: Italicized boldface indicates the S+ or reinforced comparison selection. 
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work for periods of | to 3 hours and completion of many trial 
blocks is possible. 


Instructional Programming 


As noted above, for some participants or participant populations, 
teaching techniques in addition to the matching-to-sample rein- 
forcement contingencies are necessary to produce mastery of the 
baseline arbitrary conditional discriminations. Verbal instructions 
have already been mentioned; other enhancements have included 
various prompting procedures, modeling the correct choices (e.g., 
Michael & Bernstein, 1991), and correction procedures, whereby 
a given trial is repeated until a correct choice is made (e.g., see 
Green & Saunders, 1998), for example. Some approaches have 
incorporated a systematic introduction of specific training-trial 
types designed to target particular dimensions of conditional 
control, in order to facilitate acquisition of the required baseline 
relations; these include teaching the component simple discrim- 
inations, stimulus-control shaping, exclusion training, and pro- 
grammed introduction of the multiple conditional discriminations. 


Teaching the Component Simple Discriminations. This train- 
ing approach is based on the recognition that accurate condi- 
tional discrimination performances require two different types 
of simple discriminations, in addition to conditional control by 
the sample (e.g., Carter & Eckerman, 1975). These include suc- 
cessive discrimination between the sample stimuli, which are 
presented sequentially across trials, and simultaneous discrim- 
ination between the comparison stimuli, which are presented 
together as the choice array on a given trial. While the former 
often proves more challenging than the latter (e.g., Carter & 
Eckerman, 1975), assuming either can be problematic, particu- 
larly with young children or individuals with intellectual disabil- 
ities, and the absence of either will necessarily result in failure 
on the conditional discrimination task. 

One teaching sequence designed to ensure all of the neces- 
sary component discriminations introduces a series of pretraining 
steps. Simple simultaneous discrimination is taught first with the 
stimuli that will serve as comparisons (e.g., B1, B2, and B3). Each 
stimulus serves as the S+ (that is, selection of that stimulus is 
reinforced) across phases (e.g., selecting B1, then B2, and then B3 
is reinforced), and a new phase begins only after reliable selection 
of the S+ has been demonstrated (Saunders & Spradlin, 1993). In 
the second pretraining step, the stimuli that will eventually serve 
as samples (e.g., Al, A2, and A3) are presented individually, one 
at a time, over a series of trials, and a different response (e.g., 
naming, patterns of button pressing) is shaped and reinforced 
in the presence of each until this discriminative performance is 
strong (e.g., Saunders & Spradlin, 1989, 1993). In the third step, 
conditional discrimination trials are presented for the first time 
in blocks, such that sample Al is presented for some number of 
trials in a row with B1, B2, and B3 as comparisons. A block of 
trials with sample A2, and then with A3, follows, with the same 
comparisons and for the same number of trials. The differential 
sample responses are still required, and the number of trials in a 
block is gradually decreased until the trial types with Al, A2, and 
A3 as sample are thoroughly intermixed. In this way, control by 
the sample stimulus alone becomes increasingly necessary. At this 
point, the procedure is a standard arbitrary matching-to-sample 


arrangement with accurate conditional responding established 
step by step. 


Stimulus-control Shaping. Two approaches in this category 
resemble fading or errorless discrimination procedures by first 
teaching an easily acquired discrimination and then shifting 
gradually to a more difficult arbitrary conditional discrimina- 
tion. In both cases, identity matching-to-sample serves as the 
starting point. Zygmont, Lazar, Dube, and MclIlvane (1992) 
demonstrated the effectiveness of stimulus-control shaping with 
two 4- to 6-year-old typically developing children and two indi- 
viduals with moderate to severe intellectual disabilities, none of 
whom had been successful in arbitrary match-to-sample train- 
ing. The shaping programs began by first establishing identity 
matching with two abstract black-and-white-line drawings as 
sample and comparison stimuli; trials with each sample alter- 
nated here and throughout all training steps. Next, in Phase 1, 
the shape of one sample stimulus was modified in small incre- 
ments across a series of training steps, while maintaining selec- 
tion of the appropriate comparison. Figure 19.14 denotes the 
initial training stimuli as Step B for baseline, and shows that the 
sample and S+ (that is, the correct comparison) were physically 
identical at baseline; the subsequent rows of the figure illustrate 
the sequence of changes that were made to the sample drawing 
across training steps. When the target shape was achieved for 
Sample | (in Step 9, labeled FP for Final Performance), Phase 
2 presented a similar training series for Sample 2, with grad- 
ual alterations made to its shape across steps. By the end of 
training, all four participants demonstrated accurate arbitrary 
conditional discrimination performance involving physically 
distinct samples and comparisons. Although the stimuli involved 
in this translational study were abstract line drawings, the shap- 
ing strategy could be easily applied to a wide range of teaching 
materials. 

A second approach also shaped arbitrary sample control 
across a series of phases. To start, accurate identity matching 
(Pilgrim, Jackson, & Galizio, 2000) was established with young, 
typically developing children, first with pictures of familiar 
everyday objects (e.g., a pencil, a flower, and a boy’s face), and 
then with abstract line drawings, indicating conditional control 
by sample stimuli across a range of stimulus types. The next 
training phase presented a thematic matching task in which the 
sample and correct comparison stimulus were physically dis- 
similar, but members of a common category or theme presumed 
to be familiar to the children. For example, with a picture of an 
ice cream cone as sample, selecting a picture of a birthday cake 
instead of a car or a bird was reinforced. Accurate performance 
on this training step extended conditional control to stimuli that 
had no physical resemblance to the correct comparison. When an 
arbitrary matching task with unfamiliar line drawings was pre- 
sented next, the conditional discrimination was learned quickly, 
both by children who had failed to master the task previously 
and by experimentally naive children taught with this sequence 
from the start (Pilgrim et al., 2000). 


Exclusion Training. This approach to teaching a new arbitrary 
conditional discrimination is based on a robust outcome known 
as exclusion, in which a novel comparison stimulus will be 
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selected over a known one in the presence of a novel sample 
(e.g., de Rose, de Sousa, & Hanna, 1996; MclIlvane & Stoddard, 
1981). Let’s say, for example, that the starting repertoire of a 
young child with autism includes selecting the appropriate pic- 
ture of a bird, a dog, and a cat when she hears the corresponding 
spoken word, but that matching printed letters of the alphabet 
to their spoken names occurs at chance levels of accuracy. A 
teacher might then present the spoken words “bird,” “dog,” and 
“C” as samples, with pictures of a bird, a dog, and the printed 
letter C as comparison choices. The exclusion effect would be 
demonstrated by the child’s selecting the printed letter when 
presented with “C” as the sample, which would be followed by a 
reinforcer. Selecting other letters (e.g., B and D) in the presence 
of their spoken names could be established in similar fashion. 
Thus, by carefully intermixing new sample-comparison combi- 
nations with familiar stimulus relations, teachers can repeatedly 
create the opportunity for reinforcing target comparison selec- 
tions in the presence of target sample stimuli. Although the name 
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Figure 19.14 Steps of stimulus control shaping. 
Identity matching at baseline (B) was gradually 
changed to arbitrary matching by modifying the 
form of the sample stimulus in a series of steps, 
first for one sample (shown on the left side of the 
figure) and then for the other (shown on the right 
side of the figure). 

"Teaching Arbitrary Matching via Sample Stimulus-control 
Shaping to Young Children and Mentally Retarded Individuals: 
A Methodological Note by D.M. Zygmont, R.M. Lazar, 

W.V. Dube, and W.J. Mcllvane, 1992, Reproduced with 
permission of John Wiley & Sons Inc. 
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implies that the “known” comparison stimulus is excluded (a 
form of Type R or Reject relation), studies have consistently 
demonstrated that once demonstrated through exclusion, condi- 
tional control by the sample is maintained when the new relation 
is presented together with other samples and comparisons from 
the training set (rather than the already familiar combinations); 
that is, Type S or Select control is demonstrated. For example, 
a later training step following from our example above might 
present the three letter names as samples with the three printed 
letters as comparison choices. In this way, exclusion program- 
ming can be utilized to build the new arbitrary conditional dis- 
criminations required as baseline performances for equivalence 
outcomes. 


Sequential Versus Simultaneous Introduction of Multiple 
Conditional Discriminations. The previous sections have 
focused on teaching a single conditional discrimination but, 
as described earlier, two (or more) conditional discriminations 
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(e.g., AB and AC) are necessary to allow for all defining tests 
of equivalence-class formation. Any of the teaching strategies 
outlined above could be applied in training additional condi- 
tional discriminations, of course, but fortunately, this is often 
unnecessary; establishing a first arbitrary relation frequently 
allows for rapid acquisition of those taught later (e.g., Pilgrim 
et al., 2000), an example of a learning-set outcome (Harlow, 
1949). When children or individuals with intellectual challenges 
receive EBI, standard practice is to teach each conditional dis- 
crimination independently (e.g., AB training followed by AC 
training), and then to intermix the trial types making up each 
set of relations within a single trial block. A next conditional 
discrimination (e.g., AD) could then be trained in isolation, sub- 
sequently added to a mix of trials from all three, and so on, such 
that the baseline from which equivalence classes could emerge 
is developed sequentially. The alternative—introducing the mix 
of trial types from multiple conditional discriminations from the 
start of training—can sometimes save training time for select 
participant groups, such as college students. 


Training Structures 


Whether approached sequentially or simultaneously, teach- 
ing multiple conditional discriminations necessarily requires 
decisions about the specific sample-comparison combinations 
that will make up each conditional discrimination, and about 
the manner in which the different conditional discriminations 
will be interrelated. These dimensions of the teaching arrange- 
ment are described generally as the training structure. At least 
one stimulus set, often referred to as the nodal stimulus set 
or, more simply, the node, must be held in common across a 
minimum of two conditional discriminations to provide a basis 
for all equivalence properties. A number of different training 
structures are well represented in the experimental and applied 
literature on equivalence. For example, in Sidman’s early stud- 
ies reviewed in detail above, AB and AC conditional discrimi- 
nations were trained. This arrangement is known as one-to-many 
training, in that one set of sample stimuli (e.g., the A stimuli in 
this case) is trained with multiple sets of comparison stimuli 
(e.g., B and C, in this example). A synonymous term for this 
training structure is sample-as-node training, which emphasizes 
that the A stimulus set serves a sample function in both of the 
trained conditional discriminations and thus provides the basis 
for emergent relations between stimuli from the different con- 
ditional discriminations (e.g., between B and C stimuli, in this 
case). A second popular training protocol is the many-to-one or 
comparison-as-node structure. In this arrangement, multiple sets 
of sample stimuli are trained with a single set of comparisons, 
and the comparison stimuli thus serve as node. For example, 
AB and CB conditional discriminations could be trained such 
that the B comparison stimulus set is held in common. A third 
structure is known as a linear series training. Here, the compari- 
son stimuli from one conditional discrimination serve as sample 
stimuli in the next; for example, AB and BC relations might be 
taught. Each of these training structures could be applied with 
more than two conditional discriminations as well (e.g., AB, 
AC, and AD for one-to-many training; AB, CB, and DB for 
many-to-one training; AB, BC, and CD for linear series training, 


where two different nodes would be involved). Figure 19.15 
illustrates these common training structures with a single stim- 
ulus from each of four sets (A to D). A total of three conditional 
discriminations are represented for each training structure, and 
the arrows in this diagram always point from a sample stimulus 
to the comparison selection that would be reinforced on a given 
training trial. In each case, teaching the matching performances 
shown by the arrows would serve as the baseline for emergent 
matching between any of the stimulus pairs not linked by an 
arrow. Finally, many examples of EBI have utilized mixed train- 
ing structures with multiple nodal stimuli (e.g., AB, AC, and DC 
training); the variations in this category are extensive. 

A number of studies have compared the effectiveness of 
these training structures for teaching the baseline conditional 
discriminations and for generating equivalence outcomes 
(e.g., Arntzen, 2012; Saunders, Saunders, Williams, & Spradlin, 
1993). There are some reports of faster baseline acquisition with 
one-to-many training, perhaps because the more challenging 
successive discriminations between sample stimuli are required 
for only one set of stimuli (see Saunders & Green, 1999, for 
analysis), but these findings have not always been replicated. 
Similarly, across studies, one-to-many and many-to-one training 
structures appear to be equally effective for producing equiv- 
alence performance, with some studies favoring one-to-many 
slightly (e.g., Arntzen & Holth, 1997) and others favoring many- 
to-one (e.g., Fields et al., 1999). Certainly both provide a strong 
basis from which equivalence classes can be demonstrated. The 
one training structure that has consistently underperformed, 
compared to the others, in generating equivalence is linear 
series training (e.g., Arntzen, 2012), particularly as the number 
of trained conditional discriminations increases. Although linear 
series training has proved effective with many participants, this 
training arrangement is probably best avoided unless needed for 
other experimental reasons (e.g., to avoid a ceiling effect for 
equivalence successes, so that the impact of other procedural 
variables can be better analyzed). Either of the other structures 
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Figure 19.15 A schematic representation of three different 
training structures. Three conditional discriminations are 
represented for each structure, by presenting one stimulus 
from each of four sets (A, B, C, and D). The one-to-many (or 
sample-as-node) structure is presented in the left column, 


the many-to-one (or comparison-as-node) structure in the 
middle, and the linear series structure on the right. 


Linear Series 


can be a good choice for EBI, and the best selection will depend 
on factors such as the type of stimuli to be used (e.g., auditory 
stimuli are difficult to present as comparisons in the absence 


of special procedures) and the instructional goals of the EBI 
program. 


Mastery (Accuracy) Criteria 


The trained conditional discriminations provide the necessary 
prerequisites for equivalence-class formation, so it is essential 
that performance on all trial types is strong and consistent. To 
ensure this end, EBI programs incorporate predetermined accu- 
racy or mastery criteria that must be met in each phase of train- 
ing before moving on to the next. Typically, overall accuracy 
is measured for each trial block and defined as the number of 
trials on which the S+ comparison is chosen divided by the 
total number of trials in the block. (When acquisition is prob- 
lematic, separate accuracy measures for specific trial types— 
e.g., all trials with each sample stimulus—are important to 
analyze as well.) Because intermediate accuracy scores (e.g., 
75%) can reflect multiple sources of stimulus control (e.g., see 
Sidman, 1980, 1987), the mastery criteria are typically set at a 
relatively high accuracy percentage and are often required for 
more than one consecutive trial block (unless the number of tri- 
als composing the block is large). For example, in one series of 
studies (Pilgrim & Galizio, 1990; Pilgrim, Chambers, & Galizio, 
1995), adults and children received conditional discrimination 
training in trial blocks consisting of 16 trials each. The mastery 
criteria for each training phase required accuracy of at least 88% 
(14 out of 16 trials correct per block), with no more than one 
error on any given trial type, for two consecutive sessions. 


Scheduling Consequences During Training 


Match-to-sample training for one or more conditional discrimi- 
nations will typically include reinforcers for selection of the S+ 
comparison on every trial until the specified mastery criteria are 
met. Choosing an appropriate and effective reinforcer for EBI 
training is subject to the same considerations that would hold 
for any applied behavior analysis program. Common examples 
have included points or tokens worth money or prizes, small 
edibles, written feedback (e.g., “Correct” or “Incorrect’’), attrac- 
tive visual displays or cartoons, and auditory jingles or chimes. 
Some teaching programs also include mild punishers (e.g., a 
brief buzzer, a short time-out, or a trial repetition) for S— selec- 
tions, again programmed for every training trial until mastery 
is achieved. Before moving immediately from continuous rein- 
forcement to tests for the properties of equivalence, however, a 
common practice is to first arrange intermittent reinforcement, 
where the number of trials per block that include programmed 
consequences is decreased, often in a series of steps, with mas- 
tery criteria required at each step. For example, the steps might 
include having reinforcers available on 100%, then 75%, and 
then 50% of the trials. At issue in these reinforcement reduc- 
tions is preparation for the introduction of the test or probe 
trials designed to evaluate equivalence-class formation. Probe 
trials are typically presented in extinction, to allow for interpre- 
tations of emergent versus directly trained relations. Without 
adequate preparation, the absence of reinforcers for novel probe 
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performances can produce variability in comparison selections, 
whereas training to mastery with intermittent reinforcement 
helps to ensure response maintenance in the absence of trial-by- 
trial consequences. Relatedly, one benefit of including punishers 
as part of the training contingency is that doing so provides 
a basis for discrimination between incorrect responses and the 
mere absence of a reinforcer. 

Some EBI programs have included final training steps 
with still leaner schedules, arranging programmed consequences 
for only 20% or even 0% of the trials (e.g., Sidman & Tailby, 
1982). Although motivation can become an issue, experience 
with lean reinforcement schedules for trained relations makes 
discrimination between training trials and later probe trials less 
likely, with the net result that probe responding is not differen- 
tially extinguished. Strategies designed to counteract the low 
reinforcer density have included providing alternative bases for 
reinforcement (e.g., color-matching trials presented at the end 
of the AB and AC training block; e.g., Sidman & Tailby, 1982) 
and/or instructions (e.g., “In this part of the game, I can’t tell 
you whether you are right or not, but you should continue to 
do your best.”). As usual, the appropriateness of such strategies 
will depend on the participant population and the reasons for 
conducting the EBI program. 


Testing Protocols 


As was the case for baseline training, a number of different 
testing procedures are reflected in the equivalence literature. 
Many decisions about testing arrangements carry important 
advantages and disadvantages, such that careful consideration 
is warranted for procedural details, including the presentation 
and order of probe trials, consequence scheduling, and criteria 
for equivalence-class formation. 


Composition of Probe-trial Blocks 


Two general methods are used for presenting probe trials: 
massed testing and interspersing probe trials with baseline-trial 
types. In the first method, the trial block is composed of probe 
trials only. In the second, the block includes probe trials as well 
as each of the baseline training trial types presented in a semi- 
randomized order, often with no more than one or two probe trials 
presented in succession. Massed testing allows for more rapid 
assessment of equivalence-class formation, but when equivalence 
outcomes are not demonstrated, this approach leaves open the 
question of whether the prerequisite baseline relations have been 
maintained. Introducing novel trial types in discrimination stud- 
ies can sometimes disrupt previously mastered performances, at 
least temporarily, and this issue may be particularly relevant with 
some participant populations. If the trained conditional discrim- 
inations are not strong, there is no reason to expect equivalence 
to emerge, and in the absence of baseline-trial types, this possi- 
bility can be neither evaluated nor remedied. Related strategies 
have included beginning test sessions with a baseline-review trial 
block (e.g., Kennedy, Itkonen, & Lindquist, 1994) or alternating 
baseline-review and massed-probe trial blocks. These arrange- 
ments necessarily mean that the equivalence assessment is less 
rapid and actually represents approximations of the interspersed 
trial-type approach. In any case, some provision for evaluating the 


518 Part8 * Developing New Behavior 


maintenance of baseline performances is critical in those cases in 
which equivalence outcomes are not demonstrated. When inter- 
spersed, the proportion of probe trials to baseline trials varies 
considerably across EBI programs and is often determined by the 
overall reinforcement density being targeted (i.e., with no rein- 
forcers for any probe trial, and intermittent reinforcement for the 
baseline trials) and the necessary balanced composition of trial 
types described for training arrangements. To illustrate, Table 
19.3 presents a balanced trial block composed of three symmetry 
probes and nine baseline training trials. 


Scheduling Consequences During Testing 


When baseline-trial types have been included as part of the test 
sessions, decisions must be made about scheduling consequences 
for those trials. In some EBI programs, consequences are pro- 
grammed for at least a portion of the baseline trials (e.g., Sidman 
et al., 1985), such that the trained relations are not only evaluated 
but also possibly strengthened during testing, and overall task 
motivation is facilitated. In other work, extinction conditions are 
in effect for all trial types—baselines as well as probes (e.g., Bush 
et al., 1989). The latter arrangement is often preceded by instruc- 
tions, as described previously for leaning the reinforcement den- 
sity during training (e.g., “You will no longer receive feedback on 
your selections, but there is still a correct answer, and you should 
continue to do your best;” e.g., Dougher et al., 1994), although 
the effectiveness of this strategy will be limited with participants 
who have poor verbal skills. An advantage of testing in complete 
extinction is that at least one basis for discrimination between 
baseline- and probe-trial types is removed, and the possibility that 
probe trials will be treated differentially (e.g., attended to less 
carefully) is thereby lessened. 

There is also the issue of programming consequences 
for probe trials. Certainly, the overwhelming majority of 
equivalence-based instruction, basic or applied, has presented 


all probe trials under extinction conditions, because a primary 
research question involves the extent to which new or emer- 
gent performances result from the experimental training. In 
an alternative strategy, however, reinforcers are arranged for 
probe responses consistent with equivalence-class formation 
(e.g., Kastak, Schusterman, & Kastak, 2001). In these cases, 
data analysis is focused on comparison selections for the first 
presentation of each probe-trial type. Because direct reinforce- 
ment cannot yet be responsible for these responses, conclu- 
sions about emergent equivalence are still possible. Further, 
when EBI is implemented for purely practical reasons—for 
example, to establish an important functional skill for the first 
time—answering the research question noted above may be less 
important than ensuring optimal conditions for the acquisition 
of that skill (e.g., Sidman, 1981), and these will surely include 
reinforcement of the skill when it is demonstrated, for whatever 
reason. 


Test Order 


Whether probe trials are massed or interspersed with baseline 
trials, one common practice in EBI is to present only one probe 
type (i.e., reflexivity, symmetry, transitivity and/or combined 
equivalence) per test-trial block. This practice can allow for 
multiple presentations of each probe combination within a test 
block, and it simplifies data analysis. As with the other proce- 
dural variables discussed in this section, however, the literature 
reveals great variety in the order of presentation of the probe 
types. Indeed, the range of test approaches has run the gamut 
from introducing the most involved test types first (e.g., after 
training AB, BC, and CD conditional discriminations, present- 
ing DA tests for combined equivalence as the first probe types; 
Sidman & Tailby, 1982; Sidman et al., 1985) to beginning with 
the “simplest” (e.g., testing for BA symmetry first, after AB 
training combination; Imam, 2006). As we’ ve also seen for the 


TABLE 19.3 Arranging Balanced Trial-block Composition—Interspersed Probe 


and Baseline Trials 


Sample Stimulus 
Left Position 


A3 B3 
A2 B1 
B1* A2 
Al BI 
A3* B1 
A2 B2 
B2* Al 
A3 B2 
Al* B2 
B3* A3 
Al B3 
A2* B3 


Comparison Stimuli 


Center Position Right Position 


BI B2 
B2 B3 
A3 Al 
B2 B3 
B2 B3 
B3 BI 
A2 A3 
BS BI 
B3 Bl 
Al A2 
Bl B2 
BI B2 


Notes: Italics indicates the S+ comparison stimuli. Boldface indicates probe trials. 


* indicates a trial with no programmed consequences. 


other procedural variables, the optimal sequence of test presen- 
tation will depend largely on the goals of the particular study 
and the participant population. 

For programs of EBI with young children or individuals 
with intellectual disabilities, where the generation of equiva- 
lence classes is of primary interest (as opposed to the factors 
underlying them), a simple-to-complex testing protocol has 
much to recommend it (e.g., Adams, Fields, & Verhave, 1993). 
In this arrangement, probe types are introduced sequentially, 
beginning with symmetry, followed by transitivity (if relevant), 
and then combined tests for equivalence. Similarly, probe types 
requiring fewer nodes are presented prior to those requiring 
more nodes.* Further, each probe type is introduced immediately 
following the minimal training necessary to provide the prereq- 
uisites for the probe relation; when probes and baseline trials are 
interspersed, only those baselines that serve as prerequisites for 
the probed relation would be included. 

For example, consider an EBI program that teaches three 
conditional discriminations—AB, BC, and CD, as in the linear 
series training illustrated in Figure 19.15. The simple-to-complex 
testing protocol for this training is outlined in Table 19.4, 
and involves the following sequence of training and testing 
phases: train the AB (coin to price sign) conditional discrim- 
ination to mastery; present BA (price sign to coin) symmetry 
probes interspersed with AB baseline trials; train BC (price 
sign to written coin name) to mastery; present CB (written coin 
name to price sign) symmetry tests interspersed with BC base- 
line trials; train mixed AB and BC; present AC (coin to written 
coin name) transitivity tests, and then CA (written coin name to 
coin) combined tests in that order, both interspersed with AB 
and BC baseline trials; train CD (written coin name to quantity 
of nickels) to mastery; present DC (quantity of nickels to writ- 
ten coin name) symmetry tests interspersed with CD baseline 
trials; train mixed BC and CD; present one-node BD (price sign 
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to nickels) transitivity tests and then one-node DB (nickels to 
price sign) combined tests, both interspersed with BC and CD 
baseline trials; train mixed AB, BC, and CD; present two-node 
AD (coin to nickels) transitivity tests and then two-node DA 
(nickel to coin) combined tests, both interspersed with AB, BC, 
and CD baseline trials. This approach to systematic increases in 
test complexity develops experience with each of the necessary 
simple discriminations as it progresses, and has been shown to 
increase the likelihood of strong equivalence outcomes (e.g., 
Adams et al., 1993). For older typically developing children and 
adults, fewer steps will be necessary, but a testing sequence of 
symmetry followed by transitivity and combined tests at each 
nodal level can maximize class formation. 


Judging Class Formation 


Our earlier section titled “Class Formation” described the basics 
for assessing this most fundamental EBI outcome. At issue is 
whether emergent performances indicative of stimulus inter- 
changeability will be demonstrated. Because interpretations of 
equivalence require reliable class-consistent responding on the 
defining probe-trial types (e.g., Sidman & Tailby, 1982), data 
analysis typically focuses on the percentage of probe trials on 
which comparison selections reflect the equivalence classes that 
would be expected to follow from the training provided. For 
example, if teaching signed word to picture (AB) and signed 
word to written word (AC) conditional discriminations pro- 
duced equivalence relations, AIB1C1 and A2B2C2 classes 
would be predicted, with each class including the correspond- 
ing signed word, picture, and printed word. For trials presenting 
a printed-word C1 as sample, then, class-consistent responses 
would include selections of the matching signed word, picture, 
or written word (Al, B1, or Cl) comparisons on symmetry, 
combined, and reflexivity tests, respectively. The percentage of 
class-consistent responses could thus be calculated, either for 


TABLE 19.4 A Simple-to-Complex Testing Protocol 


Training/Testing Phase 


Training Trial Types 


Test-Trial Types 


1 AB 

2 AB BA 
3 BC 

4 BC CB 
5 AB & AC 

6 AB & AC AC 
il AB & AC CA 
8 CD 

9 CD DC 
10 BC & CD 

11 BC & CD BD 
12 BC & CD DB 
13 AB, BC, & CD 
14 AB, BC, & CD AD 
iS AB, BC, & CD DA 


Dl etd abled 1 ea let A SSA SSS 
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all probe trials combined or, especially when class-consistent 
selections are below 100%, for each probe type individually. 
When multiple test blocks are presented (versus one massed test 
presentation, for example), these measures are often graphed 
and followed across the individual tests. As was the case with 
judging baseline acquisition, intermediate percentages of class- 
consistent responses (e.g., 60% to 80%) are indicative of multi- 
ple sources of stimulus control (e.g., see Saunders & Williams, 
1998; Sidman, 1980, 1987). For this reason, strong conclusions 
about class formation require relatively high levels of class- 
consistent responding (e.g., 90% or greater). As with many of 
our procedural variables, the specific criteria vary across studies, 
and for purely practical uses, any increase in the probed relations 
may provide a useful basis for further development. 

When the expected emergent performances are not 
demonstrated immediately, several different strategies have 
proven helpful. Perhaps the simplest involves repeated presen- 
tation of the probe blocks. As noted previously, delayed emer- 
gence, in which the probe performances improve with additional 
testing, is a relatively common finding (e.g., Sidman et al., 1985; 
Sidman et al., 1986). Delayed emergence has been interpreted as 
a weakening in competing sources of stimulus control that are 
not available consistently across probe trials, leaving the equiva- 
lence relation as the dominant source of control over comparison 
selections (e.g., Sidman, 1992, 1994). In addition to repeated 
testing, low scores on initial probe tests are often improved by 
re-exposure to the training blocks, especially those that include 
the baseline prerequisites for the problematic probe types. Other 
intervention strategies include training or testing modifications 
that target particular forms of competing stimulus control. For 
example, if a probe-trial error analysis documents a position 
preference in which a participant consistently chooses the lower 
right corner of the computer screen regardless of the stimulus 
presented, the presentation format could be redesigned so that 
no stimulus appears in that position on any trial. 


Procedural Variations 


Teaching a minimum of two conditional discriminations in 
the match-to-sample format provides the core prerequisites for 
assessing equivalence-class formation. However, a number of 
alternative training approaches have been utilized effectively 
in efforts to (1) explore the range of procedures that might 
generate equivalence classes; (2) enhance the efficiency of 
equivalence-based instruction; and (3) enhance the variety of 
performances that might be produced by EBI. To illustrate, a 
few of the most prominent of these alternatives are reviewed 
here, including the use of compound stimuli, class-specific rein- 
forcement, and simple discrimination training. 


Compound Stimuli 


One procedural variation builds on the matching-to-sample train- 
ing format by presenting a stimulus compound as the sample. For 
example, the printed word DOG and a picture of a dog could be 
presented one above the other as a pair in the sample-stimulus 
position, and selecting a particular comparison stimulus (e. g., the 
printed word CANINE) would be reinforced in the presence of that 
compound. EBI programs utilizing such compounds have revealed 


that each component of the compound can come to function as 
an independent equivalence-class member; that is, in the present 
example, DOG, CANINE, and the dog picture could all be matched 
interchangeably when presented individually (e.g., Groskreutz, 
Karsina, Miguel, & Groskreutz, 2010; Lane & Critchfield, 1998). 
Thus, by using a compound sample, a given amount of training 
results in many more emergent relations and larger equivalence 
classes—a finding with important implications for EBI. 

In one demonstration of the applied significance of this 
finding, Stromer and MacKay (1992) used compound stimuli 
in match-to-sample training designed to target spelling perfor- 
mances in three 9- to 13-year-old boys with academic deficits. 
Each sample stimulus included a printed word together with a 
picture (e.g., DOG and a line drawing of a dog). In addition 
to using compound sample stimuli, the procedure incorporated 
three additional variations from the standard teaching approach 
described thus far. First, an identity match-to-sample procedure 
was employed; that is, the S+ comparison stimulus was physi- 
cally identical to one part of the sample compound. In the present 
case, on any given trial, the comparison stimuli included either 
three different pictures, one of which was identical to the picture 
component of the compound sample, or a group of 10 letters, 
some combination of which would be identical to the written 
word component of the sample. The two types of identity trials 
(i.e., with picture or letter comparisons) were presented equally 
often in an unpredictable order. Selection of the comparisons 
that were physically identical to the sample resulted in reinforce- 
ment. The second procedural variation involved in this study is 
known as a constructed response requirement. On trials that pre- 
sented letters as comparisons, the participant’s task was to touch, 
in the correct order, the letters that produced the written word 
from the sample compound. After the identity matching perfor- 
mances had been taught, a third variation was introduced—a 
delayed match-to-sample procedure. In this step, the compound 
sample was presented as usual to start each trial; however, an 
observing response to the compound sample removed it from 
the computer screen, and the comparison stimuli were presented 
without the sample being present. Because the comparisons to be 
presented were unpredictable (i.e., pictures versus letters), this 
addition required attention to both components of the sample. 

The top half of Figure 19.16 illustrates the delay proce- 
dure; each column represents a different trial type as it would 
have appeared on the computer screen. In each case, the com- 
pound sample is shown in the top box, and an observing response 
to the sample produced the display illustrated in the bottom box. 
Note that the sample stimulus is no longer present in the bottom 
display, and that the comparisons to choose from include either 
three pictures for the trial type shown on the left or, for the trial 
type shown on the right, an array of letters from which the sam- 
ple word could be composed. 

Over the course of the study, this training was arranged 
with three different sets of printed words (i.e., dog, cat, owl; 
canine, feline, avian; etc.), one set at a time. The same pictures 
appeared as part of the sample compound with all printed-word 
sets; that is, the pictures were held in common across the trained 
conditional discriminations and thus could serve as nodes. The 
bottom half of Figure 19.16 illustrates training trials with words 
from the second and third word set, on the top and bottom rows, 


Figure 19.16 The top half of the figure illustrates the 
delayed matching task. Each column represents a different 
trial, with the compound sample shown first; an observing 
response to the sample produced the display in the bottom 
box. The bottom half of the figure illustrates training trials 
with words from the second and third word sets. The delay 
procedure is not represented here but was in place. 
Adapted from “Spelling and Emergent Picture-printed Word Relations 
Established with Delayed Identity Matching to Complex Samples,” by 

R. Stromer and H. A. Mackay, 1992, Journal of Applied Behavior Analysis, 
25, p. 898. 


respectively; the delay procedure was in place here, too, but is 
not represented in this figure. Note that in these trial examples, 
the picture of the dog is common to each sample compound. 
Test trials presented each word and picture individually 
as samples; all possible combinations of the printed words, pic- 
tures, and constructed responses were tested after training with 
each of the three stimulus sets (1.e., a multiple baseline across 
stimulus sets design). For each test, all three boys demonstrated 
emergent performances involving only those stimuli for which 
training had been provided. By the end of training, probe per- 
formances revealed the formation of equivalence classes, each 
of which included a picture, three different printed words, and 
three different constructed words. That is, any of the visual stim- 
uli within a class could be matched to any other (€.g., DOG 
and CANINE could be matched) and could occasion any of the 
three word constructions (€.g., with CANINE as sample, the 
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word DOG could be constructed, depending on the letter choices 
provided). Further, when the corresponding spoken words were 
presented as samples on probe trials, any of the three printed 
words or the picture in the class was selected, and the appro- 
priate spelling was constructed. At this point, the boys had an 
established spelling repertoire, resulting from EBI. 


Class-specific Reinforcement 


A second variation on the match-to-sample format involves a 
particular arrangement of consequences known as differential 
outcomes or Class-specific reinforcement. In this procedure, 
not only is the correct comparison choice conditional on the 
sample stimulus, but the type of consequence delivered is, too. 
For example, given sample A1, correct selection of comparison 
B1 would produce Reinforcer 1: a small marshmallow, per- 
haps. Given sample A2, correct selection of B2 would produce 
a different reinforcer: Reinforcer 2—let’s say, a sip of juice. 
Teaching a second conditional discrimination would work sim- 
ilarly; for example, selecting comparison Cl given sample Al 
would produce the marshmallow, while selecting comparison 
C2 given sample A2 would produce access to juice. Finally, 
let’s add identity match-to-sample training with D stimuli, such 
that selecting D1 given a D1 sample produces the marshmallow, 
with selecting D2 given a D2 sample produces juice. 

Several important findings have been obtained with 
this procedure. Not only can differential outcomes facilitate 
the acquisition of conditional discriminations (e.g., Litt & 
Schreibman, 1981), but reinforcer-probe trials reveal that the 
class-specific consequences themselves become members of 
the equivalence classes, thereby increasing the number of class 
members that can be added by teaching a single conditional dis- 
crimination (e.g., Dube, McIlvane, Mackay, & Stoddard, 1987; 
Dube, Mcllvane, Maguire, Mackay, & Stoddard, 1989). From 
our example above, when pictures of the marshmallow or juice 
are presented as sample or comparison stimuli, they are matched 
with the related A, B, C, or D stimuli (i.e., these are the rein- 
forcer probes). Further, the A, B, C, and D stimuli prove to be 
interchangeable with each other, indicating equivalence-class 
formation (e.g., Dube et al., 1987, 1989). Thus, the class-specific 
reinforcer can also serve as a node for class formation; in this 
example, the D stimuli could have become class members only 
by virtue of their relation to the class-specific consequences held 
in common across the trained relations. 

These sorts of findings also carry important implications 
for equivalence-based instruction. First, differential-outcomes 
procedures provide another route to producing more emergent 
relations with a given amount of training, because the rein- 
forcers join the respective classes. Second, equivalence-class 
formation can result from identity match-to-sample training 
when class-specific reinforcers are utilized (e.g., Dube et al. 
1987, 1989), and for certain target populations, identity train- 
ing is more easily mastered than the arbitrary training that is 
more typical in equivalence research (e.g., Pilgrim, Jackson, & 
Galizio, 1990; Zygmont et al., 1992). 

Studies are beginning to explore class-specific reinforcement 
procedures as a route to increasing the power of equivalence-based 
applications. To illustrate, preschool children who could count but 
otherwise scored poorly on math pretests were first taught to match 
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“Four” 
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J 


Figure 19.17 An illustration of the trained conditional 
discrimination. Each row represents a trained relation, 

with sample stimuli presented in the left column, the $+ 
comparison in the middle column, and the compound 
consequences that followed selection of the $+ in the right 
column. Note that the consequence for any given trial 
included the spoken word and either the numeral or the 
written word. 


pictures with equal quantities of objects totaling 4, 7, or 10 (e.g., given 
4 moons as sample, selecting 4 hearts rather than 7 lightning bolts 
or 10 squares; Luffman, 2012). Incorrect matches produced a brief 
buzzer, while correct matches produced a two-part class-specific 
consequence. Every correct match produced the correspond- 
ing spoken quantity value (e.g., “Four!”) and one of two visual 
stimuli, the corresponding numeral (e.g., 4) or the written word 
(e.g., FOUR), each presented on half of the relevant trials. Figure 
19.17 summarizes the training; each row of the figure presents an 
example of the matches that were taught, with the sample stimulus 
presented in the left column, the correct comparison in the middle 
column, and the possible consequences in the right column. 

On the probe trials that followed this teaching, the chil- 
dren demonstrated class-consistent matching of each quantity, 
numeral, written word, and spoken word. Later teaching phases 
arranged the same compound consequences for accurately match- 
ing quantities of objects to a “+ 1” addition problem (e.g., given 
four objects, selecting “3 + 1,” rather than “6 + 1” or “9 + 1”; 
AB training) and then for matching quantities of objects to 
a “+ 2” addition problem (e.g., given four objects, selecting 
“2 + 2,” rather than “S + 2” or “8 + 2”; AC training). Figure 
19.18 shows all of the stimuli presented in AB and AC conditional 
discrimination training, including the class-specific consequences. 

After completing the three teaching phases, probe per- 
formances documented the emergence of three equivalence 
classes with at least six members (i.e., a quantity of objects, 
the corresponding numeral, spoken and written words, 


Stimulus Sets Used in Conditional Discrimination Training 


A B C Consequences 
Quantities 
Trained + 1 Stimuli + 2 Stimuli 
4 3+1 2+2 “Four,” 4, FOUR 
7 6+1 54+2 “Seven,” 7, SEVEN 
10 9+1 8+2 “Ten,” 10, TEN 


Figure 19.18 Stimulus sets used in conditional 
discrimination training. Note that the consequence for 
any given trial included the spoken word and either the 
numeral or the written word. 


“+ 1” and “+ 2” addition problems), any of which could be 
matched to any other interchangeably by children who had 
been unable to do so prior to training. (Probe batteries were 
presented just before and just after each training phase.) In 
this example, EBI resulted in the symbolic relations defin- 
ing of a rudimentary but fundamental start to understanding 
mathematics. 


Three-term Contingency Training 


As noted previously, the acquisition of arbitrary conditional 
discriminations has sometimes proved challenging for young 
children and individuals with developmental delays. Thus, for 
both practical and theoretical reasons (Sidman, 1994, 2000), 
the possibility of generating equivalence classes based on 
simple discrimination, or three-term contingency, training 
has been explored, and with promising results (e.g., Debert, 
Matos, & Mcllvane, 2007; Pilgrim, Boye, Hogan, & Groff, in 
press; Sidman, Wynne, Maquire, & Barnes, 1989). Indeed, the 
ease with which simple discriminations are typically acquired 
suggests considerable potential for this approach to applica- 
tions of EBI. 

To illustrate, consider a follow-up to the math teaching 
program described above. In this case, class-specific conse- 
quences were arranged again, but for simple discrimination 
training (Yonkers, 2012). Four young children with learning 
disabilities were identified by their teachers as needing extra 
help in basic math skills. The first training phase taught three 
simple discriminations with the correctly oriented numerals 4, 7, 
or 10 as the S+ stimuli. For each discrimination, the two S— 
stimuli presented the S+ numeral in either an upside-down or 
a 90° orientation. Figure 19.19 illustrates a trial presentation 
for each of the three discriminations, with the S+ and two S— 
stimuli appearing in three of the four corners of the computer 
screen. 


Figure 19.19 An illustration of example 
training-trial configurations, one with each 


of three trained simple discriminations. 


Selecting either S— stimulus produced a buzzer, while 
selecting the S+ produced a compound class-specific conse- 
quence consisting of the spoken numeral name (e.g., “Four!”) 
and the corresponding written word (e.g., FOUR), as illustrated 
in Figure 19.20. 

Rapid acquisition of these simple discriminations was fol- 
lowed by conditional discrimination probe trials in which the 
spoken word, the written word, and the numerals were matched 
class-consistently, regardless of which served as sample and 


Compound 
S+ Consequence 
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Figure 19.20 An illustration of the trained simple 
discriminations. Each row represents the $+ stimulus in 
the left column, and the compound consequences that 
followed selection of the S+ in the right column. 
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which as comparison. Additional simple discriminations were 
taught in subsequent training phases, in which the stimuli pre- 
sented depended on each child’s skill level, as measured by 
pretests. For the children with the lowest pretest scores, the 
B stimulus set consisted of correctly oriented “+ 1” addition 
problems (1.e.,3 + 1,6 + 1,and9 + 1), while the C stimulus 
set consisted of correctly oriented “+ 2” problems (i.e., 2 + 2, 
5 + 2, and 7 + 2). One training trial of each type is illustrated 
in Figure 19.21; Figure 19.22 presents the S+ stimuli for each 
of the nine trained simple discriminations. 

Again, selecting one of the correctly oriented S+ stim- 
uli, rather than an upside-down or sideways presentation of the 
problem, resulted in the same class-specific consequences used 
for numeral training (i.e., the appropriate spoken and written 
number name, also illustrated in Figures 19.21 and 19.22). For 
children with more advanced skills, multiplication problems or 
Roman numerals were presented as stimulus sets, illustrating the 
flexibility of this approach to meet individual children’s needs. 
Upon completion of the simple discrimination training with dif- 
ferential outcomes, match-to-sample probe tests documented the 
formation of equivalence classes (e.g., 4, “Four”, FOUR, 3 + 1, 
and 2 + 2). Figure 19.23 illustrates just a few of the many probe 
arrangements made possible by the training program, with a sam- 
ple stimulus presented in the center of the screen and comparison 
stimuli presented in three of the four corners of the screen. In this 
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Figure 19.21 Illustration of a trial 
presentation with each of the nine 
trained simple discriminations. Each 
row represents trials with a particular 
stimulus set (A, B, or C), and each 
column represents trial types for which 
the same class-specific compound 
consequence was programmed, as 
indicated in the bottom rows. 
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Stimulus Sets Used in Simple Discrimination Training 


A B C Consequences 
Numerals +1 Stimuli + 2 Stimuli 

4 3+1. 242 “Four,” FOUR 

7 6+1 54+2 “Seven,” SEVEN 

10 9+1 84+2 “Ten,” TEN 


Figure 19.22 A representation of all of the S+ and 
consequence stimuli presented during simple discrimination 
training. 


EBI program, then, nine rapidly learned simple discriminations 
resulted in three five-member equivalence classes, as evidenced 
by more than 60 emergent conditional performances. These 
findings not only established the feasibility of such an approach 
to EBI but also provided a striking demonstration of the effi- 
ciency of EBI for producing generative and functional symbolic 
skills. 


APPLICATIONS AND GENERALITY 


The past years have witnessed enthusiastic attention to the pos- 
sibility that equivalence-based instruction might be utilized 
creatively to establish or enhance performances across varied 
participant populations and across an increasingly broad range 
of target skills. The inherent flexibility of the equivalence 
paradigm, in which literally any stimulus combination can be 
employed, and the increasing variety of effective teaching and 
testing procedures that it has inspired, make the approach partic- 
ularly attractive to applied behavior analysts. As each new appli- 
cation is documented, the generality of the equivalence outcome 
is further confirmed, a critical issue in the development of any 
scientific concept. In the sections that follow, some highlights 
from the accumulating application literature will be reviewed, to 
illustrate the scope and the promise of the equivalence approach 


to instruction with children and adults with intellectual disabil- 
ities, young typically developing children, adult clinical popu- 
lations, and college students. 


Children and Adults with Intellectual 
Disabilities and Young Typically Developing 
Children 


To date, these populations stand as the major recipients of 
equivalence-based applications, because skill acquisition is a 
necessary and ongoing focus for such individuals. Important cat- 
egories of EBI focus have included language- and number-skills 
training as well as other functional academic and life skills. 


Language-Skills Training 
Perhaps the greatest advances in programmatic EBI are found in 
the area of language-skills training. Building on Sidman’s (1971; 
Sidman & Cresson, 1973) original findings, a number of studies 
have replicated the formation of equivalence classes as a basis 
for establishing rudimentary reading skills (e.g., Elias, Goyos, 
Saunders, & Saunders, 2008, with adults with intellectual disabil- 
ities; Osborne & Gatch, 1989, with hearing-impaired preschool 
children; Sidman, Cresson, & Willson-Morris, 1974, with two 
boys with Down syndrome), where each class included a spoken 
word or sign (demonstrated in American Sign Language), a writ- 
ten word, and a picture or object. Participants, all of whom were 
nonreaders at the start, went on to sign or orally name both the 
pictures/objects and the written words following class formation. 
As illustrated earlier in this chapter, another group of stud- 
ies has demonstrated that spelling performances can be effectively 
incorporated into an equivalence approach (e.g., de Souza et al., 
2009; Goyos, Souza, Silvares, & Saunders, 2007; Melchiori, de 
Souza, & de Rose, 2000). The constructed-response procedure has 
often been used for this purpose whereby, in the presence of a spo- 
ken or written word, the participant’s task is to produce that word 
by selecting a series of letters or syllables from a choice pool, in 


as 


Figure 19.23 Examples of conditional 
discrimination test-trial arrangements. 


order, either by touching each choice sequentially on a computer 
presentation or by moving letter tiles in a table-top task. This teach- 
ing approach has proven effective with a range of participant groups 
including preschoolers, young students from regular and from spe- 
cial education classes, and adult nonreaders. 

One particularly exciting outcome of the studies just cited 
has been that spelling improvements successfully generalized to 
words that were not directly targeted in the teaching steps. As 
was the case in the work described above with compound sam- 
ple stimuli, it seems that the individual components, or units, of 
the trained compounds came to exert stimulus control on their 
own (e.g., the individual letters, in the case of spelling). Thus, 
when these smaller units appeared in novel arrangements (e.g., 
when new words were presented), appropriate responses (e.g., 
accurate spelling sequences) were possible. The ability to produce 
and recombine minimal units (e.g., words, syllables, phonemes, 
letters) in the presence of novel stimulus compounds is clearly 
essential to many functional language skills, such as reading and 
spelling. Not surprisingly, then, another important development 
in equivalence-based language-instruction programs has involved 
incorporating steps to deliberately facilitate such recombinations, 
also known as recombinative generalization. The general strat- 
egy involves teaching with compounds that have been specifically 
chosen to include overlap in the individual units that will be nec- 
essary for correct responding to novel compounds. For example, 
a child might be taught to select the written words MAP, SAP, 
MUM, or SET correctly upon hearing the corresponding spoken 
word, and then prove able to select the novel words MET or SUM 
on test trials. This teaching arrangement is sometimes described as 
matrix training, to emphasize the designed overlap in units across 
training and testing compounds. The key is to establish all of the 
prerequisite units for successful performance prior to test trials 
with the new combinations. (For further elaboration and examples, 
the interested reader should consult, as representative sources, de 
Souza et al., 2009; Hanna et al., 2011; Mueller, Olmi, & Saunders, 
2000). 

One group of behavior analysts has extended the gener- 
ality of this training strategy considerably with a sophisticated, 
comprehensive, and ongoing reading-instruction program for 
children in Brazil who have experienced repeated failure in the 
public school system (de Souza et al., 2009). Many of the chil- 
dren are from families with low socioeconomic status and they 
carry significant risk for school drop-out. The reading curricu- 
lum as a whole incorporates many of the findings described in 
this chapter as contributing to maximal effectiveness, and it has 
produced many first-time readers as a result (e.g., de Rose, de 
Souza, Rossito, & de Rose, 1992: Matos & Hubner-D’ Oliveira, 
1992). By way of overview, some key characteristics of the early 
instructional phases include a matching-to-sample procedure 
with constructed responses and a matrix training approach that 
builds on an interesting feature of the Portuguese language. Many 
Portuguese words are formed by consonant-vowel syllables (e.g., 
BOLA, which means “ball” in English, and LATA, which means 
“can’”). Recombinations of these syllables form new words (€.g.., 
BOTA, which means “boot” in English), thus setting the stage 
nicely for training designed to produce generative reading rep- 
ertoires, with syllables as the minimal units. For the constructed 
responses, students learn to select the appropriate sequence of 
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syllables (rather than individual letters, as in the spelling studies 
above) given a printed word, dictated word, and picture. AB (spo- 
ken word to picture matching) and AC (spoken word to written 
word matching) conditional discriminations are also taught, as in 
several of our earlier examples (e.g., Sidman, 1971). 

Tests at this point reveal the formation of equivalence 
classes (i.e., spoken word, picture, and written word) both for 
trained words and for recombinative generalization words. Sig- 
nificant improvements have also been obtained in reading and 
in spelling, both in the syllable-constructed response format and 
in cursive writing, again with training and generalization words 
alike. Subsequent teaching phases of this integrated instructional 
program move from acquisition of reading and writing syllables 
to whole words to sentences and short stories, and refinements 
continue. This increasingly complete reading curriculum has 
now benefited more than 3000 students at reading centers in 
multiple schools throughout Brazil, and may well stand as the 
single most comprehensive application of equivalence-based 
instruction in existence to date. 


Number-Skills Instruction 


Just as basic reading comprehension has been approached 
through EBI, so has basic numeric comprehension, with tar- 
gets ranging from simple number classes to more complex 
mathematical concepts. Examples of the former have involved 
match-to-sample training in which participants learn to select 
the appropriate quantity of objects or pictures (e.g., *, **, or 
**) in the presence of dictated number names (e.g., “one,” 
“two,” and “three”), and the appropriate numeral (e.g., 1, 2, 
or 3) in the presence of the same number names (i.e., AB and 
AC conditional discriminations). Probe tests following such 
training have reliably resulted in evidence of equivalence-class 
formation, with each class including a dictated name, a quan- 
tity, and a numeral (e.g., Gast, VanBiervliet, & Spradlin, 1979; 
Maydak, Stromer, Mackay, & Stoddard, 1995; both with indi- 
viduals with intellectual disabilities). Emergent matching of 
quantities and numerals supports an interpretation of number 
comprehension, and participants come to name the quantities 
and the numerals correctly as well. Variations on this train- 
ing have also established number names from two unfamiliar 
languages (i.e., Native American Ojibwe and Dakota words) 
as equivalence-class members (Haegele, McComas, Dixon, & 
Burns, 2011) with typically developing preschoolers. 

These simple number classes have been elaborated in sev- 
eral ways in subsequent studies. One example was described 
previously (Luffman, 2012) in which match-to-sample training 
using compound class-specific reinforcers was conducted with 
young typically developing children, and probe performances 
documented the formation of six-member equivalence classes 
that included quantities, numerals, spoken number names, writ- 
ten number names, and X + 1 and X + 2 arithmetic problems 
for each of three values (i.e., 4, 7, and 10; see Figure 19.18 for 
all stimulus sets). Each class member was matched accurately 
to every other class member, yielding hundreds of emergent 
numeric relations, after teaching only AA, AB, and AC condi- 
tional discriminations. The children in this study also learned 
to perform a constructed response. For each quantity sam- 
ple presented during training, they were required to slide the 


526 Part 8 * Developing New Behavior 


appropriate number of beads on an abacus, prior to the presen- 
tation of comparison stimuli; that is, a class-specific construct- 
ed-observing response was required. At the study’s conclusion, 
generalization trials were presented in which the three numer- 
als served as either samples or comparisons, and novel addition 
problems equaling 4, 7, or 10 (e.g.,2 + 2,4 + 3,5 + 5) served 
as comparisons or samples, respectively. (Note that, as illus- 
trated in Figure 19.18, all values from | through 10 had been 
presented during training). The children made accurate matches 
on these novel trial types, and they performed the appropriate 
constructed response to the novel sample combinations, thus 
demonstrating recombinative generalization of the values. 
Many beginning math learners struggle to understand and 
respond accurately with respect to fractions and decimals, and 
EBI has been designed to facilitate these more complex mathe- 
matical concepts as well (e.g., Hall, DeBernardis, & Reiss, 2006; 
Lynch & Cuvo, 1995). To illustrate, match-to-sample training 
was conducted with seven 1|1- to 13-year-old students identified 
as having math difficulties; grade-equivalent math scores for 
these fifth- and sixth-graders ranged from 3.0 to 3.7 (Lynch & 
Cuvo, 1995). AB and then BC conditional discriminations were 
taught in which the A stimuli included 12 fractions expressed 
as ratios (e.g., 3/5), the B stimuli included 12 fractions repre- 
sented pictorially (i.e., as a grid of 100 squares with the appro- 
priate number of squares shaded), and the C stimuli included 
12 values presented as decimals (e.g., 0.60). The top portion of 
Figure 19.24 illustrates the trained and tested relations. In 
contrast to generally poor pretest scores, all seven students 
improved dramatically, often to near-perfect levels, on the 
fraction-to-decimal probe tests (i.e., AC and CA) indicative of 
equivalence-class formation. Thus, 12 equivalence classes con- 
sisting of a fraction, a shaded grid, and a decimal value emerged, 
and a subsequent AD training step with 12 new fractions 
(D stimuli) expanded the classes to include a fourth interchange- 
able member. Results were mixed on paper-and-pencil tests for 
solving fractions with no choices provided, and on match-to- 
sample generalization tests involving novel fractions, with some 
of the students showing large improvements from pretests and 
others none at all. It might be noted that the teaching procedures 
in this early study did not utilize a matrix-training approach. 
Another obviously important functional-skill category 
for young children and individuals with disabilities involves 
money equivalencies, in which dictated or printed prices and 
various coin or dollar combinations must become interchange- 
able (see our example from Figure 19.7). This target has also 
been the subject of a number of EBI studies (e.g., Keintz et al., 
2011; Stoddard, Brown, Hurlbert, Manoli, & Mcllvane, 1989) 
using the match-to-sample procedure. In an early classic report 
(McDonagh, Mcllvane, & Stoddard, 1984), a woman with mod- 
erate intellectual disability was taught to match two different 
coin combinations to a printed price for each of three values 
(5, 10, and 15 cents). With no further training, she could then 
match the coin combinations to each other and give the val- 
ues of each, demonstrating equivalence-class formation. The 
three-member classes were expanded successfully by teaching 
additional coin combinations for some of the values, and these 
results were replicated systematically across multiple individu- 
als with intellectual disabilities (e.g., Stoddard et al., 1989). In 


a more recent illustration (Keintz et al., 2011), two preschoolers 
with autism received training with three dictated coin names (the 
A stimuli; “penny,” “nickel,” and “dime’”), three actual coins (the 
B stimuli; a penny, a nickel, and a dime), three printed prices 
(the C stimuli; $0.01, $0.05, and $0.10), and three dictated prices 
(the D stimuli; “1 cent,” “5 cents,” and “10 cents”). After learn- 
ing AB, BC, and DC conditional discriminations (see the bottom 
half of Figure 19.24), both children demonstrated four-member 
equivalence classes including the dictated coin and price names, 
the actual coins, and the written prices, and both named each of 
the coins. Collectively, the results from these studies demon- 
strate the great potential of EBI for establishing symbolic equiv- 
alencies involving large numbers of individual coin relations. 


Other Functional Skills 


A range of skills in addition to those involving basic language 
and number classes have been established through EBI, provid- 
ing further testament to the considerable flexibility of this teach- 
ing approach. For children and adults with cognitive disabilities 
and typically developing children, several examples will serve 
to reflect the variety of performances that have been targeted to 
meet specific needs of the individual participants. 

One interesting study (Arntzen, Halstadtrg, Bjerke, & 
Halstadtrg, 2010) focused on facilitating music skills for a 
16-year-old with autism and intellectual disability; the boy was 
interested in music and could play a few simple songs on a key- 
board when guided by a color-code that showed which note to 
play. The equivalence-based training was designed to teach him 
to read and play musical chords. For each potential class, the 
stimulus sets in this study included (a) the name of a chord in 
Norwegian (the boy’s second language); (b) a keyboard diagram 
with dots indicating finger placement; (c) a musical staff with 
the chord represented us notes; and (d) the name of the chord in 
Vietnamese (the boy’s first language). Across the phases of the 
study, eight different major chords and an equal number of minor 
chords were targeted; none of these chords were played or named 
correctly on a pretest. Posttest results verified the emergence of 
a large number of.four-member equivalence classes. The authors 
also reported that the boy continued to use the chords after the 
study concluded, and they recommended EBI approaches for 
establishing other hobby skills, to possibly enhance the lives of 
individuals with intellectual disabilities. A similar study with typ- 
ically developing preschoolers was also effective in establishing 
simple music skills with an equivalence-based approach (Tommis 
& Fazey, 1999). Again, improvements in playing simple melodies 
were maintained, at a 7-week follow-up test in this case. 

Several studies have focused on expanding sources of stim- 
ulus control for key functional skills in young children and adults 
with disabilities. For example, activity schedules are an import- 
ant aid to helping children work independently in many special 
education classrooms (e.g., MacDuff, Krantz, & McClannahan, 
1993; McClannahan, MacDuff, & Krantz, 2002). Typically, activ- 
ity schedules present a sequence of pictures, one at a time, each 
of which indicates a particular task to be completed. To make 
the developmentally appropriate shift from control by pictures to 
control by written words, an equivalence-teaching approach was 
arranged for two 6-year-old children with autism who responded 
accurately to picture schedules (Miguel, Yang, Finn, & Ahern, 
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Figure 19.24 The top half of the figure presents 
a schematic representation of trained (solid 
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2009). After being taught to match six activity-schedule pictures 
to their dictated names (AB training) and six corresponding writ- 
ten words to the same dictated names (AC training), both chil- 
dren demonstrated accurate responding to activity schedules with 
written words, as well as accurate matching of words and pic- 
tures, indicating comprehension of the words, and oral reading. 
A similar strategy was used (Rehfeldt & Root, 2005; Rosales & 
Rehfeldt, 2007) to augment the manding skills of male adults with 
severe intellectual disabilities who had learned to request items 
with pictures via the Picture Exchange Communication System 
(PECS) (Frost & Bondy, 1994). After successful picture-exchange 
requests had been established, participants were taught to choose 
the correct picture given its dictated name and to choose the cor- 
responding written word given the same dictated names (AB and 
AC conditional discriminations). Although unable to do so on pre- 
tests, all participants demonstrated equivalence-class formation, 
matching pictures to their written words and vice versa, and they 
successfully requested items with the appropriate written word. 
Some of the men also named the written words and requested 
items by name as well as by written-word exchanges. For the 
participants who received maintenance tests 1 month after training 
(Rosales & Rehfeldt, 2007), lasting improvements to functional 
communication were demonstrated. 

Equivalence-based instruction has also been used to teach 
academic content such as geography to children with disabilities, 


arrows) and tested (dashed arrows) conditional 
discriminations from Lynch and Cuvo (1995). 
The bottom half of the figure presents the same 
information from Keintz, Miguel, Kao, and Finn 
(2011). 


as well as to typically developing children. For example, basic 
geography lessons were presented to two boys with autism by 
teaching them to match a drawing of each of nine states to the 
dictated state name (AB training) and then to match the name 
of the state’s capital city to the state drawing (BC training; 
LeBlanc, Miguel, Cummings, Goldsmith, & Carr, 2003). Where 
all pretest scores were at approximately chance levels of accu- 
racy for both boys, posttests revealed strong emergent perfor- 
mances indicative of equivalence-class formation; state names 
were matched to their capitals and vice versa, and each trained 
relation was reversible. One of the boys also demonstrated per- 
fect accuracy on vocal posttests in which correct responses had to 
be produced orally (rather than selected) in response to relevant 
geography questions about the states involved in training (e.g., 
“What is the capital of Florida?”). This teaching approach was 
replicated systematically in a study with five adolescents with 
fragile X syndrome where three state names, drawings, and cap- 
itals were taught (Hall et al., 2006). In another variation, also 
involving geography content, five typically developing 6-year- 
olds were taught AB and BC intraverbal sequences rather than 
match-to-sample performances (Perez-Gonzalez, Herszlikowicz, 
& Williams, 2008). In this procedure, a geography question was 
presented orally (e.g., “What is the capital of Argentina?”), and 
the child was taught to give the correct answer orally. Thus, the 
procedure involved production of an oral response rather than the 
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more typical selection of a stimulus in a match-to-sample task 
(i.e., topography-based rather than selection-based responding). 
The stimulus sets included (a) two countries; (b) two capital 
cities; and (c) two city parks. After mastering the trained intra- 
verbal sequences (i.e., country-capital and capital-park), probe 
trials presented novel questions with each possible combination 
of elements (i.e., capital-country, park-capital, country-park, and 
park-country). Results showed accurate emergent responding on 
at least some probe types for most participants. 

Another important research direction for EBI in applied 
behavior analysis involves utilizing equivalence classes as an 
explicit means to promote generalization after teaching other func- 
tional skills. That is, if a new performance is taught in the presence 
of one stimulus setting, and that stimulus is a member of an equiv- 
alence class, we might expect the performance to be controlled by 
other members of the class as well (i.e., the transfer-of-function 
effect). Although this possibility has received relatively little atten- 
tion in applied work, some initial exploration has been promising. 
Taylor and O’ Reilly (2000) worked with six participants with mild 
intellectual disabilities to establish a response chain of grocery-store 
shopping skills. A task sequence (e.g., enter the store, consult a 
two-item shopping list, find the first item, place it in the cart, etc.) 
was trained first in a classroom, and then in a particular grocery 
store for all six. For two participants, this constituted the complete 
training (i.e., single-instance training). For another two participants, 
the same task-sequence training was conducted in two additional 
grocery stores (i.e., multiple-exemplar training). For the remain- 
ing two participants, single-instance training was followed by EBI. 
AB and BC conditional discriminations were taught with spoken 
words (“Supermarket,” “Shop,” and “Restaurant”), the correspond- 
ing written words, and photographs taken from inside each setting. 
An interesting feature of the training was that three different photo- 
graphs of supermarket settings were presented across training trials. 
Probe trials for these participants documented the formation of three 
equivalence classes, each including a spoken word, a written word, 
and the relevant photographs. After training, shopping skills were 
evaluated for all participants in a generalization grocery setting. 
The task sequence was followed accurately and equally well for 
the participants who received EBI and multiple-exemplar training, 
while those who received only single-instance training performed 
more poorly. Although this study is limited by the small number 
of participants in each teaching condition, the results suggest that 
EBI can provide the same positive impact on generalization as more 
costly and labor-intensive training in multiple sites. Certainly, these 
are findings worthy of further exploration. 

A final and significant category of equivalence-based 
instruction with children involves programs designed to facil- 
itate skills related to sensory deficits, such as vision or hearing. 
One illustrative study used an equivalence approach to teach 
Braille letter naming to four children (ages 7 to 12 years) with 
degenerative visual impairments (Toussaint & Tiger, 2010). 
All four students could name printed letters, but despite having 
a year or more of classroom Braille instruction, none demon- 
strated proficiency in reading Braille words. The stimulus 
sets used for teaching and testing included (a) Braille letters: 
(b) printed letters; and (c) spoken letter names, for 5 to 26 letters, 
depending on the participant. With printed letter-name relations (BC 
relations) already intact, teaching focused on having the children 


match Braille letters to printed ones (AB relations). On subsequent 
posttests, all four children demonstrated strong emergent match- 
ing between printed-letter samples and Braille comparisons, and 
between spoken name and Braille comparisons. They also named 
the Braille letters, verifying the formation of equivalence classes 
that provided a basis for further Braille reading instruction. With 
respect to hearing deficits, another program of study has undertaken 
the goal of establishing symbolic function for auditory stimulation 
provided by a cochlear implant in children with deafness (Almeida- 
Verbu et al., 2008; da Silva, de Souza, & de Rose, 2006). To 
illustrate, 6- to 9-year-old children with early-onset deafness first 
received AB and AC conditional discrimination training with all 
visual stimuli (Almeida-Verbu, et al., 2008). After three-member 
equivalence classes were demonstrated, a new conditional discrim- 
ination was taught with auditory stimuli as samples (DC training). 
All participants then demonstrated class expansion, indicating that 
the sounds had become the symbolic equivalent of the existing class 
members (e.g., D1 was interchangeable with Al, B1, and C1, and 
so on). These preliminary results suggest great promise for helping 
young children learn the meaning of sounds once cochlear implants 
make auditory stimulation available to them, perhaps for the first 
time in their lives. 


Adult Clinical Populations 


Although a relatively small category compared to many we have 
reviewed, applications of EBI to adult clinical issues have been 
explored in several studies, noteworthy because of the potential 
that is exemplified for these populations. As a case in point, indi- 
viduals with acquired brain injury or illness often display deficits 
for which equivalence approaches seem well suited, such as the 
inability to provide or respond to words for everyday objects, 
events, or people. One study (Cowley et al., 1992) focused on 
three adult men with brain injury who were unable to relate their 
therapist’s name to his or her face, even after many months of 
working together—a problem for independent functioning within 
the rehabilitation facility. The solid arrows in Figure 19.25 depict 
the stimulus relations that were demonstrated by each of the men 
at pretest for each of three different therapists. The relations tar- 
geted in training are depicted by the bold arrow, and the potentially 
emergent relations are indicated by the broken arrows. Posttest 
results revealed equivalence-class formation for all three men; 
that is, spoken names, written names, and faces were matched 
together in any combination. The two men who received the test 
were also successful at providing oral names for their therapists’ 
faces. Facial-emotion recognition was targeted in another study 
with three adults with brain injury (Guercio, Podolska-Schroeder, 
& Rehfeldt, 2003). Although these participants were unable to do 
so on pretests, learning to match two sets of facial expressions 
to their appropriate written labels (i.e., happiness, sadness, and 
anger; AB and CB training) produced significant improvements 
in matching different representations of the same emotion to each 
other, and in providing correct oral labels for those emotions. 
The field of gerontology represents another obvious target 
for equivalence approaches, although work in this area is in its 
infancy. Although it is clear that some elderly individuals demon- 
strate equivalence-class formation readily (e.g., Perez-Gonzalez 
& Moreno-Sierra, 1999; Saunders, Chaney, & Marquis, 2005), it 
is also apparent that age matters (Wilson & Milan, 1995), with 
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older participants (62 to 81 years old) less likely to demonstrate 
class formation than young adults (19 to 22 years old). Further, a 
correlation has been noted (Gallagher & Keenan, 2015) between 
demonstrations of equivalence-class formation and scores on the 
Mini Mental Status Examination (MMSE), a widely used assess- 
ment of cognitive impairment in the elderly. Indeed, the ability 
to demonstrate equivalence-class formation may prove to be a 
more sensitive measure of impairment, in that deficits in class 
formation were present for individuals with MMSE scores above 
those indicative of cognitive difficulties. One investigation of a 
participant diagnosed with Alzheimer’s disease (an 84-year-old 
woman with an MMSE score of 20, or mild cognitive impair- 
ment) found evidence of delay-identity matching but not equiva- 
lence-class formation (e.g., Steingrimsdottir & Arntzen, 2011; see 
also Steingrimsdottir & Arntzen, 2014). Equivalence approaches 
may thus have much to offer in the way of assessing cognitive 
declines. Whether EBI can be used effectively to target lost skills, 
or to facilitate maintenance of remaining skills at least in individ- 
uals with mild impairment, remains to be explored. 

The broad domain of health care would seem to hold many 
possibilities for skill improvement based on EBI. In one inno- 
vative study relevant to the critical issue of obesity (Hausman, 
Borrero, Fisher, & Kahng, 2014), researchers sought to teach 
college students to make accurate estimations of portion size 
because portion size can be directly related to caloric intake, 
especially for overweight individuals (e.g., Burger, Fisher, & 
Johnson, 2011). Nine students, five of whom were overweight 
or obese, were taught to match measuring cups filled at a given 
level (i.e., one-quarter cup, one-half cup, and one or two cups of 
a dry food) to a quantity of the same food piled on a plate, and 
vice versa (i.e., AB and BA training). They were also taught to 
match one of four measuring aids (i.e., a golf ball, tennis ball, 
baseball, and softball) to each measuring cup amount, and vice 
versa (i.e., BC and CB training). Figure 19.26 illustrates these 
training procedures. At pretest, posttest, and 1-week mainte- 
nance and generalization tests, a large plastic container filled 
with a food was presented, and each participant was asked to 
move one-half cup of the food to a plate. No cues, aids, or mea- 
suring cups were provided. The generalization tests involved 
novel foods that had not been presented during training sessions. 
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Figure 19.25 A schematic representation of the existing 
(thin solid arrows), trained (bold solid arrow), and tested 
(dashed arrows) conditional discriminations. 

Using Stimulus Equivalence Procedures to Teach Name-face 

Matching to Adults with Brain Injuries” by BJ. Cowley, G. Green, and 
B. Brounling-McMorrow, 1992, Reproduced with permission of 


John Wiley & Sons Inc. 
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Figure 19.26 A schematic representation of the trained 
conditional discriminations. 


All nine participants demonstrated improvements in the accu- 
racy of their portion estimates from pretraining to posttraining 
and for seven, those improvements were maintained 1 week 
later. Five participants demonstrated similar accuracy with at 
least one novel food in the generalization tests. Thus, a brief 
teaching intervention (50 to 90 trials) was sufficient to impact a 
potentially influential skill relevant to healthful eating practices. 
Further investigation of EBI for health-related skills is called for. 


In the College Classroom 


The benefits to be gained from programmed instructional tech- 
niques based on behavioral principles have long been noted for 
university-level instruction (e.g., Keller, 1968, 1982; Skinner, 
1968), and modern technology has only increased the feasibil- 
ity of providing course content in a more effective manner. Of 
course, many college classes or course sequences require facil- 
ity with large sets of basic facts and relations before advanced 
study is possible. Unfortunately, the rote memorization of facts 
that characterizes some common study techniques (e.g., flash 
cards) often fails to suffice for true understanding of the subject. 
Instead, comprehension of the sort at issue throughout this chap- 
ter is necessary for progress. It only stands to reason, then, that 
behavior analysts would apply equivalence-based instruction to 
teaching university-course content, extending the generality of 
the instructional approach to much more complex stimulus sets, 
a sophisticated group of learners, and new settings. 
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One early study of this type (Fields et al., 2009) sought 
to teach introductory psychology students concepts of sta- 
tistical interaction in a two-way analysis of variance, a sub- 
ject matter that has presented challenges for many learners 


Al BI 


== Kids 
154— Adults 
7 ee 
5 


1 2 3 
Days of no sleep 


Aggression was directly 
related to sleep deprivation 
for both kids and adults. For 

each level of sleep 
deprivation, a decrease in 
age produced a constant 

increase in aggression. 
Degree of aggression in kids 
and adults did not intersect at 
any level of sleep 
deprivation. 


# Aggressive Rs 


== Kids 
154— Adults 


In kids, aggression was 
inversely related to sleep 
deprivation. In adults, 
aggression was directly 
10 related to sleep deprivation. 
Age reversed the effect of 
sleep deprivation on 
aggression. Levels of 
aggression seen with kids 
and adults intersected at an 
intermediate level of sleep 
deprivation. 


# Aggressive Rs 


5 
1 = 


1 2 3 
Days of no sleep 


== Kids 


ae 


1 2 3 
Days of no sleep 


In kids, aggression was 
directly related to sleep 
deprivation. In adults, 
aggression was inversely 
related to sleep deprivation. 
Age reversed the effect of 
sleep deprivation on 
aggression. Levels of 
aggression in kids and 
adults did not intersect at 
any level of sleep 
deprivation. 


# Aggressive Rs 


== {Kids 


Aggression was directly 
15+— Adults 


related to sleep deprivation 
for both kids and adults. 
Increased sleep deprivation 
produced a larger increase 
in aggression in kids than in 
adults. Levels of aggression 
in kids and adults did not 
intersect at any level of 
sleep deprivation. 


# Aggressive Rs 


Days of no sleep 


at this level. Match-to-sample training established AB, BC, 
and CD conditional discriminations with the four sets of 
stimuli depicted in Figure 19.27, and probe performances 
documented the emergence of four classes, each with four 
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Figure 19.27 Stimulus sets used in training AB, BC, and CD conditional discriminations. Each 
column represents a stimulus set (A, B, C, D), and each row represents a potential equivalence class. 


From “Equivalence Class Formation: A Method for Teaching Statistical Interactions” by L. Fields, R. Travis, D. Roy, 
E. Yadlovker, L. de Agular-Rocha, and P. Sturmey, 2009, Reproduced with permission of John Wiley & Sons Inc. 


members. Each class included a line graph illustrating a par- 
ticular statistical interaction, text describing the interaction 
shown in the graph, the label for the type of interaction, and 
text defining the interaction type. Interchangeability between 
the members of each of the four classes (1.e., no interaction, 
crossover interaction, divergent interaction, and synergistic 
interaction) indicated comprehension of the statistical con- 
cepts, based solely on the EBI. As further evidence, the stu- 
dents completed a paper-and-pencil multiple-choice test of 
the same interaction types but with novel graphs and graph 
descriptions, both before and after the match-to-sample 
training. After completing the 90-minute training, students 
showed significant improvement in test scores, from an aver- 
age of approximately 50% on the pretest to an average greater 
than 90% on the posttest; all students earned scores corre- 
sponding to a grade of “A” or “B.” Thus, the comprehension 
that emerged from training generalized to new exemplars. In 
contrast, a matched group of control students took the test 
twice with 90 minutes in between, but did not receive train- 
ing. For these students, there was no change in test scores as 
a result of repetition, and 73% of them earned scores corre- 
sponding to a grade of “D” or “F.” 

Fienup, Covey, and Critchfield (2010) arranged an EBI 
program designed to take advantage of the generative power 
of class expansion or merger. The lesson content in this study 
was brain structures and functions, another complex subject 
matter consisting of a great number of interrelated facts. 
Volunteer undergraduate students received match-to-sample 
training with the stimuli illustrated in the top half of Figure 
19.28. All possible transitive/equivalence relations were pre- 
tested and posttested following each phase of training. The 
diagram at the bottom of Figure 19.28 represents all tested 
relations with dashed lines and all trained relations with solid 
lines. The first phase of training established AB and AC con- 
ditional discriminations, and probe tests documented the for- 
mation of four three-member classes (i.e., ABC classes, each 
including the name of a lobe of the brain and two different 
functions corresponding to that lobe). The second training 
phase taught AD and AE conditional discriminations; again, 
four three-member classes resulted (i.e., ADF classes, each 
including the name of a lobe of the brain, a brain schematic 
depicting the location of the lobe, and a deficit caused by 
damage to that lobe). Probes for class merger/expansion 
tested for the emergence of all possible relations between the 
B, C, D, and E stimuli in each class. To sum, the five-member 
classes documented by 80 different stimulus relations in this 
study were generated by teaching a subset of only 16, in 
lessons that required only about 15 minutes—a tremendous 
efficiency in teaching. 

The applications of EBI to college coursework would 
seem to be limited only by the teacher’s creativity. Indeed, 
a number of different subject matters have been targeted 
with this teaching approach. Statistical concepts have been a 
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frequent choice for instructional programs (e.g., in addition 
to those above, Albright, Reeve, Reeve, & Kisamore, 2015; 
Fienup, Hamelin, Reyes-Giordano, & Falcomata, 2011). Other 
targeted content has included concepts from research design 
(e.g., Lovett, Rehfeldt, Garcia, & Dunning, 2011; Sella, Ribeiro, 
& White, 2014; Walker & Rehfeldt, 2012), from diagnostic 
categories of disabilities (Walker, Rehfeldt, & Ninness, 2010); 
and from classes of drug names in psychopharmacology (Zinn, 
Newland, & Ritchie, 2015). Although all of these studies have 
demonstrated tremendous teaching efficiency in the number 
of emergent match-to-sample relations generated, compared 
to the number trained, some have extended that efficiency 
significantly by including generalization tests of various 
kinds. Generalization testing strategies have included probes 
for topography-based responding that was not targeted in 
training (e.g., orally naming stimuli or writing answers; 
Albright et al., 2015; Sella et al., 2014); probes presented 
in a different format (e.g., on multiple-choice paper-and- 
pencil tests akin to those used in traditional classroom set- 
tings; Albright et al., 2015; Critchfield & Fienup, 2010); novel 
stimulus presentations (e.g., Albright et al., 2015; Walker & 
Rehfeldt, 2012); and follow-up maintenance tests (e.g., 
Albright et al., 2015; Walker & Rehfeldt, 2012). 

Finally, in addition to evaluating the success of EBI 
with improvements at the level of the individual participant 
(e.g., from pretest to posttest), some studies have also com- 
pared performances across groups. As noted above, Fields et 
al. (2009) compared students who received training to those 
who did not, and found significantly stronger performances 
after training. Fienup and Critchfield (2011) compared the 
outcomes of EBI instruction to those from two different 
groups—one that received no training and one that received 
training for all possible relations (i.e., all of the relations that 
were trained as well as those that were probed for the EBI 
group). The students with no training performed poorly on 
the comparison tests; the EBI and full-training groups did 
equally well, but the EBI group had received significantly 
less training. Zinn et al. (2015) compared a group of stu- 
dents receiving EBI to each of two control groups trained on 
a randomly selected (rather than interrelated) subset of the 
EBI training and probe trials. One control group was matched 
to the EBI condition with respect to the accuracy criterion 
required for training; the other was matched with respect to 
the total number of training trials presented. Probe perfor- 
mances following equivalence training were significantly 
superior to those of the unstructured-instruction groups. 
Lovett et al. (2011) compared a group of students trained 
with EBI to a group that received a standard lecture on the 
same subject matter, presented as a video. The two groups 
performed equally well in this comparison. If EBI is to have 
widespread impact on university teaching practices, further 
comparisons across teaching formats will continue to be an 
important direction for future research. 
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Figure 19.28 The top half of the figure presents the stimuli used in training; each row represents 
a stimulus set (A, B, C, D, E), and each column represents a potential equivalence class. The bottom 
half of the figure presents a schematic representation of all trained (solid arrows) and tested (dashed 


arrows) conditional discriminations. 


From “Teaching Brain-behavior Relations Economically with Stimulus Equivalence Technology” by D. Fienup, D. Covey, and 
T. Critchfield, 2010, Reproduced with permission of John Wiley & Sons Inc. 


APPLICATIONS STEMMING FROM 
ALTERNATIVE THEORETICAL APPROACHES 
TO RELATIONAL RESPONDING 


Sidman’s (1994, 2000) theoretical account of equivalence-class 
formation holds equivalence to be a basic behavioral process, 
akin to reinforcement or discrimination in that individuals do not 
have to learn how to behave in accordance with these processes. 
Just as no one had to teach you to respond more frequently when 


your behavior produces a reinforcer, in Sidman’s view, no one 
had to teach you to produce the patterns of responding defined 
as equivalence. Rather, basic behavioral processes, like equiv- 
alence by this view, are a direct outcome of the reinforcement 
contingencies in place. In other words, arranging a three- or 
four-term reinforcement contingency will automatically result in 
equivalence performances as long as no methodological features 
constrain or interfere (e.g., by creating competing sources of 


stimulus control), at least for most human participants. Further, 
Sidman’s view is that all elements of the reinforcement contin- 
gency (e.g., for a four-term contingency, that would include the 
sample stimulus, the comparison stimulus, the response, and 
the consequence) can function as equivalence-class members 
in a given context (again, as long as no methodological issues 
interfere). In keeping with this emphasis on the reinforcement 
contingency as the basis for equivalence, the EBI literature 
reviewed thus far has highlighted many creative arrangements 
of three-, four-, and even five-term training contingencies, with 
focus on the make-up of the classes made possible by those 
contingencies. 

Alternative theoretical accounts of equivalence perfor- 
mances have also been developed since Sidman’s seminal work 
on the topic (e.g., Sidman, 1971; Sidman & Cresson, 1973; 
Sidman & Tailby, 1982), as the phenomenon has been explored 
and extended, both experimentally and conceptually. The two 
most prominent of these alternatives are naming theory, developed 
by Horne and Lowe (1996) and colleagues, and relational frame 
theory (RFT), developed by Hayes, Barnes, and colleagues (e.g., 
Hayes, Barnes-Holmes, & Roche, 2001). A complete review of 
the literature related to these accounts is well beyond the scope 
of this chapter; Chapter 20 provides a careful examination of 
work based on RFT, and the interested reader can consult the 
references on naming theory provided in this chapter. Some dis- 
cussion of the two positions is warranted here, however, particu- 
larly because of the practical strategies for producing emergent 
(or derived) behavior that follow from each theory. Thus, the 
necessarily brief overviews that follow are intended to introduce 
a few representative illustrations of application work based on 
the naming and relational frame theories, to provide the reader 
with a basis for further study of these productive directions in 
applied behavior analysis. 

One key theoretical feature shared by the naming theory 
and RFT distinguishes these accounts from Sidman’s (e.g., 
(2000). In both cases, emergent or derived responding 1s seen as 
learned behavior, rather than as a direct outcome of a reinforce- 
ment contingency. In other words, a specific learning history is 
viewed as necessary before a given contingency could give rise 
to emergent performances. The nature of the training history 
held to be necessary differs across the two accounts, provid- 
ing the basis for differences in the application approaches to be 
described. For both accounts, however, emphasis is placed on 
training that produces a generalized or higher-order operant 
class. Higher-order operants are defined in terms of general 
relations between antecedents and responses, rather than in 
terms of specific stimuli and responses, and any given instance 
of antecedent control is seen as just one example of a more 
general relation (e.g., Catania, 1998). Generalized imitation is 
considered a classic example of a higher-order operant (e.g., 
Catania, 1998; Dinsmoor, 1995). Imitation is typically viewed 
as a correspondence between the actions of a model and those 
of an observer, rather than in terms of any specific modeled 
action or response, and imitative repertoires are generalized 
when they include novel correspondences that have not been 
directly taught. To give a further feel for the central role played 
by this sort of conceptualization, manding (Catania, 1998; 
Skinner, 1957) and instruction following (e.g., Catania, 1998) 
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stand as other examples of higher-order operant classes. The 
creation of higher-order operants requires teaching a number 
of examples, or multiple-exemplar training. For instance, a 
series of different modeled actions could be presented, with a 
child’s responses approximating those of the model shaped and 
reinforced in the presence of each. After teaching a number of 
such correspondences, children have come to show appropriate 
approximations when presented with novel model actions (e.g., 
Baer, Peterson, & Sherman, 1967; Poulson, Kymissis, Reeve, 
Andreatos, & Reeve, 1991). Naming and relational framing are 
similarly viewed as higher-order operants, created by training 
many specific examples that ultimately provide for the acqui- 
sition of a more generalized class of discriminated responding. 


Naming Theory 


Naming has been suggested (Horne & Lowe, 1996) as a technical 
term for a higher-order operant in which an individual engages 
in bidirectional speaker and listener behavior with respect to a 
given object or event. In one example, a child might see a shoe, 
tact “shoe” (either overtly or covertly), hear herself say “shoe,” 
and then exhibit listener behavior occasioned by hearing herself 
say “shoe”’—perhaps by orienting to the shoe or placing her foot 
in it. The complete cycle is necessary to satisfy the requirements 
for naming in this technical sense, and naming is viewed as the 
basic unit of verbal behavior. To establish the naming oper- 
ant, each component of the cycle (e.g., listener responses, tact 
responses) needs to be directly taught (e.g., Horne et al., 2004; 
Lowe, Horne, Harris, & Randle, 2002), and with many differ- 
ent stimuli, or exemplars (e.g., shoes, dolls, dogs). Once fully 
established, however, the entire cycle could be initiated from 
any point (seeing the shoe and then saying “shoe,” or hearing 
“shoe” and then looking for the shoe, thus meeting the bidirec- 
tional dimension of the definition), and the acquisition of new 
instances of naming would require teaching only one component 
of the cycle (e.g., teaching a child to select a new toy given a 
novel name would be sufficient to produce the full naming rela- 
tion). Figure 19.29 illustrates that seeing any number of different 
shoes could occasion saying “Shoe,” and that hearing “shoe” 
could occasion listener behavior toward a number of different 
shoes, which would then occasion saying “Shoe,” and so on. 
Naming is thus viewed as classifying, so that any two or more 
items or events with the same name would be treated similarly; 
they would occasion the same tacts and thus the same listener 
behaviors (e.g., Horne, Hughes, & Lowe, 2006; Lowe, Horne, 
& Hughes, 2005). In this way, naming is seen as responsible for 
categorization, equivalence performances, and various forms of 
transfer of function (Horne & Lowe, 1996). 

Of course, a large number of the EBI programs described 
so far involved names (e.g., Sidman, 1971; Sidman & Cresson, 
1973), whether presented as auditory samples, as written sam- 
ples or comparisons, or as responses, and certainly the data 
are clear that providing stimulus names can facilitate equiv- 
alence-class formation (e.g., Randell & Remington, 1999, 
2013; Sprinkle & Miguel, 2012). Although such findings are 
consistent with naming theory, they are also consistent with 
Sidman’s theoretical position that any term in a reinforcement 
contingency, including a class-consistent response such as a 
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says Jit. hears 


“shoe” /shoe/ 


Figure 19.29 An illustration of the naming relation, where 
hearing “shoe” occasions listener behavior (e.g., orienting 
toward any of a number of different shoes), which in 

turn occasions tacting “shoe.” The naming unit could be 
initiated from any point in the cycle. 

From “On the Origins of Naming and Other Symbolic Behavior” by 

P.J. Horne and C.F. Lowe, 1996, Reproduced with permission of 

John Wiley & Sons Inc. 


name, can become an equivalence-class member. Applications 
more unique to naming theory have sought to generate emer- 
gent performances with training procedures that target each of 
the individual components that constitute the complete naming 
unit. Indeed, well-developed protocols have been developed for 
teaching each of the listener and speaker skills involved (e.g., 
Horne & Lowe, 2000; Miguel & Petursdottir, 2009). Not sur- 
prisingly, the populations involved in these applications have 
been very young children or children with intellectual disabili- 
ties, for whom higher-order naming is not a part of the existing 
repertoire. 

To illustrate, two studies compared the impact of lis- 
tener training and tact training on the emergence of intraverbal 
responses involving a native language word and a second- 
language word (Petursdottir & Haflidadottir, 2009; Petursdottir, 
Olafsdottir, & Aradottir, 2008). A total of six typically devel- 
oping 5-year-olds were involved. The stimuli included color 
photographs, native Icelandic words, and corresponding words 
from a different language (either Spanish or Italian). The chil- 
dren were able to name each picture in Icelandic at baseline, and 
training was conducted with two stimulus sets for each child. 
Two of the children received tact training only, two received 
listener training only, and two received each form of training 
for a different stimulus set. Tact training involved presenting 
a picture and teaching the child to produce its name orally in 
the second language. Listener training involved presenting three 
pictures and teaching the child to select the appropriate one 
when its second-language name was presented orally by the 
experimenter. Pretests and posttests presented each Icelandic 
word and asked the child to say its second-language name, and 


vice versa. Across the studies, both forms of training resulted 
in significant improvements from pretest to posttest, with tact 
training producing the greatest gains in emergent intraverbals 
(see also May, Downs, Marchant, & Dymond, 2016). Similar 
findings were obtained with a study of two 5-year-old children 
with autism (Grannan & Rehfeldt, 2012), in which pre- and 
post-intraverbal tests involved asking the child to produce the 
names of members of a category (e.g., body parts or vehicles). 
No correct responses were given on the pretests, but after lis- 
tener and tact training, substantial improvements in intraverbal 
skills emerged. 

Other studies have evaluated the impact of listener (Kobari- 
Wright & Miguel, 2014) and tact training (Miguel & Kobari- 
Wright, 2013) on categorization skills in young children with 
autism, a commen target in many curricula designed for this 
population and one that frequently requires extended training. 
The training stimuli used here were nine cards picturing three 
examples of each of three categories of dogs (i.e., hound dogs, 
toy dogs, and working dogs). On each trial of the categorization 
test, a picture from a particular category was presented as a sam- 
ple stimulus along with three additional pictures as comparisons, 
one from the same category and two from different categories. 
The categorization test was given twice before and twice fol- 
lowing training; the second test of each pair also required the 
child to name the sample before the comparisons were presented. 
Accuracy was at approximately chance levels on the pretests, 
and training was then conducted with all nine cards. Each child 
received either listener training, in which she learned to select the 
correct picture from three comparisons when given its category 
name, or tact training, in which she was taught to produce the 
category name orally when shown a picture, or both if necessary. 
Of the four children who received listener training, three demon- 
strated near-perfect emergent performance on the category tests 
and also on posttraining tact tests. For the fourth, neither tact 
responses nor categorizations improved; when tact training was 
added, however, accuracy was perfect on the subsequent tests. Of 
the two children exposed to tact training first, both demonstrated 
accurate listener behavior and strong performance on the cate- 
gorization tests, especially when a tact of the sample was also 
required on each test trial. Collectively, these data suggest that 
active tacting may be important for accurate categorization with 
this population, and that listener training alone can be insuffi- 
cient for generation of tacting with some children. Similar results 
were obtained in a study with two young men with moderate 
intellectual disabilities and extremely limited vocal repertoires 
(Ribeiro, Elias, & Goyos, 2010) for whom signing was the tar- 
geted skill. During listener training, a video clip of each of six 
signs being performed was presented as the sample stimulus, 
along with three picture comparisons on each trial (AB training). 
After training, both men demonstrated emergent signing (tacting) 
and emergent manding with the same signs. 

As a group, the studies described here illustrate an 
approach to producing novel performances that follows from 
the premises of naming theory. For certain participant pop- 
ulations, teaching critical components of the naming unit 
(i.e., listener behavior, tacting) can be effective in generat- 
ing repertoires indicative of symbolic relations, classes, and 
categories. 


Relational Frame Theory 


In relational frame theory (RFT), framing is viewed as the key 
higher-order operant, and it is characterized as responding to one 
stimulus in terms of another (Hayes et al., 2001). RFT empha- 
sizes that we learn many different types of relations in our lives 
(e.g., larger than, opposite of, different from), each of which can 
be considered a frame in that any pair of stimuli can function 
within it. Equivalence is thus just one of many frames (known 
as the frame of coordination in RFT terminology), although cer- 
tainly an early and important one. As noted previously, framing 
is behavior and it must be taught; multiple-exemplar training 
is viewed as the basis for learning to frame. In a classic exam- 
ple, a mother might teach symmetry by engaging her child with 
a picture book. Mom could point to the picture of a dog, say 
“dog,” and then ask, ““Where’s the dog?””—shaping, prompting, 
and reinforcing the child’s pointing to the appropriate picture. 
This same interaction would unfold for many different pic- 
tures, with the child’s symmetrical response directly trained 
in each case. Training with a number of exemplars allows for 
abstraction of the bidirectional relationship between names and 
pictures; this relation could then be applied to new examples 
without the need for further shaping, demonstrating emergent 
or derived relations, in RFT terms. To describe a performance 
as derived thus implies an active response on the part of the 
behaver—a relating or framing response that has been directly 
taught with many previously experienced exemplars. In the nat- 
ural acquisition of many frames, early exemplars are likely to be 
characterized by real physical differences; for example, initial 
examples of “larger than” training are likely to involve pairs of 
stimuli in which one is physically greater in size than the other. 
Once abstraction of the “larger than” relation has been learned, 
however, it could be applied to novel stimulus pairs, including 
those for which the physical dimension of size is unimportant; 
for example, a learner could respond appropriately after being 
told that 7 is larger than 5, or that a dime is larger in value than 
a nickel. 

Given the broad range of possible relations in our world, 
RFT introduces generic terms for the critical properties of fram- 
ing. These include mutual entailment (e.g., if A is larger than 
B, then B is smaller than A; in equivalence relations, symme- 
try would be an example of mutual entailment); combinatorial 
entailment (e.g., if A is larger than B, and B is larger than C, 
then A is larger than C, and C is smaller than A; in equivalence 
relations, transitivity would be an example of combinatorial 
entailment); and transformation of function (e.g., if A is larger 
than B, and B is good, then A would be better; in equivalence 
relations, transfer of exactly the same function from one stimu- 
lus to another in its class would be an example). Also, given the 
number of relations possible, RFT emphasizes the critical role 
of contextual stimuli to indicate exactly which relation should be 
applied in any given instance. According to RFT, with the proper 
training history and given the appropriate contexts, frames can 
be combined in virtually limitless networks, providing for 
increasingly complex derived performances. Thus, framing 1s 
viewed as the basic behavioral unit not only of language but also 
of almost all complex cognitive functioning for humans. Indeed, 
at a conceptual level, RFT has been seen as relevant to domains 
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ranging from the acquisition of basic relations (e.g., opposite, 
different) by young children to religion and spirituality (e.g., 
Barnes-Holmes, Hayes, & Gregg, 2001) to adult psychopathol- 
ogy and its treatment (especially in the form of Acceptance and 
Commitment Therapy; e.g., Hayes, Luoma, Bond, Masuda, & 
Lissis, 2006; Wilson, Hayes, Gregg, & Zettle, 2001). 

In keeping with the breadth of the theory, applications of 
RFT have been varied with respect to targeted skills and partic- 
ipant populations. A few examples here must suffice for illus- 
trating the focus on multiple-exemplar training and on emergent 
relational performances other than equivalence. (Because more 
than one relation is often targeted, RFT procedures can be a 
bit more complicated than many of those described thus far.) 
One early RFT application study (Berens & Hayes, 2007) asked 
whether multiple-exemplar training could establish a frame of 
comparison (more than and less than) with four typically devel- 
oping 3- and 4-year-old girls who had low scores on pretests 
with this frame. For each test or training trial, either two or 
three pictures were presented (e.g., a smiley face, a lightning 
bolt, and a heart), and comparative relations between the stim- 
uli were taught. For example, on a two-picture trial, the child 
might be told, “This (with a point to the smiley face) is more 
than that (with a point to the lightning bolt).” Next, the child 
was asked, “Which would you use to buy candy?” Probe tests 
were conducted with three different stimulus sets before training 
began and then after each phase of multiple-exemplar training. 
All training was conducted with a single stimulus set. The first 
training phase taught the “more than” relation with the same two 
pictures, and the picture designated as “more than” the other 
varied from trial to trial (i.e., A might be more than B on one 
trial, and B more than A on the next). Correct responses to the 
candy-buying question were reinforced across the repeated tri- 
als. The training thus provided multiple exemplars of the “more 
than” relation, and required control by the contextual stimulus 
(“more than”) and the specifics of the instruction, as opposed to 
control by the physical properties of the actual stimuli involved. 
Subsequent training phases taught “less than” with two-stimulus 
trials, “more than” and “less than” with three-stimulus trials, and 
“more than” and “less than” in the same trial (e.g., “A is more 
than B, and C is less than B”). After the teaching steps were 
mastered, all four girls demonstrated emergent comparative rela- 
tions with new stimulus sets. In many instances, accurate test 
performances were demonstrated on trial formats that hadn’t yet 
been targeted in training. That is, performance on three-picture 
trials improved after training with two-picture trials, and perfor- 
mance on mixed-relation trials improved after the “more than” 
and “less than” relations were taught independently. In short, the 
multiple-exemplar training was successful in producing emer- 
gent performances, generalized across many different stimulus 
pairs and across test arrangements. 

A similar approach was used to establish emergent 
manding for “more” or “less” in young children with autism 
(Murphy & Barnes-Holmes, 2009; Murphy, Barnes-Holmes, & 
Barnes-Holmes, 2005) and to generate derived tacting in a sec- 
ond language (Rosales, Rehfeldt, & Lovett, 2011). In the latter 
case, 3- to 4-year-old Spanish speakers were taught to select 
the appropriate picture when presented with its English name 
(i.e., listener training). When this failed to produce tacting in 
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English, additional listener training with multiple novel words 
was followed by tact training with the same words. This resulted 
in substantial improvements in emergent tacting with the orig- 
inal words after listener-training only, with evidence of main- 
tenance for many of the tacts after 1 month. Results like these 
demonstrate the value of multiple-exemplar approaches for 
language training programs, where generalized or higher-order 
responding is especially important. Other interesting work with 
multiple-exemplar training has sought to establish generalized 
perspective-taking skills in both typically developing chil- 
dren and those with autism (e.g., Barnes-Holmes, McHugh, & 
Barnes-Holmes, 2004; Heagle & Rehfeldt, 2006). After teaching 
multiple instances of each relation, children have shown appro- 
priate derived responses with new examples for relations of “I” 
as contrasted with “You,” “Here” as contrasted with “There,” 
and “Now” as contrasted with “Then” (Heagle & Rehfeldt, 
2006), all thought to be critical components of the perspec- 
tive taking that is necessary for more complex reasoning and 
for social skills (Barnes-Holmes, Hayes, Dymond, & O’ Hara, 
2001). Indeed, intensive multiple-exemplar training for four 
frames (same, opposite, more, and less) was reported to cor- 
relate with increased Wechsler Intelligence Scale for Children 
(4th ed.; WISC IV) scores (a common measure of intelligence 
quotient [IQ]) for fifteen 11- to 12-year-old typically developing 
children, and with increased scores on the Differential Aptitude 
Test (Sth ed.) for 30 typically developing 15- to 17-year-olds 
(Cassidy, Roche, Colbert, Steward, & Grey, 2016), an intriguing 
outcome certainly worthy of further study. 

With respect to adult participants, multiple-exemplar 
training approaches have been used successfully to establish 
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rather complex mathematical skills (i.e., matching standard and 
factored forms of mathematical formulas with each other and 
with graphical representations of the formulas) in college stu- 
dents (e.g., Ninness et al., 2005), where the role of contextual 
stimuli (Ninness et al., 2006) and of multiple frames (same as 
and reciprocal of; Ninness et al., 2009) has also been demon- 
strated. Certainly these results add to those of studies on equiv- 
alence-based instruction for efficient approaches to teaching 
college-level academic content. 

In another program of study, multiple-exemplar training 
has been utilized to impact the behavior of recreational and 
pathological gamblers (e.g., Dixon & Holton, 2009; Hoon, 
Dymond, Jackson, & Dixon, 2007, 2008; Zlomke & Dixon, 
2006). To illustrate, five pathological gamblers completed delay 
discounting tests in which a series of choices were made between 
an amount of money to be delivered now and an amount to be 
delivered at some future time (e.g., a choice between $500 today 
and $700 in 12 weeks). Past studies have established strong 
reliability for the discounting task (e.g., Lagorio & Madden, 2005) 
and have shown that delayed consequences hold less value for 
gamblers than for nongamblers; that is, gamblers show a greater 
degree of discounting (e.g., Dixon, Jacobs, & Sanders, 2006). 
In the present study, the discounting task was presented with the 
smaller, sooner choice always shown in purple and the larger, 
later choice shown in pink. Multiple-exemplar training was then 
conducted in a match-to-sample format as follows. On each 
training trial, either a pink or a purple square was presented as 
a contextual stimulus, followed by a sample stimulus and three 
comparisons from the A stimulus set illustrated in Figure 19.30. 
When the purple context was presented, selecting the comparison 
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Figure 19.30 Stimulus sets used in multiple-exemplar conditional discrimination training. 


From “Altering the Magnitude of Delay Discounting by Pathological Gamblers” by M.R. Dixon and B. Holton, 2009 
Reproduced with permission of John Wiley & Sons Inc. 


that was worse than the sample produced a reinforcer (e. g., given 
C+ as the sample, selecting D— rather than B— or A); when the 
pink context was presented, selecting the comparison that was 
better than the sample produced a reinforcer (e.g., given C— as 
the sample, selecting B— rather than D— or F). Next, training 
trials were conducted with stimuli from Sets A, B, and C (i.e., 
multiple-exemplar training), and then trials from Sets A to F were 
presented with no consequences, to assess control by the con- 
textual stimuli with novel stimuli. Test performances confirmed 
that the purple context controlled selections of the “worse than” 
comparison, while the pink context controlled selections of the 
“better than” choice. When the discounting task was repeated as 
a posttest, all five participants demonstrated a substantial shift in 
their discounting, such that the smaller, sooner choices shown in 
purple were selected far less often than on the pretest. In short, 
the reliable discounting outcome was changed dramatically by 
simple stimulus cues, the function of which had been established 
via a brief multiple-exemplar teaching program with “better” and 
“worse” relational frames. These results are worthy of follow-up 
given their implications for the development of assessments as 
well as possible interventions with this clinical population. 

To sum, the studies described in this section emphasize 
an approach to generating novel symbolic behavior that reflects 


SUMMARY 
Research Foundations and Core Concepts 


1. Stimulus equivalence describes a behavior analytic 
approach to understanding and establishing symbolic 
function. Our field’s interest in the equivalence phenom- 
enon can be traced back to early work by Sidman (1971; 
Sidman & Cresson, 1973) on reading comprehension. 


2. Arbitrary match-to-sample procedures are commonly 
used in equivalence studies to teach the necessary base- 
line performances and to test for emergent (or untrained) 
ones. Arbitrary match-to-sample training involves a 
discrete-trial four-term contingency arrangement with 
physically dissimilar stimuli. A trial begins with the 
presentation of a sample stimulus (e.g., Al or A2); an 
observing response to the sample produces two or more 
comparison stimuli (e.g., B1 and B2). Selection of the 
comparison designated as correct for that sample results in 
a reinforcer, while selection of any other comparison does 
not. Arbitrary match-to-sample training produces condi- 
tional discriminations, in which discrimination between 
the comparison stimuli is conditional on the sample pre- 
sented (e.g., given an Al sample, choosing B1; given an 
A2 sample, choosing B2). 


3. After teaching two interrelated conditional discriminations 
(e.g., AB and BC), probe trials are presented to test for 
emergence of the untrained relations that define equiva- 
lence. There are no programmed consequences for probe 
trials; thus, consistent patterns of responding on these trial 
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the tenets of relational frame theory. Ample evidence shows that 
multiple-exemplar training can help to establish the sorts of gen- 
eralized or higher-order forms of responding that contribute to 
complex symbolic skills. 


CONCLUDING REMARKS 


This chapter has presented a case for equivalence-based 
instruction as one of the most exciting directions in research 
and practice in applied behavior analysis. Indeed, the pow- 
erful generativity of an equivalence approach, demonstrated 
repeatedly across participant populations, procedures, set- 
tings, and stimulus sets, demands the attention of practi- 
tioners, applied scientists, basic laboratory scientists, and 
conceptual theorists alike, and ensures the sort of ongoing 
translational effort that characterizes the best of behavior 
analysis. The pattern of work to date suggests that ever more 
effective and efficient equivalence protocols will continue 
to emerge, as scientists learn to combine critical features of 
the most successful programs. There can be little doubt that 
the studies described and reviewed in this chapter represent 
only the tip of the proverbial iceberg in terms of what might 
be accomplished with EBI approaches. 


types are described as emergent or derived. The critical 
probe types include: 


* reflexivity tests, that is, tests for generalized identity match- 
ing with all training stimuli, or AA, BB, and CC; in mathe- 
matical set theory, reflexivity is defined as A = A. 


* symmetry tests for the reversibility of sample and com- 
parison functions, or BA and CB; in mathematical set 
theory, symmetry is defined as, If A = B, then B = A. 


° transitivity tests for relations between stimuli that were 
never presented together during training but that share a 
trained relation with a nodal stimulus, or AC; in mathe- 
matical set theory, transitivity is defined as, If A = B, 
and B = C, then A = C. 


* combined or equivalence tests in which both symmetry 
and transitivity are evaluated simultaneously, or CA. 


4. Positive outcomes on all probe types document inter- 

changeability among a given group of stimuli and allow 

the conclusion that an equivalence class has formed. That 
is, responses on reflexivity, symmetry, and transitivity or 
equivalence trials show that any member of the class can 
serve effectively as sample or as comparison with any other 
class member. The equivalence class consists of all such 
interchangeable stimuli (e.g., A, B, and C, in this example). 


5. The equivalence outcome is significant because: 


* Many stimulus relations are generated after teaching 
only a small number (i.e., there is an impressive “bang 
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for your buck” effect). This generativity carries tremen- 
dous implications for behavioral applications. 


The defining criteria for equivalence provide a com- 
pelling operational definition of symbolic function, 
and make possible a behavioral analysis of a range of 
complex human performances (e.g., categorization, 
semantic correspondence, creativity). 


6. The literature on stimulus equivalence is characterized by 
a true translational approach. Advances in our understand- 
ing and application of equivalence have resulted from a 
necessary and continuing interplay between basic labora- 
tory research, applied behavior analysis, and conceptual 
developments. 


7. Several key outcomes of equivalence approaches have 
contributed to the effectiveness of EBI programs in 
important ways. These include: 


¢ class formation—typically measured as the percentage 
of probe trials on which a comparison selection is con- 
sistent with equivalence, for each defining probe type. 
Such measures are often compared in a pretest, train, 
posttest design, or in a multiple baseline across stimulus 
sets experimental design. 


¢ delayed emergence—a relatively common observation 
in which equivalence performances grow stronger with 
repeated testing. 


* class expansion—increasing the size of an established 
equivalence class to include an additional member or 
members. This is typically accomplished by teaching 
a new relation between a novel stimulus and an estab- 
lished class member. Class expansion is verified by 
emergent relations between the novel stimulus and the 
rest of the established class members. 


* class merger—a combining of independent equivalence 


classes such that a larger class is established. This is typ- 


ically accomplished by teaching a new relation between 
one member of each of the classes to be combined. 
Class merger is verified by emergent relations between 
the other stimuli from the formerly independent classes. 


transfer of function—the finding that teaching a new 
function for one member of an established equivalence 
class results in the same function holding for all mem- 
bers of the class. 


contextual control—stimulus control requiring three 
levels of antecedent stimuli, such that the functions of 
the stimuli in a conditional discrimination vary depend- 
ing on the context. Contextual control training requires 
a five-term contingency. It allows for the same stimuli 
to be members of more than one equivalence class, 
depending on the context. 


Designing Equivalence-based Instruction 


8. A number of procedural variables have proven important 
for the successful generation of equivalence classes, and 
for prevention of problematic stimulus-control patterns 


10. 


11. 


that could compete with demonstrations of class forma- 
tion. These variables are considered in the context of 
making decisions about training and testing procedures. 


. In planning match-to-sample training procedures, critical 


decisions will include the number and type of stimuli to 
be used, the mode of stimulus presentation, inclusion and 
type of observing response, content of the instructions to 
be presented, the approach to balanced trial presentations, 
the necessity of special training steps, the type of training 
structure to be used, the mastery criteria for training steps, 
and the arrangement of consequences for training trials. 


In planning the testing protocol, critical decisions will 
include the composition of the probe-trial blocks, the 
arrangement of consequences within the probe-trial 
blocks, the order of probe-type presentation, and the 
criteria to be used for judging class formation. 


In addition to the standard match-to-sample training 

and testing approach to equivalence, several effective 
procedural variations have been identified that may carry 
benefits for particular populations or for certain EBI 
targets. These include: 


¢ stimulus compounds—when a two-part stimulus (e.g., a 
word presented together with a picture) is presented as the 
sample in a MTS procedure, for example, each element 
of the compound can come to function as an independent 
equivalence-class member, thus increasing the size of the 
class created by a given amount of training. 

* class-specific reinforcement—providing a unique rein- 
forcer for discriminated responses as a function of the 
antecedent’s intended class (e.g., in match-to-sample 
training, choosing B1 given Al would produce Reinforcer 
1, while choosing B2 given A2 would produce Reinforcer 
2). Class-specific reinforcers can function as members of 
an equivalence class, again increasing the size of the class 
made possible by a given amount of training. 

° three-term contingency training—simple discrimina- 
tion training is mastered more easily than conditional 
discrimination training by some learners, and can pro- 
duce equivalence-class formation. 


Applications and Generality 


12. 


14. 


The generality of the equivalence outcome has been repeat- 
edly confirmed and extended by creative applications of 
EBI across populations, settings, and instructional targets. 


. Work with children and adults with intellectual disabilities, 


as well as young typically developing children, represents 
the largest category of EBI applications. Targets have 
included language- and number-skills instruction, as well 
as other types of functional skills (e.g., music, academic 
content, Braille reading). 


Important work with adult clinical populations includes 
examples with health-related behaviors and skills impacted 
by brain injury; investigations in the field of gerontology 
are also being explored. 


15. Equivalence-based instruction targeting university course 


content has had promising results across a range of subject 
matters and modes of presentation. 


Applications Stemming from Alternative Theoretical 
Approaches to Relational Responding 


16. Naming theory and relational frame theory (RFT) provide 18 


differing accounts of the basis for novel relational respond- 
ing. Both emphasize that novel responding is due to a 
special learning history, rather than a direct outcome of a 
reinforcement contingency (as in Sidman’s view). Naming 
Theory and RFT differ, however, with respect to the nature 
of the learning history that is required. These differences 


give rise to unique applications for generating emergent or 
derived responding. 


17. For naming theory, naming is a technical term for a 
bidirectional higher-order unit of speaker and listener 
behavior that is seen as responsible for any form of clas- 
sifying, including equivalence performances. Following 


sons anusan aR SEDO 


KEY TERMS 
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from this view, applications designed to promote equiva- 
lence include instruction in the components of naming— 
listener behavior and tacting. For participants lacking 
these skills, targeting them directly can be effective in 
producing response patterns indicative of classes and 
categories. 


. For RFT, framing, or responding to one stimulus in terms 
of another, is the key higher-order operant. It is established 
by teaching many different examples of a given relation, 
which allows for abstraction and then application of the 
relation to new examples, given the proper context. RFT 
emphasizes that equivalence is just one example of the 
many relations we learn in this manner. Following from 
this view, applications designed to promote novel, or 
derived, relational responding feature multiple-exemplar 
training as the key instructional approach. A range of 
examples has provided evidence for the production of gen- 
eralized response patterns that can contribute to complex 
symbolic skills. 


class expansion 

class merger 

class-specific reinforcement 
conditional discrimination 


contextual control 


equivalence test 

exclusion 

higher-order operant class 
matching-to-sample procedure 


nodal stimulus (node) 


stimulus-control topographies 
stimulus equivalence 
symmetry 

training structure 


transfer of function 


derived stimulus relations reflexivity 


emergent stimulus relations simple discrimination 


equivalence-class formation 


s 


MULTIPLE-CHOICE QUESTIONS 


1. Interchangeability of function can be assessed by 


a. Reflexivity 
b. Symmetry 
c. Transitivity 
d. All of the above 
Hint: (See “Transfer of Function”) 

2. In simple-to-complex testing protocol, probe types are 
introduced sequentially 
a. Beginning with symmetry followed by transitivity 
b. Beginning with transitivity followed by symmetry 
c. Beginning with reflexivity followed by transitivity 
d. Beginning with reflexivity followed by symmetry 


Hint: (See “Test Order’) 


A LS eS oo x “ 


transitivity 


simple-to-complex testing protocol 


3. Which of.them is a critical property of framing as 
proposed in relational frame theory? 


a. Mutual entailment 
b. Transformation of function 
c. Combinational entailment 
d. All of the above 
Hint: (See “Relational Frame Theory”) 
4. In the study by Sidman, 1971; Sidman & Cresson, 1973, 
three member equivalence classes ( : 
. written word), can be increased to include 


a potentially number of members based on 
class and class 


a. Spoken word, picture, unlimited, expansion and merger 


b. Touch, sound, unlimited, contraction and demerger 
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2. 


ESSAY-TYPE QUESTIONS 
1. 


c. Touch, sound, limited, contraction and merger 
d. Spoken word, picture, limited, expansion and demerger 


Hint: (See “Class Expansion and Class Merger” 


. In the EBI program designed by Fineup, Covey and 


Critchfield (2010), to document five-member classes, how 
many subsets were used? 


a. 16 subsets produced 80 different stimulus relations in 
15 minutes 


b. 80 subsets produced 16 different stimulus relations in 
15 minutes 


c. 80 subsets produced 15 different stimulus relations in 
25 minutes 


d. 15 subsets produced 80 different stimulus relations in 
25 minutes 


Hint: (See “In the College Classroom’”’) 


Describe the training structures used in equivalence-based 
instructions. 


Hint: (See “Training Structures”’) 


Define higher-order operants with an example. 


Hint: (See “Applications Stemming from Alternative 
Theoretical Approaches to Relational Responding”’) 


NOTES 


il 


N 


As we will see in the section on Relational Frame Theory, equating these 
terms can be problematic, because the latter carries important theoreti- 
cal implications about the basis for responding. Thus, “emergent perfor- 
mances” is often preferred as a theory-neutral description. 

Exemplary experimental analyses of the complex behavior patterns men- 
tioned have been conducted by Augustson and Dougher, 1997; Augustson, 
Dougher, and Markham, 2000; Challies, Hunt, Garry, and Harper, 2011: 
Dougher, Augustson, Markham, Wulfert, and Greenway, 1994; Griffee 
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. Methods used for presenting probe trials is/are: 


a. Massed Testing 

b. Interspersing 

c. Both of the above 

d. None of the above 

Hint: (See “Composition of Probe-trials Blocks”’) 


. Component Simple Discriminations training approach 


is based on the recognition that accurate conditional 
discrimination performances require: 


a. Conditional control by the sample 
b. Successive discrimination between the sample stimuli 


c. Simultaneous discrimination between the comparison 
stimuli 


d. All of the above 


Hint: (See “Instructional Programming”) 


RCE CREEE HERNSDO RO TH SESS DLS OS SSS ANSOS SE SNSIO DEO POE ESTE SIE SNODNOO ESRI SAN OO EISEN NICS OSES ERS EDO OSSD UIRIEBENDODO HESSEN CTS SOSDONIOIO 


3. Discuss transfer of function. Elaborate on the negative 


impact of equivalence-class formation on an individual’s 
functioning. 


Hint: (See “Transfer of Function”) 


and Dougher, 2002; Guinther and Dougher, 2010; Hayes, Thompson, and 
Hayes, 1989; Kohlenberg, Hayes, and Hayes, 1991; Moxon, Keenan, and 
Hine, 1993; and Watt, Keenan, Barnes, and Cairns, 1991. 


. EBI typically targets at least three classes. The example here is simplified 


for illustrative purposes. 


. With many populations, probes for reflexivity are omitted, primarily 


because identity matching can be assumed and is likely the result of 
experiences prior to the EBI training. 
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Engineering Emergent Learning 
with Nonequivalence Relations 


Thomas S. Critchfield and Ruth Anne Rehfeldt 


LEARNING OBJECTIVES 


¢ Understand what nonequivalence relations are and why do they matter. 


e Discuss the vocabulary of nonequivalence relations. 
e Explain some types of nonequivalence relations. 


e Discuss the theoretical foundations of nonequivalence relations. 


self-concept, and intellectual development. 


Understand the connections between nonequivalence relations and psychological constructs: knowledge structures, 


e Understand the connection between derived stimulus relations and general well-being. 


hapter 19 explained how equivalence (“sameness”) 

relations! function and how they can be harnessed 

to generate valuable new learning. But equivalence is 
only one beast in a menagerie of ways in which stimuli can 
be symbolically and behaviorally linked. Nonequivalence 
relations—those in which stimuli are related on some basis 
other than “sameness”—are a big part of how people make 
sense of, and function effectively in, the world around them. 
We believe, therefore, that every behavior analyst needs a solid 
working understanding, and mastery, of both the general concept 
of stimulus relations and the specific domain of nonequivalence 
relations (e.g., Critchfield, 2018; Critchfield, Barnes-Holmes, & 
Dougher, 2018). 

Our discussion of nonequivalence relations is intended to 
complement Chapter 19’s discussion of equivalence relations. 
To properly understand nonequivalence relations, you need 
grounding in stimulus relations generally. By mastering the con- 
ceptual foundations of equivalence relations and the procedures 
that have been used to establish them, you prepare yourself to 
benefit from the present chapter. 

This chapter is divided into six sections. The first section 
explains generally what nonequivalence relations are and why 
they matter. The second section presents basic vocabulary for 


The authors thank Bryan Roche, Dan Fienup, and Daniel Assaz for providing 
helpful feedback on a draft of this chapter. 


talking about nonequivalence relations and phenomena asso- 
ciated with them. This is necessary because the vocabulary of 
equivalence relations is not always adequate for describing non- 
equivalence relations. The third section describes selected types 
of nonequivalence relations and illustrates their everyday impor- 
tance. The fourth section introduces key theoretical concepts 
relevant to nonequivalence relations. The fifth section discusses 
the role of nonequivalence relations in big-picture psychological 
constructs that often are of interest in the practical world. The 
final section explains how derived stimulus relations, including 
nonequivalence relations, provide a basis for enhancing general 
well-being in ways that are quite different from what is typical 
in applied behavior analysis (ABA). 

Although this chapter highlights many opportunities to 
incorporate nonequivalence relations into applied interventions, 
we will say little about practical tactics of implementation, for two 
reasons. First, because the behavioral repertoires of interest can be 
complex, procedures for creating them often are as well. Before 
developing protocols to teach nonequivalence relations, the inter- 
ested student may need to digest other sources that provide the 
required detail (e.g., Rehfeldt & Barnes-Holmes, 2009). Doing 
so, however, requires conceptual skills that develop with prac- 
tice. Thinking about derived stimulus relations is a detail-heavy 
logical process that does not come naturally to many people, and 
the present chapter is an introduction to this process. Second, in 
many respects the mechanics of building equivalence relations, 
as discussed in Chapter 19, also apply to the construction of 
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nonequivalence relations (e.g., Luciano et al., 2009). For example, 
both depend on directly teaching baseline relations consisting of 
overlapping conditional discriminations. In both cases, baseline 
relations must be learned before untaught relations can arise, and 
two stimulus classes with a shared member can merge to form a 
larger class—and so forth. Chapter 19 is a guide to these basics, 
and the present chapter builds on it by expanding the range of 
derived stimulus relations that can be the focus of intervention. 


WHAT ARE NONEQUIVALENCE RELATIONS? 
WHY DO THEY MATTER? 


To explain nonequivalence relations, we’ll begin by expand- 
ing on a general point from Chapter 19: Most stimulus rela- 
tions are arbitrary relations (Hayes, Fox, et al., 2001), stimuli 
that “go together” not because they are physically identical or 
because a law of the universe demands this, but rather because 
social-verbal reinforcement contingencies teach people to relate 
them in a certain way. Building on an example presented in 
Figure 19.7, there is no inherent reason that a token exchange- 
able for goods and services has to be called a dime, to be a disk 
of silver-colored metal, or to have its value represented with 
“10¢.” Humans have arbitrarily created these symbols and the 
relations among them (such as the equivalence class in Figure 
19.7). Had the token been a lump of pink salt and been called 
armadillo, for instance, the equivalence class in Figure 19.7 
would look different but function the same. 

Because relations among stimuli are arbitrary, the vari- 
ety of possible relations is limited only by the repertoires and 
experiences of humans. People create all kinds of symbols and 
devise many different ways to compare and contrast them. How 
they do this is a core feature of how they understand the world, 
as each bit of the world is given meaning through the other bits 
of the world to which a person’s experiences connect it. Con- 
sider the word beer, which is an arbitrary printed symbol for a 
particular beverage. The behavioral functions of beer are many 
and nuanced. To get a sense of this, think about what beer means 
to you by answering the questions below (adapted from Barnes- 
Holmes, Hayes, Dymond, & O’Hora, 2001): 


¢ What are other names for beer? 
e¢ What are the attributes of beer? 
¢ What is beer composed of? 

e What is beer similar to? 

¢ Where is beer found? 

¢ What is beer different from? 

¢ What is beer an example of? 

¢ What does beer include? 

¢ What is beer better than? 

e What does beer do? 

e What came before beer? 


Notice that all of these questions demand an active response; how 
you understand beer is evident in how you respond to it. Notice, 
too, that all of the required responses involve relating the target 
word to something else. The behavioral functions of beer are 
inherent in the things to which people relate it. 

Figure 20.1 provides a few sample answers to the beer 
questions. The resulting diagram is reminiscent of the cognitive 


psychology concept of a semantic network, a hypothetical con- 
struct intended to model how bits of knowledge are organized in 
long-term memory.” Based on studies using a variety of meth- 
ods (such as free association, in which people are presented 
with a word and asked for the first word that comes to mind), 
many cognitive researchers have concluded that bits of stored 
knowledge connect to one another through shared meaning 
(e.g., Collins & Loftus, 1975). 

The semantic network construct is worth mentioning 
partly because it predates the earliest research by behavior 
analysts on derived stimulus relations (Sidman, 1971), and thus 
psychologists outside of behavior analysis tend to be famil- 
iar with it. Also, despite the fact that behavioral psychology 
and cognitive psychology are often portrayed as intellectual 
archenemies (Uttal, 2000), the semantic network construct 
incorporates at least three observations with which behavior 
analysts agree (see Hayes, Barnes-Holmes, & Roche, 2001; 
Reese, 1968; Sidman, 1994). First, as Figure 20.1 suggests, 
people relate various “bits of the world” in a Jot of different 
ways. Second, the relationships of interest are forged through 
experience, such that each individual will have a relational net- 
work as unique as her learning history. Third, the relationships 
of interest are, in essence, the fundamental units of meaning, 
because no “bit of the world” is understood separately from its 
connection to others. A person with no beer experiences cannot 
answer any of the questions listed above. That person does not 
know the meaning of beer. 

Behavior analysts believe they have improved on the 
semantic network concept in two ways. First, cognitive psy- 
chologists assume that “bits of knowledge” and the relation- 
ships among them are somehow represented (stored) inside the 
brain (Collins & Loftus, 1975). Behavior analysts say instead 
that each of us learns various ways of responding to stimuli 
like beer, and two stimuli are related if the way we respond 
to them is in some way intertwined (Critchfield et al., 2018: 
Reese, 1968). This is a network of behavior, not of knowledge.* 
Second, although most observers agree that relational networks 
arise from individual experience, cognitive psychologists have 
devoted more effort to describing the characteristics of already- 
formed networks than to understanding their formative expe- 
riences. By contrast, behavior analysts have typically tried to 
understand relational networks by studying their acquisition 
(Zentall, Galizio, & Critchfield, 2002). Because applied inter- 
ventions exist to engineer changes in behavior, this approach, 
which focuses on experiences that create relational networks, 
is more informative than one that seeks to describe the features 
of relational networks that are the product of unspecified prior 
learning (Critchfield & Fienup, 2008). 

As Chapter 19 explained, stimuli in the relational net- 
works called equivalence classes are united by “same as” rela- 
tions. For instance, beer has several synonyms (e.g., cold one, 
brewski, and draft) and together these stimuli may form an 
equivalence class. As Figure 20.1 indicated, however, beer can 
be important in many ways other than what it is. In fact, one 
can fully understand beer only by knowing what it is not, what 
it is similar (but not identical) to, what it’s an exemplar of, what 
kinds of beer exist, and so on. This is the essence of nonequiv- 
alence relations. 
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Figure 20.1 A variety of things to which beer is related, illustrating the wide variety of 
stimulus relations. Except for the “same as” example, all are nonequivalence relations. 


In distinguishing nonequivalence relations from equiva- 
lence relations, it is important to note that the former some- 
times have been included in behavioral research without proper 
acknowledgment. Figure 20.2 illustrates by reference to a study 
in which college students learned neuroanatomy concepts 
(Fienup, Covey, & Critchfield, 2010). As part of the lesson, stu- 
dents saw the term frontal lobe as sample stimulus and, from 
several pictures of brains, learned to select the one on which 
the frontal lobe was indicated by an arrow. With frontal lobe 
as the sample stimulus, the students also learned to select 
impulsiveness from among several alternatives as a symptom 
caused by damage to this lobe. The lesson was described as 
equivalence-based instruction, but it did not really involve 
“same as” relations. Rather, it involved a spatial relation (where 
the brain structure called frontal lobe is located) and a causal 
relation (what symptom is produced by damage to the brain 
structure called frontal lobe). 

In one sense, it doesn’t matter what Fienup and col- 
leagues called the stimulus relations in Figure 20.2, because 
their students both mastered what was directly taught and 
demonstrated untaught relations (e.g., when shown the location 
of the frontal lobe, they could indicate that impulsiveness was 
caused by damage to that area). This was possible because the 
match-to-sample procedures used in the study gave students 
limited options for how to respond, so they chose compari- 
son stimuli that were related to the sample in some fashion.’ 
Difficulties might have arisen in a different kind of test. For 


instance, imagine that the students were shown a picture of 
the frontal lobe and asked to say its name. A student who 
answered impulsiveness (treating the brain picture, frontal 
lobe, and impulsiveness as “the same”) would get the question 
wrong. What a person “needs to know,” therefore, depends on 
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Figure 20.2 A neuroanatomy stimulus class that was 
established in college students. The published report called 
this an equivalence class, but it is more accurately described 
as a nonequivalence class. 

Based on Figures 1 and 4 of “Teaching Brain-Behavior Relations 
Economically with Stimulus Equivalence Technology,” by D. M. Fienup, 
D. P. Covey, and T. S. Critchfield, 2010, Journal of Applied Behavior 
Analysis, 43, 19-33. 
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contingencies of reinforcement that often involve nonequiva- 
lence relations (for more on this, see the “Theoretical Founda- 
tions” section). In fact, nonequivalence relations appear to be 
at the root of many abilities thought of as representing “deep 
understanding,” “higher thought,” or “complex cognition” 
(Hayes et al., 2001), a point on which we will comment later. 


THE VOCABULARY OF 
NONEQUIVALENCE RELATIONS 


Much of the vocabulary coined to describe equivalence rela- 
tions is too narrow to describe nonequivalence relations, so more 
inclusive alternatives have been proposed (Hayes, Fox, et al., 
2001). Recall from Chapter 19 that stimulus equivalence classes 
are defined by relations of “sameness.” Two emergent mani- 
festations of “sameness,” symmetry and transitivity (Sidman, 
1994; see also Chapter 19),> are illustrated in the top left panel 
of Figure 20.3 as part of how three words (overdue, tardy, and 
late) come to “mean the same thing” and are treated by a learner 
as interchangeable. By emergent we mean relational respond- 
ing that experience does not directly teach, but that arises as an 
indirect function of experience (Chapter 18 similarly referred to 
generative effects). Emergent relations also are called derived 
relations, and this is the term we will use moving forward. 


Entailment 


Symmetry is when, after learning to respond to stimulus B as if 
it were stimulus A, a learner spontaneously can do the reverse. 
Symmetry implies mutual “sameness.” For example, in the top 
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left panel of Figure 20.3, a learner who is shown overdue and 
taught to identify tardy as “the same” should also be able to 
identify overdue as “the same” when given tardy. Now con- 
sider the nonequivalence relation shown in the top right panel 
of Figure 20.3. A learner is shown early and taught to identify 
tardy as “opposite.” The reverse of this directly taught relation 
is expected to emerge spontaneously: Shown tardy, the learner 
should be able to spontaneously identify early as “opposite.” 
Because not all derived relations involve “same,” the reverse of 
a learned relation can be of a different type than the learned rela- 
tion. For instance, if a learned relation is that 25¢ is “more than” 
10¢, the reverse relation will be that /0¢ is “less than” 25¢. The 
more general term mutual entailment applies to any derived 
bidirectional relation in which one “direction” has been directly 
learned (Hayes, Fox, et al., 2001).° Symmetry is a special case 
of mutual entailment, but not all mutual entailment is symmetry. 

Transitivity is when a learner can spontaneously treat two 
stimuli as interchangeable (“same”) because they are related 
indirectly through “sameness” with a common third stimulus. 
In the top left panel of Figure 20.3, a learner who is shown tardy 
should be able to spontaneously identify late as “the same,” and 
vice versa. Now consider the nonequivalence relation in the top 
right panel of Figure 20.3. The derived relation between tardy 
and /ate (“same’’) is different from that between these words 
and their common associate, early (“opposite”). Tardy and late 
qualify as “same”—but not because they are both “the same” 
as a common third stimulus. Rather, in a case of “the enemy 
of my enemy is my friend,” tardy and late both are opposites 
of the same word. The general term combinatorial entailment 
applies to any derived relation involving stimuli that are “linked” 


Figure 20.3 Relationships between terms used 

to describe derived relations in equivalence and 
nonequivalence stimulus classes. Top: Mutual 
entailment and combinatorial entailment as 
replacements for symmetry and transitivity, 
respectively. Bottom: Transformation of function 

as replacement for transfer of function. Emoticons 
represent emotion functions (fear and happiness); 
those in black are acquired through direct experience 
and those in gray are the result of transformation of 
function. 
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via common associates (Hayes, Fox, et al., 2001). To be more 
precise, combinatorial entailment is the relation involving 
two stimuli that both participate in mutual entailment with 
some common third stimulus. Transitivity is a special case of 


combinatorial entailment, but not all combinatorial entailment 
is transitivity. 


Transformation of Function 


Another defining feature of equivalence classes, as described 
in Chapter 19, is transfer of function. Chapter 19 noted that 
equivalent stimuli come to function in the same manner (control 
the same responses), and this applies to any new behavioral 
function acquired by any of the stimuli in a class. For instance, 
Dougher and colleagues showed that when classical condition- 
ing made one stimulus in an equivalence class fear-eliciting, 
all of the other stimuli in the class became fear-eliciting as 
well (Dougher, Augustson, Markham, Greenway, & Wulfert, 
1994; see also Dymond, Schlund, Roche, & Whelan, 2014). The 
bottom left panel of Figure 20.3 illustrates this effect. Imagine 
that a sensitive student is one day harshly reprimanded for 
being tardy to class. This scares her and makes the word tardy 
a fear-eliciting stimulus. If tardy is part of the equivalence class 
shown in Figure 20.3, then words like overdue and late will 
also come to elicit fear. “Transfer” of function in this case is the 
propagation of a single behavioral function across stimuli that 
are ‘“‘the same” (an equivalence class). 


Now consider the nonequivalence class in the bottom right 
panel of Figure 20.3. When tardy becomes fear-eliciting, so does 
late, which is “the same” as tardy. But it would be illogical to 
expect early, the “opposite” of tardy, to become fear-eliciting, 
and research in fact suggests it should not (e.g., Dymond & 
Barnes, 1995, 1996). In this example, early now elicits pleasant 
responses that are “opposite” to fear. Thus, the function of tardy 
does not “transfer” to early but, within the constraints of the rela- 
tion that unites tardy and early, it does influence the new function 
that early acquires. The general term that subsumes these kinds 
of effects is transformation of function, which can be defined as 
the behavioral function of one stimulus changing systematically 
due to the change in function of another stimulus in the same 
class (Hayes, Fox, et al., 2001). The transfer of function seen in 
equivalence classes is a special case of transformation of func- 
tion, but not all transformation of function is transfer of function. 

The critical thing about transformation of function, as 
illustrated in Figure 20.3, is that, for a given stimulus, alter- 
ation of behavioral function will be consistent with the type 
of relation that unites the stimulus with others. This incorpo- 
rates many possibilities, including that stimuli nor united via 
“same” relations can all acquire the “same” behavioral func- 
tion, just as happens in equivalence classes. Box 20.1 describes 
an example. Note that the stimuli in this example are related 
temporally, and there is nothing inconsistent in responding 
the “same” way to stimuli regardless of whether they happen 
“before” or “after.” 


BOX 20.1 


Transformation of Function in a Nonequivalence Class: A Clinical Example 


Shawn was a high school student diagnosed with “school pho- 
bia”’ He was rather shy and did not like public attention, but 
had always enjoyed going to school—until one day a teacher 
lost her temper and began shouting at him. This frightened 
Shawn and humiliated him in front of the whole class. In a 
likely example of classical conditioning, Shawn began to fear 
his teacher. It was the shouting that had frightened him, but 
now just glimpsing the teacher upon entering the classroom 
was enough to cause him to shake and perspire and feel an 
awful tightness in the pit of his stomach. 

Soon, without any additional unpleasant experiences, 
Shawn also began to fear the classroom itself. Just viewing 
the other students in their desks from the hallway—even if 
the teacher was not present—caused Shawn to experience the 
same unpleasant symptoms. Over time the symptoms were 
caused as well by seeing the outside of the school building; 
by seeing the school’s sign at the start of a long, curvy drive 
that led up to the school building; and by hearing the sound 
of the bus that brought Shawn to school. Before long, Shawn 
was refusing school entirely, including by leaving the house 
early in the morning to hide in the neighborhood until after 
his bus had departed for school. Note that this is not a case of 
stimulus generalization, involving shared behavioral functions 
among stimuli that are physically alike (Chapters 17 and 30). 
The stimuli Shawn feared are as dissimilar as can be. This 


probably also is not a case of higher-order classical condition- 
ing (in which a neutral stimulus, having been paired with a 
conditioned stimulus, can elicit the same response as the condi- 
tioned stimulus); responding based on this kind of conditioned 
stimulus tends to be unreliable (Kehoe & Macrae, 1998). 

Figure 20.4 summarizes how the stimuli were tempo- 
rally related in Shawn’s experience of typical school mornings. 
First, Shawn heard the bus approaching his bus stop. The bus 
picked him up and traveled past the school sign and up to the 
front of the school building, where Shawn disembarked and 
proceeded to his classroom. Finally, he entered the classroom 
and encountered the scary teacher. The various stimuli, there- 
fore, we related in a temporal hierarchy involving what “hap- 
pens before” and what “happens after.” 

Shawn’s condition likely resulted from the stimuli in 
the temporal hierarchy being transformed, through direct or 
indirect association with the scary teacher, to elicit fear. This 
temporal hierarchy is not an equivalence class. The bus, for 
instance, isn’t the “same as” the teacher, because normally we 
do not respond in the same way to buses and teachers (we 
speak to teachers and ride on buses; confusing these behavioral 
functions would be awkward, to say the least). 

How stimuli in a class are functionally transformed 
depends on the nature of the stimulus relations the class incor- 
porates. In Shawn’s case, nothing in the temporal hierarchy 
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conflicts with a behavioral function of fear. You can be equally 
afraid of things that “happen before” and “happen after,’ and 
so all of the stimuli acquire the same function. But other kinds 
of relations create other kinds of function transformations. 
For example, imagine that posted right next to the Sidman 
High School sign was a second sign indicating the way back 
to Shawn’s home. The two signs would be related as oppo- 
sites (discussed further in the next section). One is about get- 
ting from home to school, and the other is about getting from 


Sidman 


High School 


SOME TYPES OF NONEQUIVALENCE 
RELATIONS 


We now identify, and provide examples of, a few basic types 
of nonequivalence relations. All are relevant to everyday expe- 
rience, but we do not mean to suggest special importance for 
any of the specific relations that we describe. When harnessing 
nonequivalence relations for applied interventions, one does not 
start with a specific type of relation in mind. Rather, the job 
is to identify specific problems that need solving, and to tar- 
get the things that people do that require a functional analysis 
(e.g., Hayes et al., 2001; Rehfeldt & Barnes-Holmes, 2009). The 
stimulus relations that are implicated will vary from problem 
to problem, and may even include relations on which no pre- 
vious research has been conducted. The onus thus falls on the 
applied behavior analyst to determine what relations are relevant 
in any specific case and to figure out how to integrate them into 
an intervention protocol (Rehfeldt & Barnes-Holmes, 2009). 
The underlying conceptual analysis can be the most laborious 
part of behavior programming involving derived stimulus rela- 
tions, but with both equivalence and nonequivalence relations 
it is critical to precisely define what needs to be directly taught 
and what untaught relations are logically expected to emerge 
(e.g., Critchfield & Twyman, 2014). Becoming familiar with 
several different types of relations will aid in this process. 


Distinction Relations 


Distinction relations involve responding to stimuli in terms of 
their differences. In some distinction relations, there is a well- 
defined basis of stimulus difference. For example, on a dimension 
of quantity, “many” and “few” are extremes (or at least highly 
divergent values). Figure 20.5 provides a highly simplified exam- 
ple of a procedure for teaching many/few relations to children 
(based on Barnes-Holmes, Barnes-Holmes, Smeets, Strand, & 
Friman, 2004). Here the stimuli are several different types of 


Figure 20.4 The temporal-sequence relations among stimuli that became fear-inducing to a school-avoidance student. 


school to home. Within this relation of opposites, you cannot 
fear both stimuli. Rather, to remain consistent with the under- 
lying relation of opposites and the learning history that created 
it, if you fear one sign, then in the presence of the other you 
should feel the opposite of fear (approach? joy?). The general 
point is that, within nonequivalence classes, transformation of 
function can involve either the same or different behavioral 
functions, depending on the nature of the specific relations 


that define the class. 
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tokens,’ and the children learn that these tokens can be exchanged 
for “many” sweets versus “few” sweets. Relations are taught using 
a verbal procedure.® Thus, to teach AB relations, the teacher says, 
“This token [points to A] can buy many sweets. This token [A] is 
opposite to that token [points to B]. Which token would you use 
if you wanted to buy as many [or few] sweets as possible?” A cor- 
rect answer is reinforced. To teach BC relations, the teacher says, 
“This token [points to B] is opposite to that token [points to C]. 
Which token would you use if you wanted to buy as many [or few] 
sweets as possible?” In the resulting stimulus class, the function 
of stimulus C should transform so that it becomes a “many” token 
that is “the same” as A. Children exposed to this kind of teaching 
in fact showed the expected derived relations. They were able to 
mand “more” and “less” using the coins as symbols. 

The derived stimulus relations literature tends to divide 
distinction relations into two subtypes called opposition and 
comparison relations, but we are aware of no firm definitions 
of these subtypes. A reasonable way to think of comparison 
is as a dimensional distinction relation that does not involve 


©, 
Figure 20.5 A distinction relation. 


Based on “Establishing Relational Responding in Accordance with More- 
than and Less-than as Generalized Operant Behavior in Young Children, ” 
by Y. Barnes-Holmes and colleagues, 2004, International Journal of 
Psychology and Psychological Therapy, 4, pp. 531-538. 
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“opposites” —but we know of no ironclad rule for deciding when 
a relation involves “opposites.” Sometimes, stimuli that we call 
“opposite” define endpoints or extremes on some theoretically 
measurable dimension: “Beautiful” and “ugly” are extremes on 
a dimension of attractiveness, and “hyperthermia” and “hypo- 
thermia” are extremes on a dimension of body temperature. As 
Box 20.2 illustrates, other “opposites” reflect mutually exclu- 
sive instances for which there are no intermediate values. 

Recalling that arbitrary relations are defined by the 
social-verbal reinforcement contingencies that create them, the 
simplest guide to “opposite” relations may be that often you will 
know one when you see it. Still, the distinction (pun intended) 
between opposition and comparison is not always clear cut. For 
instance, is a many versus few relation (Figure 20.5) really a 
case of opposites (“extremes on some dimension’), as Barnes- 
Holmes et al. (2004) described the relations in Figure 20.5? Or 
is this a comparison relation (after all, “more” is not “most,” and 
“few” is not “least”)? We are, frankly, unsure, so we recommend 
the following perspective to students who are just learning about 
distinction relations. If good practical reasons exist to teach a 
particular relation, don’t worry too much about what to call it. 
Instead, expend your energy determining what baseline relations 
need to be taught to make your desired derived relations a logi- 
cal outcome (see Luciano, Becerra, & Valverde, 2009). 


Spatial and Temporal Relations 


Spatial relationships involve the position of a stimulus in refer- 
ence to the learner or some other stimulus: above versus below, 


left versus right, in front of versus behind, and so on. Temporal 
relationships involve the relative position of stimuli (or events) 
in time (e.g., before versus after, per Box 20.1 and Figure 20.4). 
Spatial and temporal relations are inherent in academic subjects 
like geography and history, respectively, and these relations 
are obvious components of everyday nonacademic experience. 
A child who does not respond appropriately to the parent 
instruction “Please stand behind Tony” or “Eat your ice cream 
after you finish your peas” may soon encounter difficulties. 
More serious consequences await someone who cannot respond 
appropriately to the statement “Behind you! An axe murderer!” 

We are not aware of any applied studies that focus on con- 
structing derived spatial or temporal relations in “pure” form, 
but several laboratory studies lay out basic procedures that 
might be translated for applied use (Brassil, Hyland, O’ Hora, 
& Stewart, 2018; May, Stewart, Baez, Freegard, & Dymond, 
2017). Spatial and temporal relations also are components in 
complex deictic relations, which involve relationships between 
the self and the world, and which have been the focus of con- 
siderable research, as we will describe in the “Nonequivalence 
Relations and Big-Picture Psychological Constructs” section. 


Causal Relations 


Causal relations are if-then relations. Making accurate causal 
attributions is an important part of negotiating the everyday 
world, as illustrated by the prevalence of faulty causal attributions 
in a variety of clinical problems (Curtis, 1989; Hayes et al., 2001; 
Matute & Miller, 1998: Torneke, 2010). If-then relations also 


BOX 20.2 


Opposite Relations in Higher-Order Mathematics 


Opposite relations are important to many academic areas of 


study in the form of either-or categories. Chris Ninness and 
his colleagues used derived relational procedures to teach col- 
lege students trigonometry concepts that included cosine and 
secant, which are reciprocals (Ninness et al., 2009). According 
to the rules of mathematics, an expression can be cosine or 
secant, but nothing is intermediate to these two cases. 

Figure 20.6 summarizes some of the relations that the 
students mastered. An interesting feature of this work is that, 
consistent with what many classrooms require, the learners 
had to construct a response rather than simply point to a cor- 
rect stimulus. For instance, shown an example of a cosine (A), 
they had to write it in reciprocal form (B). This opposite rela- 
tion was combined with several “same” relations. Given the 
reciprocal (B), students had to write the equivalent formula 
for secant (C). Given the secant, they had to construct a graph 
representing the correct function (D). 

Once the baseline relations had been learned, students 
demonstrated both “same” and “opposite” derived relations. 
These included the converse of the baseline relations (mutual 
entailment). For instance, shown the reciprocal of a cosine, 
they could write the cosine (opposite). Shown a secant, they 
could write the corresponding cosine reciprocal (same). Also 


demonstrated were combinatorially entailed relations between 
stimuli that were not directly associated during training. For 
example, shown a graphed function, the students could write the 
relevant cosine formula (opposite) or cosine reciprocal (same). 
These outcomes generalized to many novel instances, showing 
that the lesson produced a general skill rather than simply teach- 
ing the students about specific examples of cosine and secant. 
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Figure 20.6 A simplified version of same and opposite 
relations that were established in a college-level 
trigonometry lesson. 

Based on “Constructing and Deriving Reciprocal Trigonometric Relations: 
A Functional Analytic Approach,” by C. Ninness and colleagues, 2009, 
Journal of Applied Behavior Analysis, 42, pp. 191-208. 
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are a central feature of understanding and doing science. Just as 
for other kinds of nonequivalence relations, causal relations can 
either define the structure of a stimulus class or define the behay- 
ioral function through which stimuli in a class are transformed. 
We are aware of no research on either phenomenon, but in the 


former case it seems likely that stimuli united by overlapping 
causal relations (e.g., if A, then B; if B, then C) could cohere into 
a stimulus class. In the latter case, Box 20.3 illustrates how, for 
a phenomenon of potential clinical importance, the stimuli in a 
relational class can be transformed in terms of causal attributions. 


BOX 20.3 


Learned Helplessness and Transformation of Function 


Many clinical conditions are characterized by negative 
self-evaluations (Curtis, 1989) that a behavior analyst might 
describe as behavior-controlling rules (Torneke, 2010; see 
also Chapter 18’s treatment of rule-governed behavior). For 
instance, depressed patients may be heard to remark, “I’m no 
good at anything” or “Why should I try? I'll just fail” (Peterson 
& Bossio, 1989). According to learned helplessness theory, 
these self-evaluative rules are sometimes acquired under con- 
ditions of specific unpleasant experiences, during which the 
individual’s behavior is, by definition, ineffective. Then they 
begin to appear in new situations, where they discourage effec- 
tive action (Abramson, Seligman, & Teasdale, 1978). 

To put it another way, self-evaluative rules can inter- 
fere with behavior that could generate important reinforc- 
ers (Torneke, 2010). For example, imagine that you are the 
victim of a mugging. You’re walking down the street one 
night when three people jump out of nowhere, punch you 
several times, and rob you. The experience is very frightening 
and, importantly, apparently unpreventable. You were in the 
wrong place at the wrong time, and nothing you did could 
have prevented you from being overpowered. Perhaps, as a 
result of this experience, you say to yourself, “I’m weak and 
ineffective. I can’t even look out for myself.” This is a causal 
rule that can be stated in if-then form: “If placed in a situation 
where my well-being depends on acting effectively, I will 
fail.” Later, you find yourself being inappropriately passive 
in many situations. You begin exerting little effort in your 
classes, which causes your grades to drop. You stop invest- 
ing in your social relationships, which causes your friends 
to distance themselves from you. Learned helplessness the- 
ory assumes that these things happen because new situations 
somehow begin to evoke your “helpless” rule (Abramson 
et al., 1978). 

How self-attributions (causal rules) are generated in 
the first place is a topic unto itself. For now suffice it to say 
that considerable research shows that humans are prolific in 
generating them (Barnes-Holmes, Hayes, & Dymond, 2001; 
Matute & Miller, 1998). Of special interest is how “helpless” 
self-rules, once formed, become applied to new circumstances. 
It seems unlikely that this is a matter of stimulus generaliza- 
tion (Chapters 17 and 30), as situations like “being mugged” 
and “taking a college class” have few physical or perceptual 
features in common. Thus, something else must be responsible 
for the spread of inappropriate passivity beyond the situation 
where unavoidable unpleasant events first were experienced. 


Cognitive psychologists have proposed that the common factor 
is a “helpless” mental state (Abramson et al., 1978) that moves 
with the individual from situation to situation. But Dack, 
McHugh, and Reed (2009) suggested the common factor as 
transformation of function involving the original unpleasant 
experiences and stimuli in new situations. 

In the Dack et al. (2009) laboratory investigation, par- 
ticipants first demonstrated the formation of two equivalence 
classes within a match-to-sample procedure. One class incor- 
porated the word useless plus two nonsense words (lewoly 
and gedeer), and the other incorporated good and two non- 
sense words (matser and rigund). Next, participants worked 
under two different reinforcement contingencies; in only one 
of these could reinforcement rate be increased by increasing 
response rate. Using a rating scale, they said that they felt 
more effective in gaining points on this schedule than on the 
other one. The two reinforcement schedules were signaled by 
different-colored lights. For simplicity, let’s say that blue sig- 
naled the schedule that made people feel effective and yellow 
signaled the schedule that made them feel ineffective. Finally, 
in a match-to-sample test, the participants were shown a word 
(or nonsense word) from the equivalence classes as the sample 
stimulus, and the two schedule-correlated colors as compari- 
son stimuli. They matched blue to words in the useless class 
and yellow to words in the good class. 

What do these results show? The fact that participants 
matched schedule-correlated colors to useless and good stim- 
uli reflects partly a verbal history (they already knew what 
those words mean) and partly new learning (they learned the 
“meaning” of the colors by working on the reinforcement 
schedules). By assigning the labels useless and good to schedule- 
correlated colors, participants basically generated a verbal 
rule to describe experience with the reinforcement schedules. 
Through transformation of function, we would expect the non- 
sense words to also be entangled in rule control. For instance, 
in a situation where the stimulus /ewoly is present, the useless 
rule might be occasioned. 

This experiment is not a perfect parallel to learned help- 
lessness, but it does show how inappropriate passivity can 
spread to new situations through pre-existing stimulus classes. 
By analogy, perhaps your mugging, your school experiences, 
and your social relations are united in a class that might be 
called “places where effort and skill matter.” Once a “helpless” 
self-rule arises in the presence of one of these situations, it 
should arise as well in the others. 


Chapter 20 
Three Practical Considerations 


Using Transformation of Function to Create 
New Reinforcers 


Reinforcers are at the heart of just about everything in ABA, 
but for individuals with highly restricted interests it can be dif- 
ficult to identify effective reinforcers (Tiger & Kliebert, 2011). 
In such cases there would be considerable value in a means of 
reliably creating new reinforcers, and transformation of func- 
tion can be used for this purpose. Before considering how this 
works for nonequivalence relations, let us take up the some- 
what simpler case of transformation of function in equivalence 
relations. If an already-effective reinforcer becomes a member 
of an equivalence class (e.g., using procedures described in 
Chapter 19), then through transformation of function, other 
stimuli in the class should also begin to function as reinforc- 
ers, even if they have never been directly paired with the orig- 
inal reinforcer. Table 20.1 shows an example (based on Hayes, 
Kohlenberg, & Hayes, 1991). Imagine that raisins already func- 
tion as a reinforcer for some individual; audio clips of a particu- 
lar song do not. The individual is taught to choose a blue triangle 
when shown a raisin, and also to choose the blue triangle when 
the audio clips are played. This should establish an equivalence 
class among the three stimuli, and therefore promote transfor- 
mation of function. Subsequently, the audio clips (which have 
never been directly paired with raisins) should function as a 
reinforcer.” New consequences created in this manner have 
been found effective in changing behavior in standard operant 
situations such as reinforcement schedules and discrimination 
learning (Dougher, Hamilton, Fink, & Harrington, 2007; Hayes 
et al., 1991; Whelan, Barnes-Holmes, & Dymond, 2006). 

As the preceding example shows, the behavioral function 
acquired by new stimuli in an equivalence class is the “same 
as” that served by an original stimulus. In a nonequivalence 
class, by contrast, transformation of function is consistent with 
the types of relations that define the class—sometimes with 
surprising results. Table 20.1 illustrates (based on Whelan 
and Barnes-Holmes, 2004). Imagine that pickle juice (P), an 


TABLE 20.1 
and Nonequivalence Classes 
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already-effective punisher for some individual, becomes part 
of a class incorporating several other stimuli and both “same 
as” and “opposite” relations (for explanation of how different 
kinds of relations can incorporate the same stimuli, see the 
“Theoretical Foundations” section). Two “same as” relations 
are taught: After smelling pickle juice, choose a blue triangle, 
and after hearing an audio clip of a song, choose the blue 
triangle. Two “opposite” relations are taught as well: Shown 
a picture of a cat, choose the blue triangle as its opposite, 
and shown a particular doll, choose the blue triangle as its 
opposite. Through transformation of function, the audio clip 
becomes a punisher (it was “same as” the blue triangle, which 
was “same as” the punisher). Of special importance to the 
present discussion, the cat and the doll became reinforcers 
(they were opposites of the blue triangle, which was “‘same as” 
the punisher). Thus, new reinforcers are created even though 
no other stimulus in the nonequivalence class initially func- 
tioned as a reinforcer. 

Do not feel bad if you find the examples of Table 20.1 
to be a little mind-bending, because standard thinking about 
reinforcers has not prepared you for them. The usual story 
told about conditioned reinforcers is that they result from the 
pairing of neutral stimuli with already-effective reinforcers (see 
Chapter 11). But the stimuli that become learned reinforcers in 
Table 20.1 were never directly paired with an already-effective 
reinforcer, suggesting that a pairing-focused account of 
conditioned consequences is too narrow. This is not surprising 
given that the pairing view arose long before the start of 
systematic research on derived stimulus relations (e.g., Hull, 
1943; Skinner, 1938; Wyckoff, 1959). But even more recent 
theoretical accounts of conditioned consequences are silent on 
derived effects (e.g., Fantino, Preston, & Dunn, 1993; Grace & 
Savastano, 2000; Shahan, 2010). Thus, it is not exaggerating 
to say that, in this area at least, research on derived stimulus 
relations forces a reconsideration of the very fundamentals of 
behavior theory. 

Returning to the mission of the present book, the effects 
described in Table 20.1 suggest a way to use derived stimulus 


Creating New Reinforcers Through Transformation of Function Within Equivalence 


Stimulus Paired With Type of Relation Original Function Transformed Function 
Equivalence Class 

Raisin Blue triangle Same Reinforcer _ 

Audio clip Blue triangle Same — Reinforcer 
Nonequivalence Class 

Pickle juice Blue triangle Same Punisher — 

Audio clip Blue triangle Same — Punisher 

Cat Blue triangle Opposite — Reinforcer 

Doll Blue triangle Opposite — Reinforcer 


Equivalence class example: Based on “The Transfer of Simple and General Consequential Functions Through Simple and Conditional Equivalence Relations,” 


by S. C. Hayes and colleagues, 1991, Journal of the Experimental Analysis of Behavior, 56, 119-137. Nonequivalence class example: a on “The Trans- 
Penton of Consequential Functions in Accordance with the Relational Frames of Same and Opposite,” by R. Whelan and colleagues, 2004, Journal of the 


Experimental Analysis of Behavior, 82, 177-195. 
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relations to create potentially endless reinforcers for any indi- 
vidual. Much remains to be learned, however. For example, to 
our knowledge the long-term durability of learned reinforcers 
established as in Table 20.1 has not been explored (we assume 
that the new reinforcers would be subject to the same motivating 
operations as the original reinforcer, and that derived reinforcing 
effects would depend on retention of the stimulus class on which 
transformation of function is based). This is an interesting 
direction for future research and application. 


Incoherent Stimulus Classes 


So far we have described derived stimulus relations in which 
the directly learned baseline relations predict the exact nature of 
derived relations. However, this will not be the case for every set 
of overlapping conditional discriminations. Consider an exam- 
ple involving distinction relations, summarized in Figure 20.7 
(left panel), in which it is unclear on what precise basis the 
stimuli differ: 


If you are told that “bees are different from mammals,” 
you cannot know what bees are like, or exactly which 
features differentiate them from mammals. Furthermore, 
the derived combinatorially entailed difference relations 
also are unspecified. For example, if you are told that 
bees are different from mammals, and that mammals 

are different from birds, you cannot know how bees and 
birds differ, or whether they are even the same (Luciano 
cual 20095 pe Loo): 


Figure 20.7 (right panel) shows another example. A child 
who is just starting to learn about money is taught that 25¢ 
is “more than” /0¢ and also that /0¢ is “more than” JS¢. 
These relations, by themselves, do not provide any basis for 
judging whether /0¢ or 25¢ is “more” (note that the issue 
is not what is true, but what can emerge from the already- 
mastered relations). 

Also potentially undefined in classes like these is how 
transformation of function might proceed. For instance, in the 
left panel of Figure 20.7, what happens if you are attacked by 
a pugnacious parrot and thereafter fear birds? How (if at ali) 
does this affect your emotional response to bees and mammals? 
Something may happen (stimulus control will always exist, and 
when experimental control is weak, unknown influences from 
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Figure 20.7 Two logically incoherent stimulus classes. Both 
include overlapping conditional discriminations, but the 
directly learned baseline relations do not support firm 
predictions about combinatorial entailment. 


the subject’s learning history will predominate; Sidman, 1960), 
but in this case, based on the directly taught relations in the 
stimulus class, we cannot guess what.!° 

These examples highlight why we have stressed the impor- 
tance, in derived stimulus relations interventions, of carefully 
mapping out the relations to be taught and the relations that log- 
ically should emerge from this teaching. A defining advantage 
of applied work involving derived stimulus relations is the “free 
learning” (in the form of untaught relations) that arises from 
it (e.g., Critchfield, 2018; Rehfeldt, 2011; Rehfeldt & Barnes- 
Holmes, 2009). From this perspective, applied relational inter- 
ventions are pointless if one cannot specify the derived relations 
that training is expected to yield. 


Tools for Planning Interventions Involving 
Nonequivalence Relations 


Historically, an applied behavior analyst interested in teaching 
derived stimulus relations had to create stimuli and devise train- 
ing protocols from scratch. Recently, however, efforts have been 
made to develop standardized tools to ease this process. We will 
mention two. 

Training and Assessment of Relational Precursors and 
Abilities (TARPA) is a free software program for assessing and 
establishing derived stimulus relations. Several studies support 
its reliability and validity (Moran, Stewart, McElwee, & Ming, 
2010, 2014; Moran, Walsh, Stewart, McElwee, & Ming, 2015). 
As of this writing, the program was available for equivalence 
relations only, but expansion into nonequivalence relations was 
under development. For information, go to http://www.vb3 
.co.uk/ 

The PEAK (Promoting the Emergence of Advanced 
Knowledge) Relational Training System is a commercially avail- 
able assessment and treatment curriculum for adults and children 
with autism and related disabilities. Each volume contains 184 
structured lessons and related assessment tools. One of the four 
volumes focuses on derived equivalence and another focuses on 
derived nonequivalence relations. More than 20 peer-reviewed 
studies provide evidence of the PEAK curriculum’s reliability, 
validity, and effectiveness (for a review, see Dixon et al., 2017). 


THEORETICAL FOUNDATIONS 


The examples of the last section were intended to clarify the 
“what” of derived nonequivalence relations. In the present sec- 
tion we take up the “why” of those relations, in two ways. First, 
in a discussion of contextual stimuli, we explain how relational 
behavior, like all operant behavior, is situation specific. Second, 
we introduce a leading theory of derived relations, with empha- 
sis on two of its most interesting propositions. 


Stimulus Relations Are Defined 
by Contextual Control 


To repeat an observation from earlier, in arbitrary stimulus rela- 
tions nothing about the stimuli themselves dictates how they 
can be related. Rather, human social/verbal customs and rein- 
forcement contingencies decide this. These customs and contin- 
gencies exert their influence through contextual stimuli, which 
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may be said to signal the specific way in which stimuli are to be 
related (Bush, Sidman, & Rose, 1989: Hayes, Fox et al., 2001). 
Figure 20.8 shows how the relations among a single set of stim- 
uli might change depending on contextual cues. In this example 
(adapted from Berens & Hayes, 2007), a child must master the 
relations among unfamiliar coins, including a U.S. quarter (25¢), 
dime (/0¢), and nickel (5¢). In the left panel, when the child is 
shown 25¢ and then asked, ““Which is less?” a correct response 
is to identify /0¢. In similar fashion the child can be taught that 
5¢ is “less than” /0¢. This should yield a derived relation in 
which the child, shown /0¢ and allowed to select either 5¢ or 
25¢, correctly identifies S¢ as “less.” Everything reverses, how- 
ever, when the context is “more than” (center panel). And given 
the context of “money,” the coins could be treated as equiva- 
lent (right panel; they are all types of money).!! Thus, the same 
stimuli can be related in any number of ways, depending on the 
contextual cues. 

In a sense, contextual stimuli signal what a person “needs 
to know” (that is, what way of relating stimuli will be reinforced) 
in a given situation. To get a better sense of contextual stimuli, 
imagine that you are completing a match-to-sample task, and the 
sample stimulus is Picasso. The comparison stimuli are Spain, 
Michelangelo, and Gris. Which one should you choose? With- 
out knowing “the question being asked,” this is an impossible 
decision, because each of these comparisons is an appropriate 
match for Picasso under some circumstance. Picasso was born 
in Spain. Like Michelangelo, he did some sculpting. Like Juan 
Gris, he used the painting style known as analytical cubism. 
Here “the question being asked” is a contextual cue that tells 
you on what basis you should choose a comparison stimulus. 
Because learned contextual cues mediate the nature of arbitrary 
stimulus relations in this way, contextual stimuli are an import- 
ant component of applied stimulus relations interventions. 

In many of the examples provided in this chapter, we men- 
tion verbal contextual stimuli (e.g., “sculpting,” above) because 
these are easy to understand. But contextual stimuli don’t have 
to be verbal. Imagine that you live in a lovely house, but your 
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hobby involves survivalist adventures in which you must live 
off the land in the wilderness. In the context of being at home, 
insects fall into a class of stimuli that are to be avoided—for 
instance, you might call an exterminator to rid your house of 
termites. In the context of being alone in the forest without food, 
however, termites, as a source of life-saving nourishment, may 
fall into a class of things to be sought rather than avoided. A con- 
textual stimulus is merely one whose presence correlates with 
contingencies favoring a particular way of relating. 


Are Contextual Stimuli Always Essential? 


The preceding sets the stage for a critical look at match-to-sample 
procedures for establishing derived stimulus relations like those 
described in Chapter 19. In much of the published research on 
stimulus equivalence that has been conducted to date, you will 
be hard pressed to find reference to contextual stimuli as we 
are discussing them here.!* The example described in Figure 
19.1 illustrates. During the training of each baseline relation, the 
sample stimulus was a spoken word, and the comparison stimuli 
were pictures. Thus, on one type of trial, the sample might be 
‘car’ and the comparisons might be pictures of a dog, a car, a 
cow, and a coffee mug. Selecting the car was reinforced. At no 
point during this procedure, however, was there any discrete 
stimulus whose presence correlated with the “same” contin- 
gency. Earlier we said that derived stimulus relations depend 
on contextual stimuli to signal the type of relation that should 
operate. Many different relation types could unite the stimuli. 
For instance, a coffee mug is something you might take with you 
in acar, “cow” starts with the same sound as “car,” and a dog is 
something that chases a car. 

How, then, did the participants know what to choose from 
trial to trial? Two possibilities exist. The first is that, in the relatively 
simple confines of the experiment (and unlike in the everyday 
world) only one type of relation was ever reinforced, so there was 
no need for contextual cues. In the experiment of Figure 19.1, 
once “same” responding was shaped up, it applied equally well 
(i.e., was reinforced) across all parts of the experiment. 
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Figure 20.8 How contextual cues define stimulus relations. Left: In a “less” 
context, coins are related in a certain way (arrows point from the more valuable 
to the less valuable coin). Center: In a “more” context, those relations are 
reversed. Right: In the context of “money,” the coins are essentially “the same.” 


Left and center panels are based on “Arbitrarily Applicable Comparative Relations: Experimental 
Evidence for a Relational Operant,” by N. M. Berens and S. C. Hayes, 2007, Journal of Applied 


Behavior Analysis, 40, 45-71. 
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A second possibility is that the experiment actually con- 
tained contextual stimuli, but they were implicit, meaning the 
overall experimental setting became a compound contextual 
stimulus that was correlated with reinforcement contingencies. 
In effect, perhaps the participant learned, “When I’m here, in 
this room, with this experimenter and these stimuli, I should 
respond with ‘same.’” There is plenty of precedent for contex- 
tual cues of this type. In the classical conditioning literature, 
for instance, it is well established that contextual stimuli can be 
compounds of discrete stimuli plus aspects of the setting where 
conditioning took place (Bouton & Nelson, 1998). 

Neither of these possibilities bodes particularly well for 
the generality of what is learned (Chapter 30). Someone whose 
relational responding is not under contextual control will be at 
a loss for what to do in a world where the contingencies are 
always changing (it will not pay, for example, to point to a 
picture of a car when asked, “What do you take with you in a 
car?”’). Someone whose contextual stimuli for “same” respond- 
ing are highly specific to one training situation may not make 
“same” responses when somewhere else. 

What all this suggests to the applied behavior analyst is 
that efforts to create derived stimulus relations should, when- 
ever possible, incorporate explicit, naturalistic contextual stimuli 
in the form of cues that are likely to be present in everyday 
environments. In the human world, these cues will often be 
verbal in nature—hence the many verbal examples employed 
in the present chapter. While it may be possible to build an 
equivalence class using a strictly nonverbal match-to-sample 
procedure, for verbal learners we recommend including a 
contextual prompt such as “pick the one that is the same,” or 
simply, “same.” 


Relational Frame Theory 


Now that you have a sense of what nonequivalence relations 
are, it will be helpful to briefly examine the leading theoretical 
account of them, relational frame theory (RFT; Hayes et al., 
2001), which you encountered briefly in Chapters 18 and 19. In 
the present chapter we will not take up the question of whether 
RFT or some other theory of stimulus relations, like Sidman’s 
(2000), is “better.” However, RFT has had sufficient impact 
on research into derived stimulus relations that any student 
who wishes to become familiar with this topic needs a working 
understanding of the theory. To date, the vast majority of studies 
conducted on nonequivalence relations have been inspired by 
RFT, and as a result the vocabulary of nonequivalence relations 
used in the present chapter (e.g., mutual entailment and combi- 
natorial entailment) comes from RFT. Additionally, although 
there is wide agreement that derived stimulus relations provide 
a basis for understanding “higher cognitive abilities” of humans 
(e.g., Hayes et al., 2001; Sidman, 1986, 1994), RFT research- 
ers have been especially focused on this possibility (e.g., see 
the “Nonequivalence Relations and Big-Picture Psychological 
Constructs” section). A complete explanation of RFT is beyond 
the scope of the present discussion, but we will mention two key 
features of the theory that will help in understanding points to 
be made later in the chapter. 


Proposition 1: The Behavior of Relating Stimuli 
Is Verbal Behavior 


RFT assumes that derived stimulus relations are inherently 
verbal relations (Hayes, 1984).!3 That is, the behavior of relat- 
ing stimuli is verbal behavior. We know of no evidence that 
directly confirms or refutes this assumption—it may not even 
be definitively testable—but it is supported by circumstantial 
evidence. For example, derived stimulus relations have, at best, 
been produced in nonhumans only rarely and as the product of 
considerable effort (e.g., Galizio & Bruce, 2018). This stands in 
stark contrast to the apparent ease with which verbally proficient 
humans derive stimulus relations and the broad flexibility with 
which they apply them (e.g., Figure 20.1). Moreover, as we will 
describe later in the chapter, human relational abilities appear 
to arise during the same period of child development in which 
verbal behavior grows precipitously. These observations suggest 
a profound linkage of derived stimulus relations and the unique 
verbal capabilities of humans. 

Thinking of stimulus relations in terms of verbal behavior 
raises the interesting question of what is going on in match- 
to-sample protocols for establishing stimulus relations, which 
Chapter 19 portrayed as largely nonverbal procedures. One pos- 
sibility is that in “nonverbal” circumstances learners apply covert 
verbal behavior to master stimulus relations (e.g., Horne & Lowe, 
1996; see Chapter 19 for information about naming theory). For 
present purposes, however, we merely consider the possibility 
that, if the relating of stimuli 1s a verbal process, then verbal 
behavior can be strategically harnessed to construct stimulus 
relations. A number of studies show that baseline relations can 
be established, and derived relations promoted, by simply telling 
the learner about the required overlapping conditional discrimi- 
nations (e.g., Berens & Hayes, 2007; Smyth, Barnes-Holmes, & 
Barnes-Holmes, 2008). This suggests a way, in applied work, to 
established stimulus relations more efficiently than through labo- 
rious match-to-sample protocols, as Box 20.4 discusses. Note 
that research of this type foreshadows other ways in which verbal 
interactions may intersect with derived stimulus relations, a topic 
that we will take-up in this chapter’s final section. 


Proposition 2: Reinforced Relational Experience 
Yields a Generalized Relational Repertoire 


Chapter 19 described equivalence classes as the result of 
reinforced practice with baseline relations involving specific 
stimuli. For example, consider the class in Figure 20.9 (top 
panel). According to the standard view, acquiring this class 
requires two kinds of baseline learning experiences: reinforced 
experience selecting p = .05 (rather than p > .05) in the pres- 
ence of low p value, and reinforced experience selecting statis- 
tically significant in the presence of low p value (selecting the 
opposite answers in the presence of high p value also would 
need to be reinforced). Also according to the standard view, 
acquiring each new equivalence class (e.g., overdue, tardy, and 
late; top right panel of Figure 20.3) would require the analo- 


gous steps. RFT, however, proposes a developmental process 


that alters the nature of stimulus-class acquisition. At the core 
of this proposal is a disagreement with Sidman’s (2000) theory 
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BOX 20.4 


Telling Isn't Teaching—Right? 


About that much-maligned mainstay of classroom teaching, 
the lecture, it has long been observed that “Telling isn’t teach- 
ing.” The corollary is that passive listening isn’t learning, 
as Skinner (1958) suggested when advocating for teaching 
machines: “Unlike lectures . . . the machine induces sustained 
activity. The student is always alert and busy” (p. 971). In 
behavioral systems of instruction (e.g., Keller, 1968; Skinner, 
1958), student learning is said to depend on frequent, active 
responding with feedback. This has been suggested specifi- 
cally about equivalence-based instruction (e.g., Critchfield & 
Twyman, 2014). 

It would therefore seem unlikely that derived stimulus 
relations could be established simply by telling learners about 
the baseline relations. But consider the case of some college stu- 
dents who learned about inferential statistics through an auto- 
mated lesson in an online course delivery system (Critchfield, 
2014). At each step of the lesson, the students were asked to 
read “notecards” presenting brief statements such as: 


With a low p value, the results are statistically significant. 
With a high p value, the results are not statistically 
significant. 


The preceding statements incorporate two AB rela- 
tions (one of which is diagrammed in Figure 20.9, top panel). 
After reading each statement, a student answered two to four 
multiple-choice questions such as 


Low p value 


A. not statistically significant 
B. statistically significant 


These questions were intended to serve as an attention 
check. Correct answers earned the feedback “Correct,” and 
incorrect answers earned the feedback “Wrong,” along with a 
prompt to review the relevant notecard. After this exposure to 
the baseline relations AB and BC, between 89% and 100% of 
the students answered test questions correctly about expected 
derived relations (BC; right panel of Figure 20.9). For instance, 
relevant to the relations shown in the left panel, students cor- 
rectly answered this question: 


pes 0d 


A. _ statistically significant 
B. not statistically significant 


The noteworthy feature of this lesson was its efficiency. 
With “telling” substituted for the elaborate sequences of match- 
to-sample trials that are found in many stimulus relations pro- 
tocols, students completed the lesson in about 25% to 50% 
of the time previous students needed to master the same con- 
cepts ina match-to-sample protocol (see Fienup & Critchfield, 
2010, 2011). In this particular instance, perhaps “telling” was 
teaching. Perhaps. There is uncertainty on this point because 
the attention-check questions did require some active respond- 
ing, and even though students did not have to answer these 
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Figure 20.9 Stimulus relations established in college 
students by telling about baseline relations rather than 
reinforced match-to-sample experience. Top: Concepts 
related to inferential statistics. Middle: Concepts related 
to neuroanatomy. Bottom: Correct responses on one 
relation for students directly taught it versus those 
expected to demonstrate it emergently. For each group, 
separate bars show correct responding for students who 
did and did not first show mastery of baseline relations. 


Top based on “Online Equivalence-based Instruction About Statisti- 

cal Inference Using Written Explanation Instead of Match-to-Sample 
Training,” by T. S. Critchfield, 2014, Journal of Applied Behavior 
Analysis, 47, 606-611; middle based on “Using Stimulus Equivalence 
Technology to Increase Efficiency in Teaching Neuroanatomy, " by 

C. L. Pytte and D. M. Fienup, 2012, Journal of Undergraduate 
Neuroscience Education, 10(2), A125-A131; bottom “Using Stimulus 
Equivalence Technology to Increase Efficiency in Teaching Neuroanat- 
omy,” by C. L. Pytte and D. M. Fienup, 2012, Journal of Undergraduate 
Neuroscience Education, 10(2), A125-A131, Reproduced by permission 
from the Pytte and Fienup study. 
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questions correctly (there was no mastery criterion), responses 
did produce contingent feedback. Thus one cannot rule out 
that active responding and feedback, even in small doses, were 
necessary to the study’s good outcomes. 

Here a study by Pytte and Fienup (2014) is potentially 
informative. College students learned about neuroanatomy 
concepts involving brain areas, their locations in the brain, and 
their behavioral functions. The medium of instruction was a 
lecture organized to emphasize certain baseline relations. The 
center panel of Figure 20.9 shows one example. For seven 
brain regions, the AB and BC relations were explained in the 
lecture, but the expected derived AC relations were not. For 
seven other brain regions, which served as positive controls, 
the AC relation also was explained. 

On the course exam, students averaged approximately 
82% correct on questions about untaught AC relations, regard- 
less of whether or not those relations were addressed in the 
lecture. Thus, “telling” about baseline relations was sufficient 
to promote combinatorial entailment. This is not really sur- 
prising, because past laboratory research had shown similar 
effects from “telling” about baseline relations (Smyth et al., 
2008), and from simply “demonstrating” them by presenting 
pairs of stimuli without any requirement for overt responding 
(e.g., Leader & Barnes-Holmes, 2001a). 

What is the “best” approach to establishing stimulus 
classes? Although “telling” and “demonstrating” alone appar- 
ently can work under at least some circumstances, perhaps 
even better outcomes would be obtained by requiring active 
responding with feedback, per the Critchfield (2014) study. On 
this point, however, the evidence is mixed. In comparisons of 
the “demonstration” procedure with match-to-sample, stimu- 
lus classes were more reliably established via the latter in one 
study (Clayton & Hayes, 2004) and by the former in another 


asserting that derived relations are an automatic outcome of 
specific reinforcement contingencies. RFT assumes instead 
that, given the right learning history, the deriving of stimulus 
classes becomes a generalized repertoire that RFT calls arbi- 
trarily applicable relational responding (Barnes-Holmes & 
Barnes-Holmes, 2000) that potentially applies to any number 
of specific stimuli. 

The required learning history is reminiscent of that 
required to create generalized imitation (Baer, Peterson, & 
Sherman, 1967). For at least some learners, a teacher’s com- 
mand to “Do this” initially is insufficient to prompt the copy- 
ing of an action like touching one’s nose. Direct teaching can 
bring that particular action under the control of “Do this,” but 
the prompt initially may not work with other actions (e.g., clap- 
ping). Given direct instruction of imitation of enough actions, 
“Do this” becomes a generalized contextual cue that prompts 
the copying of any action under the stimulus control of what 
the teacher does next. In this example you may recognize 
multiple-exemplar training, which was described in Chapter 17 
as the repeated acquisition of some behavioral repertoire with 
variations in situations, stimuli, or response features (see also 


(Leader & Barnes-Holmes, 2001b). In at least one study 
with college students learning about single-subject designs, 
“telling” via lecture and a match-to-sample protocol promoted 
derived stimulus classes equally well (Lovett, Rehfeldt, Garcia, 
& Dunning, 2011). 

To help make sense of these findings, it is worth not- 
ing an especially interesting feature of the Pytte and Fie- 
nup (2014) study that is summarized in the bottom panel of 
Figure 20.9. For both taught and untaught AC relations, Pytte 
and Fienup calculated accuracy separately for students who 
did, and did not, correctly answer questions about baseline 
relations. Regardless of whether students were told about AC, 
the critical issue about AC relations seemed to be that base- 
line AB and BC relations first were thoroughly mastered. For 
students who did acquire the baseline relations, telling was 
not necessary for combinatorial entailment. For students who 
did not master baseline relations, telling was insufficient for 
combinatorial entailment. 

The Pytte and Fienup (2014) results suggest that, as 
with match-to-sample protocols, the critical thing about 
“telling” or “demonstrating” procedures may be whether 
they firmly establish baseline relations. Unfortunately, 
although “telling” and “demonstrating” have been staples 
of classroom instruction for centuries, the conditions under 
which these strategies reliably change behavior remain to 
be fully understood. It is therefore prudent, for the time 
being, to follow the lead of most work on behavioral sys- 
tems of instruction (e.g., Keller, 1968; Skinner, 1958) and 
assume that active responding and contingent feedback, in 
some measure, are useful features of stimulus relations pro- 
tocols. This does not, however, negate the potential of other 
procedures to greatly streamline the acquisition of stimulus 
relations by learners with adequate verbal skills. 


Holth, 2017). In the RFT account of generalized relational oper- 
ants, each new stimulus class of a particular type (like equiva- 
lence) that reinforcement teaches is one exemplar, and given 
enough reinforced class formations, the deriving of stimulus 
classes becomes a higher-order operant that potentially applies 
to any number of specific stimuli. It does this through the con- 
textual stimuli that were mentioned earlier in the chapter. In 
short, what multiple-exemplar training creates is generalized 
control by contextual stimuli. 

When contextual stimuli become generalized, the rela- 
tional behavior they mediate is called arbitrarily applicable 
relational responding (AARR; Hayes, Fox et al., 2001), which 
means forming new stimulus classes with little to no new rein- 
forced practice. AARR involving any specific type of relation 
is called a relational frame (Hayes, Fox et al., 2001) and is 
thought to be acquired separately from that involving other 
types. A possible example of AARR is the following: When 
you see a supermarket sign indicating that the store’s price on 
your favorite snack is “less than” the price at another store, 


you don’t need any new learning experiences to know how to 
respond to that. 


Chapter 20 * Engineering Emergent Learning with Nonequivalence Relations 555 


Because AARR is thought to result from accumulated 
multiple-exemplar experience, very young children, who lack 
an extensive history of reinforced relating, would be expected 
not to spontaneously demonstrate derived relations after learn- 
ing baseline relations. Rather, they would need to acquire those 
relations by learning about baseline relations through reinforced 
practice. As relational experience accumulates, however, chil- 
dren should begin to form stimulus classes without extensive 
reinforced conditional discrimination training. 

This perspective has two important implications. The 
first concerns developmental trajectories: Normatively speak- 
ing, in childhood, relational abilities should improve with age, 
although not necessarily at the same pace for all types of rela- 
tions because each type is its own generalized class requiring 
its own formative experiences. In the next section, we describe 
evidence that supports this prediction. The second implication 
is that, using multiple-exemplar training, it should be possible 
to experimentally create a particular type of AARR. Numer- 
ous studies have shown that when enough exemplars of one 
type of stimulus relation are directly trained, new relations of 
the same type may subsequently be formed without further 
training (Barnes-Holmes et al., 2004; Berens & Hayes, 2007; 
Dunne, Foody, Barnes-Holmes, Barnes-Holmes, & Murphy, 
2014; Heagle & Rehfeldt, 2006; May et al., 2017; Ninness et 
al., 2009).!* Collectively, these studies remind us that a derived 
stimulus relations intervention can have two goals: to establish a 
stimulus class involving specific stimuli, and to establish a gen- 
eralized repertoire through which the learner might form similar 
classes with new stimuli. 

A brief editorial comment is in order: You needn’t be a 
fan of RFT to appreciate studies showing generalized reper- 
toires of relating. One of the most difficult challenges in ABA 
is to create behavior change that generalizes beyond the con- 
fines of a particular intervention setting (Stokes & Baer, 1968; 
Chapter 30). AARR may well be this kind of effect, although 
much remains to be learned about it. To date, research has 
shown mostly that, given multiple-exemplar experience, newly 
acquired relational repertoires can apply to novel stimuli in 
the same setting where an intervention took place. It is quite 
another matter to suggest that relational abilities acquired in 
one setting extend to very different ones (e.g., clinic-established 
relating also appears at home or at school). We are not aware 
of any research conducted to date that applies this stricter test 
of the AARR concept.!° Nevertheless, findings like those just 
described underscore that it may be possible with stimulus 
relations interventions not just to establish a particular stim- 
ulus class of interest but also to set generalized repertoires of 
relating into motion. 


NONEQUIVALENCE RELATIONS AND BIG- 
PICTURE PSYCHOLOGICAL CONSTRUCTS 


As many of the diagrams in this chapter illustrate, behavior ana- 
lysts often think about behavior in terms of discrete stimuli and 
responses, often as these unfold moment by moment. Outside 
of behavior analysis, people often operate at a different level of 
analysis. They speak of big-picture constructs like “intelligence” 


and “knowledge” that refer to functioning over extended time 
frames. Skinner (1945) saw a means of reconciling the two lev- 
els of analysis by viewing psychological constructs as umbrella 
terms for potentially complex behavioral phenomena. If you 
want to know what “intelligence” is, Skinner suggested, look 
at the various behavioral circumstances under which people 
speak of “intelligence.” In his view, psychological constructs 
tact something. That something is behavior, and it can be infor- 
mative to specify what kind of behavior. 

From the outset, pioneering researchers of relational learn- 
ing saw their mission as providing an alternative to mainstream 
thinking about big-picture psychological constructs (Hayes, 
1984; Hayes et al., 2001; Sidman, 1986, 1994). In the present 
section, we briefly explore some possible connections between 
derived stimulus relations and three such constructs: knowledge 
structures, self-concept, and intellectual development. 


Categorical Knowledge: Hierarchical Relations 


We opened the present chapter by referring to semantic networks 
as hypothesized structures that organize bits of knowledge in 
long-term memory. Figure 20.1 showed that many different 
kinds of derived stimulus relations may be implicated in the 
behavioral phenomena that lead observers to speak of “knowl- 
edge” in the first place. One such case involves responding to 
stimuli in categories, which are classes within which (a) non- 
identical stimuli are treated as “‘same,” and (b) the same stimuli 
all are simultaneously “different” from stimuli in other classes. 
As Lakoff (1987) has written: 


There is nothing more basic than categorization to our 
thought, perception, action, and speech. Every time we 

see something as a kind of thing . . . we are categorizing. 
Whenever we reason about kinds of things—chairs, nations, 
emotions, any kind of thing at all—we are employing 
categories. .. . An understanding of how we categorize is 
central to any understanding of how we think and how we 
function, and therefore central to an understanding of what 
makes us human. (pp. 5—6) 


Category relationships, by their nature, involve nonequiv- 
alence, and thus hierarchical relations have much in common 
with distinction relations. Though related to ale, beer is not 
the “same as” ale. Rather, beer is a category that includes ale 
(Figure 20.1), but ale does not include beer. Conversely, ale 
is a member of the category beer, but beer is not a member 
of the category ale. Such asymmetrical stimulus relations in 
categorization have been described as hierarchical relations, 
which involve a stimulus “including” (serving as a category 
label for) other stimuli, and those stimuli being “member of” 
the category stimulus. Hierarchical relations imply a nesting of 
multiple classes of stimulus relations. Thus, India pale ale and 
English bitter, though “different” in some ways, are “the same” 
in that they are members of the stimulus class ale. Ale and lager, 
though “different,” are “the same” in being types of beer. Beer 
and wine, though “different,” are “the same” in being types of 
alcoholic beverages, which are “different” from nonalcoholic 
beverages, though both the alcoholic and nonalcoholic varieties 
are types of beverages—and so on. 
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We will not attempt to explain the procedures by which 
derived stimulus relations researchers have built hierarchical 
relations because these can be quite complex (e.g., Griffee & 
Dougher, 2002; Paliliunas, Belisle, Barron, & Dixon, in press). 
A good place for the interested reader to begin may be with 
general considerations that apply to establishing hierarchical 
relations (Luciano, et al., 2009), one of the most important of 
which is establishing contextual stimuli that mediate whether 
two stimuli are related via “member of” or “includes” relations 
(e.g., Slattery & Stewart, 2014). For instance, in the context of 
“includes,” animal “includes” dog (which is a type of animal), 
but dog does not “include” animal (which is not a type of dog). 
In the context of “member of,’ dog is a “member of” animal, 


but not vice versa. Once established, the contextual stimuli can 
assume generalized properties such that they guide the catego- 
rization of novel stimuli (Gil-Luciano, Ruiz, Valdivia-Salas, & 
Sudrez-Falc6n, 2017; Slattery & Stewart, 2014). 

To summarize thus far, the research literature shows that 
derived stimulus relations protocols can be used to establish rela- 
tions with hierarchical properties, but this does not necessarily 
show that similar processes operate when people learn to catego- 
rize in the everyday world. Real-world categorization typically 
exhibits three effects (Murphy, 2002) that any approach based 
on derived stimulus relations must reproduce to be credible. 
Box 20.5 introduces the phenomena and illustrates how hier- 
archical categories can be conceptualized in terms of derived 


BOX 20.5 


Three Characteristics of Hierarchical Categories 


Researchers outside of behavior analysis have found that three 
properties reliably apply to the way that people categorize 
(Murphy, 2002). Two of these ‘containment,’ referring to what 
stimuli are subsumed by which categories, and the third con- 
cerns “property induction,’ which may be thought of as the 
transformation of behavioral functions. 

Asymmetrical class containment refers to the fact that a 
higher-order class (e.g., “animal’) contains a lower-order class 
(e.g., “dog”’) but not vice versa (i.e., “dog” does not contain 
“animal”) (Slattery & Stewart, 2014, p. 61). 

Figure 20.10 (top panel) shows that contextual stim- 
uli controlling “includes” and “member of” relations can be 
thought of as dictating unidirectional stimulus control. The 
figure is arrayed such that in the “includes” context an item 
includes only items positioned to its right. In the “member of” 
context, an item is only a member of items positioned to its 
left. This applies equally to relations that are directly taught 
(e.g., in the figure, that animal “includes” dog and ant) and 
derived (e.g., animal includes poodle) relations. 

Transitive class containment refers to classifying a stimu- 
lus (A) as a member of a higher-order class (C) on the basis that 
it is a member of a subclass (B) that is a member of that high- 
er-order class. For example, if a child is taught that “poodle” 
(A) is a type of “dog” (B), then the child may also classify 
“poodle” as an “animal” (C) on the basis that “dog” (B) is a 
type of animal (Slattery & Stewart, 2014, p. 61). 

The derived stimulus control of transitive class con- 
tainment flows unidirectionally in accordance with con- 
textual cues. Because animal “includes” dog and dog 
“includes” poodle, in Figure 20.10 animal is understood to 
“include” poodle even if this relation is not directly taught. In 
similar fashion, poodle would be understood as “member of” 
animal. 

Unilateral property induction refers to the concept that 
properties or features of a higher-order class (e.g., animal”) 
will also be found in a lower-order class (e.g., “dog’’) but not 
vice versa. For example, all animals breathe and thus dogs 
breathe; however, while dogs have four legs, not all animals 
do (Slattery & Stewart, 2014, p. 61). 


Figure 20.10 (bottom panel) depicts unilateral property 
induction as transformation of function flowing unidirection- 
ally in accordance with the contextual cues. An individual who 
learns that “breathes” is a property of animals will understand 
that dogs breathe and poodles breathe. An individual who 
learns to identify “curly hair” as a property of poodles will not 
assume that all dogs or all animals share this property. 
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Figure 20.10 Examples of derived stimulus relations inherent 
in three effects commonly seen in hierarchical categorization. 
Note that, unlike in other diagrams in this chapter, arrows 

do not point from sample to comparison stimulus. Rather, 
arrows indicate class containment; for instance, for the 
contextual cue “includes,” an arrow pointing from animal to 
dog indicates that the category animal includes dog. 
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stimulus relations. Recent studies have shown that these phe- 
nomena do in fact arise when hierarchical relations are created 
via derived stimulus relations protocols (Slattery & Stewart, 
2014; Slattery, Stewart, & O’Hora, 2011). 


Self-Concept: Deictic Relations 


Much of this chapter has described how people relate different 
aspects of the world to each other. But people also focus a great 
deal on the relations between themselves and the world. Vari- 
ous psychological constructs such as identity, self-concept, and 
self-image at least partly address this self/world relation. As is 
true for beer (Figure 20.1), however, a “self” cannot be under- 
stood outside the context of things to which it is related. RFT 
provides a detailed theoretical account of various aspects of this 
“self” and the relations through which it arises and manifests 
(Barnes-Holmes, Hayes, & Dymond, 2001; Torneke, 2010). As 
we will note briefly in the chapter’s final section, this account 
is useful in understanding and resolving a variety of clinical 
problems. 

Let’s start to unpack “sense of self’ by considering how 
“self” is related to other people. Asked what her dog in the next 
room sees, a young child may instead describe what she sees. 
The child does not understand that one’s vantage point is differ- 
ent from that of others. Perspective shifting'© means respond- 
ing as if you see the world as from another’s vantage point, or 
perceive a situation as if were you in another’s position (Batson, 
Early, & Salvarani, 1997). But there are other types of perspec- 
tive shifting. Very young children may also confuse the perspec- 
tives of now with then (asked, ““What did you do yesterday?” the 
child may describe what she is doing now) or the perspectives 
of here and there (asked, “What did you see at Grandma’s?” the 
child may describe what she sees in her own house). 

Perspective-shifting abilities correlate with general social 
success (e.g., McHugh, Barnes-Holmes, Barnes-Holmes, 
Stewart, & Dymond, 2007), and disruptions in this ability are 
associated with reduced “emotional intelligence” (Bar-On & 
Parker, 2000) and several clinical conditions (e.g., Ingram, 1990; 
Mullins-Nelson, Salekin, & Leistico, 2006), including autism 
(Baron-Cohen, Leslie, & Frith, 1985). Additionally, as this chap- 
ter’s final section explains, perspective-shifting repertoires are 
relevant to certain problems of adjustment experienced by typi- 
cally developing individuals. 

In the behavior analytic view, perspective shifting involves 
specific kinds of stimulus relations. These are learned relations 
rather than the automatic product of maturation (Weil, Hayes, 
& Capurro, 2011), and the everyday environment does not 
always provide the needed learning experiences (McHugh, 
Barnes-Holmes, & Barnes-Holmes, 2009; Valdivia-Salas, 
Luciano, Gutierrez-Martinez, & Visdomine, 2009). Fortu- 
nately, research is beginning to show how to promote the 
acquisition of selected stimulus relations related to perspec- 
tive shifting, for both typically developing children (Heagle & 
Rehfeldt, 2006; Montoya-Rodriguez & Cobos, 2016; Rehfeldt, 
Dillen, Ziomek, & Kowalchuk, 2007) and children with autism 
(Jackson, Mendoza, & Adams, 2014; Rehfeldt, Dillen, Ziomek, 
& Kowalchuk, 2007). 

Most commonly, perspective shifting has been established 
using variants of the “Barnes-Holmes Protocol” (e.g., McHugh, 


Barnes-Holmes, & Barnes-Holmes, 2004; though see Guinther, 
2017). The protocol incorporates a variety of self-focused dis- 
tinction relations called deictic relations, which define the rela- 
tion between the self and various aspects of the external world. 
Deictic relations have three dimensions. The I- YOU dimension 
distinguishes self from other people. The HERE-THERE dimen- 
sion distinguishes between geographic locations of self versus 
others (HERE is always associated with I, and THERE always 
with YOU). The NOW-THEN dimension distinguishes between 
events taking place in the present versus other times. 

The everyday importance of the deictic dimensions can be 
appreciated by considering the kinds of self-referencing verbal 
interactions that adults tend to have with young children. Adults 
may ask children questions like, “What is Mommy doing? 
(1/YOU), “Where are you?” (HERE/THERE), and “What did 
you do this morning at preschool?” (NOW/THEN). The rein- 
forcing of correct answers forces attention to important deictic 
distinctions (e.g., it is difficult to correctly answer “Where are 
you?” without grasping that you are not your dog who is sitting 
in the other room). However, these are “simple” deictic relations 
that are not perspective shifting per se but rather prerequisites to 
it. Adults also ask children questions that require a reversal of 
perspective along one or more of the three deictic dimensions. 
For example, the question “What do you like to do when you are 
at Grandma’s house?” requires a reversal of both HERE/THERE 
and NOW/THEN. This is an example of perspective shifting. 

The Barnes-Holmes Protocol uses verbally presented 
scenarios about which the learner is asked to answer ques- 
tions reflecting the three deictic dimensions. Table 20.2 shows 
examples of these scenarios. This verbal format differs from 
match-to-sample procedures that, in many cases, contain as lit- 
tle verbal content as possible. !7 The scenarios in the Barnes- 
Holmes Protocol represent three levels of deictic complexity 
(Table 20.2, leftmost column). Simple deictic relations involve 
the three core deictic dimensions mentioned earlier and, as 
mentioned previously, are not instances of perspective shifting. 
The building blocks of perspective shifting are reversed deictic 
relations (involving reversal along one of the three dimensions) 
and double-reversed relations (involving reversal on multiple 
dimensions). 

You may be struck by how confusing some of the scenar- 
ios in Table 20.2 seem, even to an advanced learner like yourself. 
This illustrates that deictic relations do not necessarily “come 
naturally” and thus may not always be acquired under everyday 
circumstances (even by high-functioning adults; look ahead to 
Figure 20.11). It is therefore noteworthy that the Barnes-Holmes 
Protocol has been successfully used to teach perspective shifting 
to young children and persons with disabilities (e.g., Barron, 
Verkuylen, Belisle, Paliliunas, & Dixon, 2018; Belisle, Dixon, 
Stanley, Munoz, & Daar, 2016; Heagle & Rehfeldt, 2006; 
Jackson et al., 2014; Rehfeldt et al., 2007). 


Intellectual Development 


Intelligence 

The construct of intelligence traces back to well before 
psychology was an independent discipline, and psychologists 
have sought to measure it for more than 100 years. Individual 
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TABLE 20.2 Example Items from the Barnes-Holmes Protocol for Teaching Perspective Shifting 
LEVEL | Example Item? 1/YOU HERE/THERE | NOW/THEN 
| have a red brick and you have a green brick. Which brick do | have? Simple 
x | am sitting here on the blue chair and you are sitting there on the black Simple Simple 
= chair. Where are YOU sitting? 
i Yesterday | was watching television. Today | am reading. What am | doing Simple Simple 
NOW? 
| have a red brick and you have a green brick. If | was YOU and YOU were Reversed 
ME, which brick would | have? 
== | am sitting here on the blue chair and you are sitting there on the black Reversed | Simple 
a chair. If |was YOU and YOU were ME, where would YOU be sitting? 
2 | am sitting here on the blue chair and you are sitting there on the black Simple Reversed 
i chair. If HERE was THERE and THERE was HERE, where would YOU be sitting? 
Yesterday | was watching television. Today | am reading. If NOW was THEN Simple Reversed 
and THEN was NOW, what would | be doing THEN? 
lam sitting here on the blue chair and you are sitting there on the black Reversed | Reversed Simple 
chair. If | WAS YOU and YOU were ME, and HERE was THERE and THERE 
© was HERE, where would YOU be sitting? 
g Yesterday you were sitting here on the blue chair and today you are sitting Reversed | Simple Reversed 
: there on the black chair. If | was YOU and YOU were ME, and NOW was 
2 THEN and THEN was NOW, where would | be sitting? 
ra Yesterday | was sitting here on the blue chair and you were sitting there on Simple Reversed Reversed 
the black chair. If HERE was THERE and THERE was HERE, and if NOW was 
THEN and THEN was NOW, where would | be sitting? 


“Items can represent three different deictic dimensions (I/YOU, HERE/THERE, and NOW/THEN). Each dimension can be represented in non-deictic (simple) 


or deictic (reversed and double-reversed) form. 


Based on Table | of “Teaching Perspective-Taking to Typically-Developing Children Through Derived Relational Responding,” by A. I. Heagle and 
R. A. Rehfeldt, 2006, Journal of Early and Intensive Behavioral Interventions, 3, 1-34. 


differences in intelligence are seen as having practical impor- 
tance; for instance, intelligence tests are used in schools for 
diagnosing learning issues and for placing children into educa- 
tional tracks (Kaufman, 2018). Suffice it to say that any theoret- 
ical or empirical effort that sheds light on intelligence is likely 
to be of substantial interest. 

The chapter introduction suggested that “knowledge” 
and “understanding” consist of derived stimulus relations (e.g., 
Figure 20.1). If this is true, then three things follow. First, 
people whose experiences create a wide variety of derived 
stimulus relations should be perceived as especially knowl- 
edgeable. Second, because what is already known is part of 
the raw material for new stimulus relations, those who “know 
a lot” have an advantage in future learning—they have more 
conceptual hooks, as it were, on which to hang new learning. 
Third, recall that rich histories of relational learning are the 
source of the generalized relational repertoires (AARR) that 
RFT hypothesizes. This may allow people to learn new things 
with unusual speed and flexibility, which is one basis on which 
they might be called intelligent. On the basis of these observa- 
tions, RFT theorists have equated “general intelligence” with 
AARR (Barnes-Holmes, Barnes-Holmes, et al., 2001; see also 
Cassidy, Roche, & O’Hora, 2010), and have begun to study the 
connections between relational repertoires and performance on 
standardized measures of intelligence. 


Early results are encouraging. Several investigations 
have found positive correlations between standardized intelli- 
gence test scores and performance on relational tasks, including 
those emphasizing hierarchical relations (Mulhern, Stewart, & 
McElwee, 2017), temporal relations (O’ Hora et al., 2008), and 
distinction relations (O’ Hora, Pelaez, & Barnes-Holmes, 2005), 
as well as a battery involving multiple relation types (Colbert, 
Dobutowitch, Roche, & Brophy, 2017). Even more exciting, 
several studies have shown that intensive relation-focused 
interventions can significantly increase children’s standardized 
intelligence test scores (Amd & Roche, 2018; Cassidy, Roche, 
Colbert, Stewart, Grey, 2016; Hayes & Stewart, 2016; Parra & 
Ruiz, 2016). These outcomes suggest a potentially important 
link between the kind of behavior change that applied behavior 
analysts target and larger scale measures of functioning that are 
valued by schools and other institutions, making this an area ripe 
for further investigation. 


Developmental Trends in Relational Behavior 


On a reiated topic, if relational abilities are learned over time, 
as RFT proposes, then one would expect to see orderly age- 
correlated changes in these abilities. To be clear, we are not 
suggesting that relational abilities automatically develop as a 
function of age. Rather, age is a marker for multiple-exemplar 
relational experience that, under the right circumstances, 
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Figure 20.11 Left: Number of children (out of 10) in several age groups who 
demonstrated mutual and combinatorial entail ment based on hierarchical stimulus 
relations. Each panel compares competence on a relatively simple (nonarbitrary 
containment) and relatively complex (arbitrary hierarchical) categorization. Right: 
Percentage of correct responses by children in several age groups who demonstrated 
self-referential relations. Results are shown separately for non-deictic (simple relations) 


and two kinds of deictic relations. 


Left: Based on Table 5 of “Investigating Relational Framing of Categorization in Young Children,” by 
T. Mulhern, |. Stewart, & J. McElwee, Psychological Record, 67, 519-536. Right: Adapted from Figure 1 of 
“Perspective-taking as Relational Responding: A Developmental Profile,” by L. McHugh, Y. Barnes-Holmes, 


& D. Barnes-Holmes, Psychological Record, 54, 115-144. 


accumulates over time. It is entirely possible for an individual 
of any age to lack the relevant experience to exhibit a particular 
type of generalized relating. A few investigations have directly 
sought evidence of the development of relational abilities. Two 
studies, each using a longitudinal design with a single child, 
found evidence of equivalence-class formation only after about 
19 to 23 months of age (Lipkins, Hayes, & Hayes, 1993; Luciano 
et al., 2007). Such findings are interesting for two reasons. 
First, in these studies, equivalence-class formation was pre- 
ceded by the appearance of presumably prerequisite relational 
abilities, such as symmetrical relations. Second, normatively 
speaking, it is around this same age that children first reliably 
exhibit verbal behavior (e.g., the naming of objects) that might 
be thought to draw upon equivalence relations (Luinge, Post, 
Wit, & Goorhuis-Brouwer, 2006). 

Regarding how early in development relational abilities 
can arise, the longitudinal studies just described are somewhat 
ambiguous, because they provided relational experience along 
with assessing relational abilities. Other studies have used cross- 
sectional designs that provided only enough relational experience 
to assess performance on a single occasion, thereby reducing the 
chances that the research protocol instated any new abilities. 
Figure 20.11 (left panel) summarizes selected findings of a study 
that assessed hierarchical relations in children of different ages 
(Mulhern et al., 2017). The figure shows the number of children, 
out of 10, in each age cohort who met a mastery criterion for 
each of several kinds of relations. The general pattern was for 
derived mutual entailment and combinatorial entailment based on 
hierarchical classes to become more prevalent with age. Shown 


separately are the results for relatively difficult hierarchical rela- 
tions versus nonarbitrary containment, an elementary, nonhier- 
archical type of categorization that is based on simple physical 
properties of stimuli like size and color. Developmental trajecto- 
ries were steeper for nonarbitrary containment, which all children 
demonstrated in the age 7 to 8 cohort, than for hierarchical rela- 
tions, which were not reliably present even for the older children 
who were tested. This is consistent with the general assumption 
that “difficult” levels of categorization develop relatively slowly 
(Bornstein & Arterberry, 2010). 

Figure 20.11 (right panel) summarizes the findings of 
a study that assessed deictic relations in individuals aged 3 to 
30 years (McHugh et al., 2004). For each relation type, the figure 
shows the percentage of assessment trials on which these indi- 
viduals answered correctly. Simple deictic relations, which do 
not involve perspective shifting, were evident even in the young- 
est children. Reversed and double-reversed relations, which do 
involve perspective shifting, were absent before age 6, and not 
completely reliable even in the 18 to 30 age cohort. This find- 
ing is intriguing for two reasons. First, the spotty acquisition of 
reversed/double-reversed relations is consistent with the assump- 
tion that perspective shifting is a difficult skill—for instance, 
compare to the developmental pattern for hierarchical relations 
in the left panel. Second, the trend for reversed/double-reversed 
relations is broadly consistent with other research reporting 
that, in normally developing children, reliable evidence of self- 
focused constructs like “theory of mind” arises around age 5. 
Note, however, that there may be cultural differences in devel- 
opmental trajectory, as Chinese children show “theory of mind” 
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earlier than U.S. children (Sabbagh, Xu, Carlson, Moses, & Lee, 
2006). Assuming that “culture” implies different kinds of social 
learning environments, this finding is consistent with the RFT 
position that relational abilities are learned. 


DERIVED STIMULUS RELATIONS 
AND GENERAL WELL-BEING 


The previous section addressed some big-picture psychologi- 
cal constructs to which derived stimulus relations of various 
types may be related. Here is one more: general psychological 
well-being. “Well-being” can mean many things, but a critical 
component likely involves behaving in ways that make contact 
with one’s most valued reinforcers. A three-term-contingency 
view assumes that behavior follows reinforcement, so a question 
of interest is when this would ever not happen. As we describe 
below, one answer intersects with what is known about derived 
stimulus relations, and this sets the stage for acceptance and 
commitment therapy (ACT), an evidence-based behavior 
therapy that counts RFT among its influences and is designed to 
help people better contact their reinforcers (Hayes, 2005; Hayes, 
Strosahl, & Wilson, 2016; see also McEnteggart, 2018). 

Soon we will describe selected aspects of ACT inter- 
ventions, which, unlike conventional ABA interventions, rely 
heavily on verbal exercises (e.g., Assaz, Roche, Kanter, & Oshiro, 
2018).'® This is different enough from the modus operandi of 
conventional ABA that we must begin by explaining ACT’s con- 
ceptual undergirding so that its verbally based interventions make 
sense. Doing so will also clarify how ACT connects to the derived 
stimulus relations described here and in Chapter 19 (for more on 
these connections, see Assaz, et al., 2018; McEnteggart, 2018). 

Because some effort will be required to digest ACT’s 
precepts, let us try to provide some motivation for doing so: 
ACT works. Behavior analysts are interested in data and, as an 
exercise in reinforcer sampling, Figure 20.12 presents some, in 
the form of outcomes from an intervention for direct support 
staff who worked with clients with developmental disabilities 
(Chancey et al., in press). During baseline the staff engaged 
clients in positive interactions only rarely. After an interven- 
tion began, positive interactions with clients more than doubled. 
Note, however, that this intervention was not preceded by a 
functional assessment (Chapter 27) and included no prompts or 
consequences directly targeting interactions with clients. That is, 
the intervention was effective despite omitting much that is used 
in conventional ABA. Later we will describe the intervention, 
but for now we proceed to the present section’s punchline: We 
think that applied behavior analysts should be interested in any 
intervention that works, and curious about how it works, even if 
that means stepping outside the confines of familiar practices. 
In reading about ACT, prepare yourself for a striking departure 
from the type of analysis employed in conventional ABA. 


How Private Events Curtail 
Contact with Reinforcers 


The core rationale of ACT is that private events (see Chapter 18) 
are at the root of a variety of human challenges (Hayes, 2005). 
The following is a trivial example of how private events can be 


problematic. You are waiting for a bus to take you to work, which 
is a significant source of reinforcers. A woman walks by who 
reminds you of an old friend you haven’t seen since high school. 
You begin to think about experiences you and your friend shared 
and you try to imagine what she might be doing today. This rev- 
erie ends suddenly with the sound of your bus—which you hadn’t 
noticed approaching or loading—pulling away. By being “in your 
mind” (Hayes, 2005) rather than in the situation, you have missed 
the bus, and with it a good part of your work day. This isolated 
incident illustrates how private behaviors can compete with public 
behaviors. Now imagine what it would be like to be constantly “in 
your mind.” Engaging in an extensive suite of private behaviors 
might cause you to miss a lot of reinforcers that are contingent on 
your public behavior. This experience is common in a variety of 
clinical disorders (Chawla & Ostafin, 2007), and indeed to many 
people without clinical disorders who fail to get full enjoyment 
out of life (Hayes, 2005). Box 20.6 provides a fictional, but not 
unrealistic, example of the latter. 


Unpleasant Private Events Monopolize Attention and 
Occasion Problem Public Behaviors 


As Box 20.6 suggests, one way in which private events com- 
pete with “well-being behaviors”—those that contact high- 
priority reinforcers—is simply to crowd them out. Aversive events 
may evoke unpleasant emotional responses that interfere with 
well-being behaviors (see Friman, Hayes, & Wilson, 1998). This 
is similar to how, for laboratory animals, “fear” elicited by electric 
shocks can interrupt positively reinforced behavior (e.g., Estes & 
Skinner, 1941). Unpleasant emotions also can potentiate public 
behaviors that interfere with well-being behaviors. The woman 
who kicks her dog after accidentally striking her thumb with 
a hammer illustrates the phenomenon, as do individuals who 
become socially aggressive after experiencing unpleasant events 
(e.g., Roberton, Dafferm, & Bucks, 2012). Also relevant is a 
host of possible avoidance behaviors. For instance, imagine that 
a college professor must write a lengthy book chapter on a tight 
time line. Through stimulus relations formed during past experi- 
ences involving missed deadlines, working on the chapter elicits 
considerable anxiety, which the professor escapes by taking long 
bicycle rides when he should be writing. In this strictly hypothet- 
ical example (having nothing whatsoever to do with either of the 
present authors), biking is negatively reinforced and occurs with 
increasing frequency, even though in the long run this compro- 
mises reinforcers associated with finishing the chapter. 


Transformation of Function Increases 
Contact with Suffering 


Unpleasant events like being mugged (Box 20.3) may happen 
only once, but through transformation of function in derived 
stimulus relations they may be experienced repeatedly. To illus- 
trate, let’s return to the school-anxious boy, Shawn, described in 
Box 20.1. The boy had an unpleasant moment with his teacher, 
but suffering related to that moment expanded to fill a good 
portion of his life away from the teacher. The “spread of suffer- 
ing” (see Dymond et al., 2014) began when direct experience 
(the reprimand) transformed the function of one stimulus (the 
teacher) in an already-existing stimulus class (Figure 20.4). 
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Other stimuli in the class acquired the same function. For 
Shawn, this functionally was like being yelled at by his teacher 
multiple times every day. 

Less direct effects are possible. Let’s say that on vacation 
you are having a lovely time visiting a Scottish castle. You 
adore the architecture, are impressed with the beauty of the 
medieval tapestries hanging on the walls, and find the history 
described by the tour guide to be fascinating. Thus, you’re 
deriving an intensely pleasant stimulus class involving the 
castle and all of these stimuli. Then you stumble upon a coat of 
arms, which reminds you of your mother, who was descended 
from Scots and treasured the coat of arms that had been handed 
down through her family. Your mother passed away 2 years 
ago after a painful illness, and thinking of her makes you feel 


melancholy. This new transformation of function means that 
the stimulus class involving the castle will no longer involve 
untempered joy. 

Finally, generalized relational abilities can contribute to 
the “spread of suffering” by transforming pleasant functions into 
unpleasant ones: 


Consider a six-year-old girl who drew a picture at school. 
Her father says, “That’s nice. You did really well on that!” 
Given her verbal competence, the child will also be in 
contact with the opposite of what is said, that is, “not so 
nice” and “not doing very well.” She may not contact this 
at the moment her father speaks, but she may do so at a 
later time, such as when she tries to draw another picture 
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BOX 20.6 


The Private Events of Walter Mitty 


“We're going through!” The Commander’s voice was 
like thin ice breaking. He wore his full-dress uniform, 
with the heavily braided white cap pulled down rakishly 
over one cold gray eye. “We can’t make it, sir. It’s spoil- 
ing for a hurricane, if you ask me.” “I’m not asking you, 
Lieutenant Berg,” said the Commander. “Throw on the 
power lights! Rev her up to 8,500! We’re going through!” 
(Thurber, 1999, p. 55) 


So begins the iconic James Thurber short story, “The 
Secret Life of Walter Mitty” (which was loosely adapted into 
a 2013 film starring Ben Stiller and Kristin Wiig). The story 
follows a middle-aged man through a series of afternoon 
errands in town with his wife. There is no real hurricane or 
Lieutenant Berg; these are part of elaborate fantasies in which 


and it doesn’t turn out the way she wanted it to. She will 
not need anyone else to make the judgment that this is 
“not so nice” or that she “did not do well.” (Torneke, 
2010, pp. 136-137)” 


In the end, it doesn’t matter whether unpleasant experiences are 
“real” or derived—in fact, there is no difference in the experience 
of the two (Friman et al., 1998). Because of transformation of 
function within stimulus classes, unpleasant private events that 
constrain well-being behaviors can occur any place, at any time. 


Excessive Rule Following Interferes 
with Well-being Behaviors 


Recall the learned helplessness paradigm described in Box 20.3. 
Unpleasant experiences, like being mugged, can lead to the 
creation of generalized self-rules that occasion behaviors 
that interfere with well-being behaviors (recall how an “I’m 
ineffective” rule undermined college classes and social rela- 
tionships). Rule-governed behavior, also called instructional 
control, occurs when verbal stimuli (rules) occasion behavior, 
alter motivating operations, and/or specify the consequences of 
that behavior (Hayes, 1989; Skinner, 1969). Rules and instruc- 
tions can come from other people, as when a parent tells a child, 
“To avoid being hit by a car, look both ways before crossing 
the street.” However, people contacting environmental contin- 
gencies also generate their own rules, and these verbal stim- 
uli can control behavior just as externally generated rules do 
(e.g., Rosenfarb, Newland, Brannon, & Howey, 1992). Behavior 
under the control of rules (both self- and other-generated) often 
is insensitive to change by local contingencies (Hayes, 1989; 
Skinner, 1969). 

RFT views rule governance as operating through derived 
stimulus relations. Rules are thought to control behavior partly 
because they include stimuli (words) that participate in stimu- 
lus classes with “real” (nonverbal) stimuli that control behav- 
ior. For example, for a student whose failed geometry test 


Mitty engages while the day’s events unfold around him. 
At various turns, Mitty pictures himself a brave military com- 
mander, a famous surgeon, and a convict facing a firing squad. 
The story is usually said to illustrate a man’s desperate 
efforts to escape from a boring life, but quite possibly his life is 
boring (devoid of external reinforcers) because of the fantasies. 
Each time Mitty lapses into a fantasy, to all the world he becomes 
behaviorally inert. He misses out on conversations. He fails to 
move his car forward when the traffic light changes. He forgets 
what to buy in a store. The private behaviors of his fantasies 
crowd out public behaviors that might contact reinforcers. As 
a result, other people—including his wife—think Mitty strange 
and either withdraw from him or browbeat him in attempt to 
prompt useful behavior. This magnifies Mitty’s isolation. 


evokes the rule “I am terrible at math,’ the words “terrible” 
and “math” are experienced in much the same way as the 
actual unpleasant experience (failing the test) to which they 
are related. You might recognize self-rules as similar to deic- 
tic relations, person-location-time relations in the format of 
I-HERE-NOW. Self-rules can pervade everyday experience and 
broadly disrupt well-being behaviors when they are extended 
along deictic dimensions. The self-rule “I am terrible at math” 
is based on an experience that occurred at a specific time and 
place, but describes (hypothesized) outcomes at all times and 
places. This is an extension on the HERE/THERE and NOW/ 
THEN dimensions. The same happened with the mugging vic- 
tim of Box 20.3: A factual statement (J was not effective in 
that place at that time) extended to other situations where it 
might not be accurate but still could control behavior (J am 
ineffective at all times and in all places). Other problem rules 
may conflate the I/YOU terms, as when a man whose father 
was divorced 4 times derives that “J can never find lasting 
happiness in a relationship” (it is the father, not the son, whose 
relationship skills are in question). For an accessible general 
introduction to how stimulus relations facilitate rule gover- 
nance, see Torneke (2010, Chapter 6). 

Self-rules become problematic when the public behav- 
iors they control interfere with well-being behaviors. Box 20.7 
describes an example. Note especially in this example how 
behaviors controlled by self-rules are maintained by negative 
reinforcement. In cases like this, unpleasant private events create 
the motivating operations for both public behavior and the self- 
rules that control it. Behavior that terminates these private 
events is automatically strengthened (e.g., Dymond & Roche, 
2009). Thus, for the school-avoidant boy in Box 20.1, hiding 
in the neighborhood until after the school bus departs prevents 
school-related fear. For the germ-obsessed parent in Box 20.7, 
cleaning reduces anxiety. This is one reason why rule-governed 
behavior is insensitive to change: It generates its own immediate 
reinforcers (McEnteggart, 2018). 
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BOX 20.7 


The Sanitation Complication?" 


Joachim was the father of two girls, ages 2 and 3. When the 
first daughter was born, he and his wife decided that Joachim 
would be a stay-at-home dad, because Joachim’s wife earned 
more money than he did as a first-grade teacher, and in that 
job Joachim had been great with kids. Joachim looked for- 
ward to teaching his girls about the world through “field 
trips,” like those he used to arrange for his students, to the 
park, to museums, and to the zoo. He was excited to help 
the girls develop strong basic academic skills before they 
entered school. 

But Joachim struggled unexpectedly in his role as full- 
time parent. For reasons even he did not understand, Joachim 
worried constantly about protecting his daughters from germs, 
and he developed a suite of self-rules like “I have to keep my 
children safe from disease in order to be a good parent,” and 
“T need to keep the house clean.” Before long Joachim became 
a cleaning machine. He walked around the house with a bottle 


How ACT's Verbally Guided Exercises Aim to 
Short-circuit Harmful Private Events 


In ACT, the persistent interference of private events with 
well-being behaviors is called psychological inflexibility (Hayes, 
2005). We will call this behavioral inflexibility, and it is the 
opposite of flexibility, a capacity to be “in the moment” such that 
one’s behavior is under the real-time stimulus control of events 
happening in the current situation (Hayes, 2005). ACT inter- 
ventions use mainly verbally guided exercises to disrupt inflex- 
ibility and to potentiate behavior that contributes to well-being 
(Harris, 2008; Hayes et al., 2016). One type of ACT exercise, 
called values clarification, is designed to help people identify 
their most valued positive reinforcers in important life domains 
such as relationships, work, and leisure (Lundgren, Dahl, & 
Hayes, 2008). Among other functions, these exercises aid in 
recognizing behaviors on which the high-priority reinforcers are 
contingent (and with which private events may interfere). 

A second kind of ACT exercise is designed to alter the 
aversive function of private events. Some exercises mimic expo- 
sure therapies (e.g., Wolpe, 1961) by forcing repeated contact 
with anxiety-provoking thoughts—for example, by repeatedly 
speaking aloud those thoughts until their emotional properties 
abate and the thought-statement is experienced merely as sound; 
or by imagining the anxiety-provoking thought being spoken in 
the ridiculous voice of a cartoon character (Harris, 2008). 

A third kind of ACT exercise is designed to lessen con- 
trol of private events over other behavior. An organizing con- 
cept is mindfulness, which has been defined as “bringing one’s 
complete attention to the present experience on a moment- 
to-moment basis” (Marlatt & Kristeller, 1999, p. 68), and as 
observation without judgment (Baer, 2003). Some mindfulness 
exercises, like “Take Ten Breaths” in Box 20.8, focus on mun- 
dane external stimuli to distract attention from troublesome pri- 
vate events.22 Others (like “Floating Leaves” and “Soles of the 


of cleaning spray, wiping down every doorknob and light 
switch that he passed. He scrubbed the floors and disinfected 
every surface of the kitchen multiple times each day. 

These behaviors “worked” in the sense that they left the 
home spotless and reduced Joachim’s anxiety about germs. But 
they were incompatible with many of the things Joachim had 
planned to do as a parent. As a result, he rarely played with 
the girls, and when he did, the games were interrupted by fre- 
quent spot cleaning. Joachim did not arrange any “field trips” 
or teach the girls to read because he was too busy disinfecting. 
In short, Joachim became an absentee parent in his own home. 
This had adverse effects on his daughters, and it deprived 
Joachim of many of the joys of interacting with them and of 
helping them develop as people. At a conscious level Joachim 
understood this tradeoff, but the notion of being responsible 
for a dangerously unsanitary environment was so aversive that 
he simply did not think he could help himself. 


Feet” in Box 20.8) first focus attention on the mundane and 
then shift it to private thoughts and rules, including those that 
may be problematic. Here the goal is to teach how to notice pri- 
vate events without acting on them. For example, school-fearful 
Shawn (of Box 20.1) may feel anxious and afraid when he hears 
his school bus, but he can learn to observe these private events 
and still get on the bus. The cleaning-obsessed parent can notice 
that he repeats his rules about protecting his children without 
having to scrub floors every hour. 


Effectiveness Evidence 


By now it should be obvious that ACT interventions take a very 
different form than those of conventional ABA. We understand 
that readers who are accustomed to functionally analyzing 
behavior in terms of its relation to external stimuli and conse- 
quences may find ACT difficult to understand?? and may, like 
some other behavior analysts, resist the suggestion that public 
behavior should be examined through the lens of private events 
(e.g., Baum, 2011; Fisher, Groff, & Roane, 2011). Yet, as empir- 
icists, behavior analysts are committed to following the data, and 
there is considerable evidence supporting the effectiveness of 
ACT for “well-being” challenges in many domains. This includes 
mental health issues in schools (Burckhardt, Manicavasagar, 
Batterham, Hadzi-Pavlovic, & Shand, 2017; Van der Gucht et al., 
2017) and in a variety of psychological conditions like anxiety, 
depression, addiction, chronic pain, and somatic health problems 
(e.g., A-Tjak et al., 2015; Hann & McCracken, 2014; Kanter, 
Baruch, & Gaynor, 2006). Positive effects are not, however, lim- 
ited to clinical conditions. For example, ACT (or an interven- 
tion using some of its components) has reduced workplace stress 
(Ly, Asplund, & Andersson, 2014), increased college students’ 
engagement with physical activity (Butryn, Forman, Hoffman, 
Shaw, & Juarascio, 2011), and improved athletes’ performance 
in competition (e.g., Schwanhausser, 2009). 
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BOX 20.8 


Three ACT Mindfulness Exercises 


These exercises instruct “observation without judgment” 
(attending to stimuli without otherwise reacting to them). The 
goal of practicing them is not to prevent problematic private 
events but rather to disrupt their control over other behavior. 


Take Ten Breaths (Harris, 2008, p. 35) 


1. Throughout the day, pause for a moment and take ten 
slow, deep breaths. Focus on breathing out as slowly 
as possible, until the lungs are completely empty, and 
breathing in using your diaphragm. 

2. Notice the sensations of your lungs emptying and your 
ribcage falling as you breathe out. Notice the rising and 
falling of your abdomen. 

3. Notice what thoughts are passing through your mind. 
Notice what feelings are passing through your body. 

4. Observe those thoughts and feelings without judging 
them as good or bad, and without trying to change them, 
avoid them, or hold on to them. Simply observe them. 

5. Notice what it’s like to observe those thoughts and feel- 
ings with an attitude of acceptance. 


Floating Leaves on a Moving Stream 
(Hayes, 2005, pp. 76-77) 


Imagine a beautiful, slow-moving stream. The water flows over 
rocks, around trees, descends downhill, and travels through a 
valley. Once in a while, a big leaf drops into the stream and 
floats away down the river. Imagine you are sitting beside that 
stream on a warm, sunny day, watching the leaves float by. 
Now become conscious of your thoughts. Each time a 
thought pops into your head, imagine that it is written on one 
of those leaves. If you think in words, put them on the leaf as 
words. If you think in images, put them on the leaf as an image. 
The goal is to stay beside the stream and allow the leaves on 
the stream to keep flowing by. Don’t try to make the stream 
go faster or slower; don’t try to change what shows up on the 
leaves in any way. If the leaves disappear, or if you mentally 
go somewhere else, or if you find that you are in the stream 
or on a leaf, just stop and notice that this happened. File that 


Interventions using ACT or some of its components have 
reduced the stress and depressive symptoms of parents of chil- 
dren with autism spectrum disorder (ASD; Cachia, Anderson, & 
Moore, 2016; Singh et al., 2014), as well as developmental dis- 
abilities staff (Biglan, Layton, Jones, Hankins, & Rusby, 2013; 
McConachie, Mckenzie, Morris, & Walley, 2014). ACT also has 
been used to increase positive interactions between staff and 
clients (Castro, Rehfeldt, & Root, 2016). Figure 20.12, which 
we introduced earlier with little explanation, illustrates with 
data from a study by Chancey et al. (in press). Staff completed 
mindfulness exercises, but the intervention did not focus on 
client interactions per se, and there was no in situ prompting 


knowledge away and then once again return to the stream, 
watch a thought come into your mind, write it on a leaf and let 
the leaf float away down the stream. 


Meditation on the Soles of the Feet 
(Singh et al., 2011, p. 1055) 


This exercise was used in an intervention, to be discussed 
shortly, to reduce aggression by adolescent boys (results are 
summarized in Figure 20.12). 


1. If you are standing, stand in.a natural rather than an 
aggressive posture, with the soles of your feet flat on 
the floor. 

2. If you are sitting, sit comfortably with the soles of your 
feet flat on the floor. 

3. Breathe naturally, and do nothing. 

4. Cast your mind back to an incident that made you very 
angry. Stay with the anger. 

5. You are feeling angry, and angry thoughts are flowing 

through your mind. Let them flow naturally, without 

restriction. Stay with the anger. Your body may show 
signs of anger (e.g., rapid breathing). 

Now, shift all your attention fully to the soles of your feet. 

7. Slowly, move your toes, feel your shoes covering your 
feet, feel the texture of your socks, the curve of your arch, 
and the heels of your feet against the back of your shoes. 
If you do not have shoes on, feel the floor or carpet with 
the soles of your feet. 

8. Keep breathing naturally and focus on the soles of your 
feet until you feel calm. 

9. Practice this mindfulness exercise until you can use it 
wherever you are and whenever an incident occurs that 
may otherwise lead to you being verbally or physically 
aggressive. 

10. Remember that once you are calm, you can walk away from 
the incident or situation with a smile on your face because 
you controlled your anger. Alternatively, if you need to, you 
can respond to the incident or situation with a calm and 
clear mind without verbal threats or physical aggression. 


a 


or reinforcement of interactions with clients. Rather, improved 
interactions presumably were a diffuse side effect of improved 
staff well-being. 

Because of ACT’s reliance on verbal exercises, it is not 
yet known how broadly applicable it may be to persons with 
intellectual disabilities. However, a few published applications 
exist, all focusing on higher-functioning individuals. In one 
case, ACT exercises targeting anxiety, coupled with behavioral 
skills training, improved job-interviewing performance in two 
of three individuals with developmental disabilities (Brazeau 
et al., 2017). A randomized controlled trial found mindfulness 
exercises to reduce symptoms of depression and anxiety in 
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adults with an ASD diagnosis (Spek, Van Ham, & Nyklicek, 
2013). For a further example, examine Figure 20.13, which 
shows the effects of mindfulness practice on aggression in three 
adolescents with an ASD diagnosis (Singh et al., 2011). In this 
case, the mindfulness exercise (“Soles of the Feet” in Box 20.8) 
was expressly intended to create replacement behavior for 
aggression. During the first week of the intervention, the boys 
practiced the exercise with parental assistance for 30 minutes. 
Afterward they practiced under parental supervision twice 
per day until aggression was absent for 4 consecutive weeks. 
Practice stopped at that point, but the boys retained a digital 
audio recording of the exercise instructions that could be used as 
desired. Serious aggression (hitting, kicking, and biting), which 
occurred during baseline about 2 to 3 times per week, dropped to 
near-zero levels following introduction of mindfulness exercises 
and stayed there for about 3 years. 

Although few published ACT interventions have targeted 
persons with developmental disabilities directly, an espe- 
cially interesting phenomenon is that when ACT is used to 
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improve the well-being of parents or staff, those they care for 
may experience positive trickle-down effects (Cachia et al., 
2016). Figure 20.14 shows an example (Singh et al., 2014). 
Participants were three mothers of high-functioning adoles- 
cents with an ASD diagnosis. All three had in the past worked 
with applied behavior analysts but had abandoned behav- 
ioral programs because of the stress of daily implementation; 
they had not used behavioral programs for at least 2 years 
prior to the study. Across 8 weeks, the mothers completed 
a once-weekly workshop focusing on mindfulness-based 
meditation. Afterward, they were instructed to continue prac- 
ticing mindfulness each morning, and parent diaries showed 
that on most days they did. Mother stress, as measured by a 
self-report rating scale, decreased from baseline to the practice 
phase. Concomitantly, although the workshops did not focus 
on behavioral interventions or even on the changing of child 
behavior, incidents of child aggression decreased. Disruptive 
behavior also decreased, and child compliance with mother 
requests increased (not shown in Figure 20.14). 
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Figure 20.14 Weekly incidents of aggression by three adolescents with an autism spectrum 


disorder diagnosis. 


Republished with permission of Springer, Mindfulness-based Positive Behavior Support (MBPBS) for mothers of 
adolescents with autism spectrum disorder: Effects on adolescent behavior and parental stress, N. N. Singh and 
colleagues, 2014, Mindfulness, 5, pp. 646-657. Permission conveyed through Copyright Clearance Center, Inc. 


Section Conclusion: The Functional 
Significance of “Well-Being” 


Results like those shown in Figures 20.12 through 20.14 
capture the essence of what is meant by “well-being” and of 
why applied behavior analysts should be interested in this con- 
struct. In contrast to most ABA interventions, those grounded 
in ACT do not usually target discrete problem behaviors, incor- 
porate direct prompts, or systematically manipulate externally 
arranged consequences. They may teach no specific replace- 
ment behaviors. Yet by addressing very general aspects of 
functioning, they can have a positive impact on the kinds of 
problems that applied behavior analysts often address. Practi- 
tioners of ACT might argue, therefore, that conventional ABA 
has ignored an important context for behavior problems, that 
taking place within the individual. Well-being, from the per- 
spective of ACT, exists when troublesome private events do not 
occasion problem behaviors or constrain behavior that contacts 
reinforcement. 

Whether that argument registers or not, ACT is worth 
exploring for at least two reasons. First, relevant to the mission 
of the present chapter, ACT extends concepts of derived stim- 
ulus relations in unexpected directions, and this necessarily 
challenges the interested observer to make sure those concepts 


are well understood. Second, ACT illustrates how the scope of 
practice of an applied science can be much broader than what 
is often seen in ABA, which has remained heavily focused on 
persons with disabilities. Those who think ABA should system- 
atically address the full range of human problems and interests 
(e.g., Friman, 2006; see also Chapter 1) should be intrigued 
by ACT’s primary focus on typically developing people, and 
its contention that verbal behavior contributes to many human 
problems and can therefore be used as a tool of intervention. 
Those who are not persuaded by ACT’s theoretical framework 
must reckon with data showing that it works and generate a 
compelling alternative explanation for why. Overall, we believe 
that anything that forces careful thinking about how to do 
the most good for the most people will be good for ABA and the 
people it serves. 


A FINAL COMMENT 


You would not be alone if you found reading the present chap- 
ter to be hard work. It is difficult to learn about equivalence 
classes (Chapter 19), and things become more complicated once 
the many additional ways of relating are taken into account. 
A goal of this chapter has been to persuade you that the effort 
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required to master this difficult topic is worth it. We have men- 
tioned a number of ways in which derived stimulus relations con- 
cepts map onto important everyday behaviors and suggested some 
avenues for application. Yet even though research on derived 
stimulus relations began almost 50 years ago, bench-to-bedside 
translation has been fairly slow in developing (e.g., Critchfield, 
2018; Dixon, Belisle, Rehfeldt, & Root, 2018; Rehfeldt, 201 1). 

There are good reasons for this. To learn about derived 
stimulus relations, one must first master a new vocabulary, 
although, as you have seen, derived stimulus relations can be 
complex enough that mere words do not do them justice (hence 
a heavy reliance on diagrams). This is a problem because most 
people don’t think naturally in terms of the alternatives, like 
arrow-laden schematics. Humans, after all, are storytellers 
by nature (e.g., Hineline, 2018), not diagrammers and logi- 
cians. Something that this chapter and Chapter 19 can sug- 
gest but not fully illuminate is that, because derived stimulus 
relations have many moving parts, interventions focusing on 
them can be effortful to devise. No matter what potential may 
exist to harness derived stimulus relations concepts to create 
valuable behavior change, this can only happen after someone 
has carefully determined what to teach, how to teach it, and 
what derived abilities are expected to result from this. 

The challenge of fully harnessing the potential of derived 
stimulus relations interventions now falls to you, the student, and 
your peers. To accomplish this, you must become so fluent with 
derived stimulus relations concepts, and the procedures arising 
from them, that applications and extensions suggest themselves 
effortlessly. Fortunately, there is ample precedent for master- 
ing such challenges. Those who teach behavior analysis know 
that many people find even its most rudimentary concepts, like 
positive/negative reinforcement/punishment, to be extremely 
difficult. But as the present book illustrates, for those who 
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SUMMARY 


What Are Nonequivalence Relations? 
Why Do They Matter? 


1. Stimulus relations are arbitrary associations between 
stimuli, with “association” meaning that how one stimulus 
is responded to is intertwined in some way with how other 
stimuli are responded to. 


2. Stimulus equivalence is only one of a wide variety 
of stimulus relations. Not all stimuli are related via 
“game as.” and nonequivalence relations are integral to 
many everyday experiences. 


The Vocabulary of Nonequivalence Relations 


3. Key ways of talking about stimulus equivalence do 
not work for nonequivalence relations, so a substitute 
vocabulary is needed. 


4. What’s called symmetry in the stimulus equivalence 
literature is subsumed under the term mutual entailment, 
which means the converse of any A-to-B relation. For 


persist, new concepts eventually become intuitive and essential 
building blocks for very sophisticated procedures. 

Many opportunities exist to develop new derived stimulus 
relations interventions that target specific skills. Built prop- 
erly, these are not just effective but also efficient, establishing 
new behaviors without directly teaching them (Critchfield, 
2018; Rehfeldt, 2011; Chapter 19). In this regard, you might 
say that derived stimulus relations interventions can help ABA 
become better at what it historically has done. However, in 
this chapter we mentioned generalized relational repertoires 
(AARR), and devoted substantial space to big-picture con- 
structs like intelligence and well-being, to illustrate that the 
stakes are higher than this. Looking past the complexities of 
the underlying concepts, you should have noticed, here and 
in Chapter 19, that derived stimulus relations research is not 
limited to particular populations or behavior problems. In this 
regard, it addresses the long-standing aspiration for behav- 
ior analysis to be a general-purpose science relevant to all of 
human experience (e.g., Skinner, 1953). There is one more 
consideration. Mastering derived stimulus relations concepts 
and following your curiosity about them may lead to work 
that (as the example of ACT illustrates) looks very different 
from conventional ABA. You may be concerned, therefore, 
about straying too far from the rich traditions that have made 
ABA so successful. To a large extent, derived stimulus rela- 
tions concepts expand, rather than invalidate, ABA’s founda- 
tions, but, inevitably, mastering and applying those concepts 
requires updates in how ABA does business. Fear not; we have 
it on good authority that change is acceptable. 


Regard no practice as immutable. Change and be ready 
to change again. Accept no eternal verity. Experiment. 
(Skinner, 1979, p. 346) 


example, if the original relation is “A is greater than B,” 
the converse will be “B is less than A.” 


5. What’s called transitivity in the stimulus equivalence 
literature is subsumed under the term combinatorial 
entailment, which means any derived relation in which the 
function of stimuli depends on the other stimuli to which 
they are indirectly related. 


6. What’s called transfer of function in the stimulus 
equivalence literature is subsumed under the term 
transformation of function. This means that the stimuli 
in a relational class do not all acquire the same behavioral 
functions; however, the functions acquired by any stimulus 
depend on the functions acquired by others in the class. 


Some Types of Nonequivalence Relations 


7. Structured protocols similar to those used in equivalence- 
based instruction have been successful in establishing 
nonequivalence repertoires of everyday importance. 


568 


Part 8 * Developing New Behavior 


8. The notion that there are many ways of relating the same 


10. 


11. 


stimuli implies the existence of contextual stimuli that 
signal which way of relating is appropriate to a given 
situation. Explicit contextual stimuli are recommended as 
part of applied interventions focusing on derived stimulus 
relations. 


. Distinction relations involve relative stimuli being judged 


as different. Jn opposition relations, stimuli are oppo- 
sites. In comparison relations, stimuli differ along some 
definable dimension. 


Stimuli can also be related in terms of spatial or 
temporal juxtaposition and in terms of cause-effect 
relations. 


Structured tools are just beginning to become available 
for programming derived stimulus relations like those 
described here. 


Theoretical Foundations 


Nonequivalence Relations and Big-Picture Psychological 
Constructs 


15. 


16. 


17. 


Categorical knowledge: Establishing a type of nonequiv- 
alence relation called hierarchical relations produces 
several effects seen in cognitive psychology studies of 
categorization. 


Self-concept: A type of nonequivalence relation called 
deictic relations may underlie important self-related and 
social repertoires, a primary example of which is perspec- 
tive shifting, in which an individual experiences the world 
as another person would. 


Intellectual development: Normatively speaking, 
various types of derived stimulus relations tend to 
emerge gradually during child development. 
Importantly, systematically training stimulus relations 
has been found to boost scores on standardized intelli- 
gence tests. 


12. Most research on nonequivalence relations has been 
guided by relational frame theory, which assumes 


Derived Stimulus Relations and General Well-Being 


18. “Well-being behaviors” are those on which a person’s 


13. 


14. 
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(a) that verbal processes are bound up in derived 
stimulus relations, and (b) that given enough reinforced 
experience with multiple exemplars, people develop 

a higher-order ability to derive relations among sets 

of stimuli, even in the absence of further reinforced 
experience. 


The role of verbal behavior in derived stimulus relations 

is not fully understood, but assuming a close connection 

has led to interesting research. For example, as the theory 
would predict, providing reinforced multiple-exemplar 
experience has been shown to create generalizable relational 
abilities. This may be an example of what relational frame 
theory calls arbitrarily applicable relational responding, a 
generalized repertoire that allows new classes to form with- 
out new reinforced experience. 


It has also been found that, for some learners, the 

baseline relations of stimulus classes can be built simply by 
telling learners about them. Among other things, this find- 
ing could imply that verbal experience is functionally sim- 
ilar to the largely nonverbal match-to-sample experience 
provided in many stimulus equivalence protocols. 


KEY TERMS 


acceptance and commitment therapy 


arbitrarily applicable relational 
responding 


arbitrary relations 
behavioral inflexibility 
causal relations 


contextual stimulus 


deictic relations 
derived relations 
distinction relations 


hierarchical relations 


mutual entailment 


combinatorial entailment 


multiple-exemplar trainin 


most valued reinforcers are contingent. Many challenges 
to general psychological well-being involve the constrain- 
ing of well-being behaviors by private events that include 
emotional responses and self-generated rules. 


19. Acceptance and commitment therapy is a behavior ther- 


20. 


nonequivalence relations 


oO 
S 


apy based partly on relational frame theory and research 
on derived stimulus relations (including nonequivalence 
relations). It focuses on promoting general well-being. In 
many cases, the therapy does not target public behaviors 
relevant to the referring problem. Instead, to undermine 
the control of behavior by problematic private events, the 
therapy employs verbally guided exercises that are very 
different from most ABA interventions. 


The therapy has been shown to lessen symptoms of 
certain clinical disorders and to enhance performance 
in nonclinical situations. Another important application 
has been to reduce stress and alleviate depression in 
parents and caregivers of persons with disabilities. 

In some documented cases, improving caregiver 
well-being has been followed by improvements in 
client behavior. 
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perspective shifting 
relational frame theory 
relational frame 
rule-governed behavior 
spatial relations 
transformation of function 


temporal relations 
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MULTIPLE-CHOICE QUESTIONS 


1. Relational responding that experience does not directly 


teach, but that arises as an indirect function of experience 
is called: 


a. Emergent functions 
b. Derived functions 
c. Generative effect 
d. All of the above 


Hint: (See “The Vocabulary of Nonequivalence Relations’’) 


2. The act of responding as if you see the world as from 
another’s vantage point, or perceive a situation as if you 
were in another’s position is called: 

a. Perspective shifting 
b. Empathy 

c. Social success 

d. Positive regard 


Hint: (See “Self Concept: Diectic Relations”) 


3. The core rationale of acceptance and commitment therapy 
suggests that the root of human challenges is/are: 


a. Private events 
b. Personality 

c. Past behavior 
d. All of the above 


Hint: (See “How Private Events Curtail Contact with 
Reinforcers’’) 


ESSAY-TYPE QUESTIONS 


1. Discuss the types of nonequivalence relations. 


Hint: (See “Some Types of Nonequivalence Relations”) 


2. Name any two tools used for planning interventions 
involving nonequivalence relations. 


Hint: (See “Three Practical Considerations”) 
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NOTES 


1. In keeping with tradition, we'll use the term stimulus relations as shorthand 
for various “ways of relating” (Reese, 1968; see also Critchfield, Barnes- 
Holmes, and Dougher, 2018). But the term is a bit of a misnomer. A stim- 
ulus relation is really a behavioral relation, a pattern in how an individual 
responds to groups of stimuli. 

2. Compared to the semantic networks that cognitive researchers have mapped out, 
Figure 20.1 is quite rudimentary. Beer is related to many “bits of the world” that 
are not shown here, and each “bit” can itself be related to many other things. Cold 
is opposite to hot. Lager is made using bottom fermentation. A baseball stadium 
may remind you of your father, who used to take you to games, and so on. 

3. But note that stimulus relations allow you to respond to a stimulus even 
when it is not physically present. For instance, you do not need to have 
a cold malt beverage in front of you to talk about beer, to picture how it 
looks, to imagine its taste, or to recall the embarrassing things you did last 


4. In acceptance and commitment therapy, the persistent 
interference of private events with well-being behaviors is 
called: 


a. Maladaptive behavior 

b. Psychological inflexibility 

c. Psychosocial dysfunction 

d. Negative emotionality 

Hint: (See “How ACT’s verbally guided Exercises aim to 


short-circuit harmful private events’’) 


5. Classifying a stimulus as a member of a higher-order class 
on the basis that it is a member of the subclass that is a 
member of that higher-order class is called: 


a. Asymmetrical class containment 

b. Unilateral property induction 

c. Transitive class containment 

d. None of the above 

Hint: (See “Three characteristics of Hierarchical Categories”) 
6. The repeated acquisition of some behavioral repertoire 


with variations in stimuli, situations, or response feature 
is called 


a. Stimulus generalization 

b. Response generalization 

c. Single-exemplar training 
d. Multiple-exemplar training 


Hint: (See “Relational Frame Theory”) 


3. Discuss ACT’s verbally guided exercises that aim to 
disrupt harmful private events. 


Hint: (See “How ACT’s Verbally Guided Exercises Aim 
to Short-circuit Harmful Private Events”) 


night after consuming too many. These responses can, instead, be occa- 
sioned by stimuli that participate with beer in arbitrary stimulus relations. 
This point is especially important to the chapter's final section. 


4. Each trial was similar in format to a multiple-choice question. For example, 


when shown a picture of the frontal lobe, the student’s job was to select 
an answer (impulsiveness) to which it was related, and ignore answers to 
which it was not (impaired vision, spatial perception problems, and hearing 
problems). Chapter 19 called this selection-based responding, and it likely 
allowed the students to succeed by responding on the basis of “any relation 
that exists,” not of equivalence specifically. 

5. The third derived property of equivalence classes, reflexivity, will not be 
addressed here. Reflexivity implies that a stimulus in an equivalence class 
is recognized as itself. This is an assumption for nonequivalence classes 
as well. 
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6. 


11. 


The word entailment means simply that one thing is dependent on another, 
so mutual entailment is when one direction of a relation (A to B) depends 
on, or is predictable from, the other (B to A). 

This description is our contrivance for ease of explanation. Actually, the stim- 
uli were several colored disks of paper, arrayed on a table surface such that 
the leftmost or rightmost disk served as a symbol of “many” or “few.” Tech- 
nically, then, this procedure confounded spatial relations (described below) 
with opposition relations, although that does not matter for present purposes. 
This approach appears very different from the match-to-sample proto- 
cols described in Chapter 19. We will say more about the difference in the 
“Theoretical Foundations” section. 

To repeat a point from earlier in the chapter, this might be construed as 
higher-order pairing, in which a neutral stimulus is paired with a stimu- 
lus that has been paired with an already-effective consequence. However, 
because higher-order conditioning typically is weak, it is unclear whether 
this explanation can account for effects like those just described. 

There is an additional problem: For reasons too complicated to deal with 
here, generalized relational repertoires (arbitrarily applicable relational 
responding) are unlikely to arise from incoherent stimulus reiations, even 
with multiple exemplars. 

This phenomenon, of nonidentical stimuli being “the same” by virtue of 
inclusion in a common category, is one basis for hierarchical relations that 
we discuss later in the chapter. 

Except in a handful of studies in which contextual cues are the explicit topic 
of investigation (e.g., Bush et al., 1989). 


. Naming theory (Horne & Lowe, 1996), introduced in Chapter 19, makes a 


similar assumption. 


. These studies illustrate a possible reason why “telling” and “demonstrat- 


ing” procedures, which require no overt practice and provide no feedback, 
can produce derived stimulus relations—they may tap into already- 
established AARR. 


. What's also unclear is just how broadly applicable these generalized rep- 


ertoires are expected to be. The RFT literature is somewhat vague on this 
point. The crux of the matter may concern what, precisely, functions as a 
contextual stimulus. The RFT literature tends to refer to discrete stimuli 
(like those verbal labels such as “opposite’’), but if contextual stimuli can 


16. 


18. 


19. 


20. 


Oi 
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be compounds of discrete stimuli plus aspects of the setting where condi- 
tioning took place, as suggested earlier, then the repertoires they control 
might not readily generalize from one setting to another. 

The traditional term is perspective taking, but that term normally is used 
to emphasize only the difference in perspective between self and other. We 
actually shift perspectives along multiple dimensions, as elaborated in this 
section. 


. Although the Barnes-Holmes Protocol has been used with children 


as young as 3 years of age, because of its narrative format it may yield 
the best results with learners at a developmental age of about 5 or older 
(e.g., Montoya-Rodriguez, Molina, & McHugh, 2017). 

Among several reasons for this is that RFT assumes an intractable link 
between derived stimulus relations and verbal behavior, in which case any 
problem involving derived stimulus relations is also a problem involving 
verbal behavior. Verbal interventions also are compatible with the brief 
consultations through which clinicians often serve clients. 

ACT theorists also assume that, through transformation of function, private 
events can acquire aversive properties and become the motivating operation 
for avoidance behaviors that reduce the private aversive stimulation but 
interfere with well-being behaviors. A complete explanation is beyond the 
scope of the present chapter. 

This example may be easiest to appreciate by recalling from Chapter 19 that 
the baseline relations of stimulus classes are conditional discriminations, 
which means learning not just what a stimulus “is” in relational context 
but also what it “is not.” Thus, opposition relations always are inherent in 
the acquisition of stimulus relations, and through a generalized opposition 
repertoire we are always, by experiencing pleasure, put at risk of experi- 
encing pain when contextual cues evoke “opposite.” 

We thank Ciara McEnteggart for suggesting this example. 

This is, perhaps, functionally similar to response interruption/redirection 
procedures that use “distraction” to prevent imminent stereotypical and 
repetitive public behaviors (e.g., Ahrens, Lerman, Kodak, Worsdell, & 
Keegan, 2011). Other interpretations are possible. 

The primary literature on ACT, which any interested reader will need to 
consult in order to become properly conversant with ACT interventions, 
employs its own complex vocabulary that we mostly avoided here. 


Ci el gg 21 


Imitation, Modeling, and 
Observational Learning 


LEARNING OBJECTIVES 


¢ Define and provide examples of echoics and imitation. 


¢ Discriminate between examples and nonexamples of imitation. 


e Describe the imitation process; state what a trainer should do during the training process, based on learner 


performance. 


Consider a trio consisting of two guitarists and a 
singer-songwriter performing in a contemporary rock 
band. During their opening set, the lead guitarist impro- 
vises a short riff. His accompanying guitarist, and band’s 
newest and youngest member, hears the riff and then 
immediately reproduces it note-for-note. 

Backstage between sets, the singer-songwriter 
says to the lead guitarist, “That was an impressive riff 
you dropped into our first number, but the style doesn’t 
fit our sound. Don’t play that type of riff again.” The 
accompanying guitarist overhears the singer-songwriter’s 
comment. During the group’s second set, the lead guitarist 
bangs out several riffs in the same style that he played in 
the opening number. The accompanying guitarist does not 
imitate these riffs. 


mitation, modeling, and observational learning are respon- 

sible for a great deal of learning, much of it occurring with- 

out the intentional intervention by others. The two guitarists 
anecdote illustrates all three processes. The accompanying gui- 
tarist imitated a riff modeled by the lead guitarist. To be most 
adaptive, observational learning requires more than attending to 
and duplicating the behavior of others. Observational learners 
who discriminate the consequences produced by another per- 
son’s behavior can better determine which behaviors to imitate. 
After observing the singer’s disapproval of the lead guitarist’s 
improvised riff, the accompanying guitarist stopped playing 
similar riffs. 

The first section of this chapter defines imitation and 
outlines an imitation training protocol for learners who do not 
display imitative behavior. The chapter’s second and final sec- 
tions describe how applied behavior analysts use modeling as a 
behavior change strategy and teach children how to be skilled 


observational learners. 


IMITATION 


An imitative repertoire promotes the quick acquisition of behav- 
iors such as a young child’s development of social and com- 
munication skills. Without an imitative repertoire, a person has 
little chance for the agile acquisition of behaviors. Imitation has 
received considerable experimental and theoretical attention 
spanning several decades (e.g., Baer & Sherman, 1964; Carr & 
Kologinsky, 1983; Garcia & Batista-Wallace, 1977; Garfinkle 
& Schwartz, 2002; Wolery & Schuster, 1997). The experimental 
literature shows that individuals can acquire and maintain imita- 
tive behavior in the same fashion as they acquire and maintain 
other operant behaviors. That is, (a) reinforcement increases 
the occurrence of imitation; (b) when some imitative behaviors 
receive reinforcement, other imitative behaviors occur without 
specific training and reinforcement; and (c) some children who 
do not imitate can be taught to do so. 


Imitation Defined 


A technical definition of imitation entails four criteria: (a) The 
imitative behavior is occasioned by another person’s model of 
the behavior (or symbolic representation thereof), (b) the imi- 
tative behavior has formal similarity with the model, (c) the 
imitative behavior follows the modeled behavior closely in time, 
and (d) the model is the primary controlling variable for the 
imitative behavior. 


Models 


A model is an antecedent stimulus with topographical similar- 
ity to the behavior the analyst wants to be imitated. A model 
can be a person who actually demonstrates a behavior (our lead 
guitarist playing a riff), or a model can be symbolic (i.e., a pic- 
ture, video) (Bandura, 1977). 

Planned models are prearranged demonstrations of behavior 
that show the learner exactly what to do, be it acquiring new skills 
or refining the topography of certain elements of existing skills. 
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Planned models can be live performances, as when a yoga instruc- 
tor demonstrates how to achieve the adho mukha svanasana pose 
(a stretch in which one stands on the hands and feet; Downs, 
Miltenberger, Biedronski, & Witherspoon, 2015). Videos of a per- 
son performing the target behavior often serve as planned models. 
LeBlanc and colleagues (2003), for example, used video models 
to teach perspective-taking skills to three children with autism. 
The children learned to imitate a video model touching or point- 
ing to objects, such as a bowl or a box. Also, the children imi- 
tated vocal-verbal behaviors, such as saying “‘under the bowl, or 
saying “one.” 

Unplanned models, for better or worse, occasion imitation 
in everyday social environments (e.g., school, work, play, com- 
munity). A tourist who has never ridden the subway system in a 
foreign country imitates the ticket-scanning behavior of the rider 
ahead of her. The turnstile opens and she enters the platform. 
While visiting a prospective day-care center with his parents, a 
child sees a series of photos of children cleaning up litter from 
their playground. On the way out the door, the child picks up a 
discarded plastic bottle and places it in the recycle bin. An unde- 
sirable outcome of imitating an unplanned model might result 
if an international tourist imitates a local resident jaywalking 
across a dangerous intersection. 


Formal Similarity 


Formal similarity occurs when the model’s and the imitator’s 
behavior physically resemble each other and are in the same 
mode (i.e., they look, or sound, alike) (Michael, 2004). For 
example, when a student observes a teacher fingerspell the 
word house (1.e., the model), and the student then duplicates the 
fingerspelling correctly (i.e., the imitation), the student’s finger- 
spelling has formal similarity with the model. When a baby sit- 
ting in a high chair taps the food tray with his hand after seeing 
his mother tap the tray with her hand, the baby’s tap has formal 
similarity to the mother’s tap. 


Temporal Relationship 


A close temporal relationship between the model’s performance 
and the learner’s behavior is an important feature of imitation. We 
believe most instances of “delayed imitation” are better understood 
by referencing other behavioral principles and processes. The 
longer the latency between a previously observed model and the 
occurrence of behavior that duplicates the model, the more likely it 
is that variables other than the model are controlling the response. 

Let’s consider two examples that at first glance would 
appear to be imitation. When exiting the subway station at her 
destination, our international tourist emits the same turnstile 
operating behavior she imitated from another rider’s model to 
gain entry to the train platform at the start of her ride. When 
asked by his parent, “What did you learn at school today?” 
a student fingerspells h-o-u-s-e exactly as he imitated a deaf 
classmate’s fingerspelling the word earlier in the day. 

When a response with the same topography as a previous 
imitation occurs in the absence of the model (e.g., another rider 
operating the turnstile), that response is not imitative behavior. 
The relation between a discriminative stimulus (e.g., the turnstile) 
that signals the availability of reinforcement because of prior 


experience with ticket-scans that opened a turnstile—or between 
a motivating operation (e.g., the parent’s question) that increases 
the current frequency of behavior that has produced relevant rein- 
forcement—is functionally different from the relation between an 
antecedent model and an imitative behavior evoked by the model. 
Therefore, by definition, delayed behaviors replicating the topog- 
raphy of a prior imitative model, are not imitative. 


Model as Primary Controlling Variable 


Imitation is typically viewed as doing the same. Although the 
formal similarity of doing the same is a necessary condition for 
imitation, it is not sufficient. Formal similarity can exist with- 
out the model functionally controlling the similar behavior. The 
most important property defining imitation is the controlling 
relation between a model’s demonstration and a learner’s per- 
formance of topographically similar behavior. 

Baer, Peterson, and Sherman (1967) state: “Any behavior 
may be considered imitative if it temporally follows behavior 
demonstrated by someone else, called a model, and if its topog- 
raphy is functionally controlled by the topography of the mod- 
el’s behavior.” (p. 405, emphasis added) 

A controlling relation between the model’s behavior and 
the imitator’s behavior is inferred when a model evokes a similar 
behavior without a prior history of reinforcement. An imitative 
behavior is a new behavior that follows a novel antecedent event 
(i.e., the model). After the model evokes an imitative behav- 
ior, that behavior typically contacts contingencies of reinforce- 
ment. These new contingencies of reinforcement then become 
the controlling variables for the repetitions of the behavior (i.e., 
MO > SP > R-> S8). 

Holth (2003) explained how novel instances of imitation 
differ from discriminated operants in this way: 


Let us imagine that a dog is trained to sit whenever the 
owner sits down in a chair, and to turn around in a circle 
whenever the owner turns around. Does the dog imitate 
the owner’s behavior? Almost certainly not. The dog could 
have as easily been taught to sit whenever the owner turns 
around in a circle and to turn around in a circle when 

the owner sits down. Thus, what may look like imitation 
may be nothing more than a series of directly taught 
discriminated operants. The source of control can be 
determined only by introducing novel exemplars. In the 
absence of a demonstration that the dog responds to noyel 
performances by “doing the same,” there is no evidence 
that a similarity to the owner’s behavior is important in 
determining the form of the dog’s response. Hence, there 
is no true demonstration that the dog imitates the behavior 
of the owner unless it also responds to new instances of the 
owner’s behavior by “doing the same.” (p. 157, emphasis 
in original) 


Holth (2003) concluded by stating that imitation “can be 
inferred only through instances that do not have a direct rein- 
forcement history” (p 157). This chapter’s opening anecdote 
included an example we infer to be true imitation. The lead 
guitarist improvised a short riff. The accompanying guitarist, 
who had never heard or played the riff, immediately reproduced 


it note-for-note. The example meets the conditions for imitation: 
A model was present, formal similarity was achieved, a brief 
latency occurred between the model and the imitation, and the 
accompanying guitarist had no reinforcement history with the 
novel riff. 

We’ ll use our two guitarists one more time to illustrate 
an instance of replicating another person’s behavior that may 
appear to be imitation but is not. The lead guitarist says, “I have 
an idea for our opening song. Let me play it for you. If you like 
it, | will teach it to you.” The accompanying guitarist liked what 
he heard. The two musicians practice the riff together until the 
accompanying guitarist can play it as well as the lead. Later 
on stage, the lead guitarist plays the riff and the accompanying 
guitarist immediately reproduces it. This is not an imitative 
behavior; it is an example of a discriminated operant with a 
history of reinforcement. 


Imitation Training 


An imitative repertoire enables the quick acquisition of new, 
complex behaviors characteristic of many human endeavors. 
Typically developing children acquire many skills by imitat- 
ing unplanned models. Most children imitate the behavior of 
others without undue use of prompts or contrived reinforcement 
arrangements by parents, teachers, or other caregivers. Melzoff 
and Moore (1977) reported that infants as young as 12 to 21 days 
old can imitate adult hand and facial models. 

For reasons that are not entirely clear, some infants and 
children with autism and developmental disabilities do not 
imitate. As a result, learning that might ordinarily progress 
quickly requires intensive stepwise and time-consuming 
instruction. Children without an imitative repertoire have little 
chance of rapidly developing new behaviors and great difficulty 
advancing beyond basic skills. Baer et al. (1967) state the impor- 
tance of training such children to imitate: 


The development of a class of behaviors which may fairly 
be called “imitation” is an interesting task, partly because 
of its relevance to the process of socialization in general 
and language development in particular, and partly because 
of its potential value as a training technique for children 
who required special methods of instruction. (p. 405) 


Imitation training is a systematic, research-based set of 
steps for teaching a non-imitative learner to imitate models of 
novel behaviors. Baer and colleagues’ (1967) classic study, saline 
Development of Imitation by Reinforcing Behavioral Similarity 
to a Model,” provided a pioneering demonstration of how to 
teach an imitative repertoire. They worked with three young 
children with severe to profound intellectual disabilities who 
did not exhibit imitative behavior. The children were taught to 
emit simple discriminated (i.e., similar to the model) responses 
(e.g., raise left arm) when the experimenter presented the verbal 
response cue “Do this,’ and then provided the model (e.g., raise 
left arm). Baer and colleagues selected appropriate skills for 
their participants (e.g., nod yes, stand up, put on hat, open the 
door). The experimenter initially used physical guidance to 
prompt the similar response, and then gradually reduced the 
guidance over several trials. The word “Good” and small bits of 
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food were used as reinforcement to shape closer and closer topo- 
graphical similarity to the experimenter’s modeled responses. 

The criterion for the establishment of an imitative response 
was the subject displaying a novel response demonstrated for 
the first time by the experiment with no shaping or prompting 
procedures for that trial. 

The imitation training protocol developed by Baer and 
colleagues taught learners who did not imitate to imitate, meaning 
that a novel model controlled imitative behaviors in the absence 
of specific training and reinforcement of those behaviors. One 
participant imitated a novel model only after receiving imitation 
training for similarity on 130 different models. Results for the 
second participant were similar to those of the first. After learn- 
ing just eight discriminated operants of imitative topography, 
Subject 3 imitated the ninth response on its first presentation. 
This participant imitated the ninth training model, a novel model 
without a history of training and reinforcement. 

As aresult of imitation training, two participants imitated 
the model with 100% similarity, but only after receiving in 
excess of 100 reinforced training trials. The importance of 
delivering positive reinforcement during and after imitation 
training cannot be overstated. 


The initial training procedure contained occasions when 

the extent of the developing imitative repertoire of each 
subject could be seen. These were occasions when behavior 
was demonstrated by the experimenter to the subject for 

the first time. Any attempt by the subject to imitate such 
new behavior before direct training or shaping could be 
attributed to the history of reinforcement for matching other 
behavior of the experimenter. (Baer, Peterson, & Sherman, 
1967, p. 411) 


Additionally, all three children continued to imitate new 
responses maintained over many sessions as long as at least 
some imitative responses were reinforced. When reinforcement 
was no longer contingent upon imitative behavior, both the pre- 
viously reinforced imitations and the never reinforced probe 
imitations decreased markedly. However, all imitative behavior 
recovered when, with a small amount of shaping, reinforcement 
was again made contingent upon imitative behavior. Figure 21.1 
shows these results for Subject 3. 

The results of Baer and colleagues (1967) can be sum- 
marized as follows: (a) Children who did not have an imitative 
repertoire learned to imitate following a protocol that included 
shaping, cues, physical guidance, and reinforcement; (b) when 
target imitative behaviors produced reinforcement, the partici- 
pants imitated other novel models without reinforcement; (c) the 
children demonstrated a learning set effect (Harlow, 1959), or 
a so-called learning-to-learn phenomenon (as the participants 
progressed through imitation training, they required fewer train- 
ing trials to acquire new imitative repertoires); and (d) the chil- 
dren’s imitative repertoire maintained over many sessions as 
long as reinforcement was contingent on at least some imitative 
responses. 

The objective of imitation training is to teach learners 
to do what the person providing the model does regardless of 
the behavior modeled. If the training is effective, the learner 
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Figure 21.1 Percentage of demonstrated 
imitated behaviors by a child with 
severe/profound intellectual disabilities 
during reinforcement for imitation, DRO 
(reinforcement for any behavior other 
than imitation), and re-establishment 

of reinforcement. Solid data path shows 
results for eight reinforced imitations; 
dotted data path shows results for 
imitations that were never reinforced. 
From “The Development of Imitation by Reinforcing 
Behavioral Similarity to a Model,” by D. M. Baer, 

R. F. Peterson, and J. A. Sherman, 1967, Journal of 
the Experimental Analysis of Behavior, 21, p. 413. 
Copyright 1967 by the Society for the Experimental 
Analysis of Behavior, Inc. Reprinted by permission. 
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is likely to imitate models that have not been associated with 
specific training, and those imitations are likely to occur in 
many situations and settings, frequently in the absence of 
planned models and reinforcement. The term generalized 
imitation is often used when a learner imitates a wide variety 
of unprompted, untrained, non-reinforced modeled behaviors 
in different settings and situations. Generalized imitation could 
be considered a higher order response class in which partic- 
ipants imitate novel models without training and that are not 
predictive of reinforcement. 

The extent to which imitation generalizes types or classes of 
modeled behaviors may depend on the parameters of the response 
class used during training and may require programming directly. 
For example, Young, Krantz, McClannahan, and Poulson (1994) 
found that children with autism imitated novel models within the 
vocal, toy-play, and pantomime response types used for training, 
but imitation did not generalize across model types. 


An Imitation Training Protocol 


Building upon the experimental methods of Baer and colleagues, 
Striefel (1974) developed an imitation training protocol for prac- 
titioners. The components of Striefel’s program are (a) assess, 
and teach if necessary, prerequisite skills for imitation training; 
(b) select models for training; (c) pretest; (d) sequence models 
for training; and (e) conduct imitation training. 


Assess, and Teach if Necessary, Prerequisite 

Skills for Imitation Training 

Learners cannot imitate if they do not attend to the presentation 
of the model. Therefore, attending to the model is a prerequi- 
site for imitation training. Striefel (1974) defined attending as 
staying seated during instruction, keeping one’s hands in one’s 
lap, looking at the trainer whenever one’s name is called, and 
looking at objects identified by the trainer. Also, practitioners 
often need to decrease problem behaviors that interfere with 
training (e.g., aggression, screaming, odd hand movements). 


Subject #3 


Reinforced imit. 
eccceee Unreinforced imit. 
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Suggested procedures for assessing attending skills 
include the following: 


1. Staying seated. Seat the learner and record the duration of 
time the learner remains seated. 

2. Looking at the teacher. Say the learner’s name in a com- 
manding voice and record whether the student makes eye 
contact. 

3. Keeping hands in lap. Prompt the student to put his hands 
in his lap and record the duration of time the student’s 
hands remain in that position. 

4. Looking at objects. Place several objects on a table and 
say, “Look at this.” Immediately following the command, 
move a finger from in front of the learner’s eye to one of the 
objects and record whether the student looked at the object. 


Teachers often assess attending skills for a minimum of 
three sessions. The teacher can begin imitation training if the 
assessment data show adequate attending skills. The teacher will 
need to teach these skills before beginning imitation training if 
attending skills need to be developed. 


Select Models for Imitation Training 


Practitioners may need to select and use about 25 behaviors as 
models during initial imitation training. Including gross motor 
(e.g., raising a hand) and fine motor movements (e.g., manual 
sign language) as models provides learners with opportunities to 
develop more refined differentiations with their imitative skills. 

Practitioners usually use one model at a time during the 
initial training trials, rather than a sequence of movements. 
Practitioners may choose to use more complex models such as 
sequences of behavior after the learner can imitate one model per 
occasion successfully. Also, the initial training usually includes 
models of (a) the movement of body parts (e.g., touching nose, 
hopping on one foot, bringing hand to mouth) and (b) the manip- 
ulation of physical objects (e.g., passing a basketball, picking up 
a glass, zipping a coat). 


Pretest 


The learner’s responses to the selected models should be pre- 
tested. The pretest may show that the learner will imitate some 


models without training. The pretesting procedures advocated 
by Striefel (1974) are as follows: 


1. Prepare the learner’s attending behaviors for the pretest 


(e.g., seated, hands in lap) and often assume the same 
ready position as the learner. 


2. If you are using an object model, place one object in front 
of the learner and one object in front of yourself. 


3. Say the learner’s name to start the pretest, and when the 
learner makes eye contact, say, “Do this” (i.e., child’s 
name, pause, “Do this’’). 


4. Present the model. For example, if the selected behavior 


is to pick up a ball, pick up the ball yourself and hold it 
for a few seconds. 


5. Immediately praise each response that has formal simi- 
larity to the model, and deliver the reinforcer (e.g., a hug, 
an edible) as quickly as possible. 


6. Record the learner’s response as correct or incorrect (or 
no response), or as an approximation of the model (e.g., 
touches the ball, but does not pick it up). 


7. Continue pretesting with the remaining models. 


Practitioners can use the pretest procedure with all motor 
and vocal verbal models (e.g., name, pause, “Do this,” “Say ball’). 
Striefel recommended pretesting for several sessions until all 
models have been pretested at least 3 times. If the learner correctly 
responds during pretesting to a selected model at a set criterion 
level (e.g., three of three correct), then the practitioner should 
advance to other models. If the learner does not meet the criterion, 
the practitioner should select that model for imitation training. 


Sequence Selected Models for Training 


Practitioners use the pretest results to arrange the presentation 
sequence for the selected models, arranging the sequence from 
the easiest to the most difficult models to imitate. The first 
models selected for imitation training are those that the learner 
imitated correctly on some, but not all, of the pretest trials. The 
models that the learner responded to incorrectly, but that approx- 
imated the model, are selected next. Finally, the models that the 
learner failed to perform, or performed incorrectly, are the last 
to be selected for training. 


Conduct Imitation Training Sessions 


Striefel (1974) suggested using four conditions for imitation 
training sessions: preassessment, training, postassessment, 
and probing for imitative behaviors. The procedures used in 
imitation training are the same as those in the pretest, with the 
exception of when and how often the practitioner presents the 
selected models. 


Preassessment. The preassessment is a short pretest given 
before each training session. Practitioners use the first three 
models currently selected for training for the preassessment. 


Chapter 21 * Imitation, Modeling, and Observational Learning 575 


These three models are presented 3 times each in random 
order during the preassessment. If the learner’s behavior has 
similarity to the model on all three presentations, that model 
is removed from the training. The preassessment procedure 
allows practitioners to evaluate the learner’s current perfor- 
mance on the models selected for training that session, and 


to determine the learner’s progress in learning to respond to 
the model. 


Training. During training, practitioners use repeated presenta- 
tions of one of the three models used in the preassessment. The 
model selected first for training is the one most often responded 
to during the preassessment (i.e., the behavior was similar to the 
model on some, but not all, of the preassessment presentations). 
If, however, the learner made approximations only, the behavior 
with the closest similarity to the model is selected first for train- 
ing. Training continues until the learner responds to the model 
correctly on five consecutive trials. 

Imitation training will likely include physical guidance to 
prompt the response if the learner fails to respond. For exam- 
ple, the practitioner may physically guide the learner’s behavior 
through the complete response. Physical guidance allows the 
learner to experience the response and the reinforcer for that 
specific movement. After physically assisting the complete 
response, the practitioner will gradually withdraw the physical 
guidance, letting go of the student’s body just before the entire 
movement is completed, and then continue to fade the phys- 
ical guidance by withdrawing the physical support earlier on 
each subsequent trial. Eventually, the learner may complete the 
movement without assistance. When the learner responds to the 
model without prompting for five consecutive trials, that model 
is included in the postassessment. 


Postassessment. During the postassessment, the practitioner 
presents, 3 times each, five previously learned models and 
five models that are still included in imitation training. The 
practitioner will remove a most recently learned behavior 
from imitation training following three consecutive postas- 
sessments in which the learner responds correctly without 
physical guidance to the model 14 out of the 15 opportunities. 
Physical guidance, however, is appropriate to use during the 
postassessment. If the learner does not reach this criterion (14 
out of 15 postassessment opportunities), Striefel (1974) rec- 
ommended continuing imitation training with that model. The 
postassessment procedure allows the practitioner to evaluate 
how well a learner performs the previously and most recently 
learned behaviors. 


Probe for Imitation with Novel Models. Practitioners will 
use approximately five nontrained, novel models to probe for 
occurrences of imitation at the end of each imitation training 
session, or they will intermix the probes with the training tri- 
als. The probe procedure uses the same procedures as the pre- 
assessment activities, but without using the antecedent verbal 
response prompt (i.e., child’s name, pause, “Do this”) or other 
forms of response prompts (e.g., physical guidance). Probing for 
nontrained imitations provides data on the learner’s progress in 
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developing an imitation repertoire—in other words, learning to 
do what the model does. 


Guidelines for Imitation Training 


Assuming that the learner has the prerequisite skills for imitation 
training (e.g., attending skills, existing imitative repertoire), as 
was the case in the Garcia et al. study, and a decision has been 
made as to using live versus symbolic models, the following 
guidelines maximize imitation training. 


Keep Sessions Active and Brief 


Most practitioners use short training sessions during imitation 
training, typically 10 to 15 minutes, but often schedule more 
than one session per day. Two or three 5-min sessions may be 
more effective than one long session. To maintain quick and 
active training, allow no more than a few seconds between 
model presentation and expected imitative response. 


Reinforce Both Prompted and Imitative Responses 


In the early stages of imitation training, reinforce each occur- 
rence of prompted responses and all occurrences of imitation. 
If the learner’s participation requires a reinforcer other than 
praise, use a consequence that can be presented immediately, 
in small amounts, and that the learner can quickly consume 
(e.g., cereal bits, 5 seconds of music).! Reinforce only match- 
ing responses or imitative behaviors that immediately follow the 
model (e.g., within 3 to 5 seconds). Learners who consistently 
emit correct matching responses, but not immediately following 
the model, should be reinforced for shorter response latencies 
(e.g., a decreasing contingency from 7 seconds to 5 seconds to 
3 seconds, and so forth). 


Pair Verbal Praise and Attention 
with Tangible Reinforcers 


Often the learner’s behaviors during imitation training 
require the use of tangible consequences such as edibles 
or liquids. Even so, as training progresses, it is hoped that 
social attention and verbal praise will maintain the matching 
responses or imitative behaviors. To achieve this goal, the 
practitioner should pair the delivery of other consequences 
with social and verbal praise. Social (e.g., patting the stu- 
dent's arm affectionately) or descriptive verbal praise should 
immediately follow each correct response or approximation, 
simultaneously with the other consequence. A learner's will- 
ingness to participate in imitation training may increase when 
the practitioner schedules a favorite activity to follow each 
session. 


If Progress Stalls, Back Up and Move Ahead Slowly 


There may be identifiable reasons for a worsening performance, 
such as reinforcer satiation or context distractions; or perhaps 
the practitioner presented models that were too advanced for the 
learner. Whether or not a clear reason exists for the worsening 
performance, the practitioner should immediately return to an 
earlier level of successful performance. Once progress resumes, 
training can slowly advance. 


Fade Out Verbal Response Prompts 
and Physical Guidance 


Parents and caregivers of young children almost always teach 
imitation skills using verbal response prompts and physical 
guidance. For example, the caregiver (a) tells the child to 
wave bye-bye, the caregiver models waving, and then phys- 
ically guides the child’s wave; (b) asks the child, “What 
does the cow say?” The caregiver presents a model, “The 
cow says moo,” and then tells the child to say “moo”; and 
(c) provides praise and attention if the child says “moo.” 
This natural instructional process is the same as that advo- 
cated in this chapter for teaching imitative skills to learners 
who do not imitate: that is, a verbal response prompt, “Do 
this”; the presentation of a model; and physical guidance 
when needed. However, verbal response prompts and phys- 
ical guidance are not functional for producing imitation in 
everyday environments. Imitation training is not complete 
until all response prompts have been withdrawn. Children 
need to learn to do what the model does without the supports 
of response prompting. Therefore, to promote the effective 
use of imitation, fade out the response prompts used during 
the acquisition of trained matching responses. 


Make Data-based Decisions to Terminate 
Imitation Training 


Decisions to stop imitation training depend on the learner's 
behavior and program goals. For example, the practitioner 
could apply the criterion to terminate motor imitation training 
when the student imitates the first presentations of five con- 
secutive novel models across three sessions or imitates several 
untrained behavior chains (e.g., washing hands, brushing teeth, 
fingerspelling). 


MODELING 


Modeling is a behavior change strategy in which learners 
acquire new skills by imitating demonstrations of the skills by 
live or symbolic models. The model shows, demonstrates, or 
conveys exactly the behavior the learner is expected to perform. 
Models can be live demonstrations or symbolic representations 
of the desired behavior (Bandura, 1977). A father showing his 
teenage son how to shave by holding and moving a razor gently 
across his face, and a professional golf coach demonstrating how 
to eliminate a hitch in a putting stroke by realigning his head and 
hand positions, are examples of live models. Symbolic models 
would be a picture icon, a photo, a video, an audio, or a mixed 
media presentation that illustrates or depicts the desired behay- 
ior. For example, Delli Bovi, Vladescu, DeBar, Carroll, and 
Sarokoff (2017) used a video model combined with voiceover 
instructions to teach staff how to conduct a multistep stimulus 
assessment procedure. 


Video Modeling 


Bellini and Akullian (2007) define video modeling as “a tech- 
nique that involves demonstration of desired behaviors through 
video representation of the behavior. A video modeling inter- 
vention typically involves an individual watching a video 


demonstration and then imitating the behavior of the model. .. . 
Video self-modeling (VSM) is a specific application of video 
modeling that allows the individual to imitate targeted behaviors 
by observing her or himself successfully performing a behay- 
ior” (p. 264). 

Numerous studies have shown the positive effects of 
video modeling and video self-modeling across a range of 
ages and skills, for children and adults with various disabil- 
ities (Bellini & Akullian, 2007; Qi, Barton, Collier, & Lin, 
2017; see Table 21.1). Video modeling has also been used in 
training applications to teach complex, multistep procedures 
to staff on how to conduct assessment or intervention pro- 
tocols (Deliperi, Vladescu, Reeve, Reeve, &)DeBar, 2015). 
Finally, video or self-modeling can be fashioned using standard 
videotape, digitally through smart pad or tablet applications, 
through animations, and from various camera angles and back- 
grounds, including so-called point-of-view perspectives. The 
fact that video modeling (VM) or VSM can be combined with 
instructions, feedback, and rehearsal options makes it more 
practical and desirable for developing new behaviors. Regard- 
less of medium or mode, the evidence shows that video and 
self-modeling approaches produce imitation outcomes and, in 
some cases, generalized imitation outcomes. 

Aldi et al. (2016) provide an example of using modeling 
and imitation training. They used point-of-view video models 
to teach daily living skills (DLS) to two 18-year-old young 
men with autism spectrum disorder (ASD). The two young 
men received training in each of their residences on three daily 
living skills. Participant | received training on cooking, setting 
the table, and folding jeans. Participant 2’s tasks included set- 
ting the table, cleaning the bathroom sink/counter, and clean- 
ing the mirror. 

The authors used Apple iPad iOS software to create the 
point-of-view video models by filming family members’ arms 
and hands doing the assigned tasks. The filming included a 
verbal description of each step of the tasks. Depending on the 
DSL to be modeled, the length of the video training tape var- 
ied, usually less than 6 minutes. Further, the number of steps 
that were modeled varied from 9 to 23, depending on the task 
(e.g., folding jeans required 9 steps, whereas cooking required 
23 steps). Intervention consisted of participants watching a video 
related to the skill set being modeled. Subsequent to viewing the 
video, the participant was asked to complete the task (e.g., fold 
clothes). If the participant made an error, the video segment for 
that task was repeated as many times as necessary to achieve 
mastery. In some instances, gestural prompts were provided 
as part of an error correction procedure. Correct responses 
were praised. 

Figure 21.2 shows that both participants mastered their 
respective skill sets. During a 1-month follow-up, skills main- 
tained well above baseline levels, but were lower than the 
established treatment mastery levels, suggesting that additional 
programming for maintenance would be necessary. 

Applied behavior analysts have researched the utility of a 
variety of models and behaviors to ascertain the effectiveness of 
modeling and imitation training. Overall, these advances stem 
from the pioneering work of Baer et al. (1967) and Bandura 
(1977). Table 21.1 provides a sampling of models, participants, 


Chapter 21 * Imitation, Modeling, and Observational Learning 577 


and participants’ behaviors addressed in training across daily 
living, safety, and academic skills. 


Guidelines for Effective Modeling 


It has long been established in the experimental literature that 
a number of factors characterize effective models (Noell, Call, 
& Ardoin, 2011; Sulzer-Azaroff & Mayer, 1977). The more of 
these characteristics that the model shares with the imitator, the 
more likely the modeling—imitation relationship will be estab- 
lished and maintained. The following sections outline several 
of these factors. 


Model's Similarity with Learner 


The similarity between the individual providing the model and 
the learner can influence the likelihood that imitative behavior 
will occur (Bandura, 1969). Almost any variable can relate to 
similarity: for instance, past experience, age, sex, or physical 
appearance. If imitation is used to teach a teenager how to 
reduce drug use, a person who has successfully “gone straight” 
might serve as an effective model, more so than someone who 
has not. 


Prestige 


A model with prestige or social status can increase the likeli- 
hood of imitative behaviors. These individuals are often lead- 
ers. For instance, high school students may imitate the dress or 
hair style of their favorite rock stars. Young business executives 
may imitate the lifestyles of their firm’s senior executives. Con- 
sumers tend to buy products that are “pitched” by movie stars, 
celebrities, or other high-status individuals. The influence of a 
prestigious model is enhanced when the model and the imitator 
have other similar characteristics (e.g., they share music, art, or 
athletic interests). 


Emphasize the Critical Aspect of Behavior 


Emphasizing the critical aspects of the modeled stimulus can 
increase the likelihood of imitation. For instance, a teacher giv- 
ing instruction on sight words might hold up a card with the word 
blue written on it and tell the students that the word is blue. The 
word blue can be emphasized by inserting a short pause between 
her direction, “Say .. . blue.” The word can also be emphasized 
by destressing the intensity of the word say and increasing her 
voice volume with the word blue, “Say ... BLUE.” According 
to Risley and Reynolds (1970), emphasized stimuli increase the 
probability of an imitative response and boost the likelihood of 
imitation as a function of the emphasized model. 


Instructions 


Imitation can be enhanced when instructions are combined 
with a model’s presentation. Instructions—whether spoken or 
written words—set the occasion for the occurrence of a specific 
behavior. Instructions can serve as an orienting and prompting 
function that increases the likelihood of imitation. A number 
of researchers have combined instructions, including voiceover 
instructions, as part of a package program to enhance the effects 
of video modeling (Deliperi et al., 2015). 
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TABLE 21.1 A Sampling of Modeling Studies Across Social, Daily Living, Safety, Academic, and Vocational Skills 


Model/Format Behavior 
Social Skills 


Learners 


Self-modeling/Video (Sherer et al., 2001) Conversation skills 


Typically developing peer/Point-of-view video —_— Vocal or gestural initiations 
(Nikopoulos & Keenan, 2004) 


Daily Living Skills 
Point-of-view video modeling Cooking, setting the table, folding jeans, 
(Aldi et al., 2016) cleaning bath sink/counter and mirror 
Point-of-view video modeling/ Starting a load of laundry and 
Step-by-step video prompting hand-washing a plate, spoon, and cup 
(Cannella-Malone et al., 2011) 
Point-of-view video modeling Making orange juice, preparing a letter 
(Shipley-Benamou, Lutzker, & to be mailed, setting the table, cleaning a 
Taubman 2002) fish bowl, feeding pet cat 
Point-of-view step-by-step computer- Making microwave popcorn 
presented video prompting 
(Sigafoos et al., 2005) 
Live therapist modeling (Fu et al., 2015) Food consumption 

Safety Skills 

Live instructor model with practice and First-aid skills: treating cuts 


feedback (Christensen, Lignugaris-Kraft, 
& Fiechtl, 1996) 


Experimenter live (Garcia, Dukes, Brady, Scott, Fire safety skills 
& Wilson, 2016) 


Live and video modeling with rehearsal, praise Abduction-prevention responses 


and corrective feedback (Gunby, Carr, & (e.g., saying “no” when presented with a 
LeBlanc, 2010) lure by a stranger, immediately running 
to a safe area) 
Adult trainer/Live (Miltenberger Gun safety skills 
et al., 2005) 
Academic Skills 
Live peer vocal model (Taylor, DeQuinzio, Reading sight words 
& Stine, 2012) 
Picture prompts (Vedora & Barry, 2016) Receptive labeling 
Video modeling (LeBlanc et al., 2003) Perspective-taking skills 


Step-by-step video peer model with handheld Geometry skills 
computers (Cihak, 2009) 


Children with autism 
Children with autism 


Young adults with autism 


Students with severe intellectual 
disabilities 


Children with autism 


Adults with developmental disabilities 


Two boys with autism and food selectivity 


Preschoolers with disabilities 


Three 4- to 5-year-old children with 
autism 


Children with autism 


Typically developing children ages 
4 and 5 years 


Children with autism 


Teenagers with autism 
Children with autism 


Secondary students with learning 
disabilities 


Vocational Skills 
Video model of scripted and naturalistic Wear “Rocky the Raccoon” WalkAround 
settings (Allen, Wallace, Renes, Bowen, costume and greet customers at retail 
& Burke, 2010) warehouse 
Video models of peers, coworkers, job coaches — Vacuuming lobby, cleaning sidewalk, 
performing tasks with written or voice-over taking inventory of items to restock, 


instructions iPods (Kellems & Morningstar, 2012) recycling cardboard, etc. 


Adolescents and young adults with 
autism 


Young adults with autism 


Context Rehearsal and Feedback 


The model’s presentation is more likely to be imitated if it | Providing the learner with multiple opportunities to respond, 
occurs in a real situation or in a facsimile manner to the real combined with feedback on key elements of the model’s presen- 
situation. Further, these situations should be expanded to tation, enhances the effectiveness of modeling (Poche, Yoder, 
account for variations in circumstances and to promote gen- & Miltenberger, 1988). Rehearsal provides the analyst with the 
eralized imitation. chance to (1) determine if the learner has acquired the imitative 
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skill, (2) reinforce imitative behaviors, and (3) correct any errors 
through feedback and redirection (Garcia, Dukes, Brady, Scott, 
& Wilson, 2016). 


Reinforcement 


The more frequently reinforcement is delivered to the imi- 
tator, the more likely imitative behavior is to increase in the 
future. Likewise, the more frequently the imitator observes, 
detects, and recognizes the model being reinforced, the more 
effective the model becomes (Bandura, Ross, & Ross, 1963; 


Noell et al., 2011). 


OBSERVATIONAL LEARNING 


Observational learning entails detecting another person’s 
behavior and its consequence(s) and using that information 
to determine whether or not to imitate the behavior (Delgado 
& Greer, 2009; DeQuinzio & Taylor, 2015). Observational 
learning typically involves imitation but does not require it 
(Catania, 2013). For example, after seeing another person 
walking ahead of her slip on “black ice,” a pedestrian may 
avoid that part of the sidewalk by crossing the street or step- 
ping around it. 


580 Part 8 * Developing New Behavior 


Does observational learning require sight? Box 21.1 
explores this important question. 

The objective of observational learning is that the partic- 
ipant “learns from indirect contact with consequences experi- 
enced by others” (Delgado & Greer, 2009, p. 408). MacDonald 
and Ahearn (2015) provide a classroom-related example of 
observational learning (OL): 


An example of OL would be when a student does not cor- 
rectly point to a written word in the presence of a picture 
associated with the written word. The student observes 
another student receive instruction by a teacher followed 
by consequences (in the form of positive or corrective feed- 
back). Then, because of observing this performance and the 
consequences that follow, the student correctly points to 
the word associated with the picture in the absence of direct 
instruction. (p. 801) 


Delgado and Greer (2009) taught children with autism 
to discriminate the correct and incorrect sight word reading 
responses of confederate peers by selecting a green block if the 
teacher’s feedback indicated the confederate peer’s response 
was correct and selecting a red block if the teacher’s feedback 
indicated an incorrect response. Only under conditions in which 
monitoring correct and incorrect feedback took place did desir- 
able observational learning occur. The authors concluded that 
participants learned new sight words as a function of monitoring 
the correct and incorrect responses of their confederates. 

It is critical that observational learners discriminate feedback 
received by the model. Recall that is exactly what happened in our 
anecdote with the guitarists. The accompanying guitarist discrim- 
inated the singer-songwriter’s displeasure with the lead guitarist’s 
improvised riff and quit playing such riffs. According to DeQuinzio 
and Taylor (2015), “learning this discrimination is a fundamental 
precondition for observational learning to occur” (p. 41). 

Delgado and Greer (2009) suggest that the acquisition of 
observational learning skills may be a behavioral cusp: 


OL appears to be a behavioral developmental cusp. .. . a 
developmental cusp occurs when a child can contact con- 
tingencies that he or she could not before. . . . and it shows 
not only a cusp but a capability, in that the children can 
now learn in ways they could not before. (p. 430) 


In a review of the literature, Taylor and DeQuinzio (2012) 
noted that children with ASD often have deficits with many of 
these responses. They suggested that teaching sustained atten- 
tion to peer models, generalized imitation of a peer’s vocal and 
motor responses, and discrimination of consequences could 
facilitate the emergence of observational learning in the natural 
environment. To date, however, no method exists for assessing 
the presence of observational learning and the skills necessary 
to engage in it, although studies have specifically examined 
teaching OL. Researchers have tried to teach OL through peer- 
yoked contingencies, peer monitoring, and teaching a differ- 
ential observing response. Each study focused on teaching the 
children to attend to discriminative stimuli, to discriminate con- 
sequences related to the OL task, or both. Taylor, DeQuinzio, 
and Stine (2012) assessed the effects of teaching a monitor- 
ing response on the acquisition of differentially responding to 
words. During the training condition, participants were taught 
to imitate vocally the word read by the peer and to match the 
vocal response to the written word on the matching board. 
During the exposure condition, participants observed only the 
peer’s reading response. Test conditions occurred 10 minutes 
after each condition to assess acquisition of the previously 
observed reading words. Results, though limited, suggested that 
all participants acquired expressive identification of words with 
and without monitoring. 


Teaching Observational Learning Skills 


As indicated earlier, many children with autism and develop- 
mental disabilities do not imitate, and observational learning 
is nonexistent. Children without observational learning rep- 
ertoires “can learn new operants only via direct instruction, 
a process that is time-consuming and expensive” (Delgado & 
Greer, 2009, p. 431). 

Three requisite skills have been identified as central to 
developing an observational learning repertoire: attending, imi- 
tating, and discriminating (DeQuinzio & Taylor, 2015; Greer, 
Dudek-Singer, & Gautreaux, 2006; Taylor & DeQuinzio, 2012; 
Taylor, DeQuinzio, & Stine, 2012). 

Taylor and DeQuinzio (2012) and Taylor (2013) describe 
interventions for developing imitation and observational 
learning skills in persons without them. These interventions 


Is Observational Learning for Sighted Persons Only? 


Because observational learning connotes sight or vision, some 
readers may assume incorrectly that only sighted persons can 
profit from it. An individual who is blind uses observational 
learning when he attends to the auditory stimuli and conse- 
quences produced by the behavior of others and imitates those 


behaviors or not, accordingly. So, too, does an individual with 
dual sensory impairments (i.e., blind and deaf) when he senses 
and imitates tactile and kinesthetic stimulus products of anoth- 
er’s behavior. Observational learning requires attention, imita- 
tion, and discrimination training regardless of sensory modes. 


generally include the following three components: (a) develop 
attending skills; (b) teach imitation and program for gener- 
alized imitation; and (c) reinforce the discrimination of 
consequences—1.e., the response class “Do This, but Don’t 
Do That”—and continued practice. 


Observational learning requires several responses during 
and after the modeled action. 


Attending 


Attending means more than just being in proximity and ori- 
ented toward a stimulus. Delgado and Greer (2009) and Taylor 
and DeQuinzio (2012) indicated that attending at its funda- 
mental level means that the participant, through peer- or con- 
federate-mediated tactics, not only sustains attention toward 
the relevant stimulus but also makes an imitative and matching 
response that confirms he or she is observing that stimulus. For 
example, in Delgado and Greer’s study the participant had to 
observe the confederate and then raise a differentially colored 
block indicating whether a correct or an incorrect response 
was made by the confederate. Likewise, Taylor et al. (2012) 
required the participant not only to repeat the stimulus word 
(imitation) but also to place a “chip” under that word on a 
separate matching board, signifying that the participant was 
attending to the stimulus word. In Taylor and colleagues’ view, 
attending is a subset of a larger monitoring behavior repertoire 
that includes imitating a model and engaging in confirming 
responses to the model. 


Imitating 


Whether with discrete trial training (Smith, 2001), combining 
shaping, rehearsal, and physical guidance with imitation train- 
ing (Baer et al., 1967), adding verbal description to imitation 
training (Jahr & Eldevik, 2002), or practicing immediate or 
delayed imitation with adults or peers (Taylor & DeQuinzio, 
2012), teaching an imitation repertoire can be accomplished 
so that new, more complex repertoires of behavior develop. 
Taylor and DeQuinzio (2012) described how peers can be used 
to teach imitation: 


To teach imitation of peer responses, a teacher may 
initially have the child sit across from a peer at a 

table or in a play area. The teacher would then ask 

the peer to demonstrate actions (e.g., the peer is 
instructed to push a car back and forth) and would 
present an instruction to direct the child to imitate the 
peer (e.g., the teacher would say, “Do what John is 
doing”). If necessary, the teacher could guide the 
child to imitate the peer’s action. When the child does, 
the teacher would provide praise and a preferred snack 
or toy, to reinforce the imitative response. To promote 
generalization, the teacher would have the peer 
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model different actions each time, until the child 
can imitate novel actions without any prompting or 
reinforcement. (pp. 352, 353) 


Discriminating 


DeQuinzio and Taylor (2015) demonstrated an effective 
discrimination training procedure for students in whom this 
skill is lacking. Essentially, in their study, which included four 
students with autism, the researchers used a pool of unknown 
pictures as stimulus items. Across baseline, discrimination 
training, and 10-min posttest trials, they measured correct 
observed picture-labeling responses from reinforced and unre- 
inforced trials between the teacher (presenting the trial) and a 
confederate adult model. 

During a baseline condition, the teacher, who was seated 
across from an adult model and the participant, presented a 
picture with the direction, “What’s this?” If the confederate 
adult model said the name of the picture correctly, the teacher 
provided praise (i.e., “That’s right”) and delivered a reinforcer. If 
the confederate, by prior arrangement, answered incorrectly, the 
teacher said, “I’m sorry, that’s wrong,” and a reinforcer on the 
table was removed from sight. Whether the adult model stated a 
correct or incorrect reply, the teacher never provided the correct 
picture label as part of a consequence. During the discriminat- 
ing training phase, baseline conditions were repeated, except 
that the teacher, after providing a consequence to the model 
(“That’s right)”, turned immediately to the participant and said, 
‘“What’s this?” If the participant answered correctly, praise was 
delivered. If the participant emitted an incorrect response, the 
teacher provided error correction in the form of a statement, 
“Say what she says when she gets it right”” Correspondingly, 
after nonreinforced trials to the adult model, the picture was 
again presented to the participant immediately, with the state- 
ment “What’s this?” If the participant said, “I don’t know,” or 
didn’t say anything, that trial ended, and the next was initiated. 
Finally, generalization sessions were conducted approximately 
every three to five sessions, using different pictures than were 
used during baseline and discrimination training. 

Figure 21.3 shows the results of the study. Each partici- 
pant learned to discriminate reinforced responses of the confed- 
erate adult model and to say “I don’t know” in the presence of 
nonreinforced responses to that model. The discrimination per- 
formance of the participants was maintained, albeit at a limited 
level, for a period of time after training using different pictures, 
but improved when additional training for generalization was 
instituted. 

If any of the requisite skills for observational learning— 
attending, imitation, delayed imitation, and consequence dis- 
crimination—are lacking in a participant’s repertoire, these 
skills can be taught separately and then combined to improve or 
enhance observational learning. 
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Figure 21.3 The percentage of 
correct responses during test 
sessions conducted after baseline, 
discrimination training, and 
generalization sessions for three 0 
children with autism. 
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SUMMARY 


Imitation 


1. Imitation entails four criteria: (a) The imitative behavior 
is occasioned by another person’s model of the behavior 
(or symbolic representation thereof), (b) the imitative 
behavior has formal similarity with the model, (c), the 
imitative behavior follows the modeled behavior closely 
in time, and (d) the model is the primary controlling 
variable for the imitative behavior. 


2. A model is a live or symbolic demonstration of the target 
behavior that shows the learner exactly what to do. 


3. Formal similarity occurs when the imitator’s behavior 
physically resembles the model and is in the same mode. 


4. A brief latency between the model and the learner’s 
behavior is an important feature of imitation. 


5. Most instances of delayed imitation may be better under- 
stood with other behavioral principles and processes. 


6. The most important property defining imitation is the 
controlling relation between a model’s demonstration 
and a learner’s performance of topographically similar 
behavior. 


10. 


. Imitation training is a systematic, research-based set of 


steps for teaching a non-imitative learner to imitate models 
of novel behaviors. 


. Generalized imitation refers to a learner imitating a wide 


variety of unprompted, untrained, non-reinforced modeled 
behaviors in different settings and situations. 


. Striefel’s (1974) imitation training protocol for 


practitioners has the following components: (a) assess, 
and teach if necessary, prerequisite skills for imitation 
training; (b) select models for training; (c) pretest; 

(d) sequence models for training; and (e) conduct 
imitation training. 

Guidelines for imitation training include the following: 
* Keep sessions active and brief. 

* Reinforce both prompted and imitative responses. 


* Pair verbal praise and attention with tangible 
reinforcers. 


* If progress stalls, back up and move ahead slowly. 


* Fade out verbal response prompts and physical 
guidance. 


Chapter 21 * Imitation, Modeling, and Observational Learning 583 


* Measure and record the learner’s performance and : 


Models the learner considers to have prestige or 
review the data after each session. 


social status. 

Terminate imitation training when the learner meets * Models’ emphasis on critical aspects of the target 
specified performance criteria such as: imitates the behavior. 

first presentations of five consecutive novel models 
across three sessions, imitates several untrained 
behavior chains (e.g., washing hands, brushing teeth, 


* Instructions orienting and prompting the learner to 
attend to the model. 


fingerspelling). ¢ Model demonstrating the target behavior in an authentic 
context. 
onsen ¢ Rehearsal and feedback. 
oaellin : Core : 
] e The learner observing the model receiving reinforce- 
11. Modeling is a behavior change strategy in which learners ment for the target behavior; the learner receiving 
acquire new skills by imitating demonstrations of the skills reinforcement for imitating the target behavior. 


by live or symbolic models. 


faa é Observational Learning 
12. Video modeling is a technique that involves the demon- 


stration of desired behaviors through visual representa- 15. Observational learning entails another person’s behavior 
tions of the behavior. and its consequence(s) and using that information to 
determined whether or not to imitate the behavior. 
Observational learning typically involves imitation but 
does not require it. 


13. In video self-modeling, the learners observe a video of 
themselves successfully performing the target behavior 


and imitates their self-model. 
16. Three requisite skills are central to developing an 


observational learning repertoire: attending, imitating, 
¢ Models with characteristics similar to those of the and discriminating. If any of the skills are lacking, they 
learner. can be taught separately and then combined. 


14. The effectiveness of modeling may be enhanced by: 


KEY TERMS 


generalized imitation modeling video modeling 
imitation observational learning video self-modeling 


imitation training 


MULTIPLE-CHOICE QUESTIONS 


1. With an understanding of the imitation process, 3. Learning to purchase a soda from a vending machine by 
applied behavior analysts can use imitation as an first watching someone else purchase a soda would be an 
intervention to: example of: 

a. Eliminate old behaviors a. Planned echoic stimuli 


b. Evoke new behaviors b. Unplanned echoic stimuli 


c. Create teaching sequences c. Planned models 


d. Unplanned models 


d. Reinforce appropriate behaviors 
Piha Hint: (See “Imitation Defined”) 


Hint: (See “Introduction to the chapter”) | 
4. When a child picks up a fork immediately after observing 


2. The controlling variable for an imitative behavior is: her father pick up a fork, this imitative behavior has: 
a. An operative behavior a. Planned similarity 
b. An imitative response chain b. Unplanned similarity 
c. A model c. Formal similarity 
d. A pre-task d. Informal similarity 


Hint: (See “Imitation Defined’) Hint: (See “Imitation Defined”) 
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6. Potential learners cannot imitate if they do not have this 
prerequisite skill: 


5. Regardless of the behavior modeled, the objective of 
imitation training is for the learner to do what? 
a. What the model does a. The behavior in their repertoire 
b. Attending to the model 
c. Decrease in problem behavior 


d. Ability to define the behavior 


b. What the trainer asked or commanded 
c. What behavior best fits the situation 
d. What behavior will receive reinforcement 


Hint: (See “Imitation Training”’) Hint: (See “Imitation Training”) 


ESSAY-TYPE QUESTIONS 


1. Four behavior environment relations functionally define 
imitation. What are they and why must each occur? 


immediately after this model, the learner raises his hand. 


Is this an imitative behavior? Explain your answer. 
Hint: (See “Imitation Defined”’) Hint: (See “Imitation’’) 


2. A learner has played the game “Simon Says” many times 3. During imitation training, if a learner fails to respond, 


in the past (left foot up, right foot up, left hand up, right 
hand up, etc.). During a game of “Simon Says”, the trainer 
raises his right hand up without saying “Simon Says”; 


what should a trainer do and how will this help to 
increase the potential for imitation learning? 


Hint: (See “Imitation Training”’) 


SONAR TEIN NANNIES ANNIE NN II NOAA 


NOTES 


reinforcer (e.g., food allergies, swallowing difficulties) and should obtain 
permission from parents/guardians. 


1. Before using food as a reinforcer, practitioners should determine the 
learner has no medical conditions that might preclude delivering food as a 
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Shaping 


LEARNING OBJECTIVES 


Define and provide examples of shaping. 


Explain the process of shaping in terms of the behavioral contingencies used. 
Differentiate between shaping across different and within the same response topographies. 
Explain ways in which the efficiency of shaping can be increased. 


State the guidelines for implementing shaping effectively. 


An Amazing Display! 


In the summer of 1955 I was exploring some simple 
schedules of reinforcement in normal preschool children, 
using a variety of contingently delivered trinkets and bits 
of candies and cookies. The research was being conducted 
in a newly designed and equipped trailer laboratory at the 
University of Washington. In the far wall of the labora- 
tory room as one entered was a large one-way glass, and 
beneath it a lever for responding (actually an O’ Cedar 
mop handle). Above the lever was a recessed light bulb 
and below the lever was a small opening for delivering 
“goodies.” 

A child’s responses on the lever were carried to a 
circuit which was programmed so that each reinforced 
response was accompanied by a light flash and a short buzz 
from the dispenser motor, indicating that a goodie was 
being delivered. The experimenter operated behind the wall 
with the one-way glass and apparatus. 

One day I invited Bill Verplanck, a visiting profes- 
sor in the Department of Psychology at the University of 
Washington, to join me in observing the performances of 
the children. He was impressed with what he saw: children 
responding in a fairly orderly fashion to fixed and variable 
ratio and interval schedules of reinforcement. 

About halfway through the session, Verplanck sug- 
gested that we explore the shaping of operant behavior. We 
decided to see whether we could establish pulling the cord 
of the window shade on the window at the far side of the 
room, which was about fourteen feet away from the wall 
where the lever was situated. The procedure consisted of 
hand-shaping successive approximations to the final task, 
ie., reinforcing getting off the chair, moving farther and 
farther away from the lever, raising the hand, grasping and 
then pulling on the window shade cord. Initial movements 
away from the lever were a bit slow in developing because 
each time the child was reinforced for moving away, he 
had to turn back or run back to get the goodie lodged in the 
receptacle. In a relatively short time, however, the response 


was well established: After snatching up a goodie, the child 
would run across the room, pull on the shade cord, and as 
soon as he heard the motor buzz, he would dash back to the 
receptacle and retrieve the goodie. 

This was an amazing display! . . . Delivering rein- 
forcers contingent on the topography of successive behav- 
iors clearly showed how new operant behavior—any kind 
of new operant behavior—is probably established. I had 
just witnessed a dramatic demonstration of what we had 
for years been telling our students in introductory psychol- 
ogy, namely, that the implication of a scientific analysis of 
behavior is prediction and control of individual behavior. 

—Adapted from “In Memoriam: Sidney W. Bijou, 
1908-2009,” by M. Dougher, 2009, The Behavior Analyst, 
pp. 363-364. Originally appeared in Behavior Modifica- 
tion: Behavioral Approaches to Human Problems, by 

W. H. Redd, A. L. Porterfield, and B. L. Andersen, 1978. 
New York: Random House. Used with permission. 


haping is used in numerous situations to help learners 

acquire new behaviors. Wolf, Risley, and Mees (1964), 

for example, employed shaping in an applied setting to 
increase eyeglass wear for a child who was in danger of losing 
his eyesight if he did not wear corrective lens regularly. Athens, 
Vollmer, and St. Peter Pipken (2007) used shaping to improve 
the academic behavior of students with speech, language, and 
learning disabilities. Researchers and trainers have used shaping 
to teach functional behavior to animals (e.g., walking horses 
into a trailer without injury to the horses or the attendants), for 
appeal or utility purposes (e.g., teaching porpoises to execute 
show routines), for humanitarian reasons (e.g., conducting 
mine sweeping operations in formerly embattled combat areas), 
for security reasons (e.g., sniffing out explosives in airports, 
terminals, or high-traffic public venues), for medical assess- 
ments (e.g., detection of tuberculosis or necrotic tissue), and for 
assisting animals to remain calm during immunization protocols 
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and dental examinations and procedures (Poling et al., 2017; 
Pryor & Ramirez, 2014). 

Depending on the complexity of a given behavior, or the 
gap between the learner’s present level of performance and the 
desired target (terminal) behavior, shaping may require numer- 
ous successive approximations before the terminal behavior is 
achieved. Acquiring the terminal behavior with respect to time, 
trials, or direction is seldom predictable, immediate, or linear. 
It is possible that a learner—human or animal—might emit a 
closer approximation to the terminal behavior that a practitioner 
fails to detect and reinforce. In this situation, achievement of 
the terminal behavior will be delayed. However, if a systematic 
approach is used—that is, if each instance of a closer approxi- 
mation to the terminal behavior is detected and reinforced while 
simultaneously placing prior behaviors on extinction—progress 
can be attained more quickly. 

Although shaping a behavior can be time-consuming, it 
is an important tactic for teaching new behaviors (Fonger & 
Malott, 2018). Shaping should be in every behavior analyst’s 
repertoire, especially in those instances where it is unlikely that 
novel target behaviors will be learned by following instructions, 
gaining incidental exposure, or providing models, physical cues, 
or verbal prompts. 

This chapter defines shaping, differential reinforcement, 
and successive approximations; presents examples of how to 
shape behavior across and within different response topogra- 
phies; and suggests ways to improve shaping efficacy. Clicker 
training is illustrated as a method that trainers use to shape new 
behaviors in animals and humans. Next, emerging applications 
of shaping, and how shaping can be applied to large-scale social 
and humanitarian endeavors, is presented. Finally, we provide 
guidelines for implementing shaping and conclude the chapter 
with “how to learn to shape behavior.” 


SHAPING DEFINED 


Shaping consists of a three-part process whereby the analyst 
(a) detects a change in the learner’s environment, (b) makes 
a discriminated judgment about whether that change is a pro- 
gressively closer approximation to a terminal behavior of inter- 
est; and then (c) differentially reinforces that closer successive 
approximation. In short, shaping is defined as the differential 
reinforcement of successive approximations toward a terminal 
behavior. Detecting means that the shaper must use one or more 
of his or her physical senses to see, hear, or otherwise note that 
a change has occurred. Discriminated judgment means that the 
shaper must answer the following question: “Is what I just saw 
(or heard) a closer approximation to the desired terminal behay- 
ior or not?” Differential reinforcement, as applied in shaping, 
refers to presenting an unconditioned or conditioned reinforcer 
only to those emitted members of a response class that share a 
specified dimension or quality, while placing all other emitted 
response class members on extinction. ! 

In other words, the shaper, after detecting a change, must 
decide if that change is in the direction of an ever-evolving 
nearer approximation to the desired terminal behavior. If so, the 
analyst reinforces it immediately. If, after detecting a change, 


the analyst deems that the change occurred, but that it was not 
a closer approximation to the terminal behavior, then that pre- 
viously emitted behavior is placed on extinction, meaning the 
analyst does not reinforce it. 

For example, assume that a respiratory therapist (RT) is 
charged with increasing the lung capacity of a patient recovering 
from pneumonia. She uses a spirometer, which measures lung 
capacity volume in milliliters (ml). The physician directing the 
treatment sets the patient’s goal for 1750 ml, but the patient’s 
present lung volume breathing behavior registers only 250 ml 
(i.e., a volume gap of 1500 ml). To use shaping, the RT first 
closely observes the patient to note the occurrence of a breath- 
ing volume change—any change. If any progressive breathing 
attempt exceeds 250 ml by any volume level (e.g., 250.01, 251, 
255 ml) praise is delivered (“Well done, Yuri! That’s higher!”’). 
Any lower volume inhalation response level is placed on extinc- 
tion.” Figure 22.1 shows a series of four initial trials illustrating 
this process as the patient works his way toward his goal. 

Behavior analysts should not be deceived, however, by the 
apparent step-wise progression of improved breathing levels in 
Figure 22.1 (e.g., 300 ml, 325 ml, 500 ml). These levels were 
arbitrarily provided as a pedagogical illustration. Shaping, in its 
procedurally accurate application, is absolutely not a step-wise, 
fixed-gradient process that would require the patient to go from 
250 ml to 300 ml to 350 ml and so on upward to the goal of 
1750 ml. Any breathing behavior that moved the volume indi- 
cator level incrementally higher toward the physician-directed 
goal of 1750 ml (say, 250.1 ml to 276.5 to 339, etc.) would meet 
the criterion of being a successive approximation to 1750 ml and 
is reinforced. Any attempts lower than an already established 
higher level would be placed on extinction. 

In Science and Human Behavior, Skinner (1953) presented 
the concept of shaping with an analogy: 


Operant conditioning shapes behavior as a sculptor shapes 
a lump of clay... . The final product seems to have a 
special unity or integrity of design, but we cannot find a 
point at which this suddenly appears. In the same sense, an 
operant is not something which appears full grown in the 
behavior of the organism. It is the result of a continuous 
shaping process. (p. 91) 


Through careful and skillful manipulation of the original, 
undifferentiated lump of clay, the artisan keeps some aspects 
of the clay in its primary location, cuts other pieces away, 
and reforms and molds still other sections so that the form is 
slowly transfigured into the final sculpted design. Similarly, 
the skillful practitioner shapes novel forms of behavior from 
responses that initially may bear little resemblance to the final 
product. To restate, a practitioner using shaping differentially 
reinforces successive approximations toward a terminal behav- 
ior, while simultaneously placing on extinction attempts that 
do not progress toward the terminal behavior. The end product 
of shaping—a terminal behavior—can be claimed when the 
topography, rate, latency, duration, interresponse time, or mag- 
nitude of the target behavior reaches a predetermined criterion 
level (e.g., the physician-determined 1750-ml lung capacity 
level in our example). The two key procedural components of 
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that attempt would have 
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Figure 22.1 Illustrates the process by which a respiratory therapist (a) detects a change in breathing volume, (b) makes a 
judgment about whether that change is a successive approximation in the direction of the terminal behavior (i.e., 1750 ml); 
and (c) differentially reinforces that closer successive approximation. 


shaping—differential reinforcement and successive approxi- 
mations—are presented next. 


Differential Reinforcement 


When a heavy ball is thrown beyond a certain mark, when 
a horizontal bar is cleared in vaulting or in jumping, when 
a ball is batted over the fence (and when, as a result, a 
record is broken or a match or game won), differential 


reinforcement is at work. 
HB. FSkinner (1953, p. 97) 


To reiterate, differential reinforcement is a procedure in which 
reinforcement is delivered for responses that share a prede- 
termined dimension or quality, and reinforcement is withheld 
for responses that do not demonstrate that quality (i.e., these 
responses are placed on extinction). Differential reinforcement 
has three fundamental effects: (a) Responses within a response 
class similar in function to those that have been reinforced in 
the past are more likely to reoccur; (b) responses resembling 
the unreinforced members are not as likely to reoccur (i.e., 
these responses undergo extinction); and (c) when responses 
are unreinforced and undergo extinction, extinction-induced 
variability may occur. Extinction-induced variability is 
the phenomenon of non-reinforcement of a response class 
whereby a temporary increase in the changeability of the 
response class topography occurs, and which may produce, by 
chance alone, a closer approximation to the terminal behavior. 
Extinction-induced variability addresses a change in the form of 
the response, whereas extinction burst addresses a change in the 


rate of the response. 


When differential reinforcement is applied consistently 
within a response class, its triple effect results in a new response 
class composed primarily of responses sharing the characteris- 
tics of the previously reinforced subclass. This emergence of a 
new response class is called response differentiation. Catania 
(2013) provides a succinct description of how extinction- 
induced variability works with respect to establishing response 
differentiation. 


Shaping is based upon differential reinforcement. At 
successive stages, some responses are reinforced and 
others aren’t. In addition, the criteria for differential 
reinforcement change as responding changes, in successive 
approximations to the response to be shaped. The property 
of behavior that makes shaping effective is that behavior 
is variable. No two responses are the same, and reinforce- 
ment of one response produces a spectrum of responses, 
each differing from the reinforced response along such 
dimensions as topography (form), force, magnitude and 
direction. Of these responses, some are closer to the 
response to be shaped than others and may be selected to 
be reinforced next. Reinforcing these responses produces 
still others, some of which may come even closer to the 
response to be shaped. Thus, reinforcement can be used to 
change the spectrum of responses until the one to be shaped 
occurs. (p. 114) 


Successive Approximations 


A successive approximation refers to the gradual and pro- 
gressive criterion change for delivering reinforcement for 
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a behavior that is closer to the terminal behavior than the 
response it replaces. In other words, once a more incremen- 
tal behavior occurs that resembles the terminal behavior more 
closely, reinforcement is delivered. When an initially reinforced 
response is established, the practitioner shifts the criterion for 
reinforcement to responses that are even a closer approxima- 
tion of the terminal behavior. However, care must be applied in 
shifting the criterion. If the criterion is set too low, an excessive 
number of behaviors at the same behavior-performance level 
will be reinforced, stalling progress. If the criterion is set too 
high, extinction occurs and progress is thwarted (Hanley & 
Tiger, 2011; Noell, Call, & Ardoin, 2014). Skinner (1953) 
discussed the critical nature of successive approximations as 
follows: 


The original probability of the response in its final form is 
very low; in some cases, it may even be zero. In this way, 
we build complicated operants that would never appear in 
the repertoire of the organism otherwise. By reinforcing 

a series of successive approximations, we bring a rare 
response to a very high probability in a short time. This is 
an effective procedure because it recognizes and utilizes the 
continuous nature of a complex act. (p. 92) 


Figure 22.2 illustrates a progression of successive approx- 
imations that might be used to shape drawing uniformly spaced 
horizontal lines. Inset | contains products of the learner’s current 
repertoire (Skinner’s “undifferentiated lump’). Inset 6 shows 
products of the desired terminal behavior. Shaping begins with 


[=] 


S*+ longer scribbles 


the learner’s current behavior; in this case, by detecting and then 
reinforcing longer scribbles (star-marked scribbles in Inset 1) 
and placing shorter scribbles on extinction. The resulting 
response differentiation produces a new response class (i.e., 
longer scribbles) (see Inset 2). The shaper shifts the criteria 
for reinforcement to more closely approximate the terminal 
behavior—in this example, longer scribbles that do not cross 
themselves. Insets 3 through 5 illustrate the new response classes 
that might emerge from differential reinforcement applied to 
each previous response class. 


Shaping Different Dimensions of Performance 


Behavior can be shaped along any of its measurable and mallea- 
ble dimensions (see Table 22.1). 

Let’s focus on magnitude for a moment—the last dimension 
presented in Table 22.1—as it relates to teaching a child to talk 
within a conversational decibel range. Let us suppose a behavior 
analyst is working with a student who talks at such a low volume 
(e.g., below 45 decibels) that his teacher and peers have difficulty 
hearing him. Projected and planned intermediate approximations 
to 65 decibels (dB)—the amplitude of normal conversational 
speech—might be 45 dB, 55 dB, and ultimately 65 dB. With dif- 
ferential reinforcement, any response at or above 45 dB (45.1 
dB, 47.7 dB, 50.3 dB) would be reinforced, whereas speaking 
responses that occurred at 45 dB or lower would be placed on 
extinction. With successive approximations, the criterion for rein- 
forcement would be raised gradually and incrementally to 55 dB, 
and finally 65 dB. In effect, lower magnitude response levels are 


S*+ longer, uncrossed scribbles > 45° S*+ still longer, horizontal lines within a 
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Figure 22.2 Shaping scribbles into uniformly spaced horizontal lines. Stars mark reinforced responses. 


TABLE 22.1 Performance Dimensions That Can Be Shaped 
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Performance Dimension Example 


Topography (form of the behavior) 


Rate (number of responses per e 
unit of time) 


Latency (time between the onset e 
of the antecedent stimulus and the 
occurrence of the behavior) 


Refining motor movements associated with a golf swing, throwing motion, or vaulting behavior. 
Improving cursive or manuscript letter formation during handwriting exercises. 

Increasing the number of problems completed per minute during a math seatwork assignment. 
Increasing the number of correctly spelled and appropriately used words written per minute. 


Decreasing compliance time between a parental directive to “clean your room” and the 
onset of room-cleaning behavior. 


e Increasing the delay between the onset of an aggressive remark and retaliation by a student 


with severe emotional disabilities. 


Duration (elapsed time from the e 
onset of a response to its end point) 


Interresponse time (IRT) ° 


Increasing the time a student stays on task. 
e |ncreasing the number of minutes of engaged study behavior. 


Reducing rapid eating by increasing the time (IRT) between food bites. 


e Reducing smoking by increasing the time (IRT) between lighting a cigarette. 


Magnitude (response strength ° 
or force) 


not reinforced; higher magnitude responses (i.e., those that are 
successively above 45 dB, 55 dB, and 65 dB) are reinforced. 

Fleece and colleagues (1981) used shaping to increase the 
voice volume of two children enrolled in a private preschool for 
students with physical and developmental disabilities. Baseline 
data were collected on voice volume in the regular classroom. 
Voice volume was measured on a 0- to 20-point scale, with 0 indi- 
cating that the child’s voice level was inaudible, 10 indicating 
normal voice volume, and 20 indicating screaming. The shaping 
procedure consisted of having the children recite a nursery rhyme 
in the presence of a voice-activated relay device, whereby voice 
volume activated a light display. The intensity of the light corre- 
sponded to increased levels of voice volume: Higher voice vol- 
ume produced a brighter light, and lower voice volume produced 
a dimmer light. The teacher shaped voice volume by increasing 
the sensitivity threshold of the relay device. Each raising of the 
volume level required to activate the light represented a successive 
approximation to the terminal volume. 

Figure 22.3 shows an analysis of the children’s improved 
performance. Also, the children’s voice volume remained high 
after a 4-month period. Finally, according to anecdotal reports 
by school staff, the children’s higher voice volume generalized 
to other settings beyond the classroom. 

Applied behavior analysts can measure voice volume 
and apply differential reinforcement with advanced digital 
recording technology. Edgerton and Wine (2017), for example, 
used a digital tablet and app (Voice Meter Pro™) to measure 
the magnitude of a boy’s utterances that frequently fell below 
conversational volume. The app displayed a thermometer-style 
gauge and three emoji faces (e.g., neutral, positive, and negative) 
that coupled visually signaled calibrated changes in voice mag- 
nitude with text prompts (“Speak up, I can’t hear you!,” “That’s 
better,’ and “That’s too loud!’’). Although Edgerton and Wine 
did not use shaping—they implemented a package intervention 
of differential reinforcement, visual feedback, and corrective 
statements—the app they employed was adjustable and could 


be used in shaping voice volume. 


Increasing the projected voice volume from 45 dB to 65 dB. 
e Increasing the required torque on a ratchet wrench from 20 foot pounds to 30 foot pounds. 


SHAPING ACROSS AND WITHIN 
RESPONSE TOPOGRAPHIES 


Shaping behavior across different response topographies 
means that selected members of a response class are dif- 
ferentially reinforced, whereas members of other response 
classes are placed on extinction. Many complex behaviors 
comprise component response classes, each class a necessary, 
but insufficient, contributor to the required performance. 
Speaking, for example, involves coordinated tongue and lip 
movements, speech sounds, one-word utterances, phrase or 
sentence productions, and so forth. When shaping behavior 
across different response topographies, the practitioner grad- 
ually increases the criterion of performance before delivering 
reinforcement. 

Isaacs, Thomas, and Goldiamond (1960) reported a classic 
study showing how behaviors can be shaped across and within 
response topographies. They successfully shaped the vocal 
behavior of Andrew, a man diagnosed with catatonic schizo- 
phrenia, who had not spoken for 19 years despite numerous 
efforts to encourage speech production. Essentially, the shaping 
procedure was initiated when an astute psychologist noticed that 
Andrew’s passive expression changed slightly when a package 
of chewing gum inadvertently dropped on the floor. Realizing 
that gum might be an effective reinforcer for building behaviors 
in the response class of talking, the psychologist selected speech 
production as the terminal behavior. 

The next decision was to select an initial behavior to 
reinforce. Lip movement was chosen because the psychologist 
noted that slight lip movements had occurred in the presence 
of the pack of gum, and more important, lip movement was 
in the response repertoire of speech. As soon as lip move- 
ment was established by differential reinforcement, the psy- 
chologist waited for the next approximation of the terminal 
behavior. During this phase, lip movement alone was no lon- 
ger reinforced; only lip movements with sound produced the 
reinforcement. When Andrew began making guttural sounds, 
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vocalizations were differentially reinforced. Then the guttural 
sound itself was shaped (differential reinforcement within a 
response topography) until Andrew said the word gum. After 
the 6th week of shaping, the psychologist asked Andrew to 
say gum, to which Andrew responded, “Gum, please.” During 
that session and afterward, Andrew went on to converse with 
the psychologist and others at the institution about his identity 
and background. In this powerful demonstration of shaping, 
after selection of the terminal behavior and the initial starting 
point, each member of the response repertoire was shaped by 
differential reinforcement of successive approximations to the 
terminal behavior. 

Shaping a behavior within a response topography means 
that the form of the behavior remains constant, but differential 
reinforcement is applied to another measurable dimension of the 
behavior (Noell et al., 2014). To illustrate, let us suppose that in 
a college-level physical education class, the teacher is instruct- 
ing the students on water rescue. Specifically, she is teaching 
them how to throw a life preserver a given distance to a person 
struggling in the water. Since the important skill in this activity 
is to throw the life preserver near the person (i.e., within reach), 
the physical education teacher might shape accurate tossing by 
reinforcing successive approximations to a toss of a given dis- 
tance. In other words, each toss that is near the person (e.g., 
within 2 meters) will be praised, whereas tosses outside that 
range will not be reinforced. As students become more accu- 
rate, the area is reduced so that the terminal behavior—tosses 
within arm’s length of the person—is achieved. In this case, the 
magnitude of the behavior is being shaped; the form of the toss 
remains the same. It is important to remember that the behavior 
being shaped is not tossing per se. What is being shaped by 
differential reinforcement is the desired dimension of tossing 
behavior within the response class, namely, the magnitude of 
tossing the preserver. 
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Beneficial Aspects of Shaping 


Shaping uses a positive (i.e., reinforcing) approach to teach new 
behaviors. Also, it is systematic: Reinforcement is delivered con- 
sistently upon the occurrence of any successive approximations 
to the terminal behavior, whereas non-approximations are 
placed on extinction. Punishment or other aversive procedures 
are typically not involved in a shaping program. Further, 
shaping is usually combined with other established behavior 
change or behavior-building procedures (e.g., verbal prompts 
or chaining). 

Harrison and Pyles (2013) combined verbal prompts with 
shaping to improve tackling by high school football players. 
Figure 22.4 shows the results for three players during football 
practice and game conditions where speed was gradually shaped 
from walking to jogging to running. 

Shaping might further be combined with a chaining pro- 
cedure when a given behavior within the chain is nonexistent 
or underdeveloped.* Suppose that a behavior analyst designed 
a seven-step task analysis to teach a child to tie his shoes. 
However, the child was unable to complete Step 5 in the task 
analysis. Shaping could be used in isolation to teach a closer 
approximation to that step. Once Step 5 was learned through 
shaping, chaining would continue with the rest of the steps in 
the task analysis sequence. 


Limitations of Shaping 


At least five limitations of shaping can be identified. Practi- 
tioners should be aware of these limitations and be prepared to 
deal with them if they arise. 


1. Shaping new behavior can be time-consuming because 
numerous approximations may be necessary before the 
terminal behavior is achieved. 


PERCENTAGE OF CORRECT TRIALS 


(Tackling) 


Chapter 22 * Shaping 591 


SHAPING PROGRESSIVE SPEED 
BASELINE Si  S1-S2 S1-S3 SI-S4 WALK JOG RUN 
100 e3o 
0} | 4 Sma se vaca: 
80 
70 4 - < 
60 | 
50 4 eres ‘i 
40 4 
30 4 
20 4 
Datietae huni 
0 
arapie| ep T a T 1 a 
100 - al ee |ee eo |e 2 
so | Site < 
80 | 
70 4 
60 + 
50 | ° 
40 
20 | 
20 4 ett 
10 | DANIEL 
OR aa or aa | 1] i ae ts 
— 
100 - eo |ee jee (ee eo 
90 5 va Figure 22.4 Percentage of correct 
30 4 tackling trials across all training 
70 4 Whine . phases. Open data points represent 
60 generalization probes of a complete 
50 | “ tackle against a live ball carrier. (In the 
40 shaping procedure, S = skill.) 
30 ] From “The Effects of Verbal Instruction and 
20 © e Shaping to Improve Tackling by High School 
| Football Players,” by A. H. Harrison and D.A. . 
10 Pyles, 2013, Journal of Applied Behavior Analysis, 
Oo Sy 7h a See = 46(2), p. 521. Copyright by the Society for the 
Qi wea PGR GID Ze Tae OM TI8720 22424 2640.28 65305 132 Experimental Analysis of Behavior, Inc. Reprinted 
SESSIONS by permission. 
Progress toward the terminal behavior is not always lin- for performance at a given approximation continues lon- 
ear. That is, the learner does not always proceed from ger than necessary, progress toward the terminal behavior 
one approximation to the next in a continuous, progres- will be impeded. 
sive, or errorless sequence. Progress may be erratic. If 3. Shaping requires continuous monitoring to detect sub- 
the behavior is too irregular (i.e., not resembling a closer tle changes in the learner’s performance, indicating 
approximation to the terminal behavior), an approxima- the next approximation to the terminal behavior. Many 
tion may need to be further reduced, allowing for more practitioners—for example, teachers in busy or demand- 
reinforcement and progress. The skill of the practitioner in ing classrooms—are not able to monitor behavior closely 
noting and reinforcing the next smallest approximation to enough to note small changes. Consequently, shaping 
the terminal behavior is critical to the success of shaping. may be conducted inappropriately or at least inefficiently. 
If the practitioner fails to reinforce responses at the next 4. Shaping can be misapplied. Consider the child trying 


approximation—because of inexperience, preoccupation 
with other tasks, or neglect—the occurrence of similar 
responses may be few and far between. If reinforcement 


to obtain her father’s attention by emitting low-volume 
demands (e.g., “Dad, I want some ice cream.’’). The father 
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does not attend to the initial calls. As the child’s attempts 
become increasingly unsuccessful (1.e., not reinforced), 
she may become more determined to gain her father’s 
attention. In doing so, the number and magnitude of the 
calls may increase due to extinction-induced variability 
(e.g., “DAD, I WANT ICE CREAM!”). After listening 
to the crescendo of vocal demands, the father eventually 
provides the ice cream. The next time, however, it may 
come to pass that the child states what she wants in an 
even louder voice before gaining her father’s attention. 
In this scenario, the father has differentially reinforced an 
ever-rising level of attention-getting behavior and shaped 
higher levels of call-outs for ice cream. Using this exam- 
ple as a backdrop, Skinner (1953) anticipated the result 
of such a scenario: “Differential reinforcement applied 
by a preoccupied or negligent parent is very close to the 
procedure we should adopt if we were given the task of 
conditioning a child to be annoying” (p. 98). 


5. Harmful behavior can be shaped. For example, Rasey and 
Iversen (1993) showed that differential reinforcement shaped 
a rat’s behavior to the point that the rat fell from the edge of 
a platform. By differentially reinforcing the rat for sticking 
its nose farther over the edge for food, the rat eventually fell 
off the ledge.* It is not difficult to speculate how adolescent 
games such as Dare and Double Dare and videos that cap- 
italize on persons receiving differential reinforcement from 
peers or “likes” and encouraging comments on social media 
for increasingly higher levels of risk taking can lead to dan- 
gerous—and sometimes tragic—behavioral outcomes. 


Shaping Versus Stimulus Fading 


Shaping and stimulus fading change behavior gradually, albeit 
in much different ways. In shaping, the antecedent stimulus for 
the terminal behavior stays the same, while the response pro- 
gressively becomes more differentiated. In stimulus fading, the 
opposite occurs: The antecedent stimulus changes gradually, 
while the response stays essentially the same. 


INCREASING SHAPING EFFICIENCY 


In addition to showing how behaviors are shaped across and 
within response topographies, the Isaacs and colleagues (1960) 
study illustrates another aspect of shaping—efficiency. During 
the early stages of the program, the psychologist waited for the 
next approximation of the behavior to appear before delivering 
the reinforcer. Since waiting can be time-consuming and waste- 
ful, Isaacs and colleagues improved efficiency by using a verbal 
response prompt, “Say gum,” after the sixth training session. 
Presumably, if the psychologist had not used a verbal prompt, 
several additional sessions would have been necessary before a 
successful outcome was achieved. Shaping may become more 
efficient by using a discriminative stimulus, physical guidance, 
imitative prompts, and/or a percentile schedule. 


Discriminative Stimulus 


A discriminative stimulus (S?) may be combined with shaping. 
For example, when attempting to shape hand shaking as a greeting 


skill for an adult with developmental disabilities, the teacher might 
say, “Frank, hold out your arm.” Scott, Scott, and Goldwater 
(1997) used a discriminative stimulus in the form of a vocal prompt 
(“Reach”) as a university-level pole-vaulter ran a path to plant his 
pole in the vault box. The prompt was designed to focus the vault- 
er’s attention on extending his arms prior to take-off in the vault. 

Kazdin (2013) suggested that priming a response using 
any of a variety of prompting mechanisms (Ss) can be helpful, 
especially if the individual’s repertoire is weak and the likeli- 
hood of distinguishable successive approximations is low. “Even 
if the responses are not in the client’s repertoire of responses, the 
priming procedure can initiate early response components and 
facilitate shaping” (p. 277). 


Physical Guidance 


The second way to enhance shaping is through physical guid- 
ance. In the hand shaking example with Frank, cited earlier, the 
teacher might tap his arm manually, assist Frank with holding 
out his arm, move his arm forward with light pressure in a hand 
shaking motion, or hold his arm firmly through the full motion. 
Any physical guidance prompt used to enhance or speed up 
shaping should later be faded. 


imitative Prompts 


Using an imitative prompt to demonstrate arm extension (e.g., 
“Frank, hold out your arm like this.”) may improve shaping’s 
efficiency. As with physical guidance, any imitative prompt 
should ultimately be faded. 


Percentile Schedules 


Because shaping requires a skilled observer to (a) detect behav- 
ior changes, (b) apply differential reinforcement to ever-closer 
approximations to a terminal behavior, and (c) place non- 
approximations on extinction, shaping has been characterized 
as more “art than science” (Lattel & Neef, 1996). Galbicka 
(1994) stated that practitioners would benefit from moving from 
a qualitative-based, observer-detecting standard for determining 
if the next response in a shaping program should be reinforced 
to a quantitative standard. In his view, a percentile schedule 
enables more precise shaping procedures, allows implementa- 
tion by multiple trainers across settings, reduces the time it takes 
to complete a shaping sequence, and quantitatively moves closer 
to Skinner’s conception of shaping. 


Percentile schedules disassemble the process of shaping 
into its constituent components, translate those components 
into simple, mathematical statements, and then use these 
equations, with parameters specified by the experimenter or 
trainer, to determine what presently constitutes a criterional 
response and should therefore be reinforced. (p. 740) 


Using a mathematical equation (i.e.,k =(m+1)(1—w)) 
and a computer software program, a percentile schedule continu- 
ally calculates and adjusts the criterion for reinforcement based 
on recent observations. 


In this equation, w denotes the density of reinforcement, 
and m is a fixed number of recent observations. The k 


parameter specifies what response value out of m most 
recently observed response values an upcoming response 
must exceed to satisfy the criterion for reinforcement. 

For example, with a w value of .5, a response meeting the 
criterion for reinforcement for a given observation will be 
observed half the time, offering a mixture of reinforcement 
and extinction of responses. An m value of 10 means that 
the 10 most recent observations are ranked according to 
their ordinal value from least to most, keeping the shaping 
procedure in touch with the individual’s current repertoire. 
Solving the equation using these values gives a k value of 
5.5. In most cases, it is easier to implement the percentile 
schedule by rounding the k value to a whole number. In the 
example, rounding k to 5 keeps the reinforcement density 
at approximately half of all responses and makes the crite- 
rion for reinforcement slightly less stringent than a larger k 
value. The k value of 5 denotes the observation ranked fifth 
among the 10 most recently observed and ranked observa- 
tions is the value the current observation must exceed to 
meet the criterion for reinforcement. (Athens, et al., 2007, 
pp. 475-476) 


Several applied studies have evaluated percentile sched- 
ules with respect to shaping smoking cessation (Lamb, Kirby, 
Morral, Galbicka & Iguchi, 2010; Lamb, Morral, Kirby, 
Iguchi, & Galbicka, 2004), increased duration of eye contact 
(Hall, Maynes & Reiss, 2009), and increased engagement with 
academic tasks (Athens et al., 2007). In a parametric study of 
the effectiveness of percentile schedules in shaping academic 
task engagement, Athens and colleagues (2007) compared 
three values of m (5, 10, and 20). Their findings indicated that 
the percentile schedule was most effective when a relatively 
large number of previous observations (N = 20) were taken 
into account. In discussing the limitations of their study and 
comparing findings with those of other studies, Athens et al. 
noted that while “the procedure appears to be promising,” the 
“potential utility of a percentile schedule for clinicians and 
applied researchers remains to be seen” (p. 486). 

We agree with Athens et al. to the extent that a per- 
centile schedule is encouraging. However, for greater util- 
ity in applied settings, practitioners would need to be well 
versed in using their recommended mathematical equation 
and be proficient with using the computer software program 
in “real time.” Also, whether a percentile schedule increases 
shaping’s efficiency over conventional applications remains 
an empirical question. Additional experimental studies that 
examine efficiency as a dependent variable will contribute 
to establishing percentile schedules as a viable alternative to 
conventional applications. 


CLICKER TRAINING 


Pryor (1999) described clicker training as a science-based 
system for progressively shaping new behavior using positive 
reinforcement and extinction. The clicker is a handheld device 
that produces a clicking sound when a metal tab is pressed. 
Reinforcement is paired with the sound of the clicker so that 
the sound becomes a conditioned reinforcer. Initially used 
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to shape dolphin behavior without physical guidance (Pryor 
& Norris, 1991), clicker training was later used with other 
animals (e.g., cats and horses) and ultimately with humans 
to shape such complex behaviors as airplane pilot skills 
(M. Pryor, 2005). Other researchers and practitioners (e.g., 
Poling et al., 201 1a) have used clicker training as part of their 
overall shaping program to teach African giant pouched rats 
how to detect land mines. 


Clicker trainers focus on building behavior, not 
stopping behavior. Instead of yelling at the dog 

for jumping up, you click it for sitting. Instead of 
kicking the horse to make it go, you click it for 
walking. Then, click by click, you “shape” longer 

sits, or more walking, until you have the final results 
you want. Once the behavior is learned, you keep it 
going with praise and approval and save the clicker and 


treats for the next new thing you want to train. (Pryor, 
2002, p. 1) 


Figure 22.5 provides 15 tips for getting started with clicker 
training (Pryor, 2002). 


EMERGING APPLICATIONS OF SHAPING 


At least three emerging applications of shaping can be con- 
sidered: using computers to teach shaping skills, combining 
shaping procedures with robotics engineering, and applying 
telemedicine technology to implement shaping and differen- 
tial reinforcement programs in hard-to-access urban, rural, or 
remote settings. 


Using Computers to Teach Shaping 


Several options are currently available to teach shaping skills 
using computers, or a combination of specialized software with 
computer-based applications. For example, Sniffy the Virtual 
Rat is a computer-based, digitized animation of a white rat 
within a Skinner box. Students practice basic shaping on their 
own personal computers using this virtual simulation. 

Acknowledging the practical limitations of teaching behay- 
ior analysis principles (e.g., shaping) interactively to a large stu- 
dent body, Shimoff and Catania (1995) developed and refined 
computer simulations to teach the concepts of shaping.° In their 
simulation The Shaping Game, four levels of difficulty (easy, 
medium, hard, and very hard) were programmed over a series of 
refinements. Beginning with easier tasks and advancing to the very 
hard level, students are taught to detect successive approximations 
to bar-pressing behaviors that should be reinforced, and those that 
should not. In still later versions, students are expected to shape 
the movement of the animated rat from one side of a Skinner 
box to another. Shimoff and Catania made a case for computer 
simulations when they stated, “Computers can be important and 
effective tools for establishing sophisticated behavior that appears 
to be contingency shaped even in classes that do not provide real 
(unsimulated) laboratory experience” (pp. 315-316). 

Martin and Pear (2003) contended that computers might 
be used to shape some dimensions of behavior (e.g., topography) 
more quickly than humans because a computer can be calibrated 
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Push and release the springy end of the cee making a two- hee click. Then treat. Keep the treats small. Use a delicious 
treat at first: for a dog or cat, little cubes of roast chicken, not a lump of kibble. 

Click DURING the desired behavior, not after it is completed. The timing of the click is crucial. Don’t be dismayed if your pet 
stops the behavior when it hears the click. The click ends the behavior. Give the treat after that; the timing of the treat is not 
important. 

Click when your dog or other pet does something you like. Begin with something easy that the pet is likely to do on its 

own (ideas: sit, come toward you, touch your hand with its nose, lift a foot, touch and follow a target object such as a 
pencil or a spoon). 

Click once (in-out). If you want to express special enthusiasm, increase the number of treats, not the number 

of clicks. 

Keep practice sessions short. Much more is learned in three sessions of five minutes each than in an hour of boring 
repetition. You can get dramatic results, and teach your pet many new things, by fitting a few clicks a day here and 
there in your normal routine. 

Fix bad behavior by clicking good behavior. Click the puppy for relieving itself in the proper spot. Click for paws on the 
ground, not on the visitors. Instead of scolding for making noise, click for silence. Cure leash-pulling by clicking and treating 
those moments when the leash happens to go slack. 

Click for voluntary (or accidental) movements toward your goal. You may coax or lure the animal into a movement or 
position, but don’t push, pull, or hold it. Let the animal discover how to do the behavior on its own. If you need a leash for 
safety’s sake, loop it over your shoulder or tie it to your belt. 

Don't wait for the “whole picture” or the perfect behavior. Click and treat for small movements in the right direction. 
You want the dog to sit, and it starts to crouch in back: Click. You want it to come when called, and it takes a few steps 
your way: Click. 

Keep raising your goal. As soon as you have a good response—when a dog, for example, is voluntarily lying down, coming 
toward you, or sitting repeatedly—start asking for more. Wait a few beats, until the dog stays down a little longer, comes a 
little further, sits a little faster. Then click. This is called “shaping” a behavior. 


. When your animal has learned to do something for clicks, it will begin showing you the behavior spontaneously, trying to 
get you to click. Now is the time to begin offering a cue, such as a word or a hand signal. Start clicking for that behavior if it 
happens during or after the cue. Start ignoring that behavior when the cue wasn’t given. 


. Don't order the animal around; clicker training is not command-based. If your pet does not respond to a cue, it is not 
disobeying; it just hasn’t learned the cue completely. Find more ways to cue it and click it for the desired behavior. Try 
working in a quieter, less distracting place for a while. If you have more than one pet, separate them for training, and let 
them take turns. 


12. Carry a clicker and “catch” cute behaviors like cocking the head, chasing the tail, or holding up one foot. You can click for 
many different behaviors, whenever you happen to notice them, without confusing your pet. 


13. If you get mad, put the clicker away. Don’t mix scoldings, leash-jerking, and correction training with clicker training; you will 
lose the animal’s confidence in the clicker and perhaps in you. 


14. If you are not making progress with a particular behavior, you are probably clicking too late. Accurate timing is important. 
Get someone else to watch you, and perhaps to click for you, a few times. 


15. Above all, have fun. Clicker-training is a wonderful way to enrich your relationship with any learner. 


From Click to win! Clicker training for the show ring (Appendix A) by K. Pryor, 2002. Copyright © 2002 by Karen Pryor. Waltham, MA: Sunshine 
Books. Reprinted by permission. 


From K. Pryor & L. Chamberlain, 2016, https:/Awww.clickertraining.com/files/the-modern-principles-of- -shaping-SF-edits.pdf. Copyright © 2016 by 
Karen Pryor. Reprinted with permission. 


and programmed with specific decision rules for delivering rein- | Combining Shaping with Robotics Engineering 
forcement. They further suggested that in medical application 
cases (€.g., amputees, stroke victims), minute muscle move- 
ments may go undetected by a human observer, but a micropro- 
cessor could be calibrated to detect shapeable responses. In their 
view, “Computers . . . are both accurate and fast and may there- 
fore be useful in answering fundamental questions concerning 
which shaping procedures are most effective . .. computers may 
be able to shape at least some kinds of neheaios as effectively Given that shaping is a significant determinant of how 
as humans” (p. 133). a wide variety of organisms adapt to the changing 


Dorigo and Colombetti (1998) and Saksida, Raymond, and 
Touretzky (1997) have explored how shaping can be applied 
to robot training. Essentially, shaping is used as one method of 
programming robots to progress through initial, intermediate, 
and final goals to achieve more complicated sets of commands. 
According to Savage (1998): 


circumstances of the real world, then shaping may, 
indeed, have potential as a model for real-world 
robotic interactions. However, the success of this 
strategy depends on a clear understanding of the 
principles of biological shaping on the part of the 
roboticists, and the effective robotic implementation 
of these principles. (p. 321) 


Although Dorigo and Colombetti (1998) and Saksida and 
colleagues (1997) concede the difficulty of applying successive 
approximations to robots, this line of inquiry, combined with 
emerging knowledge on artificial intelligence, offers exciting 
prospects for the future. 


Applying Telemedicine Technology 

to Implement Shaping and Differential 
Reinforcement Programs in Hard-to-Access 
Urban, Rural, or Remote Settings 


Telemedicine technologies involve connecting host-institution 
training centers with off-site rural or remote settings through 
high-speed, Internet-based, interactive audio-video links. Each 
site—host and remote—is equipped with the appropriate com- 
puters, cameras, monitors, and routers to enable instantaneous 
two-way communications between trainers at the host site and 
trainees at the remote site. 

These technologies, originally developed to provide 
consultation and assessment services to families living in 
rural or remote locations (Barretto, Wacker, Harding, Lee, 
& Berg, 2006), have been expanded to provide additional 
services in the form of functional analyses and interventions, 
including differential reinforcement procedures (Peterson, 
Volkert, & Zeleny, 2015). Not only have telemedicine/ 
telehealth technologies been shown to produce effective 
results, but also the added cost savings with respect to 
travel time and efficiency of implementation bode well for 
wider implementation in emerging rural/remote locations in 
the future. 

Over the past decade, new and exciting applications have 
been explored to enhance the human condition and to increase 
safety and improve public welfare. Applications that com- 
bine shaping with discrimination training have been reported 
in several general areas: detecting land mines (Edwards, Cox, 
Weetjens, Tewelde, & Poling, 2015; Poling et al., 201 1a); 
uncovering contraband (Jezierski et al., 2014); searching for, 
and locating, humans trapped in simulated collapsed buildings 
(Edwards, La Londe, Cox, Weetjens, & Poling, 2016; La Londe 
et al., 2015); and detecting tuberculosis or necrotic tissue (Poling 
et al., 201 1b; Poling et al., 2017). Poling et al. (2017), for exam- 
ple, reported that trained African giant pouched rats detected 
2000 additional active tuberculous cases compared to cases that 
reported tuberculous using only traditional microscopic analysis 
for diagnosis. 

Likewise, in the case of land mine detection, Poling et 
al. (2011a) incorporated shaping, clicker training, and differ- 
ential reinforcement of African giant pouched rats, who, after 
training, successfully searched over 93,000 m? of land (i.e., 
approximately 23 acres),° locating upwards of 100 mines and 
other explosives. The rats were 100% accurate in discriminating 
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buried land mines from other materials buried beneath the soil 
surface, a finding replicated by Edwards et al. (2015) with 
remarkable consistency. 

It is worth noting that these field-based efforts provide 
stunning examples of how shaping can be combined with other 
procedures (namely, clicker and discrimination training) to help 
communities plagued by disease or the ravages of war to reclaim 
healthier and more productive lives. 


SHAPING GUIDELINES 


The practitioner should consider various factors before decid- 
ing to use shaping. First, assess the terminal behavior and the 
available resources. For example, assume that a fifth-grade 
teacher is interested in increasing the number of math problems 
performed by a student with learning disabilities. The student 
completes 5 problems per math period, with a range of 0 to 
10 problems. If the student has the ability to check her per- 
formance independently at the end of the class period, shap- 
ing could be implemented whereby the number of correctly 
completed math problems per period would be differentially 
reinforced. Reinforcement would be presented for 5, 7, 9, and 
progressively more problems answered correctly per session. 

Second, since shaping may require multiple approxima- 
tions and additional trained personnel, the analyst should assess 
whether time constraints, staff issues, or lack of resources mili- 
tate against the use of shaping. 

Third, some behaviors seem to preclude the use of 
shaping. For instance, if the manager of a community-based 
organization (e.g., Toastmasters International) is interested 
in increasing the public-speaking repertoires of participants, 
prompting, modeling, or peer tutoring might be more efficient 
than shaping. That is, telling or showing the participant how to 
use a gesture, voice inflection, or a metaphor would likely be 
much faster than attempting to shape each of these presentation 
skills alone. 

Finally, Pryor’s original Ten Laws of Shaping (1999) have 
evolved over time into what is now termed The Modern Principles 
of Shaping. These principles can assist the practitioner in imple- 
menting shaping in applied settings (see Figure 22.6). 


Select the Terminal Behavior 


Practitioners often work with learners having multiple behavior 
challenges. Consequently, they must identify the highest priority 
behavior to change as quickly as possible. The ultimate criterion 
in this decision is the individual’s expected independence after 
an intervention—that is, the likelihood of her attaining addi- 
tional reinforcers from the environment (Brown, McDonnell, 
Snell, & Brown, 2016). For example, if a student frequently 
roams the classroom poking other students, taking their papers, 
and verbally harassing them, a shaping procedure might best 
begin with a behavior that is incompatible with roaming the 
room because of the utility it would have for the individual and 
the other students. In addition, if in-seat behavior is developed, 
the staff with whom the student interacts will likely notice and 
reinforce it. In this case, shaping should differentially reinforce 
longer durations of in-seat behavior. 


596 Part 8 * Developing New Behavior 


Figure 22.6 Pryor’s The Modern Principles of Shaping. 


earn a reinforcer. 
click should not mean two different criteria. 


criteria for previously mastered skills. 


means keeping a high rate of reinforcement. 


that the animal can easily earn reinforcers. 


Karen Pryor. Reprinted with permission. 


It is also important to define the terminal behavior pre- 
cisely. For example, a behavior analyst might want to shape the 
appropriate sitting behavior of an individual with severe intel- 
lectual disabilities. To do so, sitting might be defined as being 
located upright in the chair, facing the front of the room with 
buttocks against the bottom of the chair and back against the 
chair rest during a 15-min morning activity. Using this defini- 
tion, the analyst can determine when the behavior is achieved, 
as well as what does not constitute the behavior—an important 
discrimination if shaping is to be conducted efficiently (e.g., the 
student might be half in and half out of the seat, or the student 
might be in the seat but turned toward the back of the room). 


Determine the Criterion for Success 


After identifying the terminal behavior, the practitioner should 
specify the criterion for success. Here, the practitioner decides 
how accurate, fast, intense, or durable the behavior must be 
before it can be considered shaped. Several measures can be 
applied to establish a criterion of success. Some of the more 
common include rate, magnitude, and duration. Depending on 
the terminal behavior, any or all of these dimensions can be 
used to assess achievement. For instance, a special education 
teacher might shape rate performance in solving mathematics 
calculations problems that would proceed from the student’s 
current level of performance (say, half a problem per minute) 
to a higher rate (e.g., four problems per minute). 

Norms for success can be determined by measuring the 
behavior in a similar peer group, consulting established norms in 


1. Be prepared before you start. Be ready to click/treat immediately when the training session begins. When shaping anew 
behavior, be ready to capture the very first tiny inclination the animal gives you toward your goal behavior. This is especially 
true when working with a prop such as a target stick or a mat on the ground. 


2. Ensure success at each step. Break behavior down into small enough pieces that the learner always has a realistic chance to 
3. Train one criterion at a time. Shaping for two criteria or aspects of a behavior simultaneously can be very confusing. One 
4. Relax criteria when something changes. When introducing a new criterion or aspect of the skill, temporarily relax the old 


5. If one door closes, find another. If a particular shaping procedure is not progressing, try another way. 


6. Keep training sessions continuous. The animal should be continuously engaged in the learning process throughout the 
session. He should be working the entire time, except for the moment he’s consuming/enjoying his reinforcer. This also 


7. Go back to kindergarten, if necessary. If a behavior deteriorates, quickly revisit the last successful approximation or two so 


8. Keep your attention on your learner. Interrupting a training session gratuitously by taking a phone call, chatting, or doing 
something else that can wait often causes learners to lose momentum and get frustrated by the lack of information. If you 
need to take a break, give the animal a “goodbye present,” such as a small handful of treats. 


9. Stay ahead of your learner. Be prepared to “skip ahead” in your shaping plan if your learner makes a sudden leap. 
10. Quit while you’re ahead. End each session with something the learner finds reinforcing. If possible, end a session on a 
strong behavioral response, but, at any rate, try to end with your learner still eager to go on. 


From K, Pryor & L. Chamberlain, 2016, https:/Awww.clickertraining.com/files/the-modern-principles-of-shaping-SF-edits.pdf. Copyright © 2016 by 


the literature, or complying with international standards for accu- 
racy. For instance, norms and guidelines for reading rate by grade 
level (Kame’enui, 2002), physical education skills for children 
by age group (President’s Council on Physical Fitness, 2016), 
homework guidelines per grade level (Heron, Hippler, & Tincani, 
2003), or accepted standards for land mine detection (Poling, 
201 1a; Edwards et al., 2015) can be referenced to validate prog- 
ress toward, or the achievement of, the terminal behavior. 

In the earlier illustration, the criterion for success could be 
the student's sitting in her seat appropriately 90% of the time during 
the morning activity for 5 consecutive days. In this example, two 
criteria are specified: the percentage of acceptable sitting behavior 
per session (i.e., 90%) and the number of days that the criterion 
must be met to establish the behavior (i.e., 5 consecutive days). 
Poling and colleagues (201 1a) used multiple criteria for determin- 
ing success with respect to the accurate performance trials per day, 
and consecutive days at increasingly higher performance levels. 


Assess the Response Repertoire 


There is a two-fold reason for assessing response repertoires. 
First, having established the desired terminal behavior, it is 
important to ascertain the “behavioral gap” between the cur- 
rent level of behavior and the desired target behavior. In our 
respiratory therapy example cited earlier, that gap constituted 
1500 ml of volume difference (i.e., goal was 1750 ml; present 
level was 250 ml). Second, assessing the response repertoire 
may help to project the successive approximations that might be 
emitted in the shaping sequence. When approximation behaviors 


are known—or anticipated—the practitioner is in a better posi- 
tion to observe and reinforce an approximation when it occurs. 
However, the practitioner must recognize that projected approx- 
imations are guesstimates of behaviors. In actuality, the learner 
may emit behaviors that are merely approximations of these 
projections. The practitioner must be able to make the profes- 
sional judgment that the previously emitted behavior either is, 
or is not, a closer approximation to the terminal behavior than 
the behaviors that occurred and were reinforced in the past. 
Galbicka (1994) provides a guideline for the behavior analyst 
with respect to assessing an existing response repertoire. 


The successful shaper must carefully ascertain characteris- 
tics of an individual’s present response repertoire, explicitly 
define characteristics the final behavior will have at the end 
of training, and plot a course between reinforcement and 
extinction that will bring the right responses along at the 
right time, fostering the final behavioral sequence while 
never losing responding altogether (p. 739). 


The relevant approximations across or within a response 
repertoire can be analyzed in several ways. First, practitioners can 
consult experts in the field to determine their views on the proper 
sequence of approximations for a given behavior (Brown et al., 
2016). For example, teachers who have taught three-digit multipli- 
cation for several years can be consulted to learn their experiences 
with prerequisite behaviors for performing this type of calcula- 
tion task. Second, as stated earlier, normative data from published 
studies may be used to provide an estimate of the approximations 
involved. Third, a digital recording can be used to analyze the 
component behaviors. A slow-motion digital replay would help 
the practitioner to see motion that might be undetectable in an 
initial faster-moving live performance, but become detectable as 
the practitioner becomes skilled with observing it. Finally, the 
practitioner can perform the target behavior himself, carefully 
noting the discrete behavioral components as he produces them. 

The ultimate determination of the approximations, the 
approximation order, the duration of time that reinforcement 
should be delivered at a given approximation, and the criteria 
for skipping or repeating approximations rests with the skill 
and the judgment of the practitioner. Ultimately, the learner’s 
performance should dictate when approximation size should be 
increased, maintained, or decreased, making consistent and vig- 
ilant monitoring by the practitioner essential. 


Identifying Behaviors to Reinforce 


At the outset of any shaping session, the shaper must observe 
the participant closely to detect any movement, slight though 
it might be, in the direction of the desired terminal behavior. 
At the instant that this behavior occurs, reinforcement is deliv- 
ered. So, if an occupational therapist was shaping the terminal 
behavior of a toddler to eat a bowl of potatoes with a spoon, any 
hand or finger movement, even a twitch, would be reinforced. 
Subsequently, successive hand or finger movements toward the 
spoon would be reinforced. Thereafter, movements that succes- 
sively touched the spoon, grasped the spoon, raised the spoon, 
directed the lifted spoon toward her mouth, and so on, would be 
reinforced, whereas other behaviors would undergo extinction. 
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Conversely, the analyst would not set an initial criterion of 
raising the spoon out of the bowl. To set the behavior criterion 
at this level would be counterproductive, as raising-the-spoon 
behavior may not be in the child’s present repertoire. 


Eliminate Interfering or Extraneous Stimuli 


Eliminating distractions during shaping enhances the process. 
For example, if a parent is interested in shaping one dimen- 
sion of her young daughter’s dressing behavior (e.g., the rate of 
dressing before school), but decides to begin the shaping proce- 
dure in her bedroom, where cartoons are playing on a morning 
television show, the shaping program may not be successful. 
The cartoons will compete for the daughter’s attention. It would 
be more efficient to choose a time and a location where such 
sources of distraction can be reduced or eliminated. 


Proceed Gradually 


The importance of proceeding toward the terminal goal gradu- 
ally cannot be overemphasized. The practitioner should antic- 
ipate changes in the rate of progress and be prepared to go 
from approximation to approximation as the learner’s behavior 
dictates. Each new occurrence of a successive approximation 
to the terminal behavior must be detected and reinforced. If 
not, shaping will be, at worst, unsuccessful or, at best, hap- 
hazard, requiring much more time. Furthermore, the occur- 
rence of a behavior at a given approximation does not mean 
the next response that approximates the terminal behavior will 
be immediately or accurately produced. 

The practitioner must also be aware that many trials may 
be required at a given approximation before the subject can 
advance to the next approximation. Figure 22.7 illustrates this 
point well. Scott and colleagues (1997) found that the percent- 
age of correct arm extensions by the pole-vaulter decreased ini- 
tially as the height of the bar was raised. Whereas a 90% correct 
arm extension was noted at the initial height, the vaulter’s cor- 
rect performance typically dropped to approximately 70% each 
time the bar was raised. After successive attempts at the new 
height, the 90% correct criterion was re-established. 


Limit the Number of Approximations 
at Each Level 


Just as it is important to proceed gradually from approximation to 
approximation, it is equally important to ensure that progress is 
not impeded by offering too many trials at a given approximation. 
This may cause the behavior to become firmly established, and 
because that approximation has to be extinguished before progress 
can begin again (Catania, 2013), the more frequently reinforce- 
ment is delivered at a given approximation, the longer the process 
can take. In general, if the learner is progressing steadily, reinforce- 
ment is being delivered at the correct pace. If numerous mistakes 
are being made, or behavior ceases altogether, the criterion for 
reinforcement has been raised too quickly. Finally, if the perfor- 
mance stabilizes at a certain level, the pace of shaping is too slow. 
Practitioners may experience all three of these conditions while 
shaping. Therefore, they must be vigilant and adjust the next suc- 
cessive approximation and reinforcement schedule as necessary. 
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When the terminal behavior is demonstrated, and reinforced, it is nec- 
essary to continue to reinforce it over time; otherwise, the behavior 
will extinguish, and performance will return to a lower level. Rein- 
forcement must continue until the criterion for success is achieved, 
and a maintenance schedule of reinforcement is established. 


LEARNING TO SHAPE 


Reading the narrative, illustrations, rules, and guidelines, such 
as those presented in this chapter, can help an emerging analyst 
become a skilled behavior shaper. Such reading is necessary, but 
not sufficient, with respect to gaining proficiency as a “shaper.” 
The best way to learn how to shape behavior proficiently is to 
practice shaping behavior! So, how does one go about practic- 
ing shaping behavior? Let’s explore a few options that can be 
arranged whether you prefer to practice by yourself, with a part- 
ner, Or in a group. Whatever the arrangement, however, seek the 
advice and direction of a skilled behavior shaper who can provide 
directive feedback and reinforcement during your training. 


By Yourself 


You could practice trying to shape a simple terminal behavior 
with an animal (e.g., teaching a rat to press a bar, a pigeon to 
peck a disc, or a dog to follow a “sit” command). The simpler 
the terminal behavior and the more “controlled” the setting, 
the better (e.g., a lab, a quiet room away from distractions). 
Previewing a well-conducted shaping session on a YouTube 
video that resembles the behavior you are trying to shape, or 
consulting with a skilled colleague using Skype or FaceTime, 
might expedite the learning process for you. 

Alternatively, where access to animals, controlled lab- 
oratories, or cooperative teachers is limited, consider using a 
simulation program such as Sniffy the Virtual Rat (Alloway, 
Wilson, & Graham, 2005). Sniffy, as mentioned earlier, is a 
computer-based, digitized, virtual animation of a white rat that 
can be accessed on a personal computer, which allows one to 
practice basic shaping at any convenient time or location. 
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Figure 22.7 Percentage of 
correct arm extensions at 
seven different heights. The 
numbers contained within 
each criterion condition 
refer to the height of the 
photoelectric beam. 

From “A Performance Improvement 
Program for an International-Level 
Track and Field Athlete,” by D. Scott, 
L. M. Scott, and B. Goldwater, 1997, 
Journal of Applied Behavior Analysis, 
30, p. 575. Copyright by the Soci- 
ety for the Experimental Analysis of 
Behavior, Inc. Reprinted by permission. 
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With a Partner 


Choose a friend or colleague and play “PORTL,” an acronym for 
Portable Operant Research and Teaching Laboratory (Hunter 
& Rosales-Ruiz, 2018). Box 22.1 outlines the rationale for this 
exercise, and explains how to play PORTL with a partner for 
maximum benefit.’ 


With a Group 


Keenan and Dillenburger (2000) described a classroom activ- 
ity that teaches students shaping principles. Keenan and 
Dillenburger’s “The Shaping Game,” though sharing an iden- 
tical name with Shimoff and Catania’s (1995) computer sim- 
ulation, is an in vivo experience shared by a group and it has 
a number of variations. The principal manner in which this 
game is played is to have a volunteer student initially exit 
the classroom temporarily while the remaining class members, 
guided by the instructor, reach consensus on the target behav- 
ior to shape when the volunteer re-enters the classroom. Target 
behaviors can be simple—walking to the front of the room and 
turning off a light switch—or be more elaborate movements 
(e.g., walking to the front of the room, standing on one foot 
while touching an elbow with the opposite hand). A volunteer 
“shaper” is also chosen.® Upon re-entry, the shaper clicks and 
then reinforces the first behavior that approximates the ter- 
minal behavior. Other behaviors or movements are placed on 
extinction. The Shaping Game concludes when the volunteer 
performs the terminal behavior targeted by the class. Accord- 
ing to Keenan and Dillenburger the upshot of The Shaping 
Game is to: 


1. Teach the basic principles involved in shaping. 
- Teach about the power of contingencies of reinforcement. 


2 
3. Demonstrate the operation of stimulus control. 
4 


- Demonstrate the value and difficulty of continuously mon- 
itoring the behavior of a single organism. 


5. Demonstrate how questionnaires produce an incomplete 
account of behavior (p. 22). 
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Learn Shaping by Playing PORTL 


PORTL (Portable Operant Research and Teaching Lab) is a 
second-generation table-top shaping game, originally devel- 
oped by canine trainer Kay Laurence and now extrapolated 
to human participants by Jose Rosales-Ruiz and his graduate 
students at the University of North Texas. 


Rationale for Playing PORTL 


PORTL teaches participants to shape behavior entirely through 
delivering reinforcement and changing environmental arrange- 
ments. Early training exercises help individuals practice shaping 
basics—observing behavior, clicking a clicker at the next suc- 
cessive approximation movement of their partner, reinforcing 
simple behaviors consistently, and collecting and recording data 
collection—without the benefit of verbal instructions, models, 
gestures, or prompts. During intermediate training, individu- 
als continue to learn additional concepts, including cueing and 
stimulus control, concept formation, chaining, schedules of rein- 
forcement, superstitious behavior, extinction, and resurgence, 
to name a few. Ultimately, individuals gain experiences being 
the teacher and the learner, which helps to build understanding 
and empathy as the learner relates to the challenges the game 
presents to his or her students. 


How Is PORTL Played? 


PORTL is played between two individuals, the teacher and the 
learner, using a collection of small objects, a clicker to “mark” 
behavior, and small tokens or blocks as reinforcers. 
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SUMMARY 
Shaping Defined 


1. Shaping consists of a three-part process whereby the ana- 
lyst (a) detects a change in the learner’s environment, (b) 
makes a discriminated judgment about whether that 
change is a progressively closer approximation to a 
terminal behavior of interest, and then (c) differentially 
reinforces that closer successive approximation. Shaping 
is defined as the differential reinforcement of successive 
approximations toward a terminal behavior. 


2. A terminal behavior—the end product of shaping—can be 
claimed when the topography, rate, latency, duration, inter- 
response time, or magnitude of the target behavior reaches 
a predetermined criterion level. 


3. Differential reinforcement refers to the act of presenting 
an unconditioned or conditioned reinforcer only to those 
emitted members of a response class that share a speci- 
fied dimension or quality, while placing all other emitted 
response class members on extinction. 


At the beginning of each exercise, the teacher presents 
the learner with a selection of objects. The learner is given 
only two instructions: “Your goal is to earn as many blocks as 
possible” and “Please interact with the objects.” 

When the learner performs the correct behavior or 
an approximation toward the correct behavior, the teacher 
clicks the clicker and delivers a contingent token. The learner 
places the token into a container and then resumes interacting 
with the objects. The teacher can rearrange the objects or add 
and remove objects to help guide the learner toward the goal 
behavior. 

The teacher and learner take breaks periodically to fill out 
datasheets. The teacher records what just happened and what 
she plans to teach during the next training period. The learner 
records what he thinks he is learning and his perception of the 
exercise. Breaks can be taken after a criterion number of tokens 
have been delivered (e.g., 10 tokens) or after a certain amount of 
time (e.g., 60 seconds). Breaks allow the teacher to continually 
assess the learner’s progress and to adjust the shaping plan. 

After each exercise, the teacher and learner compare data- 
sheets. In addition, the learner can provide valuable feedback 
about what the teacher did well, as well as insight regarding 
which parts of the exercise may have been confusing and why. 


Adapted from M. Hunter and J. Rosales-Ruiz, 2018, An introduction to 
PORTL: The Portable Operant Research and Teaching Lab. https://www. 
artandscienceofanimaltraining.org/tools/portl-shaping-game/. Used by 
permission. 


4. Differential reinforcement has three effects: Responses 
similar to those that have been reinforced are more likely 
to occur, responses resembling the unreinforced members 
are emitted less frequently (i.e., undergo extinction), and 
some of these emitted responses under extinction may be 
likely to resemble the next successive approximation to 
the terminal behavior and would therefore be targeted for 
reinforcement. 


5. When differential reinforcement is applied consistently 
within a response class, its triple effect results in a new 
response class composed primarily of responses sharing 
the characteristics of the previously reinforced subclass. 
This emergence of a new response class is called response 
differentiation. 


6. The gradually changing criterion for reinforcement 
during shaping results in a succession of new response 
classes, or successive approximations, each one closer 
in form to the terminal behavior than the response class 
it replaces. 
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7. Behavior can be shaped along measurable dimensions Clicker Training 
of topography (form), rate (number of responses 
per unit of time), latency (time between the onset of 
the antecedent stimulus and the occurrence of the 


14. Clicker training is a science-based system for progressively 
shaping behavior using positive reinforcement and extinction. 


behavior), duration (elapsed time from the onset of 15. The clicker, a handheld device that produces a clicking 
a response to its end point), interresponse time (time sound when a metal tab is pressed, provides the signal that 
between responses), and magnitude (response strength when a behavior is being performed in the presence of the 
or force). click, reinforcement follows. 
Shaping Across and Within Response Topographies Emerging Applications of Shaping 

8. Shaping behavior across different response topographies 16. Using computers to teach shaping, combining shaping pro- 
means that selected members of a response class are dif- cedures with robotics engineering, and applying telemedi- 
ferentially reinforced, whereas members of other response cine technology for humanitarian purposes for populations 
classes are placed on extinction. in hard-to-access settings serve as three emerging applica- 


9. Shaping a behavior within a response topography means Hope Orsay: 


that the form of the behavior remains constant, but dif- 
ferential reinforcement is applied to another measurable 
dimension of the behavior (e.g., magnitude). 17. Before deciding to use shaping, assess the nature of 

the behaviors to be learned, the staff requirements and 
resources available, and whether a different tactic alto- 
gether might be more efficient in terms of time and effort. 


Shaping Guidelines 


10. Shaping is a positive approach to teach new behavior 
and may be combined with other behavior change 


tactics. 
11. Shaping has at least five limitations that practitioners 18. After the decision has been made to use a shaping 
should consider before applying it. procedure, the practitioner performs the following 


steps: select the terminal behavior, decide the criterion for 
success, analyze the response class, identify the first behav- 
ior to reinforce, eliminate interfering or extraneous stimuli, 
proceed gradually, limit the number of approximations 

at each level, continue reinforcement when the termi- 

nal behavior is achieved, and learn to shape behavior. 


12. In shaping, the antecedent stimulus stays the same, while 
the response progressively becomes more differentiated. 
In stimulus fading, the response remains the same, while 
the antecedent stimulus changes gradually. 


Increasing Shaping Efficiency 


13. Shaping may become more efficient by using a Learning to Shape 


discriminative stimulus, physical guidance, imitative 19. To learn to shape behavior, consider shaping a simple 
prompts, and/or a percentile schedule. Any prompt that is behavior, working with a colleague, practicing with a 
introduced should be faded later. group, or using a computer-based simulator. 

KEY TERMS 

clicker training extinction-induced variability shaping 

differential reinforcement response differentiation successive approximation 
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MULTIPLE-CHOICE QUESTIONS 
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1. Shaping involves: 2. Which of the following is an example of shaping an indi- 
a. Reinforcing successive approximations to a terminal vidual to swing a golf club correctly (i.e., take the appro- 
behavior priate backswing, swing the club down to the ball, and 


: : Abe ing)? 
b. Creating response differentiation between responses that Patel tpentg Deiplic Syitiey: 


are closer to the terminal behavior and those that are not a. Stand behind the person and physically help make 

the correct golf swing. Over time, slowly provide less 
physical assistance until the individual makes a correct 
swing independently. 


c. Placing responses that are not a closer approximation of 
the terminal behavior on extinction 


d. All of these 
Hint: (See “Shaping defined’) 


b. Start with verbal prompts to do each step. If this is 
not effective, provide a model. If this is not effective, 


provide some physical assistance. If this is not effec- 


tive, stand behind the person and help the person make 
the correct golf swing. 


Cc 


Provide positive reinforcement for any swing that 
resembles a golf swing. Then, provide reinforcement 
for closer and closer approximations of a correct golf 
swing and withhold reinforcement for poorer approxi- 
mations of a correct golf swing. 


d. None of these is an example of shaping. 
Hint: (See “Shaping defined”’) 


. Response differentiation involves what two components? 


a. Differential reinforcement and changing the criterion 
for reinforcement 


b. Punishment and extinction 

c. Prompting and differential reinforcement 

d. Extinction and changing the criterion for reinforcement 
Hint: (See “Shaping defined’) 

. Ms. Anita Response wants to increase the number of 
words Sophie writes in her stories during creative writing 


time. She uses shaping to do so. What dimension of 
behavior is she shaping? 


a. Amplitude 

b. Frequency 

c. Duration 

d. Latency 

Hint: (See “Shaping defined”) 


. An example of shaping within a response topography is: 


a. Increasing the number of math problems an individual 
needs to complete during math class 


b. Increasing the duration of in-seat behavior during seat- 
work time 


c. Increasing the force of a tennis swing 

d. All of these are examples of shaping within a response 
topography 

Hint: (See “Shaping across and within Response Topographies”) 

. The text gives an example of shaping gum chewing, in 

which lip movement first received reinforcement, then lip 


movements with sound, then saying the word “gum,” and 
finally, saying “Gum please.” This is an example of: 


a. Shaping within a response topography. 

b. Shaping across different response topographies. 

c. Shaping the magnitude of a response. 

d. Shaping the duration of a response. 

Hint: (See “Shaping across and within Response 
Topographies”’) 

. Shaping can be a slow, time-consuming process. What is 
one way to effectively speed up the process? 

a. Add a punishment contingency for incorrect responses. 


10. 
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b. Each time you increase the criteria for reinforcement, 
do so by larger increments. 
c. Model the desired response for the individual. 


d. All of these are effective ways to make shaping more 
efficient. 


Hint: (See “Increasing Shaping Efficiency’’) 


. When is adding a prompt to the shaping process some- 


thing to consider? 


a. When the individual is progressing through the shaping 
procedure quickly. 


b. When the individual has mastered the terminal 
behavior. 


c. When the individual’s repertoire is weak and the 
likelihood of making a successive approximation 
is low. 


d. When you have limited staff and resources to imple- 
ment the shaping procedure. 


Hint: (See “Increasing Shaping Efficiency’) 


. What is meant by the statement that progress in shaping is 


rarely linear and often erratic? 


a. Most individuals are unsuccessful in shaping new 
responses. 


b. The amount of time it may take to shape a behavior is 
often unpredictable. 


c. Sometimes individuals may exceed your criteria for 
reinforcement, making closer approximations than you 
had expected. 


d. Sometimes individuals may get “stuck” on a response 
and have to “back up” to a previously-learned approxi- 
mation before making process. This can slow the process 
down more than expected. 


e. All of these 
f. None of these 
Hint: (See “Shaping Guidelines”’) 


One of the guidelines for effective shaping is to 
carefully define the terminal behavior and the criteria 
for successfully engaging in the terminal behavior. 
Effective ways for deciding what are the criteria for 
success: 


a. Read the research literature to see if there are published 
norms and guidelines available for the behavior you are 
trying to shape. 

b. Observe competent peers to see how well they perform 
the task, and set your criteria according to their perfor- 
mance level. 


c. Both of these are ways to determine the criteria for 
success. 


d. Neither of these are appropriate ways to determine 


criteria for success. 


Hint: (See “Shaping Guidelines”) 
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ESSAY-TYPE QUESTIONS 


I 
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Write the definition of shaping and provide an example of 
shaping that has not been discussed in class or presented 
in the textbook. When giving your example, analyze the 
response class and diagram the shaping process. 


Hint: (See “Shaping Defined”’) 

What are the behavioral processes involved in the shaping 
process? 

Hint: (See “Shaping Defined’) 

What is the key difference between shaping across 
response topographies and shaping within a response 
topography? 

Hint: (See “Shaping across and within Different Response 
Topographies’’) 


When applied to discrimination training, differential reinforcement requires 
(1) reinforcing responses in the presence of one stimulus condition (the S?) 
and (2) placing responses in the absence of the SP condition on extinc- 
tion. Refer to Chapters 17 and 24 for this discussion. 

Spirometers are not calibrated to show minute, incremental changes. They 
register units in larger whole numbers (500 ml, 1000 ml, etc.). The point 
remains that the therapist is actively (a) watching the spirometer to detect 
a change in breathing, (b) judging whether that change is a successive 
approximation higher than a previous attempt, and (c) reinforcing it if it is. 
Simultaneously, attempts that do not approximate the terminal behavior are 
placed on extinction. 


4. 
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Shaping a new response can take a long time. What is one 
way you can increase the efficiency of shaping and speed 
up the process? 


Hint: (See “Increasing Shaping Efficiency”) 


What key steps are to be taken to implement the shaping 
process? 


Hint: (See “Shaping Guidelines”’) 


Procedures for chaining a sequence of responses to a terminal outcome are 
described in Chapter 23. 

The researcher provided a safety net so that the rat was not injured. 

They also developed simulations for reinforcement schedules. 

An American-size football field, end-zone-to-end-zone and sideline-to- 
sideline, is approximately 1.32 acres. 

Accessing https://youtu.be/YSu7ghj4ib0 shows a video clip of PORTL 
being conducted. 

Some variations have the entire class serve as the shaper by having them 
clap collectively when an approximation occurs. 
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Chaining 


LEARNING OBJECTIVES 


Define and give examples of behavior chains. 
Identify why behavior chains are important. 


Conduct a task analysis and select appropriate assessment methods for evaluating performance on task analysis. 
Identify and define different chaining procedures and determine when each would be selected for use. 


Describe how behavior chains can be interrupted. 


At age 21, I was a newly employed counselor at a summer 
camp designed to provide educational, social, and recre- 
ational experiences to children with cerebral palsy. One of 
the campers—let’s call him “Joe”—wore flapjack shoes 
that featured a mechanical upper tongue that clamped down 
to secure the shoes to his feet. I soon learned that Joe wore 
the flapjacks not as a fashion statement, but because he 
hadn’t learned to tie his own shoelaces. A senior counselor 
challenged me to teach Joe to tie his shoes before the camp 
ended in approximately two weeks. I happily accepted the 
challenge, as an authentic Philadelphia cheesesteak sand- 
wich hung in the balance. 

I launched this project with my “go to” procedure— 
backward chaining. I used a practice jig that consisted 
of canvas cloth with holes at each border wrapped 
around a wooden board. I tied the laces completely, but 
loosely, and then held the jig out to Joe and said, “Tie 
your shoe.” Joe snugged the laces tight, the last behav- 
ior in the chain. I reset the jig to my initial position, and 
repeated my direction to Joe, “Tie your shoe.” Once Joe 
mastered the snugging task, the second behavior to teach 
was the next-to-the-last task in the shoe-tying sequence: 
drawing the looped laces down to the wooden template 
(before snugging them tight as before). And so the process 
continued until the first task in the sequence (i.e., crossing 
the laces) was completed. Each time Joe completed the 
chain, from performing the final step only in his very first 
trial to completing each step in sequence from the begin- 
ning, he contacted the natural reinforcer (a self-tied shoe) 
and social praise (“Good job, Jos”). 

Practice occurred for about 10 to 15 minutes per day, 
three days per week. In the remaining days of the camp, Joe 
practiced the entire chain a few times with an Oxford-style 
shoe. However, on the last day of the camp, and sporting a 
new pair of sneakers, the senior counselor asked Joe to tie his 
shoe. Joe responded with the precision of a Swiss watchmaker. 
I reveled in Joe’s success and enjoyed a delightful lunch to 


boot! Backward chaining chalked up another victory. 
—Tim Heron 


sk chapter defines a behavior chain, provides a 
rationale for establishing behavior chains in applied set- 
tings, and discusses the importance of task analysis in 
behavior chain training. The chapter presents a procedures for 
constructing and validating task analyses, along with procedures 
for assessing individual mastery levels. Next, four behavior 
change methods—forward chaining, total-task chaining, back- 
ward chaining, and backward chaining with leap aheads—are 
addressed, followed by guidelines for deciding which behavior 
chain procedure to use in applied settings. Techniques for inter- 
rupting and breaking inappropriate chains are also provided. The 
chapter concludes with an examination of factors affecting the 
performance of a behavior chain. 


BEHAVIOR CHAIN DEFINED 


A behavior chain is a linked sequence of responses leading to 
a terminal outcome. Each response produces a stimulus change 
that functions as conditioned reinforcement for that response 
and as a discriminative stimulus (SP) for the next response in the 
chain. Reinforcement for the last response in a chain maintains 
the reinforcing effectiveness of the stimulus changes produced 
by all previous responses in the chain. 

Reynolds (1975) provided this laboratory example of a 
behavior chain (illustrated in Figure 23.1): 


Assume that a pigeon is presented with a blue light key 

(S;). When the pigeon pecks the key (Rj), the blue light 
terminates, and a red light is activated. In the presence of the 
red light key (S3), the pigeon presses a pedal (R2) that turns 
the key to a yellow light. The yellow light (S3) serves as 

an S? to displace a bar (R3), which terminates the yellow 
light, changing it to a green light (Sq). Finally, during the 
green light (S4), pecks terminate the green light, and a grain- 
delivery mechanism is activated that dispenses an edible 
reinforcer. . .. Because each stimulus has a dual function, as 
discriminative stimulus and conditioner reinforcer, the links 
overlap. In fact, it is this dual function of the stimuli that 
holds the chain together. (pp. 59-60) 
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= 
S; Blue Light S) Red Light 
R, Key Peck R, Pedal Press 
(terminates blue light, (terminates red light, 
activates red light) activates yellow light) 


S3 Yellow Light 


R3 Bar Press 
(terminates yellow light, 
activates green light) 


S, Green Light 


Se eh 


Ry Key Peck 
. (food) 


(terminates green light, 
activates food magazine) 


S = Stimulus 
R = Response 
SR = Reinforcement 


Figure 23.1 Illustration of a behavior chain consisting of four components. 


Based on a chain described by Reynolds (1975, pp. 59-60). 


As Figure 23.1 shows, this chain includes four responses 
(Rj, Ro, R3, and Ry), with a specific stimulus condition 
(Sj, So, S3, and S4) associated with each response. The blue light 
(S,) is an S? to occasion the first response (Rj), a key peck that 
terminates the blue light and produces the onset of the red light 
(S>). The red light serves as conditioned reinforcement for R, 
and as an SP for Ro (a pedal press). The pedal press (R>) termi- 
nates the red light and produces the onset of the yellow light 
(S3), and so on. The last response produces food, thus complet- 
ing and maintaining the chain. 

Figure 23.2 shows the links of a behavior chain using 
a classroom example. In preparing a class of preschool stu- 
dents for recess, the teacher might say to one student, “Please 
put on your coat.” The teacher’s statement would serve as the 
Se (S;) evoking the first response (Rj) in the chain, securing 
the coat in hand from the closet. That response in the presence 
of the teacher’s statement terminates the teacher’s statement 
and produces the onset of the coat in the student’s hands (S,). 
The coat in the student’s hands serves as conditioned rein- 
forcement for obtaining the coat from the closet (R;) and as 
an SP (S2) for putting one arm through a sleeve (Rj). That 
response, putting one arm through the sleeve in the presence 
of the coat in both hands, terminates the stimulus condition 
of the coat in the student’s hands and produces the onset of 
one arm in a sleeve and one arm out (S3), a stimulus change 
that serves as conditioned reinforcement for putting one arm 
through a sleeve and as an SP (S3) for placing the second arm 
through the other sleeve (R3). That response terminates the 
condition of one arm in a sleeve and one arm out, and pro- 
duces the SP (S4) of the coat being completely on. The coat 


Discriminative Stimulus Response 
S; “Put on your coat.” 

S> Coat in hands 

S3 One arm in sleeve/one arm out 


S4 Coat on 


Figure 23.2 Discriminative stimuli, responses, and conditioned reinforcers in a four-response chain. 


R, Obtain coat from closet 

Rz Place one arm in sleeve 

R3 Place second arm in sleeve 
Rg Zip up the coat 


being fully on serves as a conditioned reinforcer for putting 
the second arm through the other sleeve and as an SP for zip- 
ping it (Ry). Finally, zipping up the coat completes the chain 
and produces teacher praise, and the natural consequence of 
wearing a fully zipped coat. 

A behavior chain has three important characteristics: 
(a) It involves the performance of a specific series of discrete 
responses; (b) the performance of each behavior in the sequence 
produces a stimulus change in the environment in such a way that 
it yields conditioned reinforcement for the preceding response, 
and serves as an S? for the next response; and (c) the responses 
within the chain must be performed in a specific sequence, and 
in close temporal succession. 


Behavior Chains with a Limited Hold 


A behavior chain with a limited hold is a chain that must 
be completed within a specified time to produce reinforce- 
ment. Behavior chains with limited holds are characterized by 
performance that is accurate and proficient. An assembly task 
on a production line exemplifies a chain with a limited hold. 
Suppose that to meet the production requirements of the job, 
an employee has to assemble 30 couplers onto 30 shafts within 
30 minutes (one per minute). If the coupler is placed on the 
shaft, the retaining clip applied, and the unit forwarded to the 
next person in line within the prescribed period, reinforcement 
is delivered. In a behavior chain with a limited hold, the person 
must not only have the prerequisite behaviors in his repertoire, 
but also must emit those behaviors in close temporal succession 
to obtain reinforcement. 


Conditioned Reinforcement 


Coat in hands 


One arm in sleeve 


Coat on 


Teacher praise 


When working with a client who completes a chain in the 
correct sequence, but fails to meet the limited hold criterion, 
the behavior analyst should measure the latency and duration 
of each component response. If this assessment reveals that the 
client is taking too much time to complete one or more compo- 
nent responses, the behavior analyst can implement contingen- 
cies focusing on the proficiency of performance. Setting a time 
criterion for the onset and completion of each response within 


the chain is one way to do this; another is to set a time criterion 
for completion of the total chain. 


RATIONALE FOR CHAINING 


Whereas behavior chain denotes a particular sequence of stim- 
uli and responses ending in reinforcement, chaining refers 
to various methods for linking specific sequences of stimuli 
and responses to form new performances. In forward chain- 
ing, behaviors are linked beginning with the first behavior in 
the sequence. In backward chaining, the behaviors are linked 
beginning with the /ast behavior in the sequence (Murphy & 
Lupfer, 2011). Both of these procedures and their variations are 
discussed in detail later in this chapter. 

New behavior chains are taught to people for several rea- 
sons. First, an important educational aspect of programs for 
students with development disabilities—and, for that matter, 
novice learners—is to increase independent-living skills (e.g., 
using public utilities, taking care of personal needs, executing 
travel skills, socializing appropriately). As these skills develop, 
the learner functions more effectively in least restrictive envi- 
ronments or participates in activities without adult supervision. 
Complex behaviors that are developed with chaining procedures 
allow individuals to function more independently. 

Second, chaining provides the means by which a series 
of discrete behaviors can be combined to form a sequence of 
responses that occasion the delivery of reinforcement. That is, 
chaining can be used to add behaviors to an existing behavioral 
repertoire. For example, assume that a person with a develop- 
mental disability consistently seeks assistance from a coworker 
or job coach while completing an assembly task. A chaining 
procedure could be used to increase the number of tasks in 
the assembly that must be performed before reinforcement is 
delivered. To do so, the teacher could give the individual a 
written or pictorial list of the parts that must be assembled to 
complete the job. When the first task is finished, the worker 
crosses off the first word or picture on the list and then pro- 
ceeds to the next task. 

In behavioral terms, the first word or picture on the list 
serves as the SP to occasion the response of completing the first 
task. That response, in the presence of the word or picture on the 
list, terminates the initial stimulus and produces the next stimu- 
lus, the second word or picture on the list. The completion of the 
second task serves as conditioned reinforcement for completing 
the task and produces the onset of the third SP. In this way, the 
chaining procedure enables simple behaviors to be combined 
into a longer series of complex responses. 

Finally, chaining can be combined with other behavior 
change tactics to build more intricate and adaptive repertoires. 
McWilliams, Nietupski, and Hamre-Nietupski (1990) taught three 
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students with moderate intellectual disabilities a 24-step chain 
of bedmaking skills with an intervention that combined chain- 
ing with modeling and error correction. Lambert and colleagues 
(2016) used a forward chaining procedure that incorporated 
prompting, instructions, and modeling to teach three complex 
chains of basketball skills to a middle school student with autism. 


ESTABLISHING BEHAVIOR CHAINS 
WITH TASK ANALYSIS 


Before individual behaviors can be linked in a chain, the behavior 
analyst must construct and validate a task analysis of the compo- 
nents of the behavioral sequence, and assess the learner’s mastery 
level of each behavior in the task analysis. Task analysis involves 
breaking a complex task into smaller, teachable units, the product 
of which is a series of sequentially ordered steps or tasks. 


Constructing and Validating a Task Analysis 


The purpose of constructing and validating a task analysis is 
to determine the sequence of behaviors that are necessary and 
sufficient to complete a given task efficiently. The sequence 
of behaviors that one person performs may not be identical to 
what another person needs to achieve the same outcome. A task 
analysis should be individualized according to the age, skill 
level, and prior experience of the learner (Brown, McDonnell, 
& Snell, 2016). Further, a few task analyses comprise a limited 
number of major steps, with each major step containing four to 
five subtasks. Figure 23.3 illustrates a task analysis for making 
a bed, developed by McWilliams and colleagues (1990). 

Researchers and practitioners typically construct and val- 
idate task analyses using one method or a combination of four 
methods: observe a competent performer, execute the task your- 
self, ask an expert, or trial and error. 


Observe a Competent Performer 


Test, Spooner, Keul, and Grossi (1990) produced a task analysis 
for using a public telephone by observing two adults perform 
the task, and then validating that sequence by training a person 
with developmental disabilities to use the task analysis. Based 
on the training, the researchers made subsequent modifications 
to the original sequence of tasks. 


Execute the Task Yourself 


A study by van den Pol and colleagues (1981) serves as a good 
example of how to determine and validate a task analysis yourself. 
To identify the behaviors required to eat in fast-food restaurants, 
their research team ate in a variety of restaurants and self-recorded 
their activities. The researchers next combined their individual 
findings into a 22-item task analysis that covered differences 
among restaurants and then organized the steps into four skills 
clusters: locating, ordering, paying and eating, and exiting (see 
Figure 30.1). The advantage of self-performing the task is three- 
fold: (1) It provides the opportunity to come into contact with all 
task demands of the sequence, (2) it generates a clearer idea of the 
behaviors to be taught and their associated SPs, and (3) it refines 
the description of the response topography necessary for the learner 
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Figure 23.3 Task analysis of bedmaking skills. 


Given an unmade bed that included a messed-up bedspread, blanket, pillow, 
and flat sheet and a fitted sheet, the student will: 


Section 1: bed preparation 
. remove pillow from bed 
. pull bedspread to foot of bed 
. pull blanket to foot of bed 
. pull flat sheet to foot of bed 
. smooth wrinkles of fitted sheet 


Section 2: flat sheet 


. pull top of flat sheet to headboard 
. Straighten right corner of flat sheet at foot of bed 


. repeat #7 for the left corner 


10. smooth wrinkles 


Section 3: blanket 
illic 


pull top of blanket to headboard 


9. make sheet even across top end of mattress 


12. straighten right corner of blanket at foot of bed 


13. repeat #12 for the left corner 


14. make blanket even with sheet across top end of mattress 


15. smooth wrinkles 


Section 4: bedspread 


16. pull top bedspread to headboard 


17. pull right corner of bedspread at foot to floor 


18. repeat #17 for left side 


19. make bedspread even with floor on both sides 


20. smooth wrinkles 


Section 5: pillow 


21. fold top of spread to within 4 inches of the pillow width 


22. place pillow on folded portion 


23. cover pillow with the folded portion 
24. smooth bedspread wrinkles on and around the pillow 


From “Teaching Complex Activities to Students with Moderate Handicaps Through the 
Forward Chaining of Shorter Total Cycle Response Sequences,” by R. McWilliams, 

J. Nietupski, & S. Hamre-Nietupski, 1990, Education and Training in Mental Retardation, 
25, p. 296. Copyright 1990 by the Council for Exceptional Children. Reprinted by 
permission. 


to use the sequence most efficiently. Figure 23.4 shows how an 
initial 7-step sequence for tying a shoe was expanded to 14 steps 
after self-performing the behavior (cf. Bailey & Wolery, 1992). 


Expert Input 


Persons or associations with recognized and demonstrated 
expertise can be a valuable resource to consult when validating 
a task analysis. For example, Stokes, Luiselli, and Reed (2010) 
based a 10-step task analysis for football tackling skills on 
recommendations by the American Football Coaches Association 
(1995). Stokes, Luiselli, Reed, and Fleming (2010) asked five 
college offensive line coaches to validate a 10-step task analysis 
for pass-blocking by offensive linemen (see Figure 23.5). 


Trial and Error 


A systematic trial-and-error procedure can assist the behavior 
analyst in developing a task analysis. Using a systematic tri- 
al-and-error method, an initial task analysis is generated and 


then refined and revised as it is being tested. With revisions 
and refinements obtained through field tests, a more functional 
and appropriate task analysis can be achieved. For instance, as 
mentioned earlier, Test and colleagues (1990) generated their 
initial task analysis for using a public telephone by watching two 
adults complete the task. They subsequently carried the process 
a step further by asking a person with developmental disabilities 
to perform the same task and then modified the task analysis. 
Regardless of the method used to sequence the steps, the 
SPs and corresponding responses must be identified. However, 
being able to perform a response is not sufficient; the individual 
must be able to discriminate the conditions under which a given 
response should be performed. Listing the discriminative stimuli 
and associated responses helps the trainer determine whether nat- 
urally occurring S?s will evoke different or multiple responses. 


Assessing Mastery Level 


Mastery level is assessed to determine which components 
of the task analysis the learner can perform independently. 
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Figure 23.4 Initial and expanded steps for teaching shoe tying. 


ee eee ECU UIFFIUIS IVI) AU soe + LE Saitou G pins toot On anthen D Srecnt tae a 


a 
Shorter sequence Longer sequence? 


. Partially tighten shoe laces. 8. Pinch lace. 
. Pull shoe laces tight—vertical pull. 9. Pull lace. 


. Cross shoe laces. 10. Hang lace ends from corresponding sides of shoe. 


1 
2 
3 
4. Tighten laces—horizontal pull. 11. Pick up laces in corresponding hands. 
5. Tie laces into a knot. 12. Lift laces above shoe. 
6. Make a bow. 13. Cross right lace over the left to form a tepee. 
7. Tighten bow. 14. Bring left lace towards student. 

15. Pull left lace through tepee. 

16. Pull laces away from each other. 

17. Bend left lace to form a loop. 

18. Pinch loop with left hand. 

19. Bring right lace over the fingers—around loop. 

20. Push right lace through hole. 

21. Pull loops away from each other. 


Sources: (a) Santa Cruz County Office of Education, Behavioral Characteristics Progression. Palo Alto, California, VORT 
Corporation, 1973. (b) Smith, D. D., Smith, J. O., & Edgar, E. “Research and Application of Instructional Materials 
Development.” In N. G. Haring & L. Brown (Eds.), Teaching the Severely Handicapped (Vol. 1). New York: Grune & 
Stratton, 1976. From Teaching Infants and Preschoolers with Handicaps, p. 47, by D. B. Bailey & M. Wolery, 1984, 
Columbus, OH: Charles E. Merrill. Used by permission. 


Figure 23.5 Task analysis for pass-blocking by offensive linemen. 


 ———————————— 


Step Description 


1 Stance Feet shoulder-width apart, 50% bend at the knees, buttocks down over heels, center-side 
hand touching the ground 


2 Split Sideline foot back slightly with toe to heel of center-side foot 

3 First step Make play-side bucket step while maintaining balance 

4 Helmet contact Helmet facemask makes contact with opponent at the top of jersey number 

5 Hand placement Pointer fingers at 11:00 o'clock and 1:00 o'clock with thumbs pointed towards opponent's 
Adam's apple 

6 Hand position Hands maintained inside opponent’s shoulder pads and torso frame 

y Arm extension Arms extended outward beyond 45 degrees 

8 Hip follow through Hips rolled towards opponent's belly button 

g Leg drive Maintain continuous running leg pump until whistle sounds 

0 Hand contact Hands in contact with opponent's body until whistle sounds 


From “Behavioral Coaching to Improve Offensive Line Pass-Blocking Skills of High School Football Athletes,” by J. V. Stokes, J. K. Luiselli, DAD Reed, 
and R. K. Fleming, 2010, Journal of Applied Behavior Analysis, 43, p. 465. Copyright 2010 by the Society for the Experimental Analysis of Behavior, 


Inc. Used by permission. 


There are two principal ways to assess a person’s mastery sequence. Lambert and colleagues (2016) used a single-opportu- 


level of task analysis behaviors prior to training—the single- nity method to assess target chains related to teaching functional 
opportunity method and the multiple-opportunity method. basketball skills to an adolescent with autism. If a behavior in 

the chain was emitted correctly, confederate players on the court 
Single-Opportunity Method repositioned themselves so as to serve as the SP for the next 


The single-opportunity method is designed to assess alearner’s _ behavior. If that behavior was not performed, the assessment 
ability to perform each behavior in the task analysis in correct probe terminated. 
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Figure 23.6 is an example of a form used to record a 
learner’s ability to insert a hearing aid. Specifically, a plus sign (+) 
or minus sign (—) is marked for each behavior that is correctly 
or incorrectly emitted. 

Assessment in this example began when the teacher 
said, “Tom, put on your hearing aid.” Tom’s responses to the 
steps in the task analysis were then recorded. The figure shows 
Tom’s data for the first 4 days of assessment. On Day 1, Tom 
opened the hearing aid container and removed the harness; 
each of these steps was performed correctly, independently, 
sequentially, and within a 6-sec time limit. However, Tom then 
attempted to put the hearing aid harness over his head (Step 
5) without first doing Steps 3 and 4. Because he continued to 
perform this behavior for more than 10 seconds, the teacher 
stopped the assessment and scored Steps 3 and 4 and all 
remaining steps as incorrect. On Day 2, Tom was stopped after 
Step | because he performed a subsequent step out of order. 
On Days 3 and 4, the assessment was discontinued after Step 
4 because Tom took more than 6 seconds to perform Step 5. 


Given a criterion for mastery of 100% accuracy within 6 
seconds over three consecutive probes, the data indicate that 
Tom only met the criterion for Step 1 (the three +’s for Step 2 
were not recorded consecutively). 


Multiple-Opportunity Method 


The multiple-opportunity method of task analysis assessment eval- 
uates the learner’s level of mastery across all the behaviors in the 
task analysis. If a step is performed incorrectly or out of sequence, 
or if the time limit for completing the step is exceeded, the behavior 
analyst completes that step for the learner and then positions her for 
the next step. Each step performed correctly is scored as a correct 
response, even if the learner erred on the previous steps. 

Figure 23.7 shows an example of the multiple-opportunity 
method for assessing a student’s (Marc’s) morning-arrival rou- 
tine across 17 tasks and 10 days. Brown et al. (2016) validated 
their task analysis by (a) watching other students complete the 
morning-arrival routine, (b) conferring with Marc’s teacher to 


Figure 23.6 Task analysis data sheet for single-opportunity assessment of 


inserting a hearing aid. 


Teacher: Christine 
Assessment method: Single opportunity 
Student: Tom 


Step Behavior 


1. Open container 
. Remove harness 


. Arm 1/Strap 1 

. Arm 1/Strap 2 

. Harness over head 

. Fasten harness 

. Unsnap pocket 

. Remove aid from container 
. Insert aid into pocket 


ee) Wey foo) Soy al SS AGU ISS) 


a 


. Snap pocket 


— 
—. 


. pick up earmold 


—s 
N 


. Insert earmold into ear 


— 
1S) 


. Turn on aid 
14. 


Percentage of Steps Correct 


set the control 


Response latency: 6 seconds 
Recording key: + (correct) — (incorrect) 


Materials: Hearing aid container, harness, earmold 


Task Analysis Assessment of Inserting a Hearing Aid 
Instructional cue: “Put on your hearing aid” 


10/1 10/2 10/3 10/4 
+ + + + 
- - - + 
- - - - 
- - + - 


14% 


Criterion: 100% correct performance for 3 consecutive days 


From “Teaching Severely Multinandicapped Students to Put on Their Own Hearing Aids,” by 
D. J. Tucker and G. W. Berry, 1980, Journal of Applied Behavior Analysis, 13, p. 69. Copyright 
1980 by the Society for the Experimental Analysis of Behavior, Inc. 
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gain her perspective on the tasks, and then (c) “piloting” the 
task analysis with Marc to ensure that it was individualized. 
After accomplishing these three validation procedures, it was 
noted that a couple of tasks were still too difficult for Marc, 
so each of those steps was further subdivided into two steps. 
Brown et al. recommend that practitioners developing a task 
analysis use a systematic process to do so (see Box 23.1). 
The key to using the multiple-opportunity method for a task 
analysis assessment is to ensure that teaching is not commingled 
with assessment. That is, if the learner is unable to perform a task, 
the behavior analyst would complete that step, position the learner 
for the next step, and continue with remaining steps. 
Single-opportunity and multiple-opportunity methods 
can be effective ways to determine mastery of initial skills in a 
behavior chain. Of the two, the single-opportunity method is the 
more conservative measure because the assessment terminates at 
the first step at which performance breaks down. It also provides 
less information to the teacher once instruction is initiated; but 
takes less time to conduct, especially if the task analysis is long, 
and it reduces the likelihood of learning taking place during 
assessment (Brown et al., 2016). The multiple-opportunity 
method takes more time to complete, but it provides the behavior 


analyst with more information. That is, the teacher could learn 
which steps in the task analysis the learner has already mastered, 
thus eliminating the need to instruct steps already in the learner’s 
repertoire. In the next section, we address the third component 
of chaining: teaching the individual to perform each step of the 
chain in close temporal succession. 


BEHAVIOR CHAINING METHODS 


After the task analysis has been constructed and validated and the 
criterion for success and the data collection procedures have been 
determined, the next step is to decide which chaining procedure 
to use in teaching the new sequence of behaviors. The behavior 
analyst has four options: forward chaining, total-task chaining, 
backward chaining, and backward chaining with leap aheads. 


Forward Chaining 


In forward chaining, the behaviors identified in the task anal- 
ysis are taught in their naturally occurring order. Specifically, 
reinforcement is delivered when the predetermined criterion for 
the performance of the first behavior in the sequence is achieved. 


How the Task Analysis for Marc's Morning Routine Was Developed 


Analyzing a chain of events and breaking these tasks down into 
teachable steps is not a trivial process. Consider how Marc’s 
teachers developed his task analysis and the data collection 
form for his morning routine (Figure 23.7). Ms. Wharton, the 
special education teacher, and the school’s autism specialist 
watched other children perform the morning routine tasks 
and then analyzed the steps. They identified the relevant 
stimuli Mare needed to attend to and the responses they 
wanted Marc to learn. Ms. Wharton also asked Marc’s 
kindergarten teacher, Ms. Kwan, to double-check the prelim- 
inary task analysis steps by using them when observing other 
students arrive in the morning. Ms. Wharton then piloted 
the task analysis with Marc to be sure the steps made sense 
for him. When she found that some of the steps were too 
hard for Marc to do alone (e.g., Step 2: Open the door of the 
building), she modified those steps by either breaking that 
step into two smaller steps or adding adult/peer assistance. 
Marc’s mother, who was familiar with the arrival routine, 
then reviewed the task analysis and added her own ideas. The 
speech-language pathologist suggested adding a greeting. 
The 17-step task analysis for Marc’s arrival routine reflects 
the cumulative input of the teachers, the speech therapist, 
and Marc’s mother, thus significantly enhancing the valida- 
tion process. 

Teams will develop more successful task analyses if they 
follow a systematic process: 


1. Select a needed skill by using ecological inventory 
results to identify a functional and age-appropriate skill 
that is an important target for a particular student. 


2. Define the target skill simply, including a description 
of the settings and materials most suited to the natural 
performance of the task. 

3. Perform the task and observe peers performing the task, 
using the chosen materials in the natural settings while 
noting the steps involved. 

4. Adapt the steps to suit the student’s disabilities and skill 
strengths; employ as needed the principle of partial par- 
ticipation, the only-as-specialized-as-necessary rule, and 
component analysis to design a task analysis that is both 
age appropriate and functional. 

5. Validate the task analysis by having the student perform 
the task, but provide assistance on steps that are unknown 
so that performance of all of the steps can be viewed. 

6. Revise the task analysis so that it works; explore adding 
simple, non-stigmatizing adaptations to steps that appear 
to be unreasonable in an unadapted form. 

7. Write the task analysis on a data collection form so that 
steps (a) are stated in terms of observable behavior: 
(b) result in a visible change in the product or process; 
(c) are ordered in a logical sequence; (d) are written in 
second-person singular so that they could serve as ver- 
bal prompts (if used); and (e) use language that is not 
confusing to the student, with the performance details 
that are essential to assessing enclosed in parentheses. 


SE ee ae ed 
Adapted from Instruction of Students with Severe Disabilities (8th ed.), pp. 158— 
159, by F. E. Brown, J. J. McDonnell, and M. E. Snell, 2016. Copyright 2016 by 
Pearson Education, Inc., Upper Saddle River, NJ. Used by permission. 


Thereafter, reinforcement is delivered for criterion completion 
of Steps | and 2. Each succeeding step requires the cumulative 
performance of all previous steps in the correct order. 

For example, when the first step, “Pinch lace.” is performed 
accurately 3 consecutive times, reinforcement is delivered (refer to 
Figure 23.4). Next, reinforcement would be delivered when that 
step and the next one, ‘Pull lace,” are performed to the same cri- 
terion. Then, “Hang lace ends from corresponding sides of shoe” 
would be added, and all three steps would have to be performed 
correctly before reinforcement is delivered. Ultimately, all 14 steps 
in the task analysis should be performed in a similar manner. How- 
ever, at any given training step a variety of response prompting and 
other strategies may be used to occasion the response. 

Longer chains of behaviors can be broken down into smaller 
chains, or skill clusters, each of which can be taught in a manner 
similar to that used with a single-response unit. When one skill 
cluster is mastered, it is linked to the next. The final response in the 
first skill cluster sets the occasion for the first response in the second 
skill cluster. Essentially, in this variation skill clusters become the 
analogue for units of behaviors, and these clusters are linked. 

McWilliams and colleagues (1990) combined skill clusters 
in forward chaining to teach bedmaking skills to three students 
with developmental disabilities. Based on a task analysis of bed- 
making responses, five skill clusters were identified, each con- 
taining four to five subtasks (see Figure 23.3). Once baseline was 
collected showing minimal accuracy with completing the entire 
chain in the presence of the cue, “Show me how to make the bed,” 
an instructional procedure was chosen that involved teaching the 
clusters one at a time by breaking the complex behaviors into 
smaller chains (see Figure 23.8). 

Initial training began with the teacher demonstrating the 
chain of tasks for Sections 1 and 2. Subsequent training involved 
teaching the prior section, the current section targeted for 
instruction, and the next section in the sequence. For example, 
if training was to be conducted on Section 2, Sections 1, 2, and 
3 were demonstrated. Next, the students practiced the targeted 
sequences 2 to 5 times. When the sequence was performed cor- 
rectly, praise was delivered. When an error occurred, a three-part 
correction procedure involving verbal redirection, redirection 
plus modeling, and/or physical guidance was initiated until the 
trial ended on a correct response. When the first skill cluster 
was mastered (S,), the second skill cluster (S) was introduced, 
followed by the third (S3), the fourth (S4), and so on. 

The results indicated that the forward chaining procedure 
was effective in teaching the students bedmaking skills. That iS, 
all the students were able to make their beds independently, or 
with only minimal assistance, when the teacher directed them 
to do so. Furthermore, all of them were able to make their beds 
in the generalization setting (i.e., the home). 

McWilliams and colleagues (1990) and Guercio and 
Cormier (2015) illustrate three main advantages of forward 
chaining: (a) It can be used to link smaller chains into larger 
ones; (b) it is easy, so teacher implementation in the classroom 
is enhanced; and (c) it can be combined with other behavior 
change procedures (e.g., fading). 

Despite its advantages, forward chaining has potential 
limitations. For example, it is possible that the analyst, after 
noting the learner has performed the targeted training steps, may 
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Figure 23.8 Number of correctly performed task steps across 
baseline, intervention, and generalization probe trials. 


From “Teaching Complex Activities to Students with Moderate Hand- 
icaps Through the Forward Chaining of Shorter Total Cycle Response 
Sequences,” by R. McWilliams, J. Nietupski, & $. Hamre-Nietupski, 1990, 
Education and Training in Mental Retardation, 25, p. 296. Copyright 
1990 by the Council for Exceptional Children. Reprinted by permission. 


eliminate opportunities for additional practice of these steps, 
thus risking extinction of the chain due to lack of reinforcement 
or the weakening of an SP. Conversely, the analyst may retain 
the learner in forward chaining intervention longer than nec- 
essary, thereby hampering progress on additional skills to be 
learned. To mitigate either or both of these limitations, Lambert 
et al. (2016) suggest that analysts probe the entire chain period- 
ically to assess mastery and retention. 

Our discussion thus far has focused on teaching behavior 
chains through direct teacher instruction. However, evidence 
suggests that chained responses can also be learned through 
observation (Wolery, Ault, Gast, Doyle, & Griffen, 1991). 
Griffen, Wolery, and Schuster (1992) used a constant time delay 
(CTD) procedure to teach students with intellectual disabilities 
4 series of chained tasks in food preparation behaviors. During 
CTD with one student to whom the chained response was taught, 
two other students watched. Results showed that the two observ- 
ing students learned at least 85% of the correct steps in the chain 
even though direct instruction on those steps had not occurred. 


Total-Task Chaining 


Total-task chaining (sometimes called total-task presentation or 
whole-task presentation) is a variation of forward chaining in which 
the learner receives training on each step in the task analysis during 
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every session. The trainer provides assistance for any steps the 
learner is unable to perform independently, and the chain is trained 
until the learner is able to perform all the behaviors in the sequence 
to the predetermined criterion. Depending on the complexity of 
the chain, the learner’s repertoire, and available resources, physical 
assistance and/or graduated guidance may be incorporated. 

Werts, Caldwell, and Wolery (1996) used peer models and 
total-task chaining to teach skills such as operating an audiotape, 
sharpening a pencil, and using a calculator to three elementary- 
level students with disabilities who were enrolled in general 
education classrooms. Response chains were individualized for 
each student based on teacher-recommended sequences that the 
students had to learn. Each session was divided into three parts: 
(a) The students with disabilities were probed on their ability 
to perform the total-task response chain; (b) a competent peer 
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model demonstrated the chain in its entirety while simulta- 
neously describing each step; and (c) the student partner was 
probed again to determine performance of the chain. During the 
probes before and after peer modeling, the students with disabili- 
ties were directed to complete the chain. If a student was success- 
ful, a correct response was noted, but no feedback was delivered. 
If a student was unsuccessful, the student’s view was blocked 
temporarily while the teacher completed that step in the chain, 
and then the student was redirected to complete the remaining 
steps. Each behavior in the response chain was scored. 

All three students learned to complete the response chain 
after peer modeling, and reached criterion on the response 
chain (100% correct for 2 out of 3 days) over the course of the 
study. The results for Charlie, one of the three students in the study, 
are shown in Figure 23.9. 
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Figure 23.9 Percentage of steps performed correctly by Charlie for the three response chains. Triangles represent 
percentage of steps correct in initial probes; open circles represent percentage of steps correct in imitation probes. 


From “Peer Modeling of Response Chains: Observational Learning by Students with Disabilities,” by M. G. Werts, N. K. Caldwell. and 
M. Wolery, 1996, Journal of Applied Behavior Analysis, 29, p. 60. Copyright 1996 by the Society for the Experimental Analysis of Behavior, Inc. 


Reprinted by permission. 


Test and colleagues (1990) used total-task chaining with a 
least-to-most prompting procedure to teach two adolescents with 
severe intellectual disabilities to use a public telephone.! After 
identifying and validating a 17-step task analysis, baseline data 
were obtained by giving the students a 3 X 5 index card with 
their home phone number and two dimes, and directing them 
to “phone home.” During training, a least-to-most prompting 
procedure consisting of three levels of prompts (verbal, verbal 
plus gesture, and verbal plus guidance) was implemented when 
errors occurred on any of the 17 steps that made up the task anal- 
ysis. Each instructional session consisted of two training trials 
followed by a probe to measure the number of independent steps 
completed. In addition, generalization probes were conducted in 
two other settings at least once per week. 

Results showed that the combination of total-task chaining 
plus prompts increased the number of steps in the task analy- 
sis performed correctly by each student and that the skills gen- 
eralized to two community-based settings (see Figure 23.10). 
Test and colleagues (1990) concluded that total-task chaining, 
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Figure 23.10 Number of correctly performed steps in 

the public-telephone task analysis by two students during 
baseline and total-task chaining (TT) plus response prompts. 
Data points for settings 2 and 3 show performance on 
generalization probes. 

From “Teaching Adolescents with Severe Disabilities to Use the Public 
Telephone,” by D. W. Test, F. Spooner, P. K. Keul, and T. A. Grossi, 1 990, 
Behavior Modification, 14, p. 165. Copyright 1990 by Sage Publications. 
Reprinted by permission. 
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especially as used within the context of a two-per-week training 
regimen, offered benefits to practitioners working with learners 
in community-based settings. 


Backward Chaining 


When a backward chaining procedure is used, all the behav- 
iors identified in the task analysis are initially completed by 
the trainer, except for the final behavior in the chain. When 
the learner performs the final behavior in the sequence at the 
predetermined criterion level, reinforcement is delivered. Next, 
reinforcement is delivered when the last and the next-to-last 
behaviors in the sequence are performed to criterion. Subse- 
quently, reinforcement is delivered when the last three behaviors 
are performed to criterion. This sequence proceeds backward 
through the chain until all the steps in the task analysis have 
been introduced in reverse order and practiced cumulatively. 

Pierrel and Sherman (1963) conducted a classic demon- 
stration of backward chaining at Brown University with a 
white rat named Barnabus. Pierrel and Sherman taught Barn- 
abus to climb a spiral staircase, push down and cross a draw- 
bridge, climb a ladder, pull a toy car by a chain, enter the car 
and pedal through a tunnel, climb a flight of stairs, run through 
an enclosed tube, enter an elevator, raise a miniature replica of 
the Brown University flag, exit the elevator, and finally press 
a bar for which he received a pellet of food. Barnabus became 
so famous for his performance of this elaborate sequence that 
he acquired the reputation of being “the rat with a college edu- 
cation.” This chain of 11 responses was trained by initially 
conditioning the last response in the sequence (the bar press) 
in the presence of a buzzer sound, which was established as 
the S? for the bar press. Then the next-to-last response in the 
sequence (exiting the elevator) was conditioned in the presence 
of the elevator at the bottom of the shaft. Each response in the 
chain was added in turn so that a discrete stimulus served as 
the SP for the next response and as the conditioned reinforcer 
for the preceding response.” 

To illustrate backward chaining with a classroom example, 
let us suppose that a preschool teacher wants to teach a student 
to tie his shoes. First, the teacher conducts a task analysis of shoe 
tying and puts the component behaviors in logical sequence. 


1. Cross over laces on the shoe. 


2. Tie a knot. 


3. Make a loop with the lace on the right side of the shoe, and 
hold it in the right hand. 


. With the left hand, wrap the other lace around the loop. 


nn 


. Use the index or middle finger of the left hand to push the 
left lace through the opening between the laces. 


6. Grasp both loops, one in each hand. 
7. Draw the loops snug. 


The teacher begins by training with the last step in the sequence, 
Step 7, until the student is able to complete it without mistakes 
for three consecutive trials. After each correct trial at Step 7, 
reinforcement is delivered. The teacher then introduces the 
next-to-last step, Step 6, and begins training the student to per- 
form that step and the final step in the chain. Reinforcement is 
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contingent on the successful performance of Steps 6 and 7. The 
teacher then introduces Step 5, making sure that the training 
step and all previously learned steps (i.e., Steps 5, 6, and 7) 
are executed in the proper sequence prior to reinforcement. 
The teacher can use supplemental response prompts to evoke 
correct responding on behavior at any step. However, any sup- 
plemental response prompts—verbal, pictorial, demonstration, 
or physical—introduced during training must be faded later in 
the program so that the student’s behavior comes under stimulus 
control of the naturally occurring SPs. 

With backward chaining, the first behavior the learner 
performs independently produces the terminal reinforcement: 
The shoe is tied. The next-to-last response produces the onset 
of a stimulus condition that reinforces that step and serves as 
an S for the last behavior, which is now being established in 
the learner’s behavioral repertoire. This reinforcing sequence is 
repeated for the remainder of the steps. 

Hagopian, Farrell, and Amari (1996) combined backward 
chaining with fading to reduce the life-threatening behavior of 
Josh, a 12-year-old boy with autism and intellectual disabili- 
ties. Because of medical complications associated with a variety 
of gastrointestinal factors, frequent vomiting, and constipation 
during the 6 months prior to the study, Josh refused to ingest any 
food or liquids by mouth. Indeed, when he did ingest food by 
mouth previously, he expelled it. 

Over the 70-day course of Josh’s treatment program, data 
were collected on liquid acceptance, expulsions, swallows, and 
avoidance of the target behavior: drinking water from a cup. 
After baseline was collected on Josh’s ability to swallow 10 cc 
of water, a swallow condition without water was implemented. 
An empty syringe was placed and depressed in his mouth, and 
he was directed to swallow. Next, the syringe was filled with a 
small volume of water, and reinforcement delivered when Josh 
swallowed the water. In subsequent phases, the volume of water 
in the syringe was gradually increased from 0.2 cc to 0.5 cc to 
1 cc, and ultimately to 3 cc. To earn reinforcement in subsequent 
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From “Treating Total Liquid Refusal with Backward 

Chaining and Fading,” by L. P. Hagopian, D. A. Farrell, 

and A. Amari, 1996, Journal of Applied Behavior Analysis, 0 
29, p. 575. Copyright 1996 by the Society for the Experi- 

mental Analysis of Behavior, Inc. Used with permission. 


3 Swallow, no water 


conditions, Josh had to emit the target chain using at first 3 cc of 
water from a cup to ultimately 30 ce of water. At the end of the 
study, a successful generalization probe was implemented with 
90 cc of a mixture of water and juice. 

The results showed that the chaining procedure improved 
Josh’s ability to emit the target chain (see Figure 23.11). Hago- 
pian and colleagues explained their procedure this way: 


We began by targeting a preexisting response (swallowing), 
which was the third and final response in the chain of behav- 
iors that constitute drinking from a cup. Next, reinforcement 
was delivered for the last two responses in the chain (accept- 
ing and swallowing). Finally, reinforcement was delivered 
only when all three responses in the chain occurred. (p. 575) 


A primary advantage of backward chaining is that the 
learner contacts the chain’s terminal reinforcer on every instruc- 
tional trial. As a direct outcome of reinforcement, the stimulus 
that is present at the time of reinforcement increases its dis- 
criminative properties. Further, the repeated reinforcement of 
all behaviors in the chain increases the discriminative capability 
of all the stimuli associated with these behaviors and with the 
reinforcer. The main disadvantage of backward chaining is that 
the potential passive participation of the learner in earlier steps 
in the chain may limit the total number of responses made during 
any given training session. 


Backward Chaining with Leap Aheads 


Spooner, Spooner, and Ulicny (1986) described a variation 
of backward chaining they called backward chaining with 
leap aheads. Backward chaining with leap aheads follows 
essentially the same procedures as backward chaining, except 
that not every step in the task analysis is trained. Selected 
steps are simply probed. The purpose of the leap-aheads mod- 
ification is to decrease the total training time needed to learn 
the chain. With conventional backward chaining, the stepwise 
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Figure 23.12 Essential features of forward, total-task, backward, and backward chaining with leap aheads. 


Forward Chaining 


Behaviors identified in the 
task analysis are taught 

in their naturally occur- 
ring order. Specifically, 
reinforcement is deliv- 
ered when the predeter- 
mined criterion for the 
performance of the first 
behavior in the sequence 
is achieved. Trainer com- 
pletes the remaining tasks 
in the chain, or assists the 
learner in doing so. Next, 
reinforcement is delivered 
for criterion completion of 


Total-Task Chaining 


A variation of forward 
chaining in which the 
learner receives training 
on each step in the task 
analysis during every ses- 
sion. The trainer provides 
assistance for any steps 
the learner is unable to 
perform independently, 
and the chain is trained 
until the learner is able to 
perform all the behaviors 
in the sequence to the 
predetermined criterion. 


Backward Chaining 


Tasks are taught in a “backward” 
(last-to-first) sequence. All the 
task-analysis-identified behaviors are 
initially completed by the trainer, 
except for the final behavior in the 
chain. When the learner performs the 
final behavior in the sequence at the 
predetermined criterion level, rein- 
forcement is delivered. Next, reinforce- 
ment is delivered when the combined 
next-to-last and last behaviors in the 
sequence are performed to criterion. 
Subsequently, reinforcement is deliv- 
ered when the combined sequence of 
the last three behaviors are performed 


Backward Chaining with 
Leap Aheads 


ee Oe EEE eee 


This type of chaining follows 
essentially the same proce- 
dures as backward chaining, 
except that not every step 
in the reverse execution of 
the task analysis is trained. 
Selected steps are simply 


probed. Tasks that the learner 


can perform are skipped, and 
the trainer “leaps ahead” to 
the next unmastered task, 
thereby decreasing total 
training time for the chain. 


Steps 1 and 2. Succeeding 
steps require the cumu- 
lative performance of 

all previous steps in the 
correct order. 


repetition of the behaviors in the sequence may slow down the 
learning process, especially when the learner has mastered 
a few of the steps in the chain. For instance, in the previous 
shoe-tying illustration, the child might perform Step 7, the 
last behavior in the sequence, and then leap ahead to Step 4 
because Steps 5 and 6 are in her repertoire. It is important to 
remember, however, that the learner must still perform Steps 5 
and 6 correctly and in sequence with the other steps to receive 
reinforcement. 

Figure 23.12 describes the essential features of for- 
ward, total-task, backward, and backward chaining with leap 
aheads. 


CHOOSING A CHAINING METHOD 


Forward chaining, total-task chaining, backward chaining, 
and backward chaining with leap aheads have each shown 
effectiveness with a wide range of self-care, vocational, rec- 
reational, and independent-living behaviors. Which chaining 
procedure should be the method of first choice? Research 
conducted to date does not suggest a definitive answer. 
Furthermore, research does not suggest a clear protocol for 
whether unfinished links in a chain should be completed by 
the teacher or the student, or left unfinished until the next 
training trial. Bancroft, Weiss, Libby, and Ahearn (201 1) 
showed that whereas the majority of students in their study 
learned a chain faster under the student condition, this result 
was not universal for all students. 

After examining evidence reported between 1980 and 
2001, Kazdin (2001) concluded: “Direct comparisons have 
not established that one [forward chaining, backward chain- 
ing, or total-task chaining] is consistently more effective than 


to criterion. This series proceeds in 
reverse fashion through the chain until 
all the steps in the task analysis have 
been introduced in reverse order and 
practiced cumulatively. 


the other” (p. 49). A study by Slocum and Tiger (2011) found 
forward and backward chaining procedures to be similarly 
effective. 

Seaver and Bourret (2014) stated, “When all teaching 
procedures are equally efficient for a particular individual, a 
practitioner may take student preference . . . teacher prefer- 
ence, and procedural integrity into consideration” (p. 791). 
Anecdotal evidence, logical analysis, and the behavior analyst's 
experience with the learner may also provide direction. Milten- 
berger (2001) and Test et al. (1990) concluded that using total- 
task chaining may be warranted (a) when the student can 
perform several of the tasks in the chain, but needs to learn 
them in sequence; (b) when the student has an imitative reper- 
toire; (c) when the student has moderate to severe disabilities; 
and/or (d) when the task sequence or cycle is not very long or 
complex. Likewise, Bancroft et al. (2011) advised considering 
other factors—the learner’s ability to tolerate physical guid- 
ance, the attention span of the learner, or the total amount of 
training time available to devote to learning the chain—when 
deciding which method to use. 

Uncertainty about which approach to follow may be mini- 
mized by conducting a personalized task analysis for the learner, 
systematically applying the single- or multiple-opportunity 
method to determine the starting point for instruction, and rely- 
ing on empirically sound data-based studies in the literature. 
Further, doubt can be reduced by ascertaining if an individual’s 
past history with any of the chaining procedures or variations 
favors one method over another, and/or by collecting formative 
evaluation data (i.e., measurements of the target behavior during 
treatment) and summative evaluation data (i.e., measurement of 
behavior after treatment) on the proposed approach, to deter- 
mine its efficacy for that person. 
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DISRUPTING AND BREAKING 
BEHAVIOR CHAINS 


Thus far, our discussion has focused on procedures for building 
behavior chains—that is, constructing a sequence of stimulus 
conditions and associated responses such that when individual 
components are joined, the chain of behavior results that pro- 
duces reinforcement. As stated earlier, each stimulus linking two 
responses in a chain serves the dual functions of conditioned 
reinforcer for the response that produced it and discriminative 
stimulus for the next response in the chain. 

Practitioners have used chaining to expand the repertoires 
of a wide range of learners. Knowing how to link behaviors into 
a practical chain, however, is not always sufficient. Behavior ana- 
lysts should also know how to disrupt or break behavior chains that 
are maladaptive or inappropriate (e.g., consuming too much food, 
texting while driving). Behavior chains can be disrupted or broken 
with six methods: extinction, satiation, unchaining, interrupting the 
chain, substituting the SP. and extending the chain with time delays. 

In a series of three experiments, Kuhn, Lerman, Vorndran, 
and Addison (2006) evaluated the effects of extinction, satiation, 
and unchaining on responding by children with disabilities in a 
previously taught two-response chain. The chain in each exper- 
iment went like this: S1 (therapist shows child a small box con- 
taining a preferred food item and closes the box) — R1 (child 
signs “open”’) — S2 (therapist opens box of food) — R2 (child 
signs “eat” or “popcorn”) — (child receives food). 


Extinction 


To break or interrupt a chain using extinction, the practitioner 
withholds reinforcement for one or more responses in the chain. 


Kuhn et al. (2006) applied extinction to the final response in the 
chain: “R1 (‘open’) resulted in the box opening as before, but 
the box was closed and no food was delivered contingent on 
R2 (‘eat’ or ‘popcorn’)” (p. 273). Extinction for R2 resulted in 
immediate decreases in both R1 and R2 by all four participants 
(Figure 23.13 shows results for two children). Extending their 
findings to a two-response chain of aggressive behavior con- 
sisting of arm grabbing and hair pulling, Kuhn and colleagues 
suggested that treating hair pulling with extinction would likely 
result in decreased arm grabbling. 


Satiation 


To break or interrupt a chain using satiation, the practitioner 
provides unlimited access to the reinforcer (an abolishing oper- 
ation) for the last response in the chain. As a result, the chain’s 
final response will likely abate because of the decreased effec- 
tiveness of the reinforcer. Responses earlier in the chain would 
be expected to decrease as well because the effectiveness of 
the stimulus changes as conditioned reinforcers for earlier 
responses depends upon their association with the chain’s ter- 
minal reinforcer. 

Kuhn and colleagues (2006) evaluated this satiation 
condition. Prior to each session, the child was given free 
access to food for the duration of “satiation periods” rang- 
ing from 25 to 30 minutes. Presession access to the chain’s 
terminal reinforcer decreased both responses in the chain 
(see Figure 23.14)-.R2 (“eat”) decreased as a direct result 
of presession access to food; R1 (“open”) decreased because 
presession access to food weakened an opened food box as a 
conditioned reinforcer. 


Figure 23.13 Frequency of 
appropriate “open” and “eat” 
(“popcorn” instead of “eat” for 
Timmy) responses during the 
extinction evaluation for Bonnie 
(top) and Timmy (bottom). 20 
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Figure 23.14 Frequency of “open” and “eat” responses during the satiation evaluation for 


Leroy (top), Sammy (middle), and Don (bottom). 


From “Analysis of Factors That Affect Responding in a Two-Response Chain in Children with Developmental Disabil- 
ities,” by S. A. C. Kuhn, D. C. Lerman, C. M. Vorndran, and L. Addison, 2006, Journal of Applied Behavior Analysis, 
39, p. 272. Copyright 2006 by the Society for the Experimental Analysis of Behavior, Inc. Used with permission. 


Unchaining 


Unchaining—a term introduced by Kuhn et al. (2006) for a pro- 
cedure described by Michael (2000)—occurs when, in the case 
of a two-response chain, R2 produces reinforcement not only 
in the presence of S2 (as usual) but also in the absence of S2. 
In Michael’s example, a food-deprived rat’s first response (R1) 
turns on a tone, in the presence of which a second response (R2) 


produces food. In the unchaining procedure (Michael called it 
unpairing), R1 continues to produce the tone, but R2 results 
in food regardless of the tone’s presence. In theory, R1 will 
decrease because its immediate consequence, the tone, loses its 
reinforcing effectiveness as a conditioned reinforcer because it 
no longer functions as a discriminative stimulus (SP) for food. 
Thus, unchaining disrupts R1 while leaving R2 intact. 
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The unchaining procedure used by Kuhn and colleagues 
(2006) was implemented as follows: 


Reinforcement was delivered contingent on R2 regardless 
of when it occurred. That is, the child received a small 
piece of food contingent on completing the chain (1.e., R1, 
then R2) and contingent on R2 alone. Thus, R2 produced 
food regardless of whether the box was open. The purpose 
of this condition was to look at the effects of unchaining 
R1 (.e., communicating to open the box) and R2 (i.e., com- 
municating for food) on the response chain. Initially for all 
children, the reinforcer for signing “eat” (or “popcorn” for 
Timmy) outside the chain “open-eat” was placed behind 
the therapist’s back, as well as in the box, so that the ther- 
apist could deliver the reinforcer without opening the box. 
Beginning with Session 16 (first unchaining phase) and 
Session 42 (second unchaining phase), the reinforcer was 


placed on a plate on the table in front of Don, next to the 
box containing additional reinforcers. The reinforcer on the 
plate was provided contingent on occurrences of the “eat” 
response outside the context of the chain. (p. 274) 


Figure 23.15 shows results for the three children who 
participated in the experiment. Unchaining had little effect on 
Bonnie, as she continued to emit both responses in the chain 
throughout the unchaining condition. In contrast, after three 
or four sessions of unchaining, Timmy no longer emitted R1. 
Unchaining had no effect on Don’s behavior until researchers 
modified the procedure by making the food visible on a plate. 
This resulted in a decrease in R1 (“open’’) responses. 

Although unchaining may show promise as a method for 
disrupting earlier responses in a chain, Kuhn et al.’s (2006) data 
do not show that the effects of unchaining are as powerful, or as 
immediate, as extinction or satiation methods. 
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Interrupting the Chain 


The behavior chain interruption strategy (BCIS) relies on the 
participant’s skill to initially perform all the critical elements of 
a chain independently, but the chain is then interrupted, or a link 
in the chain is made unavailable, at a predetermined link so that 
another behavior can be emitted or prompted. Initially developed 
to increase speech and oral responding (Goetz, Gee, & Sailer, 1985; 
Hunt & Goetz, 1988), the BCIS has been extended into pictorial 
communication systems (Roberts-Pennell & Sigafoos, 1999), 
manual signing (Romer, Cullinan, & Schoenberg, 1994), switch 
activation (Gee, Graham, Goetz, Oshima, & Yoshioka, 1991), and 
mand training (Albert, Carbone, Murray, Hagerty, & Sweeney- 
Kerwin, 2012; Hall & Sundberg, 1987). 

The BCIS works as follows. First, an assessment is 
conducted to determine if the person can independently com- 
plete a chain of two or more components. Figure 23.16 shows 
an example of how a toast-making chain, divided into five 
steps, was assessed across degrees of distress and attempts to 
complete when specific steps were blocked by an observer. 
Degree of distress was ranked on a 3-point scale, and attempts 
were registered using a dichotomous scale (1.e., yes or no). 

With respect to using BCIS in applied settings to 
increase behavior, a chain is selected based on the assumption 
that when it is interrupted, it will not be unduly distressful 
for the individual. In other words, care is taken that the inter- 
ruption does not precipitate the person becoming episodic or 
self-injurious. After collecting baseline on a target behavior 
(e.g., vocalizations), the person is directed to start the chain 
(e.g., “Make toast.”). At a predetermined step in the chain— 
say, Step 3 (e.g., “Push down”)—the individual’s ability to 
complete that step is restricted. For example, the practitioner 
might passively block access to the toaster momentarily. 
The person would then be prompted, “What do you want?” 
A vocalization, previously not in the learner’s repertoire, 
would be required for the chain to be completed. That is, the 
person would be prompted to say, “Push down.” 
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A procedural variation of BCIS, the interrupted chain 
procedure (ICP), entails arranging the environment such that 
the learner is unable to continue the chain at a predetermined 
step. A practitioner implementing the ICP in the toast-making 
chain might conceal the toaster and, after the learner has 
obtained a slice of bread, ask, “What do you want?” The verbal 
prompt may function as a motivating operation that evokes the 
learner’s response, “Toaster, please,” to obtain the toaster and 
complete the chain. 

Although the exact behavioral mechanisms responsible 
for improved performance using BCIS and ICP are still sub- 
ject to scientific inquiry, analysis, and explanation, field-based 
research efforts, including those examining generalized out- 
comes (Grunsell & Carter, 2002), have established them as 
efficacious approaches, especially for persons with severe dis- 
abilities. For example, Carter and Grunsell’s (2001) review of 
the literature on the efficacy of BCIS shows that it is positive 
and beneficial. In their view, 


The BCIS may be seen as empirically supported and com- 
plementary to other naturalistic techniques. ... A small, but 
growing body of research literature has demonstrated that 
individuals with severe disabilities have acquired request- 
ing with the BCIS. In addition, research demonstrates 

that implementation of the BCIS may increase the rate of 
requesting. (p. 48) 


The assumption underlying BCIS assessment is that 
“persistence in task completion and emotional response to 
the interruption would serve as operational definitions of high 
motivation for task completion” (Goetz et al., 1985, p. 23). 
Because momentarily interrupting a chain detains the learner 
from obtaining the reinforcer for completing the chain, the pro- 
cedure may function as a conditioned motivation operation that 
evokes additional behavior required to continue and complete 
the chain (see Chapter 16). “By interrupting the chains at the 
point at which these additional objects (or actions) are needed, 


Chain Pretesting 


Student's Name; Chuck 
Date; 5/2. 


Attempts to 


Sequence of steps Degree of distress complete 


1. Pull bread from bag* 1 (2) 3 
(low) (high) 


2. Put in toaster 1 2 3} yes no 


1 (2) 3 yes no 


3. Push down* 


Chain; making toast 


*interrupt at start of step 


Comments 


tried to touch knob 


Figure 23.16 Sample score sheet used to 
evaluate chains. 


From “Using a Chain Interruption Strategy to Teach 


4. Take toast out 1 2 3 y Ie Communication Skills to Students with Severe 
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Handicaps, 1985, 13(1), p. 24. Copyright by the 


X distress: fe 23 Total 2/3 = 66% 


Association for Persons with Severe Handicaps. 
Used by permission. 
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practitioners can momentarily establish them as reinforcers 
and contrive opportunities to teach a wider variety of mands” 
(Albert et al., 2012, p.73). 

In any case, the literature is becoming increasingly clear 
that the BCIS/ICP consists of several components and features 
that make it useful as a behavior change tactic in natural settings. 
Participants exposed to interrupted chain procedures can learn to 
perform nontrained tasks and responses, thereby increasing the 
efficiency of the procedure (Albert et al., 2012; Carter & Grunsell, 
2001). As Albert el al. state, “By interrupting the chains at the 
point at which these additional objects (or actions) are needed, 
practitioners can momentarily establish them as reinforcers and 
contrive opportunities to teach a wider variety of mands” (p. 73). 
Figure 23.17 reviews key components and features of BCIS/ICPs. 


Substituting an Initial S° for 
an Alternative Chain 


When a specific S” sets an inappropriate chain in motion, block- 
ing that SP completely and substituting the initial SP for a pre- 
ferred chain may be effective. For example, if an ashtray next 
to a favorite chair (S,) evokes a chain of finding a pack of cig- 
arettes (R;), finding a lighter (R>), and firing up a smoke (R3), 


Figure 23.17 Key components and features of the BCIS. 


replacing the ash tray with a TV remote, a newspaper, or a book 
may occasion one or more desirable sequences of responses. 


Extending the Chain with Time Delays 


Martin and Pear (2016) suggested that a behavior chain that 
reinforces excessive food consumption can be broken by intro- 
ducing links requiring the individual to place the eating utensil 
on the table between bites, or introducing a 3- to 5-sec time 
delay before the next bite can be initiated. In an undesirable 
chain, “the person gets ready to consume the next bite before 
even finishing the present one” (p. 111). A more desirable chain 
separates these components and introduces brief delays. 


TROUBLESHOOTING CHAINS 


With respect to separating chain components, let us con- 
sider the case of a restaurant trainee who has been taught 
to bus tables. Upon completion of the training program, the 
trainee was able to perform each of the necessary behaviors 
in the table-busing chain accurately and proficiently (see 
Figure 23.18, upper panel). However, on the job site, the 
new employee started to emit an inappropriate sequence of 


Instruction begins in the middle of the chain sequence, not the start, differing from forward and backward chaining 
procedures. Because the instruction begins in the middle of the sequence, interrupting the sequence might function as a 
transitive conditioned motivating operation or negative reinforcer, the removal of which increases behavior. 


Procedurally, the BCIS is predicated on an assessment that verifies that the person is able to complete the chain independently, 
but experiences moderate distress when the chain is interrupted in the middle of the sequence. 


Verbal prompts are used at the point of interruption (e.g., “What do you want?”), but a full range of response prompts— 


modeling and physical guidance—has also been employed. 


Interruption training occurs in the natural setting (e.g., a water basin for washing hair; the microwave oven for making 


cookies). 


Maintenance, generalization, and social validity data, although not overwhelmingly evident in every study, are sufficiently 
robust to suggest that the BCIS be included with other interventions (e.g., mand-model, time delay, and incidental teaching). 
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Figure 23.18 Illustration of an original behavior chain (upper panel) and its revision (lower panel) 


to break an inappropriate response sequence. 


behaviors. Specifically, in the presence of dirty dishes on the 
empty table, the individual scraped excess food scraps onto 
the table itself, instead of placing the dirty items on the food 
cart. In other words, the initial SP for the table-busing chain, 
the empty table with the dirty plates, was setting the occasion 
for a response (dish scraping) that should have occurred later 
in the chain. 

To troubleshoot this inappropriate chain and determine a 
course of action, the behavior analyst should consider several 
possible sources of difficulty, including (a) re-examining the S?s 
and responses S;-R,-S-R»; (b) determining whether similar S?s 
cue different responses; (c) analyzing the job setting to identify 
relevant and irrelevant Ss; (d) determining whether SPs in the 
job setting differ from training Ss; and (e) identifying the pres- 
ence of novel stimuli in the environment. 


Re-examine the S° and Response Sequence 


Kuhn et al. (2006) indicated that appropriate chains can be 
broken inadvertently and the entire chain, or key links within 
the chain, can be compromised when parents, teachers, and/or 
caregivers unwittingly reinforce an inappropriate sequence. 
They stated, 


...if a series of appropriate behaviors are inadvertently 
linked in a chain and the contingencies change for just 
some members of the chain (e.g., if a new caregiver begins 
to reinforce the terminal response whether or not it fol- 
lowed earlier responses in the chain), earlier appropriate 
behaviors could be eliminated (p. 265). 


So, by re-examining the SP and response sequence, a more 
appropriate arrangement might be contrived. 


Determine Whether Similar S°s Cue 
Different Responses 


Figure 23.18 (upper panel) shows two similar SPs—dirty dishes 
on the empty table (S;) and dirty dishes on the cart (S>)—which 
might have contributed to the scraping of food onto the table. In 
other words, R» (scrape dishes) might have come under the con- 
trol of S; (dirty dishes on table). Figure 23.18 (lower panel) shows 
how the behavior analyst corrected the sequence by rearranging 
the SPs and their associated responses. Scraping the dishes is now 
the fifth response in the chain (Rs), and it occurs at the kitchen 
sink, an area away from the restaurant tables and diners. Thus, any 
potential confusion would be reduced or eliminated. 


Analyze the Natural Setting to Identify Relevant 
and Irrelevant S°s 


The training program should be designed so that the learner 1s 
trained to discriminate the relevant (i.e., critical) components of 
a stimulus from the irrelevant variations. Figure 23.18 (lower 
panel) shows at least two relevant characteristics of S;: an empty 
table and the presence of dirty dishes on that table. An irrelevant 
stimulus might be the location of the table in the restaurant, the 
number of place settings on the table, or the arrangement of 
the place settings. A relevant characteristic of Ss, the presence of the 
kitchen sink, would be water faucets, the sink configuration, or 
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dirty dishes. Finally, irrelevant stimuli might be the size of the 
kitchen sink and the type or style of faucets. 


Determine Whether S°s in the Natural Setting 
Differ from Training S°s 


It is possible that variations of the S?s cannot be taught during 
simulated training phases because natural conditions cannot be 
replicable or governmental regulations are prohibitive. For exam- 
ple, instrument pilots executing a complex chain of approach 
and landing maneuvers for a “circling approach” to an airport 
as part of a competency check, can only do that maneuver in a 
real airplane at an airport. Simulated approaches are not allowed. 
Hence, it is recommended that the final training session, at least 
for selected chains, be conducted in the natural setting where the 
behavior chain is expected to be performed. This allows any dif- 
ferences that exist to be recognized by the trainer and subsequent 
discrimination training to be further refined on site. 


Identify the Presence of Novel Stimuli 
in the Environment 


The presence of a novel stimulus unexpected in the original train- 
ing situation can also prompt the occurrence of an inappropri- 
ate chain. In the restaurant example, the presence of a crowd of 
customers may set the occasion for the chain to be performed 
out of sequence. Likewise, distracting stimuli (e.g., customers 
coming and going, tips left on the table) may set the occasion for 
an inappropriate chain. Also, a coworker might unwittingly give 
contradictory instructions to the trainee. In any of these situations, 
the novel stimuli should be identified and the learner taught to 
discriminate them along with other SPs in the environment. 


FACTORS AFFECTING THE PERFORMANCE 
OF BEHAVIOR CHAINS 


Several factors affect the performance of a behavior chain. The 
following sections outline these factors and provide recommen- 
dations for addressing them. 


Completeness of the Task Analysis 


The more complete and accurate the task analysis, the more 
progress the person can make through the sequence. If the ele- 
ments making up the chain are not sequenced appropriately, or 
if the corresponding SPs are not identified for each response, 
learning the chain will be more difficult. 

The behavior analyst should remember two key points 
when attempting to develop an accurate task analysis. First, 
planning must occur before training. The time devoted to 
constructing and validating the task analysis is well spent. 
Second, after the task analysis is constructed, training should 
begin with the expectation that adjustments in the task anal- 
ysis, or the use of more intrusive prompts, may be needed 
at various steps in the task analysis. For example, McWil- 
liams and colleagues (1990) noted that one of their students 
required extensive trials and more intrusive prompts with a 
few steps in the task analysis before improved performance 
was obtained. 
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Length or Complexity of the Chain 


Longer or more complex behavior chains take more time to 
learn than shorter or less complex behavior chains. Likewise, if 
two or more chain clusters are linked, practitioners can antici- 
pate that training will be longer. 


Schedule of Reinforcement 


When a reinforcer is presented subsequent to the performance 
of a behavior in a chain, it affects each of the responses making 
up the chain. However, the effect on each response is not iden- 
tical. For example, in backward chaining, responses performed 
at the end of the chain are strengthened faster than responses 
made earlier in the chain because they are reinforced more 
frequently. The behavior analyst is advised to remember two 
points: (a) A chain can be maintained if an appropriate schedule 
of reinforcement is used (see Chapter 13), and (b) the number of 
responses in a chain may need to be considered when defining 
the schedule of reinforcement. 


Stimulus Variation 


Bellamy, Horner, and Inman (1979) provided an excellent picto- 
rial representation of how stimulus variation affects a learner’s 
performance of a chain. The upper photograph in Figure 23.19 


shows a cam switch bearing before and after its placement on a 
cam switch axle. The lower photograph shows the four different 
types of bearings that can be used in the assembly process. Any 
bearing sharing two critical features—a 1.12-cm circular hole in 
the center and one or more hex nut slots on its face—should evoke 
placing it on a cam switch axle. Bearings without these features, 
regardless of the presence of irrelevant features (e.g., color, mate- 
rial composition, weight), should not occasion the response. 

If possible, the behavior analyst should introduce all pos- 
sible variations of the S? that the learner will encounter. Regard- 
less of the behavior chain, presentation of stimulus variations 
increases the probability that the correct response will occur 
in their presence. This includes, for example, in an assembly 
task, various canisters and shafts; with dressing skills, different 
fasteners, zippers, and buttons; and with tooth brushing, assorted 
toothpaste tubes and pumps and brushes. 


Response Variation 


Often when stimulus variations occur, response variation also 
must occur to produce the same effect. Again, Bellamy and 
colleagues (1979) provided an illustration with the cam shaft 
assembly. In the upper-left photograph of Figure 23.20, the 
bearing has been placed on the cam shaft, and the retaining clip 
is being positioned for placement with a pair of nose pliers. 


A. B. 


A cam switch bearing before and after placement on a cam switch axle. 


Figure 23.19 Upper panel: A cam switch bearing 
before and after placement on a cam switch axle. 
Lower panel: Four different types of bearings used in 
cam switch assembly. 

From Vocational Habilitation of Severely Retarded Adults, pp. 40 & 


42, by G. T. Bellamy, R. H. Horner, and D. P. Inman, 1979, Austin, 
TX: PRO-ED. Copyright 1979 by Pro-Ed. Reprinted by permission. 


Placement with push tool 


The upper-right photograph shows the clip in position. The 
lower-left photograph shows a different bearing configuration 
(i.e., the S? is different) requiring a different response. Instead 
of the retaining clip being lifted over the bearing cap with 
pliers, it must be pushed over the cap with a wrench-type tool. 


SUMMARY 
Behavior Chain Defined 
1. A behavior chain is a linked sequence of responses leading 

to a terminal outcome. Each response, and its associated 
stimulus condition, produces a change that functions as 
conditioned reinforcement for that previous response 
and as a discriminative stimulus for the next response in 
the chain. Reinforcement for the last response in a chain 


maintains the reinforcing effectiveness of the stimulus 
changes produced by all previous responses in the chain. 


2. Each stimulus that links two sequential responses in a 
chain serves dual functions: It is a conditioned reinforcer 
for the response that produced it and an SP for the next 
response in the chain. 


3. A behavior chain has three important characteristics: 
(a) It involves the performance of a specific series of 
discrete responses; (b) the performance of each behav- 
ior in the sequence produces a stimulus change in the 
environment in such a way that it yields conditioned 
reinforcement for the preceding response and serves as an 
SP for the next response; and (c) the responses within the 
chain must be performed in a specific sequence, and in 
close temporal succession. 


4. Inabehavior chain with a limited hold, a sequence of behaviors 
must be performed correctly and within a specific time for 
reinforcement to be delivered. Proficient responding is a dis- 
tinguishing feature of chains with limited holds. 
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Figure 23.20 Two ways that retaining rings are 
applied to affix bearings to cam axles. 


From Vocational Habilitation of Severely Retarded Adults, p. 44, 
by G. T. Bellamy, R. H. Horner, and D. P. Inman, 1979, Austin, 
TX: PRO-ED. Copyright 1979 by Pro-Ed. Reprinted by permission. 


The response of lifting or pushing has changed, as has the 
response of selecting the appropriate tool. Thus, the behavior 
analyst should be aware that when stimulus variation is intro- 
duced, training, or retraining of responses, within the chain 
may be required. 


5. If a learner takes too much time to complete one or 
more component responses, the behavior analysis can 
implement contingencies focusing on the proficiency of 
performance. Setting a time criterion for the onset and 
completion of each response within the chain is one way 
to do this; another is to set a time criterion for completion 
of the total chain. 


Rationale for Chaining 


6. Whereas a behavior chain denotes a particular sequence of 
stimuli and responses ending in reinforcement, chaining 
refers to various methods for linking specific sequences of 
stimuli and responses to form new performances. 


7. Chaining refers to the way these specific sequences of 
stimuli and responses are linked to form new perfor- 
mances. In forward chaining, behaviors are linked begin- 
ning with the first behavior in the sequence. In backward 
chaining, the behaviors are linked beginning with the last 
behavior in the sequence. Forward and backward chaining 
have variations available. 


g. A behavior analyst should be skilled in building behavior 
chains for three reasons: (a) Chains can be used to 
improve independent-living skills; (b) chains can provide 
the means by which other behaviors can be combined into 
more complex sequences; and (c) chains can be combined 
with other procedures to build behavioral repertoires in 
generalized settings. 
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Establishing Behavior Chains with Task Analysis on. The primary advantage of backward chaining is that 
the learner contacts the chain’s terminal reinforcer on 


9. Task analysis involves breaking a complex skill into every instructional trial. 


smaller, teachable units, the product of which is a series of 

sequentially ordered steps or tasks. 15. Backward chaining with leap aheads follows essentially 
the same procedures as in backward chaining, except that 
not every step in the task analysis is trained. The leap- 
aheads modification provides for probing or assessing 
untrained behaviors in the sequence. Its purpose is to 
speed up training of the behavior chain. 


10. The purpose of constructing and validating a task analysis 
is to determine the sequence of stepwise critical behaviors 
that constitute the complete task and that would lead to it 
being individualized for the learner and performed effi- 
ciently. Task analyses can be constructed by observing a 
competent performer, by executing the task yourself, by Choosing a Chaining Method 

asking an expert, and by trial and error. 

16. The decision to use forward chaining, total-task chain- 
ing, backward chaining, or backward chaining with leap 
aheads should be based on the results of a task analysis 
assessment; empirically sound data-based studies; and 
a functional evaluation, taking into consideration the 
cognitive, physical, motoric, and attention abilities of 
the individual. 


11. The purpose of assessing mastery level is to determine 
which components of the task analysis can already be per- 
formed independently. Assessment can be conducted with 
the single-opportunity or multiple-opportunity method. 


Behavior Chaining Methods 


12. In forward chaining, the behaviors identified in the task 
analysis are taught in their natural sequence. Specifically, Disrupting and Breaking Behavior Chains 
reinforcement is delivered when the predetermined cri- 
terion for the first behavior in the sequence is achieved. 
Thereafter, reinforcement is delivered for criterion 
completion of Steps 1 and 2. With each successive step, 
reinforcement is delivered contingent on the correct per- 
formance of all steps trained thus far. 


17. Behavior chains can be interrupted or broken in at least 
six ways. These include extinction, satiation, unchaining, 
interrupting the chain, substituting an >, or extending the 
chain and building in time delays. 


ery vm Troubleshooting Chains 
13. Total-task chaining is a variation of forward chaining in 


which the learner receives training on each step in the 18. To troubleshoot an inappropriate chain, the behavior analyst 
task analysis during every session. Trainer assistance is should consider (a) re-examining the $°s and responses 
provided using response prompts with any step the indi- S;-R,-Sp-Ro, (b) determining whether similar S°s cue dif- 
vidual is not able to perform. The chain is trained until ferent responses, (c) analyzing the job setting to identify 

the learner performs all the behaviors in the sequence to relevant and irrelevant Ss, (d) determining whether Ss in 
criterion. the job setting differ from training Ss, and (e) identifying 


oh ' : the presence of novel stimuli in the environment. 
14. In backward chaining, all the steps identified in the task 


analysis are completed by the trainer, except the last one. 


Retr ana ter va iho eq beac mipertocmedin cae Factors Affecting the Performance of Behavior Chains 


terion, reinforcement is delivered. Next, reinforcement is 19. Factors affecting the performance of a behavior chain 
delivered when the next-to-last step and the last step are include (a) completeness of the task analysis, (b) length 
performed. Subsequently, the individual must perform the or complexity of the chain, (c) schedule of reinforcement, 
last three steps before reinforcement is delivered, and so (d) stimulus variation, and (e) response variation. 


KEY TERMS 
backward chaining behavior chain interruption strategy forward chaining 
backward chaining with leaps ahead behavior chain with a limited hold task analysis 


behavior chain chaining total-task chaining 
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MULTIPLE-CHOICE QUESTIONS 


1. Each of the following is an example of a behavior except: b. You could watch people who know how to do the task 


a. Mowing the lawn 

b. Brushing your teeth 

c. Blinking your eye 

d. Writing your name 

Hint: (See “Behavior Chain Defined”) 


. A behavior chain with a limited hold means: 

a. The number of steps in the chain is limited. 

b. The learner must “hold” his/her responses for a 
specified period of time. 

c. The entire chain or steps within the chain must be 
completed in a given period of time. 

d. Accuracy of the performance of steps is not important. 

Hint: (See “Behavior Chain Defined”’) 


. Behavior chains and chaining procedures are important 
because: 


a. There are many behaviors that are considered behavior 
chains, and these behaviors lead to more independence 
in life. 

b. Chaining can be combined with many other teaching 
procedures so that behaviors can be taught in a variety 
of contexts. 

c. Chaining is an efficient training procedure-it can be 
used to combine already existing repertoires of 
behavior into new behaviors. 

d. All of these 


Hint: (See “Rationale for Chaining”’) 


. How are forward and backward chaining different? 

a. Forward chaining means the learner learns the steps 
to the skill in the correct order, and backward chaining 
means the learner learns the steps in the reverse order. 

b. Forward chaining means the learner learns the first step 
first in the sequence, and backward chaining means the 
learner learns the last behavior in the sequence first (but 
the steps are still learned in the correct sequence). 

c. Forward chaining is a much quicker way to learn a task 
than backward chaining. 

d. None of these answer choices accurately differentiates 
between forward and backward chaining. 


Hint: (See “Rationale for Chaining”) 
. If you want to create a task analysis for a behavior you have 
no idea how to do yourself, how could you construct it? 


a. You could find people who are good at the behavior 
and ask them how to do it. 


and observe what they do. 


c. You could go to the literature to see if there are any 
published task analyses of the behavior. 
d. All of these. 


Hint: (See “Establishing Behavior Chains with Task 
Analysis”’) 


. What is a risk of choosing the single-opportunity 


method of assessment for assessing mastery of a task 
analysis? 
a. You may underestimate the learner’s abilities. 


b. You may inadvertently be teaching the individual the 
behavior during assessment. 


c. You may frustrate the learner because of the time- 
consuming nature of this form of assessment. 
d. All of these. 


Hint: (See “Establishing Behavior Chains with Task 
Analysis’) 


. Breaking down longer chains into smaller chains and/or 


skill clusters, teaching each skill cluster, and then chaining 
each skill cluster is a variation of which form of behavior 
chaining? 

a. Forward chaining 

b. Backward chaining 

c. Total task chaining 

d. Backward chaining with leap aheads 

Hint: (See “Behavior Chaining Methods”) 


. Peer models have been used to teach children with dis- 


abilities to perform complex tasks using which form of 
chaining? 

a. Forward chaining 

b. Backward chaining 

c. Total task chaining 

d. Backward chaining with leap aheads 

Hint: (See “Behavior Chaining Methods”) 


. The behavior chain interruption strategy (BCIS) can be 


used to: 
a. Teach a complex new behavior chain 


b. Increase speech, picture communication, and micros- 
witch activation 


c. Replace the initial SDs in a chain 
d. All of these 
Hint: (See “Disrupting and Breaking Behavior Chains”) 
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10. Kara has been learning to board a public bus, put money in likely reason that Kara wasn’t able to perform steps in the 
the slot, and sit in her seat, waiting for her stop. Then, she chain she had learned so well at home? 
pushes the button to indicate when she wants to get off. She a. The S?s on the bus were too similar to S?s on the bus 
has practiced this skill in her small hometown, where there at home, and so they triggered different responses. 
ee ee ee ae ames Gus Bae Waxpled b. The steps of the task were different on this bus than on 


to Chicago with her parents to visit her aunt. She rode the 
bus with her parents in Chicago and was unable to do many 
steps in the chain. For example, she wasn’t able to put her 
money in the slot, and she had difficulty choosing a seat. 


the one at home. 


c. There were novel stimuli present in Chicago that weren’t 
present at home and that interfered with Kara’s chain. 


Her parents noticed that she was very distracted by all the d. All of these are reasonable explanations for Kara’s 
people on the bus, as compared to the few people who are trouble. 
on the bus at home. Which of the following is the most Hint: (See “Disrupting and Breaking Behavior Chains”) 
ESSAY-TYPE QUESTIONS 
1. Define and give an example of a behavior chain. 4, Explain how forward, backward, and total-task chaining 
Hint: (See “Behavior Chain Defined”) are conducted. 


2. Explain why it is important to teach behavior chains to Fe a Doe oe Meio) 


people. 5. If Marcus tends to have pizza binges when he is stressed 
and wants to stop this behavior chain, how could he go 
about it? 


Hint: (See “Disrupting and Breaking Behavior Chains’’) 


Hint: (See “Rationale for Chaining”’) 


3. Assume you want to conduct a task analysis of playing 
a video game. What are the three different ways you 
could approach developing that task analysis? 


Hint: (See “Establishing Behavior Chains with Task 
Analysis’’) 


SIE STE APEC AP ES RN OSPR COSI 


NOTES 


1. See Chapter 17 for prompting procedures. 
2. To see Barnabus in action, refer to Barnabus: A study in complex behavior 
chaining [Film]. (1979). Boston: Northeastern University, 1979. 


PART 


Decreasing Behavior with 
Nonpunishment Procedures 


Part 9 describes nonpunishment interventions to decrease or eliminate problem behaviors. Chapter 24 
details extinction procedures for problem behaviors linked to behavior maintained by positive 
reinforcement, negative reinforcement, and automatic reinforcement. This chapter includes sections 
on extinction effects, variables affecting resistance to extinction, guidelines for using extinction 
effectively, and when not to use extinction. Chapter 25 describes the four most frequently researched 
variations of differential reinforcement for decreasing problem behavior: differential reinforcement 
of (a) incompatible behavior (DRI), (b) alternative behavior (DRA), (c) other behavior (DRO), and 
(d) low rates of responding (DRL). Examples of applications and guidelines for the effective use of 
each of these differential reinforcement procedures are provided. Chapter 26 defines, gives examples, 
and offers guidelines for treating problem behavior by changing the environment before the behavior 
occurs. Antecedent interventions directed toward problem behaviors are described with respect to 
function, stimulus control, and contingency independence. 
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Corea ER 24. 


Extinction 


LEARNING OBJECTIVES 


e State the definition of extinction. 


¢ Compare and contrast the functional and procedural forms of extinction. 


e Identify three misuses of the term extinction. 


¢ Compare and contrast the use of extinction with behaviors maintained by positive, 


negative, and automatic reinforcement. 


Plot and label an extinction curve on a graph. 
e Discuss the phenomenon of spontaneous recovery. 


e Explain the effect of continuous and intermittent reinforcement on behavior's resistance 


to extinction. 


¢ Discuss how establishing operations affect resistance to extinction. 


¢ List several behavioral examples that demonstrate how response effort influences a 


behavior’s resistance to extinction. 


e List and discuss the ten guidelines for the use of extinction. 


Pavlov had studied a process that was in a sense the reverse 
of conditioning and took place more slowly. He called 

it “extinction.” In my early notes I sometimes called it 
“adaptation,” because it resembled the slow disappearance 
of the unconditioned response to a click. My first extinction 
curve turned up by accident. A rat was pressing the lever 

in an experiment on satiation when the pellet dispenser 
jammed. I was not there at the time, and when I returned 

I found a beautiful curve. The rat had gone on pressing 
although no pellets were received, at first more rapidly than 
usual since no time was lost in eating, but then more and 
more slowly as time wore on. Some oscillation between 
high and low rates made the cumulative record wavelike. 

The change was more orderly than the extinction 
of a salivary reflex in Pavlov’s setting, and I was terribly 
excited. It was a Friday afternoon and there was no one in 
the laboratory whom I could tell. All that weekend I crossed 
streets with particular care and avoided all unnecessary risks 
to protect my discovery from loss through my death. 

In my new laboratory I collected more extinction 
curves, all with roughly the same wavelike form—the rats 
pressing rapidly at first but then more and more slowly 
until they stopped. 

—B. F. Skinner (1979, p. 95) 


ithholding reinforcement for a previously reinforced 
behavior decreases the rate of that behavior: atactic, process, 
and principle known as extinction. As a behavior change 
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tactic, extinction provides zero probability of reinforcement. As a 
behavioral process, extinction refers to the diminishing rate of a 
previously reinforced behavior when that behavior no longer produces 
reinforcement. As a principle, extinction refers to the functional 
relation between withholding reinforcement for a previously 
reinforced behavior and the resultant diminishing rate of response. 

Practitioners have used extinction effectively in a wide 
variety of settings with problem behaviors ranging from mild 
disruptions to aggressive and self-injurious behavior. The 
effectiveness of extinction depends primarily on identifying 
and consistently withholding the reinforcer maintaining the 
target behavior. Although executing an extinction procedure 
would appear simple, implementing it effectively in applied 
settings can be difficult. This chapter describes three function- 
based extinction tactics, addresses common misuses of the term 
extinction, identifies secondary effects of extinction, and explains 
variables affecting resistance to extinction. The chapter concludes 
with guidelines for using extinction effectively and a discussion 
of when not to use extinction as a sole behavior reduction tactic. 


EXTINCTION DEFINED 


Extinction as a behavior change tactic occurs when reinforce- 
ment of a previously reinforced behavior is discontinued; as a 
result, the occurrence of that behavior decreases in the future. ! 
Keller and Schoenfeld (1950) defined extinction this way: 
“Conditioned operants are extinguished by severing the rela- 
tion between the act and the effect. . . . The principle of Type R 
[operant] extinction may be put as follows: The strength of 
a conditioned operant may be diminished by withholding its 


reinforcement” (pp. 70-71). Similarly, Skinner (1953) wrote, 
“When reinforcement is no longer forthcoming, a response 
becomes less and less frequent in what is called ‘operant extinc- 
tion.’” (p. 69). 

Note that an extinction procedure does not prevent the 
target behavior from occurring. Rather, extinction terminates 
the response-reinforcer relation by changing the environment 
so that the behavior no longer produces reinforcement. 

When the third-grade teacher instructed the class to begin 
a written exercise, Moira frequently shouted, jumped out of her 
seat, and knocked her materials to the floor. Moira’s teacher 
would ask her to calm down, tell her to return to her seat, explain 
patiently why her outbursts distracted her classmates, describe 
how she should behave, and offer to let Moira do another task. 
Moira would smile and agree to behave appropriately, but her 
annoying disruptions persisted. Suspecting that Moira disrupted 
the class to get his attention, the teacher decided he would offer 
Moira an alternative activity, but he would no longer ask her 
to calm down, tell her to return to her seat, explain matters, or 
describe better behavior. After 4 days, during which her dis- 
ruptions produced no teacher attention, or attention from her 
classmates, Moira’s disruptions decreased to zero. 


Procedural and Functional Forms of Extinction 


Vollmer and Athens (2011) noted, “There appears to have 
been a strong presumption in the early applied behavior ana- 
lysis literature that problem behavior is always maintained 
by attention” (p. 319). As a result, many behavior analysts 
and practitioners relied on a procedural form of extinction 
(i.e., just ignore it, and it will go away) as treatment for prob- 
lem behavior. Procedural extinction was used in the previous 
classroom example. The teacher suspected that Moira dis- 
rupted the class to get his attention and began ignoring her 
disruptions. In this scenario, Moira’s problem behavior was 
maintained by positive reinforcement in the form of contingent 
teacher attention (e.g., explaining why the outbursts distracted 
classmates, describing how she should behave), and when the 
teacher consistently withheld that attention, the disruptions 
decreased. However, simply defaulting to “ignore it” as a 
method for applying extinction may not withhold the rein- 
forcer maintaining the problem behavior. 

Current best practice for treating problem behavior 
begins with an assessment to determine the behavior’s function 
for the client (see Chapter 27). If extinction is indicated as a 
treatment option, its form is then matched to the function of 
the problem behavior. Had a functional assessment shown that 
Moira’s disruptive behavior was maintained by socially medi- 
ated attention, her teacher’s withholding attention following 
disruptions (i.e., not asking her to calm down and return to her 
seat, not explaining why her disruptions distracted the class, and 
not describing how she should behave) would have constituted 
a function-based application of extinction. 

If a functional assessment revealed that Moira’s out- 
bursts were maintained by escape from instructional demands 
(i.e., negative reinforcement), teacher attention would not have 
constituted an extinction condition and most likely would have 
been ineffective. Extinction for negatively reinforced behavior 
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requires the discontinuation of escape or avoidance from the 
stimulus conditions that evoke the behavior. In this case, Moira’s 
disruptions would no longer enable her to avoid the workbook 
exercise in favor of a preferred activity. 

The clarification between procedural and functional varia- 
tions of extinction has led to more effective treatments (Lerman & 
Iwata, 1996a). When the form of extinction matches the function 
of the problem behavior, the intervention is usually effective 
(Vollmer & Athens, 2011). Detailed procedures for implement- 
ing function-based extinction are described later in this chapter. 


Extinction: Misuses of a Technical Term 


With the possible exception of negative reinforcement, extinc- 
tion is the most misunderstood and misused technical term in 
applied behavior analysis. As a technical matter, extinction 
should be used only to identify (a) withholding the reinforcer for 
a previously reinforced behavior (the procedure), (b) a decreas- 
ing response rate under an extinction procedure (the process), or 
(c) the functional relation between withholding reinforcement 
for a previously reinforced behavior and the resultant diminish- 
ing rate of response (the principle). Four common misuses of 
extinction terminology are described in this section. 


Using Extinction to Refer to Any Decrease in Behavior 


A common error is using extinction to refer to any decrease in 
responding. Describing decreased responding that is due to any 
behavior change tactic or principle other than extinction (e.g., an 
abolishing operation, punishment, reinforcement for an alternative 
behavior) with a statement such as “the behavior is extinguishing” 
is conceptually and technically incorrect. Another common error 
is saying “the behavior extinguished” in reference to any reduc- 
tion in behavior that reaches a zero count of occurrence. 


Confusing Forgetting and Extinction 


A misuse of the term extinction occurs when the speaker con- 
fuses extinction with forgetting. In forgetting, a behavior is 
weakened by the passage of time during which the person has 
no opportunity to emit the behavior. In extinction, behavior is 
weakened because it does not produce reinforcement. 


Confusing Response Blocking and Sensory Extinction. 


Applied behavior analysts have used objects such as goggles, 
gloves, helmets, and wrist weights to mask the automatic posi- 
tive reinforcement received from sensory stimulation during sen- 
sory extinction. The applications of response blocking to reduce 
problem behaviors appear similar to sensory extinction. Response 
blocking, however, is not an extinction procedure. With all extinc- 
tion procedures, including sensory consequences, the persons can 
emit the problem behavior, but that behavior will not produce 
reinforcement. By contrast, response blocking prevents the occur- 
rence of the target behavior (Vollmer & Athens, 2011). 


Confusing Noncontingent Reinforcement 
and Extinction 


Noncontingent reinforcement (NCR) is another tactic for 
decreasing behavior that is sometimes confused with extinction. 
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With NCR, stimuli with known reinforcing properties are deliv- 
ered on a fixed-time or variable-time schedule independent of 
responding (see Chapter 26). NCR does not entail withholding 
the reinforcer that maintains the problem behavior. 

NCR is an important and effective intervention for dimin- 
ishing problem behavior, but NCR operates on behavior in a 
different way than does extinction. Although both procedures 
produce a decrease in behavior, the behavioral effects result 
from different controlling variables. Extinction diminishes 
behavior by eliminating the reinforcing consequence; NCR 
diminishes behavior by creating an abolishing operation (see 
Chapter 16). 

Misusing extinction terminology may be more harmful 
than a conceptual or technical language error. Such mistakes can 
lead to erroneous assumptions and faulty treatment decisions. 
Box 24.1: “Extinction Semantics” provides examples of correct, 
incorrect, and questionable statements inferring extinction with 


respect to treatments intended to decrease behavior, observed 
effects, and the principles responsible for the behavior change. 


EXTINCTION PROCEDURES 


Specifically, extinction procedures take three distinct forms pe 
are linked to behavior maintained by positive reinforcement, 
negative reinforcement, and automatic reinforcement. 


Extinction of Behavior Maintained by Positive 
Reinforcement 


Behaviors maintained by positive reinforcement are placed on 
extinction when those behaviors do not produce the reinforcer. 
In a classic extinction study, Williams (1959) described the 
effects of removing positive reinforcement on the tyrant-like 
behavior of a 21-month-old boy. The child had been seriously ill 


Statement 


We used extinction to treat 
Yoshiaki’s inappropriate call-outs. 


Yoshiaki’s call-outs are 
undergoing extinction. 


Extinction decreased Yoshiaki’s 
call-outs. 


Audrey's second-language skills 
have extinguished. She has not 
spoken Italian since her last 
language class 3 years ago. 


Because a functional analysis 
suggests that Lefki’s problem 
behavior is maintained by negative 
reinforcement, | recommend escape 
extinction as treatment. When you 
present a task to Lefki and she 
begins to engage in the problem 
behavior, physically guide her 
through the task, but do not let her 
escape from the task situation. 


We have been extinguishing 
Jeremy's eye rubbing by blocking 
hand movements toward his eyes, 
and it’s working. 


BOX 24.1 


Extinction Semantics 


Meaning 


Refers to extinction as a behavior change procedure. 

The statement is correct if the procedure withheld the previous sources of reinforcement for 
Yoshiaki’s call-outs. 

The statement is incorrect if the procedure withheld presumed sources of reinforcement for 
Yoshiaki’s call-outs (e.g., “We ignored Yoshiaki’s call-outs.”). 


Refers to extinction as a behavioral process. 


The statement is correct if describing a decreasing rate of response during a treatment condi- 
tion in which previous sources of reinforcement for Yoshiaki’s call-outs are withheld. 

The statement is incorrect if describing a decreasing rate of response during a treatment 
condition in which presumed sources of reinforcement for Yoshiaki’s call-outs are withheld. 
If that is the case, the decreased rate of responding is due to other factors. 


Refers to extinction as a principle of behavior. 


The statement is correct only if an experimental analysis demonstrated a functional relation 
between the extinction procedure and the decreased occurrence of the behavior. 


Attributing this lack of behavior to extinction is correct only if Audrey's attempts to speak 
Italian since her language class produced no reinforcement. 


If Audrey's Italian was weakened by the passage of time during which she had no 
opportunities to speak the language, then the statement confused extinction with forgetting. 


This statement correctly identifies escape extinction as the function-based form of extinction 
and describes a procedure for implementing it. 


This statement probably confuses response blocking with extinction. If Jeremy's eye rubbing 
was maintained by automatic reinforcement in the form of sensory stimulation, an extinc- 
tion procedure would allow Jeremy to rub his eyes, but the responses would not produce 
the reinforcing stimulation. 


for the first 18 months of his life but had completely recovered 
when the study began. The boy demanded special attention from 
his parents, especially at bedtime, and responded with temper 
tantrums (e.g., screaming, fussing, crying) when his parents did 
not provide the attention. A parent spent !/, to 2 hours each 
bedtime waiting in the bedroom until the boy went to sleep. 

Although Williams did not speculate about how the tan- 
trums developed, a likely explanation is not difficult to imagine. 
Because the child had been seriously ill for much of his first 
18 months, crying had been his signal that he was experiencing 
discomfort or pain or needed help. In effect, his crying may 
have been reinforced by his parents’ attention. Crying became 
a high-frequency behavior during the extended illness and con- 
tinued after the child’s health improved. The parents realized 
eventually that their child cried at bedtime to gain their attention 
and tried to ignore the crying. The crying increased in intensity 
and emotion when they did not stay in the room. As the days 
and weeks went by, the child’s demands for attention at bed- 
time grew worse. The parents decided again not to stay with the 
child at bedtime; but the intensity of the crying increased, and 
some new tantrum behaviors occurred. The parents returned to 
the room and, by giving in to the tantrums, taught their child to 
behave like a tyrant. 

After 3 months of the tyrant-like behavior, the parents 
decided they must do something about their child’s tantrums. It 
appeared that parental attention maintained the tantrums; there- 
fore, they planned to apply an extinction procedure. The parents, 
in a leisurely and relaxed manner, put the child to bed, left the 
bedroom, and closed the door. The parents recorded the duration 
of screaming and crying from the moment they closed the door. 

The solid data path in Figure 24.1 shows the duration of 
tantrums for the 10 days following the first extinction interven- 
tion. The child had a tantrum for 45 minutes the first time he was 
put to bed without his parents staying in the room. The tantrums 
gradually decreased until the 10th session, when the child “no 
longer whimpered, fussed, or cried when the parents left the 


—— First Extinction 
——-— Second Extinction 


Duration of Crying in Minutes 


(ses Gua a 9) it 
Times Child Put to Bed 
Figure 24.1 Two extinction series showing duration of 
crying as a function of being put to bed. 


Based on “The Elimination of Tantrum Behavior by Extinction Procedures” 
by C. D. Williams, 1959, Journal of Abnormal and Social Psychology, 59, 
p. 269. Copyright 1959 by the American Psychological Association. 
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room. Rather, he smiled as they left. The parents reported that 
their child made happy sounds until he dropped off to sleep” 
(Williams, 1959, p. 269). 

No tantrums occurred for approximately | week, but they 
resumed after his aunt put him to bed. When the child started 
to tantrum, the aunt returned to the bedroom and stayed with 
the child until he went to sleep. Tantrums then returned to the 
previous high level and had to be decreased a second time. 

The dashed data path in Figure 24.1 during the second 
extinction intervention following the aunt’s intervention. The 
data curve is similar to that of the first removal of parent atten- 
tion. The duration of the tantrums was somewhat greater during 
the second removal, but reached zero by the ninth session. 
Williams reported that the child had no further bedtime tantrums 
during 2 years of follow-up. 


Extinction of Behavior Maintained 
by Negative Reinforcement 


Behaviors maintained by negative reinforcement are placed on 
extinction (also called escape extinction) when those behaviors 
do not produce a removal of the aversive stimulus, meaning that 
the person cannot escape from the aversive situation. 

Anderson and Long (2002) provided treatment to reduce 
the problem behaviors of Drew, an 8-year-old boy with autism 
and moderate to severe intellectual disabilities. Drew’s prob- 
lem behaviors consisted of self-injurious behavior aggression, 
and disruptions. Results of a functional behavior assessment 
led Anderson and Long to hypothesize that escape from task 
situations maintained the problem behaviors. Based on this 
hypothesis, the speech therapist used escape extinction to 
diminish the problem behaviors that occurred while Drew 
worked on match-to-sample and receptive language tasks 
(these tasks evoked the highest rates of problem behavior). 
The speech therapist provided instructional prompts during 
the tasks. When Drew emitted problem behaviors following 
the instructional prompt, the speech therapist physically guided 
him to complete the task. Escape extinction produced a sig- 
nificant reduction in problem behaviors during the match-to- 
sample and receptive language tasks. Figure 24.2 shows the 
number of problem behaviors emitted per minute during base- 
line (i.e., escape) and escape extinction conditions. 

A study by Dawson and colleagues (2003) provides 
another excellent example of escape extinction. They used 
escape extinction to help Mary, a 3-year-old whose total food 
refusal resulted in her admission to an outpatient program for 
the treatment of severe behavior problems. Her medical history 
included gastroesophageal reflux, delayed gastric emptying, and 
gastrostomy tube dependence, among other medical issues. Her 
food refusal included head turning when presented with a bite of 
food, putting her hand on the spoon or therapist’s hand or arm, 
and using her hands or bib to cover her face. 

The escape extinction procedure followed 12 sessions of 
food refusal. If Mary refused the bite of food, the therapist held 
the spoon to Mary’s mouth until she took the bite. When Mary 
expelled the bite, the food was re-presented until swallowed. 
With this procedure, Mary’s refusals did not produce an escape 
from the food. The therapist terminated the session after Mary 


632 Part9 * Decreasing Behavior with Nonpunishment Procedures 


Figure 24.2 Rate of problem behaviors by Drew during 
baseline and escape extinction conditions. 


Based on “Use of a Structured Descriptive Assessment Methodology to 
Identify Variables Affecting Problem Behavior” by C. M. Anderson & 

E. S. Long, 2002, Journal of Applied Behavior Analysis, 35(2), p. 152. 
Copyright 2002 by Society for the Experimental Analysis of Behavior. 


had swallowed 12 bites of food. Mary accepted no food during 
the 12 baseline sessions; but after 2 sessions of escape extinc- 
tion, Mary’s food acceptance increased to 100% compliance. 


Extinction of Behavior Maintained 
by Automatic Reinforcement 


Some behaviors produce natural sensory consequences that 
reinforce the behaviors that produce them. Rincover (1981) 
described a naturally occurring sensory reinforcer as a stimulus 
that “sounds good, looks good, tastes good, smells good, feels 
good to the touch, or the movement itself is good” (p. 1). Behav- 
iors maintained by automatic positive reinforcement are placed 
on extinction (also called sensory extinction) by masking or 
removing the sensory consequence (Vollmer & Athens, 2011). 

Automatic reinforcement can maintain self-injurious behav- 
iors and persistent, nonpurposeful, repetitive self-stimulatory 
behaviors (e.g., flipping fingers, head rocking, toe walking, hair 
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Based on “Functional Analysis and Treatment 
of Hair Twirling in a Young Child” by 20- 
C. M. Deaver, R. G. Miltenberger, and 
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Behavior Analysis, 34, p. 537. Copyright 2001 
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Problem Behavior 


Mittens 


6 Baseline Escape Extinction Baseline Escape Extinction 
: : : 
vu ' ' ' 
ee 
= t ' ' 
=3 
2 : : : 
u | 
0+ +1 — > 
2 AT Coors 10 sidemlain Uhr nloweeenerae 24) 20 
Sessions 


pulling, fondling body parts). Kennedy and Sousa (1995) reported 
that a 19-year-old man with profound disabilities had poked his 
eyes for 12 years, which resulted in visual impairment in both 
eyes. Eye poking was thought to serve a sensory stimulation 
function because it occurred most frequently when he was alone. 
When Kennedy and Sousa used goggles to mask contact with his 
eyes, eye-poking behavior decreased markedly. 

Deaver, Miltenberger, and Stricker (2001) used extinction 
to decrease hair twirling. Hair twirling is a frequent precursor to 
hair pulling, a serious self-injury. Tina, at 2 years and 5 months of 
age, received treatment for hair twirling and pulling. A functional 
analysis documented that Tina did not twirl or pull hair to obtain 
attention, and that hair twirling most often occurred when Tina 
was alone at bedtime. The sensory extinction procedure consisted 
of Tina wearing thin cotton mittens on both hands at naptime 
during day care and during bedtime at home. Figure 24.3 shows 
that sensory extinction diminished hair twirling to near zero levels 
at home and at day care. 
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Rincover, Cook, Peoples, and Packard (1979) and 
Rincover (1981) provided the following examples of applying 
extinction with automatic reinforcement: 


1. A child persisted in flipping a light switch on and off. The 


visual sensory consequence was removed by disconnect- 
ing the switch. 


2. A child persistently scratched his body until it bled. The 
tactile (touch) sensory consequence was removed by put- 
ting a thin rubber glove on his hand so that he could not 
feel his skin. Later, the glove was faded by gradually cut- 
ting off portions of it. 


3. A child would throw up and then eat the vomit. The gusta- 
tory (taste) extinction procedure consisted of adding lima 
beans to the vomit. The child did not like lima beans; 
therefore, the vomit did not taste as good, and the positive 
sensory consequence was masked. 


4. A child received kinesthetic stimulation (i.e., stimulation 
of muscles, tendons, and joints) by holding his arms out 
to his side and incessantly flapping his fingers, wrists, and 
arms. The extinction procedure consisted of taping a small 
vibratory mechanism on the back of his hand to mask the 
kinesthetic stimulation. 


5. A child produced auditory stimulation by persistently 
twirling an object, such as a plate, on a table. Carpet- 
ing the surface of the table he used for spinning objects 
masked the auditory stimulation from his plate spinning. 


SECONDARY EFFECTS OF EXTINCTION 


When a previously reinforced behavior is emitted but not fol- 
lowed by the usual reinforcing consequences, the occurrence of 
that behavior will decrease to its pre-reinforcement level or stop 
entirely. However, numerous unreinforced responses can occur 
during an extinction procedure. The decrease in rate of response 
during extinction tends to be sporadic, with a gradual increase 
in pauses between responses. 

In addition to extinction’s primary effect of decreasing 
response rate, behaviors undergoing extinction often show other 
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predictable characteristic effects, which can make the extinction 
procedure difficult for practitioners to apply: (a) extinction burst, 
(b) response variation, (c) initial increase in response magnitude, 
(d) spontaneous recovery, (e) resurgence, and (f) emotional out- 
bursts and aggression. (Lattal, St. Peter Pipkin, & Escobar, 2013; 
Lerman & Iwata, 1995, 1996a; Murphy & Lupfer, 2014; Vollmer 
& Athens, 2011). These extinction effects have strong generality 
across species, response classes, and settings. Although applied 
behavior analysts almost always apply extinction as part of a 
treatment package that includes components that limit or prevent 
these extinction effects, practitioners must be aware of environ- 
mental conditions and competing contingencies that may exac- 
erbate unwanted effects. 


Extinction Burst 


A common effect of the extinction procedure is an immediate 
increase in the rate of the response after removing the positive, 
negative, or automatic reinforcement. The behavioral literature 
uses the term extinction burst to identify this initial increase in 
rate of response. Figure 24.4 presents an illustration of an extinc- 
tion burst. Operationally, Lerman, Iwata, and Wallace (1999) 
defined extinction burst as “an increase in responding during any 
of the first three treatment sessions above that observed during 
all of the last five baseline sessions or all of baseline” (p. 3). The 
extinction burst is well documented in basic research, but not 
well documented in applied research (Lerman & Iwata, 1995, 
1996a). When reported, the bursts have occurred for only a few 
sessions without notable problems. 

Goh and Iwata (1994) provided data showing an extinc- 
tion burst. Steve was a 40-year-old man with profound devel- 
opmental disabilities. He had been referred for evaluation for 
self-injury (head banging, head hitting). A functional analysis 
showed that self-injury was reinforced by escape from instruc- 
tions. Goh and Iwata used extinction for the treatment of Steve’s 
self-injury. The top panel in Figure 24.5 shows extinction bursts 
occurring with the onset of each of the two extinction phases. 

Problem behaviors can worsen during extinction before 
they show improvement. For example, practitioners should 
anticipate an initial increase in disruption during extinction. 


Figure 24.4 Hypothetical data illustrating an 
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extinction burst and spontaneous recovery. 
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“Behavioral Persistence and Variability during 
Extinction of Self-Injury Maintained by Escape”, 
by H.-L. Goh and B. A. Iwata, 1994, Reproduced 
with permission of John Wiley & Sons Inc. 


Thereafter, problem behaviors should begin to decrease and 
should return to their pre-reinforcement level. Applied behavior 
analysts need to anticipate extinction bursts and be prepared 
to consistently withhold the reinforcing consequence. Extinc- 
tion bursts usually suggest that the reinforcer (or reinforcers) 
maintaining the problem behavior was (or were) successfully 
identified, indicating that there is a good chance of an effective 
intervention. 


Response Variation 


Diverse and novel forms of behavior are sometimes observed during 
the extinction process (Kinloch, Foster, & McEwan, 2009; Peleg, 
Martin, & Holth, 2017). Commenting on this extinction-induced 
variability, Vollmer and Athens (2011) noted: 


From a layperson’s perspective, it appears that the individual 
is “trying” to find new or at least other ways of obtaining 
reinforcement. For example, if a child asks politely for a 
reinforcer and the request is denied, he or she may begin to 
whine or attempt to “steal” the item. (p. 323) 


Vollmer and Athens (2011) also raised an important consid- 
eration for applied behavior analysts. Although novel behay- 
iors induced by extinction are often undesirable (e.g., whining, 
stealing, hitting), other novel behaviors can be desirable and 
appropriate. The behavior analyst can reinforce the appropriate 
novel behavior as an alternative behavior. 


Initial Increase in Response Magnitude 


An increase in response magnitude also may occur during the 
early stages of extinction. Parents who begin to ignore a child’s 
bedtime tantrums might experience an increase in the loud- 
ness of screaming and the force of kicking before the tantrums 
begin to diminish. The extinction procedure is often difficult 
for teachers and parents to apply if there is an initial increase in 
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response magnitude. For example, parents may be unwilling to 
ignore tantrum behavior for a sufficient amount of time because 
tantrums are aversive to parents and disruptive to other fam- 
ily members or siblings trying to go to sleep. Rolider and Van 
Houten (1984) presented a tactic for this practical problem. They 
suggested teaching parents to ignore gradually increasing dura- 
tions of bedtime crying. They used baseline data to assess how 
long the parents could ignore bedtime crying comfortably before 
attending to their child. Then, the parents gradually increased 
their duration of ignoring. Every 2 days they waited an addi- 
tional 5 minutes before attending to the child until a sufficient 
total duration of time was achieved. 


Spontaneous Recovery 


A behavior will typically continue a decreasing trend during 
extinction until it reaches a pre-reinforced level or ultimately 
ceases. However, a phenomenon commonly associated with the 
extinction process is the reappearance of the behavior after it has 
diminished to its pre-reinforcement level or stopped entirely. 
Basic researchers commonly report this extinction effect and 
call it spontaneous recovery (Rescorla, 2004). With sponta- 
neous recovery, the behavior that diminished during the extinc- 
tion process recurs even though the behavior does not produce 
reinforcement. Spontaneous recovery is short-lived and limited 
if the extinction procedure remains in effect (see Figure 24.4). 
Therapists and teachers need to know about spontaneous recov- 
ery, or they might conclude erroneously that the extinction 
procedure is no longer effective. 


Resurgence 


As Winterbauer, Lucke, and Bouton (2013) state: “An operant 
behavior that has undergone extinction can return (‘resurge’ ) 
when a second operant that has replaced it itself undergoes 
extinction” (p. 60). Resurgence refers to the reoccurrence of 
a previously reinforced behavior when reinforcement for an 


alternative behavior is terminated or decreased and to the three- 
phase procedure that produces the effect: (1) A target behavior 
is reinforced, (2) the target behavior is placed on extinction and 
reinforcement provided for an alternative behavior, and (3) both 
responses are placed on extinction (Lattal & St. Peter Pipkin, 
2009; Lieving & Lattal, 2003). During the final phase, the pre- 
viously reinforced target behavior re-emerges, even though it 
remains under extinction condition. 

The three-phase resurgence sequence might play out in an 
applied scenario as follows. (1) A student’s rude remarks to other 
children on the playground produce reinforcement in the form 
of teacher attention. (2) The teachers supervising recess ignore 
the rude remarks (i.e., place the problem behavior on extinction) 
and attend to and praise all instances of appropriate interactions 
by the student (i.e., reinforce an alternative behavior). The 
intervention of extinction plus differential reinforcement of 
alternative behavior produces the desired effect: Occurrences of 
the student’s inappropriate playground behavior decrease to zero 
and appropriate interactions increase. (3) Believing they have 
solved the problem, the teachers provide less and less attention 
to the alternative behavior. The resurgence of rude remarks by 
the student surprises the teachers. 

Resurgence is not necessarily a negative phenomenon. 
Desirable resurgence occurs when a previously reinforced 
adaptive behavior recurs following an extinction period and 
during a period when other behaviors in the repertoire are 
unsuccessful (Epstein, 1991; Hoffman & Falcomata, 2014). 
Lattal and St. Peter Pipkin (2009) described a scenario in 
which the desirable resurgence of appropriate academic behav- 
ior might occur. 


Consider a situation in which a student taking an examina- 
tion suddenly remembers information related to a topic. 
This could develop as follows. Assume that a teacher 
praises the use of a particular strategy to solve problems 
(similar to the reinforcement phase), but that this strategy 
is unsuccessful during subsequent lessons that involve 
structurally different problems (similar to the alternative 
reinforcement phase). With neither response now rein- 
forced (extinction; for example, during the examination, 
when immediate praise would not be available for engaging 
in either response), the previously effective strategy may 
resurge. (pp. 255-256) 


Although resurgence is often temporary, especially 
if those responses are not reinforced and some instances 
of the alternative behavior contact reinforcement (Wacker 
et al., 2010), it can also be a robust effect (Winterbauer 
et al., 2013). Basic and translational research has identified 
numerous factors that impact resurgence, including the target 
behavior’s original reinforcement history prior to extinction 
(Winterbauer & Bouton, 2010), schedules of reinforcement 
for the alternative behavior (Winterbauer & Bouton, ZOU), 
reinforcing the alternative behavior in a different context 
from that in which the problem behavior had previously been 
reinforced (Mace et al., 2010), reinforcing multiple alterna- 
tive behaviors (Bloom & Lambert, 2015), response effort 
(Wilson, Glassford, & Koerkenmeier, 2016), the presence 
of discriminative stimuli during the resurgence test (King & 
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Hayes, 2016), and lapses in treatment integrity (Fuhrman, 
Fisher, & Greer, 2016). To learn more about resurgence and 
its relevance for applied behavior analysts, see Lattal and 
St. Peter Pipkin (2009), St. Peter Pipkin (2015), and Kestner 
and Peterson (2017). 


Emotional Outbursts and Aggression 


Placing a behavior on extinction may occasion an increase in 
intensity or force of the response, and may evoke other emotional 
or aggressive behaviors (Vollmer & Athens, 2011). Common 
emotional outbursts and aggressive behaviors include escape, 
rage, prolonged whining, and crying. Other examples for the 
emotional outbursts and aggression may include behaviors such 
as escape behavior, rage, and crying. 


VARIABLES AFFECTING RESISTANCE 
TO EXTINCTION 


Behavior analysts refer to continued responding during an 
extinction procedure as resistance to extinction. Resistance to 
extinction is a relative concept. Behavior that continues to occur 
during extinction is said to have greater resistance to extinc- 
tion than behavior that diminishes quickly. Three measures of 
resistance to extinction appear in the behavior analysis literature: 
(a) declining rate of response, (b) total count of responses 
emitted before responding ceases or attains some final low level, 
and (c) duration of time required for the behavior to reach a 
predetermined criterion (Lerman, Iwata, Shore, & Kahng, 1996; 
Reynolds, 1968). 


Continuous and Intermittent Reinforcement 


Continuous and intermittent schedules of reinforcement and 
their effects are described in Chapter 13. We can make three 
tentative statements about resistance to extinction as it relates 
to continuous and intermittent reinforcement. (a) Intermit- 
tent reinforcement (INT) may produce behavior with greater 
resistance to extinction than behavior previously reinforced 
by continuous reinforcement (CRF) (Keller & Schoenfeld, 
1950). (b) Some intermittent schedules may produce more 
persistent responding during extinction than others (Ferster & 
Skinner, 1957). For example, variable schedules of reinforce- 
ment (variable ratio [VR], variable interval [VI]) may yield 
behavior more resistant to extinction than fixed schedules 
(fixed ratio [FR], fixed interval [FI]). (c) To a degree, the thin- 
ner the schedule of reinforcement, the greater the resistance 
to extinction. 

Applied behavior analysts and practitioners should con- 
sider these statements as tentative guidelines, as researchers 
continue to discover schedule factors that influence extinction 
behavior. For example, recent density of reinforcement may pre- 
dict continued responding during extinction (e.g., Lerman et al., 
1996; Mace et al., 2011). MacDonald, Ahearn, Parry-Cruwys, 
and Bancroft (2013) found that four boys with autism persisted 
in problem behavior more so in 5-min extinction sessions that 
followed CRF than in extinction sessions immediately following 
INT reinforcement. Lerman et al. (1996) found that extinction 
bursts were more likely following CRF than INT. 
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Motivating Operations 


All stimuli that function as reinforcers require a minimum level 
of an establishing operation (i.e., motivation must be present; 
see Chapter 16). The strength of the establishing operation (EO) 
above the minimum level will influence resistance to extinction. 
Basic research has shown that “resistance to extinction is greater 
when extinction is carried out under high motivation than under 
low” (Keller & Schoenfeld, 1950, p. 75). 


Number, Magnitude, and Quality 
of Reinforcement 


The number of times a behavior has produced reinforcement may 
influence resistance to extinction. A behavior with a long history 
of reinforcement may have more resistance to extinction than a 
behavior with a shorter history of reinforcement. If bedtime tan- 
trums have produced reinforcement for | year, they might have 
more resistance to extinction than tantrums that have produced 
reinforcement for 1 week. Also, the magnitude and quality of a 
reinforcer will likely influence resistance to extinction. A reinforcer 
of greater magnitude and quality might produce more resistance to 
extinction than a reinforcer with less magnitude and quality. 


Number of Previous Extinction Trials 


Successive applications of conditioning and extinction may 
influence resistance to extinction. Sometimes problem behaviors 
diminish during extinction and then are accidentally strength- 
ened with reinforcement. When this happens, applied behavior 
analysts can reapply the extinction procedure. Typically, 
behavior diminishes with fewer total responses during a reappli- 
cation of extinction. This effect is additive when the participant 
can discriminate the onset of extinction. With each successive 
application of extinction, decreases in behavior become increas- 
ingly rapid until only a single response occurs following the 
withdrawal of reinforcement. 


Response Effort 


Applied researchers have produced data on response effort and 
its effect on resistance to extinction (e.g., Lerman & Iwata, 
1996a; Wilson et al., 2016). The effort required for a response 
apparently influences its resistance to extinction. A response 
requiring greater effort diminishes more quickly during extinc- 
tion than a response requiring less effort. 


USING EXTINCTION EFFECTIVELY 


Numerous guidelines for using extinction effectively have been 
published, with most authors providing similar recommenda- 
tions. Presented below are nine guidelines for the successful 
implementation of extinction. 


Withhold All Reinforcers Maintaining the 
Problem Behavior 


A first step in using extinction effectively is to identify and 
withhold all sources of reinforcement for the target behavior. 


The effectiveness of extinction depends on the correct identi- 
fication of the consequences that maintain the problem behay- 
ior. Functional behavior assessments have improved greatly 
the application and effectiveness of using extinction in applied 
settings (Vollmer & Athens, 2011; Chapter 27). 

Applied behavior analysts collect data on antecedent 
and consequent stimuli that are temporally related to the prob- 
lem behavior and provide answers to questions such as the 
following: 


1. Does the problem behavior occur more frequently when 
something happens in the environment to occasion the 
behavior (e.g., a demand or request)? 


2. Is the frequency of the problem behavior unrelated to 
antecedent stimuli and social consequences? 


3. Does the problem behavior occur more frequently when it 
produces attention from other persons? 


If the answer to the first question is yes, the problem behavior 
may be maintained by negative reinforcement. If the answer 
to the second question is yes, the applied behavior analyst will 
need to consider withholding tactile, auditory, visual, gustatory, 
olfactory, and kinesthetic consequences alone or in combina- 
tion. If the answer to the third question is yes, the behavior may 
be maintained by positive reinforcement in the form of social 
attention. 

The consequence that maintains the problem behay- 
ior appears obvious in some applied settings. In the 
Williams (1959) study described earlier in this chapter, paren- 
tal attention appeared to be the only source of reinforcement 
maintaining the tyrant-like behavior at bedtime. However, 
behaviors are frequently maintained by multiple sources of 
reinforcement. The class clown’s behavior might be main- 
tained by the teacher’s reaction to the disruptive behavior, 
by attention from peers, or by a combination of both. When 
Johnny’s parent brings him to preschool, he might cry to 
escape from preschool, to keep his parents with him, to 
occasion his teacher’s concern and attention, or to achieve 
some combination. of all three. When multiple sources of 
reinforcement maintain problem behavior, identifying and 
withholding one source of reinforcement may have minimal 
or no effect on behavior. If teacher and peer attention main- 
tain inappropriate classroom behavior, then withholding only 
teacher attention may produce little change in the problem 
behavior. To effectively apply the extinction procedure, the 
teacher must withhold her attention and reinforce the other 
students in the room when they ignore their classmate’s 
clownish behavior. 


Withhold Reinforcement Consistently 


When the reinforcing consequences have been identified, 
teachers must withhold them consistently. All behavior 
change procedures require consistent application, but consis- 
tency is essential for extinction; otherwise, the behavior may 
inadvertently be placed on an intermittent schedule of reinforce- 
ment, thereby making it more resistant to extinction. Teachers, 
parents, therapists, and applied behavior analysts often report 


that consistency is the single most difficult aspect in using 
extinction. The error of not withholding reinforcement consis- 
tently negates the effectiveness of the extinction procedure, and 
this point cannot be overemphasized. 


Combine Extinction with Other Procedures 


Although extinction can be an effective singular intervention, 
it is rarely recommended as such (Vollmer & Athens, 2011; 
Vollmer et al., 1998). We also recommend that applied behav- 
ior analysts always consider combining extinction with other 
treatments, especially the reinforcement of alternative behav- 
iors. Two reasons support this recommendation. First, the 
effectiveness of extinction may increase when it is combined 
with other procedures, especially positive reinforcement. 
By combining extinction with differential reinforcement of 
appropriate behaviors, for example, the applied behavior ana- 
lyst alters the environment by reinforcing appropriate alterna- 
tive behaviors and placing problem behaviors on extinction. 
During intervention, the extinction procedure should not 
reduce the overall number of positive consequences received 
by a participant. Second, differential reinforcement and anteced- 
ent procedures may reduce extinction bursts and aggression 
(Lerman et al., 1999). 

Rehfeldt and Chambers (2003) combined extinction 
with differential reinforcement for the treatment of verbal per- 
severation maintained by positive reinforcement in the form 
of social attention. The participant was an adult with autism. 
Rehfeldt and Chambers presented social attention contingent 
on appropriate and non-perseveration speech, and applied 
attention extinction by not responding to the participant’s inap- 
propriate verbal behavior. This combined treatment produced 
an increase in appropriate verbal responses and a decrease in 
perseveration. These data suggest that contingencies of rein- 
forcement might maintain the unusual speech of some individ- 
uals with autism. 

The bedtime pass developed by Friman et al. (1999) as 
treatment for pediatric resistance to bedtime is another exam- 
ple of an intervention combining extinction with reinforce- 
ment for appropriate alternative behavior. Upon being put to 
bed, the child is given a small notecard exchangeable for one 
brief trip out of the bedroom for specific activity (e.g., use 
the bathroom, one more hug from Mom or Dad). The parents 
then return the child to bed without comment and ignore any 
further crying or calling out. In a systematic replication of the 
bedtime pass with four 3-year-olds, Freeman (2006) reported 
the intervention eliminated bedtime resistance without produc- 
ing extinction bursts. 


Use Instructions 


Contingencies of reinforcement affect the future frequency 
of behavior automatically. It is not necessary for people to 
know, to describe, or even to perceive that contingencies are 
affecting their behavior. However, behaviors sometime dimin- 
ish more quickly during extinction when practitioners describe 
the extinction procedure to clients. For example, teachers 
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frequently provide small-group instruction while other stu- 
dents do independent seatwork. When seatwork students ask 
questions, they interrupt instruction. Many teachers correct 
this problem by placing question asking on extinction. They 
simply ignore student questions until after the end of small- 
group instruction. This tactic is often effective. The extinction 
procedure, however, tends to be more effective when teachers 
inform the class that they will ignore all student questions until 
after the end of small-group instruction. 


Plan for Extinction-produced Aggression 


Behaviors that occurred infrequently in the past may become 
more prominent during extinction. These extinction-produced 
behaviors are often emotional and aggressive (Lerman et al., 
1999; Vollmer & Athens, 2011). Skinner (1953) interpreted the 
change in response topographies (e.g., side effects) as emotional 
behaviors, including the aggression that sometimes accompanies 
extinction. 

Goh and Iwata (1994) provided a convincing demon- 
stration showing how aggression-induced behaviors occurred 
when a target behavior was placed on extinction. Steve was 
a 40-year-old man with profound intellectual disabilities. He 
had been referred for evaluation for self-injury (head banging, 
head hitting). A functional analysis showed that self-injury was 
reinforced by escape from instructions. Goh and Iwata used 
extinction for the treatment of Steve’s self-injury. Steve rarely 
slapped or kicked other persons (i.e., aggression) during the 
two baseline phases, but aggression increased with the onset 
of the two extinction phases (see Figure 24.5, bottom panel). 
Aggression essentially stopped by the end of each extinction 
phase when self-injury was stable and low, even though Steve’s 
aggression remained untreated during the baseline and extinc- 
tion conditions. 

Applied behavior analysts need to manage aggressive 
behaviors when they occur as side effects of extinction. It is 
critical that extinction-produced aggression not produce rein- 
forcement. Frequently, parents, teachers, and therapists react to 
the aggression with attention that may function as reinforcement 
of the extinction-produced aggression. For instance, a teacher 
decides to ignore Moira’s questions while providing small-group 
instruction. When Moira interrupts the teacher with a question, 
the teacher does not respond. Moira then starts disrupting other 
seatwork students. To quiet Moira down, the teacher responds, 
“Oh, all right, Moira, what did you want to know?” In effect, 
the teacher’s behavior may have reinforced Moira’s interruptions 
during small-group instruction and the accompanying inappro- 
priate disruptions of the other seatwork students. 

At times, extinction-produced aggression takes the 
form of verbal abuse. Often, teachers and parents do not 
need to react to it. If extinction-produced aggression results 
in reinforcement, individuals simply use other inappropriate 
behavior, such as verbal abuse, to produce reinforcement. How- 
ever, teachers and parents cannot and should not ignore some 
forms of aggression and self-injurious behavior. They need to 
know (a) that they can ignore some aggression, (b) when they 
need to intervene on the aggression, and (c) what they will do 
for the intervention. 
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Increase the Number of Extinction Trials 


An extinction trial occurs each time the behavior does not pro- 
duce reinforcement. Whenever possible, applied behavior ana- 
lysts should increase the number of extinction trials for the 
problem behaviors. Increasing the number of trials improves 
efficiency by accelerating the extinction process (i.e., the cumu- 
lative series of instances in which the behavior—controlling 
variable—primary effect sequence occurs). Applied behavior ana- 
lysts can increase extinction trials when increased occurrences of 
the problem behavior can be tolerated during intervention. For 
example, Keith’s parents used an extinction procedure to reduce 
his tantrums. His parents noticed that Keith had tantrums most 
often when he did not get his way about staying up late, eating a 
snack, and going outside. For the purpose of the program, they 
decided to set up several additional situations each day in which 
Keith did not get his way. Keith was then more likely to emit the 
inappropriate behavior more often, thereby giving his parents 
more occasions to ignore it. As a result, his tantrums decreased 
in a shorter period than they would have if left at their usual 
occurrence. 


Include Significant Others in Extinction 


It is important that other persons in the environment not rein- 
force undesirable behavior. A teacher, for example, needs to 
share extinction plans with other people who might help in the 
classroom—parent volunteers, grandparents, music teachers, 
speech therapists, industrial arts specialists—to avoid their rein- 
forcing inappropriate behaviors. For extinction to be maximally 
effective, meaning having few, if any, instances of extinction 
bursts, significant increases in response magnitude, and so forth, 
all persons in contact with the learner must withhold reinforce- 
ment for the problem behavior. 


Maintain Extinction-decreased Behavior 


Applied behavior analysts leave the extinction procedure in 
effect permanently for maintaining the extinction-diminished 
behavior. A permanent application of escape extinction and 
attention extinction is a preferred procedure. Applied behavior 
analysts can use a permanent application also with some sen- 
sory extinction procedures. For example, the carpeting on a table 
used for spinning objects could remain in place indefinitely. 
Some applications of sensory extinction appear inappropriate 
and inconvenient if kept permanently in effect. For example, 
requiring Tina to permanently wear thin cotton mittens on both 
hands at naptime during day care and during bedtime at home 
appears inappropriate and inconvenient (see Figure 24.3). In 
such a case, the applied behavior analyst can maintain treat- 
ment gains by gradually fading out the sensory extinction 
procedure—for example, cutting off 1 inch of the glove palm 
every 3 to 4 days until the glove palm is removed. The analyst 
may then remove the fingers and thumb one at a time to gradu- 
ally remove the mittens. 


Guard Against Unintentional Extinction 


Desirable behaviors are often unintentionally placed on extinc- 
tion. A beginning teacher confronted with one student who is 
on task and many other students who are not will probably 
direct most of his attention to the majority and will provide lit- 
tle or no attention to the student who is working. It is common 
practice to give the most attention to problems (“the squeaky 
wheel gets the grease’”’) and to ignore situations that are going 
smoothly. However, behaviors must continue to be reinforced 
if they are to be maintained. Teachers must be vigilant to “catch 
them being good” and deliver reinforcement accordingly. 


WHEN NOT TO USE EXTINCTION 


Although most interventions designed to decrease behavior 
will include withholding reinforcement for the target behav- 
ior, we envision at least four circumstances when applied 
behavior analysts should not use extinction as a singular 
intervention. 


The Behavior Is Harmful 


Some behaviors are so self-injurious, harmful to others, or 
destructive to property that they must be controlled with the 
most rapid and humane procedure available. Extinction as a sin- 
gular intervention is not recommended in such situations. 


All Sources of Reinforcement Cannot 
Be Withheld 


Practitioners and significant others may discontinue reinforce- 
ment for previously reinforced behavior at every instance. If, 
however, the target behavior contacts other sources of reinforce- 
ment, the extinction procedure is doomed at the outset. A classic 
case in point is the previous scenario of the class clown obtain- 
ing attention from classmates while his teacher dutifully ignores 
his antics. 


A Rapid Reduction in Response 
Rate Is Required 


Practitioners often do not have the luxury of time to change 
behavior gradually. A vocational student who talks back to 
supervisors On a community job site may get fired before an 
intervention relying on extinction eliminates the problem 
behavior. 


Others Are Likely to Imitate the Problem Behavior 


Some behaviors can be tolerated if only one person emits them 
for a while. Those same behaviors may be intolerable if several 
persons begin emitting them. An extinction-only intervention is 
ill advised if others are likely to imitate the behavior placed on 
extinction (“Hey, if Thad can toss papers, turn over desks, or 
scream expletives, so can we!’’), 
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Defined 9. Behaviors maintained by automatic negative reinforcement 


1. Withholding reinforcement for a previously reinforced ae EAGROR euncHon RYCROL TEM OvARES REARS BNO: 
behavior decreases the rate of that behavior: a procedure, apiaeier cls eset cry seneron. 


process, and principle known as extinction. 
Secondary Effects of Extinction 


As a behavior change procedure, extinction provides 


zero probability of reinforcement. 10. Behaviors undergoing extinction are usually asso- 
* As a behavioral process, extinction refers to the dimin- ciated with effects in occurrence and topography of 
ishing rate of a previously reinforced behavior when that TESPONSe: 
behavior no longer produc infor inction i 
oe g | p es reinforcement. 11. Common effects of extinction include (a) extinction burst, 
° As a principle, extinction refers to the functional rela- (b) response variation, (c) initial increase in response 
tion between withholding reinforcement for a previously magnitude, (d) spontaneous recovery, (e) resurgence, 
reinforced behavior and the resultant diminishing rate of and (f) emotional outbursts and aggression. 
response. 
2. Functional behavior assessments (see Chapter 27) enable Variables Affecting Resistance to Extinction 


behavior analysts to distinguish clearly between the proce- 12. Behavior that continues to occur during extinction is said 


dural pratt of extinction (1.e., most commonly—*Just to have greater resistance to extinction than behavior that 

o 29 a B= . ‘he te . E 5 . . . 
FURS : aly uv will GOS ), and the functional varla- diminishes quickly. Resistance to extinction is a relative 
tions of extinction (i.e., identifying the specific reinforcer concept. 


maintaining the behavior). 
sae ' 13. Intermittent schedules of reinforcement may produce 
3. Behavior analysts should use extinction only to identify behavior with greater resistance to extinction than the 


(a) withholding the reinforcer for a previously reinforced resistance produced by a continuous reinforcement 
behavior (the procedure), (b) a decreasing response rate schedule. 

under an extinction procedure (the process), or (c) the 
functional relation between withholding reinforcement 
for a previously reinforced behavior and the resultant 
diminishing rate of response (the principle). 


14. Variable schedules of reinforcement (e.g., VR, VI) may 
have more resistance to extinction than fixed schedules 
(e/2., PR: FI). 


15. To a degree, the thinner the intermittent schedule of 
reinforcement is, the greater the resistance to extinc- 
tion will be. 


4. Extinction (a) does not refer to any decrease in 
behavior, (b) is not the same as forgetting, (c) is 
not response blocking, and (d) is not noncontingent 


reinforcement. 16. Resistance to extinction is likely to increase with the 
strength of the establishing operation (EO) for the rein- 
Extinction Procedures forcer being withheld. 
5. Procedures for extinction take three distinct forms that are 17. The number, magnitude, and quality of the reinforcer may 
linked to behavior maintained by positive reinforcement, affect resistance to extinction. 


negative reinforcement, and automatic (positive and nega- 


; : 18. Successive applications of conditioning and extinction 
tive) reinforcement. 


may influence the resistance to extinction. 


6. Behaviors maintained by positive reinforcement are placed 19. The response effort will likely influence resistance to 
on extinction when those behaviors do not produce the econ 
reinforcer. 

7, Behaviors maintained by negative reinforcement are Using Extinction Effectively 


placed on extinction (escape extinction) when those 
behaviors do not produce a removal of the aversive stim- 
ulus, meaning that the individual cannot escape from the 
aversive situation. 


20. Ten guidelines for using extinction are as follows: 


¢ Withhold all reinforcers maintaining the problem 
behavior. 


é : ou : ¢ Withhold reinforcement consistently. 
8. Behaviors maintained by automatic positive reinforcement 


are placed on extinction (sensory extinction) by masking 
or removing the sensory consequence. ¢ Use instructions. 


¢ Combine extinction with other procedures. 
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¢ Plan for extinction-produced aggression. When Not to Use Extinction 


* Increase the number of extinction trials. 21. Extinction should not be used as a singular intervention 
in the following circumstances: when the target behavior 


* Include significant others in the intervention. 
is harmful, when all sources of reinforcement cannot be 


pe ciiard against Warplenionakes LnCtons withheld, when a rapid reduction in behavior is required, 
¢ Maintain extinction-decreased behavior. or when others are likely to imitate the problem behavior. 
escape extinction extinction burst sensory extinction 
extinction (operant) resistance to extinction spontaneous recovery 
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MULTIPLE-CHOICE QUESTIONS 


1. A procedure in which reinforcement of a previously 5. Delivering a stimulus with known reinforcing properties 
reinforced behavior is discontinued and as a result, to an individual on a fixed time or variable time 
occurrences of that behavior decrease in the future is: schedule independently of responding is known as which 
a. Punishment procedure? 

b. Extinction a. Extinction 

c. Reinforcement b. Noncontingent reinforcement (NCR) 
d. Establishing Operation c. Differential reinforcement 

Hint: (See “Definition of Extinction’) d. Punishment 


: ~ : — int: “Extinction: Mis f a Technical Term” 
2. What is the probability of reinforcement when utilizing an ETAL SOR Ts Baia ee Bp Aly eRe a 


extinction procedure? 6. What is the main effect of an extinction procedure? 
a. 0% a. Increase in behavior 

b. 15% b. No change in behavior 

c. 50% c. Behavior decreases and/or stops entirely 

d. Variable d. The effects are unknown 

Hint: (See “Definition of Extinction’) Hint: (See: “Secondary Effects of Extinction”) 

3. What form of extinction is used when the reinforcer 7. An immediate increase in frequency and/or amplitude 
(or reinforcers) maintaining a behavior is withheld? of the response after the removal of the maintaining 
a. Procedural reinforcer is called 
b. Functional a. Extinction burst 
c. Standard b. Reinforcement 
d. Respondent c. Noncontingent reinforcement 
Hint: (See “Procedural and Functional Forms of d. Punishment 
Extinction’’) Hint: (See “Extinction Burst’’) 

4. What are the common misuses of the term extinction? 8. The reappearance of a behavior after it has diminished to 
a. Confusing forgetting and extinction its pre-reinforced level and/or stopped entirely is called: 
b. Confusing response blocking and sensory extinction a. Extinction burst 
c. Confusing noncontingent reinforcement and b. Spontaneous recovery 

extinction c. Differential reinforcement 
d. All of these d. Procedural extinction 


Hint: (See “Extinction: Misuses of a Technical Term’’) Hint: (See “Spontaneous Recovery”) 
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ESSAY-TYPE QUESTIONS 
1. 
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Continued responding during the extinction process is 
known as: 


a. Resistance to extinction 
b. Extinction burst 

c. Reinforcement 

d. Spontaneous recovery 


Hint: (See “Variables Affecting Resistance to 
Extinction’’) 


Resistance to extinction is greater when carried 
out under: 


a. Low motivation 

b. High motivation 

c. No motivation 

d. None of these 

Hint: (See “Establishing Operation’) 
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Identify and briefly discuss four common misuses of 
the term extinction. 


Hint: (See “Extinction: Misuses of a Technical Term’’) 


. Compare and contrast the use of extinction with behaviors 


maintained by positive, negative, and automatic reinforcement. 
Hint: (See “Extinction of Behavior Maintained by 
Positive Reinforcement’) 

Discuss the phenomenon of spontaneous recovery. 


Hint: (See “Spontaneous Recovery”) 


The term extinction is also used with respondent conditioned reflexes (see 
Chapter 2). Presenting a conditioned stimulus (CS) again and again without 
the unconditioned stimulus until the CS no longer elicits the conditioned 
response is called respondent extinction. 


11. 
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What suggestions did the authors give for assuring the 
effectiveness of extinction? 
a. Increasing the number of extinction trials 


b. Withholding all reinforcers maintaining the problem 
behavior 


¢. Withholding reinforcement consistently 
d. Using instruction 

e. All of these 

Hint: (See “Using Extinction Effectively”) 


12. When is it not appropriate to utilize an extinction procedure? 


Sa CER EERE SSS RETO 


a. When inappropriate behaviors are likely to be imitated 
by others 


b. When behaviors are harmful to self or others 
c. Both of these 

d. None of these 

Hint: (See “When Not to Use Extinction’’) 


4. Explain the effect of continuous and intermittent 


reinforcement on behavior’s resistance to 
extinction. 


Hint: (See “Continuous and Intermittent 
Reinforcement’) 


5. List several behavioral examples that demonstrate 


9 


how response effort influences a behavior’s resistance 
to extinction. 


Hint: (See “Response Effort’) 
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Extinction linked to behaviors maintained by positive reinforcement is 
sometimes called attention extinction, especially in the functional behavior 
assessment (FBA) literature, in which reinforcement by social attention is 
one of the conditions or hypotheses explored by FBA. 
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Differential Reinforcement 


LEARNING OBJECTIVES 


¢ Define and give examples of differential reinforcement. 


° Define, give examples of, and explain how to use differential reinforcement of 


incompatible/alternative behavior. 


* Define, give examples of, and explain how to use differential reinforcement of other behavior. 
° Define, give examples of, and explain how to use differential reinforcement of low rates of 


behavior. 


Does Love or Differential Reinforcement Open Doors? 


There is a story about Queen Victoria and Prince Albert, 
which, although charming, has not been verified as histor- 
ical fact (Strachey, 1921). Whether true or not, the story 
illustrates the use of differential reinforcement. 

As the story goes, shortly after their marriage, Prince 
Albert and Queen Victoria had a quarrel. Albert walked 
(or stomped, depending on the telling) out of the room 
and locked himself in his study. Victoria ran after him 
and pounded on the door. “Who’s there?” called Albert. 
“The Queen of England, and she demands to be admitted.” 
Albert did not respond. Victoria pounded again. 

One version of the story has it that Victoria shouted, “I am 
the Queen of England, Scotland, Wales and Ireland, the 
Empress of India and of the entire British Commonwealth, 
I am the Commander-in-Chief of all the British armed 
forces, and I order you hereby to open this door!!!” Albert 
still did not respond. More furious knocking was followed 
by a pause. Then there was a gentle tap. “Who’s there?” 
Albert asked. “Your wife, Albert,” the Queen replied. 

The prince immediately opened the door. 

This story is told on various Internet sites to extol the 
virtues of love or to make the point that it is more important 
to resolve conflict through nonviolent than violent means. But 
for behavior analysts, it is a fairly simple example of Prince 
Albert using good old-fashioned differential reinforcement 
(of alternative behavior) on the Queen’s behavior, although 
we must assume her calmer, more solicitous behavior actually 
occurred more often under similar circumstances. Moreover, 
the immediate increase in the magnitude of her behavior (i.e., 
the extinction burst) suggests that such behavior had been rein- 
forced in the past, either by Albert, or, more likely, by others. 

And what about the effect of her capitulation on his 
behavior? When they quarreled in the future, was he more 
likely to storm out of the room and hole himself up in his 
study? We don’t know, but we might predict so. 

—H. D. Schlinger (2007, p. 385) 
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for decreasing or eliminating problem behaviors. Although 

interventions based primarily on extinction or punishment 
are often effective, unwanted side effects may occur. Maladaptive 
emotional behavior and higher-than-usual rates of responding are 
commonly observed when a behavior with a long and consistent 
history of reinforcement is placed on extinction. Escape, avoid- 
ance, aggression, and other forms of undesirable counter-control 
may be evoked by punishment. In addition to unwanted side effects, 
another limitation of extinction and punishment as primary methods 
for reducing problem behaviors is that neither approach strength- 
ens or teaches adaptive behaviors with which individuals can attain 
reinforcers. Beyond the possibility of unwanted side effects and 
their lack of educative value, interventions that rely on extinction 
alone or punishment in any of its forms raise important ethical and 
legal concerns. To dispel these limitations and concerns, applied 
behavior analysts have developed procedures that use differential 
reinforcement to diminish or eliminate problem behaviors. 


| ae can choose from a wide range of procedures 


DIFFERENTIAL REINFORCEMENT DEFINED 


Differential reinforcement entails reinforcing one response class 
and withholding reinforcement for another response class. When 
used as a reductive procedure for problem behavior, differential 
reinforcement consists of (a) providing reinforcement contingent on 
a behavior other than the problem behavior or the problem behavior 
occurring at a reduced rate and (b) withholding reinforcement for 
the problem behavior.' Differential reinforcement can be imple- 
mented with positive reinforcement or negative reinforcement. 
As Cowdery, Iwata, and Pace (1990) noted, 


differential reinforcement does not involve extended inter- 
ruptions of ongoing activities (e.g., time-out), contingent 
removal of positive reinforcers (e.g., response cost), or 
presentation of aversive stimuli (e.g., punishment). These 
characteristics make differential reinforcement the least 
intrusive of all behavior interventions and probably account 
for its widespread popularity. (p. 497) 


Three well-researched tactics for using differential rein- 
forcement for decreasing inappropriate behavior are differential 
reinforcement of alternative behavior (DRA), differential rein- 
forcement of other behavior (DRO), and differential reinforcement 
of low rates (DRL). This chapter defines, presents examples, and 
suggests guidelines for the effective use of each of these differen- 
tial reinforcement procedures for decreasing problem behavior. 


DIFFERENTIAL REINFORCEMENT 
OF ALTERNATIVE BEHAVIOR (DRA) 


A practitioner using differential reinforcement of alternative 
behavior (DRA) reinforces occurrences of a desirable alterna- 
tive to the problem behavior and withholds reinforcement for 
the problem behavior. DRA has the dual effects of strengthening 
the alternative behavior and weakening problem behavior (see 
Petscher, Rey, & Bailey, 2009 for a review). DRA can be con- 
ceptualized as a concurrent schedule of reinforcement in which 
two response classes receive reinforcement at different rates: 
(a) the alternative behavior on a dense schedule of high-quality 
reinforcement and (b) the problem behavior on extinction or a 
very lean schedule of reinforcement. Because the concurrent 
schedule favors the alternative behavior, the client allocates 
more responding to the alternative behavior and less respond- 
ing to the problem behavior, as described by the matching law 
(Borrero et al., 2010; see Chapter 13). 

Dixon, Benedict, and Larson (2001) used DRA to treat the 
inappropriate verbal behavior of Fernando, a 25-year-old man 
with moderate developmental disabilities and a psychotic disorder. 
Inappropriate vocalizations included utterances not related to the 
current context, sexually inappropriate remarks, illogical place- 
ment of words within a sentence, and “psychotic” statements (e.g., 
“There’s a purple moose on my head named Chuckles,” p. 362). 
The researchers defined appropriate verbal behavior as any vocal 
utterance that did not meet the defining characteristics of inappro- 
priate vocalization. A functional analysis revealed that the inap- 
propriate verbal statements were maintained by social attention. 

During the DRA condition, the researchers attended to 
Fernando’s appropriate statements with 10 seconds of comments 
relevant to his statements and ignored his inappropriate verbal 
behavior. For example, if Fernando said something about an 
activity he liked to do, the experimenter told him it was inter- 
esting and hoped he could do it again soon. Following an initial 
phase of treatment with DRA, the researchers created a com- 
parison “baseline” condition by reversing the contingencies for 
the two response classes: Inappropriate vocalizations resulted 
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in attention and appropriate verbal behavior was ignored. Data 
from the B-A-B-A-B design shows that the DRA intervention 
reduced Fernando’s inappropriate verbal behavior and increased 
appropriate utterances (Figure 25.1). 

In most published studies evaluating DRA, the alterna- 
tive behavior and the problem behavior are mutually exclusive 
response Classes whose different topographies make it impos- 
sible for the participant to emit both behaviors at the same 
time. In the Dixon et al. (2001) study, Fernando could not utter 
appropriate and inappropriate verbalizations at the same time. 
Similarly, a learner cannot simultaneously be in and out of his 
seat (Friman, 1990), politely ask “Can I have that please?” and 
scream (Schlichenmeyer, Dube, & Vargas-Irwin, 2015), comply 
with a therapist’s instruction to complete a task and destroy 
the materials needed for the task (Ringdahl et al., 2002), or 
put an inedible item in a trashcan and in her mouth (Slocum, 
Mehrkan, Peters, & Vollmer, 2017). When DRA is implemented 
by reinforcing a behavior that cannot occur simultaneously 
with the problem behavior, the procedure is sometimes called 
differential reinforcement of incompatible behavior (DRI). 

The effectiveness of DRA does not depend upon an 
incompatible alternative behavior. An alternative behavior that 
occupies the participant when he might ordinarily engage in 
the problem behavior may be sufficient. Ideally, however, the 
alternative behavior serves a greater purpose than time filler. 
Adaptive activities or skills are the most beneficial alternative 
behaviors. For example, a classroom teacher could assign two 
students who frequently argue with each other to work on a class 
project together and reinforce cooperative behaviors associated 
with project development. Working together on a class project 
is not incompatible with arguing; the two response classes could 
occur together. However, the two students are likely to argue less 
when engaged in cooperative behaviors. 

When differential reinforcement uses escape from a task 
or demand situation as a reinforcer, the procedure is sometimes 
identified as differential negative reinforcement of alternative 
behavior (DNRA) (e.g., Marcus & Vollmer, 1995; Piazza, 
Moses, & Fisher, 1996; Vollmer & Iwata, 1992). Applying 
DNRA to reduce the occurrence of a problem behavior main- 
tained by escape from a task or demand context consists of 
providing negative reinforcement of the alternative behavior in 
the form of brief periods of escape from the task and escape 
extinction for the problem behavior. 

With DNRA, positive reinforcement for the alternative 
behavior is often provided as well. For example, Lalli, Casey, 
and Kates (1995) allowed participants to escape from a task for 
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30 seconds contingent on using an alternative response 
(e.g., giving the therapist a card with “BREAK” printed on it, or 
saying “no”) as an alternative to their aberrant behavior. The 
researchers also provided praise for using the alternative 
responses. As training progressed, the participants’ escape from 
a task was contingent on using the trained verbal responses and 
completing a gradually increasing number of required steps of 
the task. The DNRA intervention increased the use of the trained 
verbal response and decreased the problem behavior.” 


Guidelines for Using DRA 


Teachers, therapists, and parents have a long history of using 
DRA in education, treatment, and everyday social interactions 
and usually find DRA the easiest of the three differential rein- 
forcement procedures to apply. Practitioners often use DRA 
as interventions for a wide range of problem behaviors. Many 
researchers and practitioners have discovered that the following 
guidelines improve the effectiveness of DRA. 


Select Alternative Behavior 


Ideally, the alternative behavior (a) already exists in the learner’s 
current repertoire; (b) requires less effort, but never more, than 
the problem behavior; (c) occurs frequently enough to provide 
sufficient opportunities for reinforcement; and (d) is likely 
to receive reinforcement in the learner’s natural environment 
after thinning the DRA schedule. Selecting alternative behav- 
iors meeting these criteria will increase the initial effectiveness 
of DRA and facilitate the maintenance and generalization of 
behavior changes after terminating the intervention. 

When possible, the practitioner should differentially 
reinforce alternative behaviors that will lead to or increase the 


learner’s opportunity to acquire useful skills. When no behav- 
iors meeting those criteria can be identified, the practitioner can 
select an alternative behavior that can be easily taught or con- 
sider using a DRO procedure as presented in the next section of 
this chapter. Figure 25.2 shows some examples of alternative 
behaviors (Webber & Scheuermann, 1991). 


Select Potent Reinforcers and Deliver 
Them Consistently 


Presenting stimulus changes that the practitioner only assumes 
function as reinforcement can limit or even negate the effec- 
tiveness of any reinforcement-based intervention. Providing 
consequences for the alternative behavior identified by stimu- 
lus preference and reinforcer assessments (see Chapter 11) and 
functional behavior assessments (see Chapter 27) will increase 
the effectiveness of DRA. In addition, reinforcers should be 
selected that are relevant to the motivating operations that 
naturally exist in the treatment setting or that can be created 
(e.g., by deprivation prior to treatment sessions). 

The consequence that maintained the problem behav- 
ior prior to intervention is often the most effective reinforcer 
for the alternative behavior. Results from indirect functional 
assessments have been used to identify reinforcers for DRA 
interventions (e.g., Durand, Crimmins, Caufield, & Taylor, 
1989; McDonough, Johnson, & Waters, 2017). Functional 
analyses are most often used to identify maintaining conse- 
quences (e.g., Athens & Vollmer, 2010; Dixon et al., 2001; 
Wilder, Masuda, O’Conner, & Baham, 2001). 

The magnitude of the reinforcer used in a differential rein- 
forcement intervention is probably less important than its con- 
sistent delivery and control. Lerman, Kelley, Vorndran, Kuhn, 
and LaRue (2002) provided positive reinforcement (access to 


Figure 25.2 Examples of alternative behaviors for common classroom behavior problems. 


Problem Behavior Alternative Behavior 


Talking back 


Positive response such as “Yes, sir” or “OK” or “| understand”; or acceptable question 


such as “May | ask you a question about that?” or “May | tell you my side?” 


Cursing 

Being off-task 
Being out of seat 
Noncompliance 


Following directions within 


Acceptable exclamations such as “darn,” “shucks.” 
Any on-task behavior: looking at book, writing, looking at teacher, etc. 
Sitting In seat (bottom on chair, with body in upright position). 


seconds (time limit will depend upon age of student); 


following directions by second time direction is given. 


Talking out 
Turning in messy papers 


Hitting, pinching, kicking, 
pushing/shoving 


Tardiness 


Self-injurious or self-stimulatory 
behaviors 


Inappropriate use of materials 


Raising hand and waiting quietly to be called on. 
No marks other than answers. 


Using verbal expression of anger; pounding fist into hand; sitting or standing next to other 
students without touching them. 


Being in seat when bell rings (or by desired time). 


Sitting with hands on desk or in lap; hands not touching any part of body; head up and 
not touching anything (desk, shoulder, etc.). 


Holding/using materials appropriately (e.g., writing only on appropriate paper, etc.). 


Adapted from “Accentuate the Positive . . . Eliminate the Negative,” by J. Webber and B. Scheuermann, 1991, Teaching Exceptional Children, 24(1) 
p. 15. Copyright 1991 by Council for Exceptional Children. Used by permission. j 


toys) and negative reinforcement (escape from demands) in 
magnitudes ranging from 20 seconds to 300 seconds. They 
found that reinforcement magnitude only affected responding 
minimally during treatment and had little effect on maintenance 
after treatment was discontinued. However, a series of experi- 
ments conducted by Athens and Vollmer (2010) “showed the 
effectiveness of DRA that provided some combination of more 
immediate, longer duration, or higher quality of reinforcement 


for appropriate behavior relative to reinforcement for problem 
behavior” (p. 586). 


Reinforce Alternative Behavior Immediately 
and Consistently 


In addition to their potential effectiveness as reinforcers, the 
stimulus changes selected by the practitioner must be ones that 
can be delivered immediately and consistently when the alter- 
native behavior occurs. A continuous schedule of reinforcement 
(CRF) should be used initially for the alternative behavior and 
then transition to an intermittent schedule. The reinforcer should 
be presented consistently and immediately following each 
occurrence of the alternative behavior. After firmly establishing 
the alternative behavior, the practitioner should gradually thin 
the reinforcement schedule. 


Withhold Reinforcement for the Problem Behavior 


The effectiveness of DRA as an intervention for problem behav- 
ior depends on the alternative behavior yielding a higher rate 
of reinforcement than the problem behavior. Maximizing the 
difference between rates of reinforcement obtained by the two 
response classes entails withholding all reinforcement for the 
problem behavior (i.e., extinction schedule). 

Ideally, the alternative behavior would always receive 
reinforcement (at least initially) and the problem behavior would 
always be on extinction. As Vollmer, Roane, Ringdahl, and 
Marcus (1999) noted, “Perfect implementation of differential 
reinforcement entails providing reinforcers as immediately as 
possible after an appropriate behavior occurs (e.g., within 5 s). 
Treatment effects may degrade as the delay to reinforcement 
increases, especially if inappropriate behavior is occasionally 
reinforced” (p. 21). However, practitioners must often imple- 
ment DRA schedules in less than optimal conditions, in which 
some occurrences of the alternative behavior do not produce 
reinforcement, and the problem behavior sometimes inadver- 
tently produces the reinforcer. 

Results from a study by Vollmer and colleagues (1999) 
suggest that even when such treatment “mistakes” occur, dif- 
ferential reinforcement may still be effective. These researchers 
compared a “full implementation” of DRA, in which 100% of 
instances of alternative behavior were reinforced (CRF) and 0% 
of instances of aberrant behavior were reinforced (extinction), 
with various levels of “partial implementation.” For example, 
only one in every four instances of appropriate behavior pro- 
duced reinforcement with a 25%/75% schedule, whereas a 
reinforcer followed three of every four instances of inappropri- 
ate behavior. As expected, full implementation of differential 
reinforcement produced the greatest effects, with inappropriate 
behavior being “virtually replaced by appropriate behavior, and 
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lower levels of implementation eventually reducing treatment 
efficacy if the schedule of reinforcement favored inappropriate 
behavior” (p. 20). Figure 25.3 shows the results for Rachel, a 
17-year-old student with profound developmental disabilities 
who engaged in self-injurious behavior (SIB; e.g., head hitting 
and hand biting) and aggression toward others (e.g., scratching, 
hitting, hair pulling). 

An important and perhaps surprising finding of the 
Vollmer and colleagues (1999) study was that partial imple- 
mentation was effective under certain conditions. During partial 
implementation, the participants’ behavior showed a “dispro- 
portional tendency toward appropriate behavior” (p. 20) if they 
had prior exposure to full implementation. The researchers con- 
cluded that this finding suggests the possibility of intentionally 
thinning implementation levels prior to extending differential 
reinforcement interventions to settings in which maintaining 
treatment fidelity will be difficult. Recognizing that treatment 
effects might erode over time, they recommended that full 
implementation booster sessions be conducted periodically to 
re-establish the predominance of appropriate behavior. 


Watch for Treatment Relapse 


A problem behavior previously reduced by DRA may occur 
more frequently when the reinforcement schedule for the alter- 
native behavior is thinned to levels that can be maintained in 
the client’s everyday environment, when treatment shifts from a 
clinical setting to the natural settings, or when new therapists and/ 
or caregivers begin working with the client. Problem behavior 
treated with DRA may ultimately prove more resistant to extinc- 
tion when the DRA contingency is weakened or compromised 
(Nevin, Tota, Torquato, & Shull, 1990). Mace and colleagues 
(2010) conducted basic and translational research experiments 
exploring the persistence-strengthening effects of DRA and a 
possible solution. 

Resurgence refers to the reoccurrence of a previously rein- 
forced behavior when reinforcement for an alternative behay- 
ior is terminated or decreased (Lattal & St. Peter Pipkin, 2009; 
Lieving & Lattal, 2003). Fuhrman, Fisher, and Greer (2016) 
described resurgence in the context of FCT “as the recurrence 
of a previously reinforced response (e.g., destructive behavior) 
when alternative reinforcement is challenged (e.g., extinction or 
schedule thinning)” (p. 884). Also, spontaneous recovery (i.e., the 
reappearance of the problem behavior after it has diminished to 
its pre-reinforcement level or stopped entirely) and response vari- 
ation (i.e., the emergence of novel behaviors during the extinction 
process) relate to resurgence (see Chapter 24). Research has iden- 
tified several factors that may influence resurgence during appli- 
cations of DRA, such as magnitude of reinforcement, thinning 
the schedules of reinforcement, response effort (e.g., Wilson, 
Glassford, & Koerkenmeier, 2016), and lapses in treatment integ- 
rity (e.g., Fuhrman et al., 2016). 

The resurgence evoked by extinction might not be the 
problem behavior; alternative behaviors can resurge as well 
(Hoffman & Falcomata, 2014). Bloom and Lambert (2015) 
recommend that clinicians reduce resurgence of the problem 
behavior by reinforcing multiple alternative behaviors. An 
analog experiment by Lambert, Bloom, Samaha, Dayton, and 
Rodewald (2015) provides some empirical support for this 
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suggestion. Response rates by three women with developmental 
disabilities on five topographically different devices (i.e., toggle 
light switch, rocker light switch, slide light switch, cord switch, 
and personal alarm button) were measured. The researchers 
arbitrarily identified one switch as the target response (i.e., 
problem behavior) and all other switches were designated as 
alternative behaviors. 

After an initial reinforcement phase of a high-prefer- 
ence edible item on a fixed ratio (FR) | schedule developed 
independent consistent responding on the target response, an 
elimination phase was implemented in which the target response 
was placed on extinction and an alternative response (or series 
of three alternative responses) was reinforced on an FR 1 
schedule. In a final resurgence phase, all responses were placed 
on extinction. The procedure used in the elimination phase to 
reinforce one alternative behavior was analogous to traditional 
DRA intervention. The researchers dubbed the procedure for 
reinforcing a series of alternative behaviors serial DRA. The 
results are encouraging: “In all cases, the percentage of total 
responding allocated toward target response resurgence was 
less in the serial DRA component than in the traditional DRA 
component . . . Our data provide preliminary evidence suggest- 
ing that serial DRA may produce more durable and desirable 
outcomes than traditional DRA” (p. 765). 


Combine DRA with Other Procedures 


Considering that DRA interventions do not specifically pro- 
vide consequences for the problem behavior, a practitioner 
will seldom apply DRA as a single intervention if the problem 
behavior is destructive, is dangerous to the learner or to oth- 
ers, or interferes with health and safety. In these situations, the 
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practitioner might combine DRA with other reductive proce- 
dures (e.g., response blocking, time-out, stimulus fading, differ- 
ential reinforcement of other behavior) to produce a more potent 
intervention (e.g., Goh, Iwata, & Kahng, 1999; Jessel & Borrero, 
2014; Patel, Piazza, Kelly, Oschsher, & Santana, 2001). 

For example, Ringdahl and colleagues (2002) compared 
the effects of DRA, with and without instructional fading, on the 
frequency of problem behaviors of an 8-year-old girl, Kristina, 
who had been diagnosed with autism and functioned in the mod- 
erate range of developmental disabilities. Kristina’s problem 
behaviors during academic tasks—destruction: throwing, tearing, 
or breaking work materials; aggression: hitting, kicking, biting; 
and self-injurious behavior: biting herself—had become so 
severe that she had been admitted to a hospital day treatment 
program. Because a previous functional analysis revealed that 
Kristina’s problem behavior was maintained by escape from 
instructions to complete tasks, the DRA procedure in both 
conditions consisted of giving Kristina a 1-min break from the 
task and access to leisure materials contingent on independent 
completion of the task (e.g., counting items, matching cards 
with the same colors or shapes) without exhibiting problem 
behavior. Kristina received instructions to complete a work task 
at approximately one instruction every other minute during the 
DRA without instructional fading condition. During the DRA 
with instructional fading, Kristina received no instructions during 
the first three sessions, then one instruction every 15 minutes. 
The number of instructions given gradually increased until it 
equaled the number in the DRA without the fading condition. 
Problem behavior was high at the outset of the DRA without the 
instructional fading condition but decreased across sessions to a 
mean of 0.2 responses per minute over the final three sessions 


(see Figure 25.4). During DRA with instructional fading, prob- 
lem behavior was low from the outset and occurred during only 
two sessions (Sessions 9 and 14). Over the final three sessions 
of DRA with instructional fading, Kristina exhibited no problem 
behavior, and the instruction rate was equal to the DRA without 
instructional fading (0.5 instruction per minute). 


DIFFERENTIAL REINFORCEMENT OF OTHER 
BEHAVIOR (DRO) 


Reynolds (1961) described the differential reinforcement of other 
behavior as “reinforcement for not responding” (p. 59). A prac- 
titioner applying differential reinforcement of other behavior 
(DRO) delivers reinforcement contingent on the problem behav- 
ior not occurring throughout intervals of time (interval DRO) or 
at specific moments of time (momentary DRO). 

Researchers have demonstrated the effectiveness of DRO 
in reducing a wide range of problem behaviors in a variety of 
subject populations, including disruptive classroom behavior by 
typically developing students (Austin, Groves, Reynish, & Francis, 
2015), tics in youth with Tourette syndrome (Capriotti et al., 
2017), stereotypic vocalizations and pica in children with autism 
(Rozenblat, Brown, Brown, Reeve, & Reeve, 2009; Slocum et al., 
2017), and self-injurious behavior by adults with severe intellectual 
disabilities (Lindberg, Iwata, Kahng, & DeLeon, 1999). Behavior 
analysts have used DRO to reduce biting, chewing, and pawing by 
horses (Fox, Bailey, Hall, & St. Peter, 2012; Fox & Belding, 2015). 

Four behavioral processes have been suggested as poten- 
tial explanations for the effects of DRO (Jessel & Ingvarsson, 
2016; Poling & Ryan, 1982). 


First, the presentation of the reinforcer could function as an 
abolishing operation (i.e., satiation) and temporarily reduce 
or eliminate target responding. Second, the disruption of the 
response-reinforcer contingency could result in extinction. 
Third, delaying access to reinforcers could suppress target 
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responding through punishment. Fourth, other behavior 
could be strengthened by adventitious reinforcement, result- 
ing in displacement of the target behavior. 

It is likely that each of these four processes could 
be operating on different occasions, or multiple processes 
could operate at the same time, depending on the DRO 
arrangement and the reinforcer used. (Jessel & Ingvarsson, 
2016, p. 991-992) 


Applied behavior analysts implement DRO with two pri- 
mary methods. With interval DRO, if the problem behavior did not 
occur during a predetermined time interval, a reinforcer is deliv- 
ered; any instance of the problem behavior during that interval 
resets the interval and postpones reinforcement.* With momentary 
DRO, absence of the problem behavior at predetermined moments 
in time produces reinforcement; instances of the target behavior 
at other points in time do not alter when the next opportunity for 
reinforcement becomes available. The combinations of an inter- 
val or momentary omission requirement implemented on either a 
fixed or a variable schedule yield four basic DRO arrangements 
shown in Figure 25.5 (Lindberg et al., 1999). 


Interval DRO 


The following sections introduce procedures for applying 
fixed-interval DRO, variable-interval DRO, fixed-momentary 
DRO, and variable-momentary DRO schedules. 


Fixed-interval DRO (FI-DRO) 


Most applications of interval DRO apply the omission require- 
ment at the end of successive time intervals of equal duration. To 
apply a fixed-interval DRO (FI-DRO) schedule, a behavior ana- 
lyst (a) establishes a standard interval of time; (b) delivers rein- 
forcement at the end of that interval if the problem did not occur 
during the interval; and (c) upon any occurrence of the problem 
behavior, immediately resets the timer to begin a new interval. 
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For example, Allen, Gottselig, and Boylan (1982) applied 
an FI-DRO procedure to decrease the disruptive classroom 
behaviors of third-grade students. They set a kitchen timer to 
5-min intervals that continued to run as long as no disruptive 
behaviors occurred. The timer was reset and a new 5-min inter- 
val began if any student engaged in disruptive behavior at any 
time during the 5-min interval. The class received | minute of 
free time when the timer signaled the end of a 5-min interval 
without disruptive behavior, which they accumulated and used 
at the end of the class period.* 

Behavior analysts can gradually thin the DRO interval 
and treatment session length as the problem behavior improves 
(Della Rosa, Fellman, DeBiase, DeQuinzio, & Taylor, 2015). 
Cowdery and colleagues (1990) used FI-DRO to help Jerry, a 
9-year-old who never attended school and spent most of his time 
hospitalized. He scratched and rubbed his skin so often and hard 
that he had open lesions all over his body. During treatment, 
the experimenter left the room and watched Jerry through an 
observation window. The researchers initially made praise and 
token reinforcement contingent on 2-min “‘scratch-free” inter- 
vals, which were determined by baseline assessment. If Jerry 
scratched during the interval, the experimenter entered the 
room and told him she regretted he had not earned a penny and 
asked him to try again. Jerry’s SIB immediately decreased to 
zero with the introduction of DRO. Jerry’s SIB remained low as 
the DRO interval and session length were gradually thinned— 
from three 2-min intervals to three 4-min intervals. Following 
a brief return to baseline conditions in which Jerry’s scratch- 
ing quickly increased, Cowdery and colleagues reinstated DRO 
with five intervals per session and increased session length in 
1-min increments. Jerry could now earn a total of 10 tokens per 
session, a token for each interval in which he did not scratch plus 
a 5-token bonus if he refrained from scratching for all five inter- 
vals. Jerry’s SIB gradually reduced to zero as session length and 
DRO intervals were extended. Session 61 consisted of a single 
15-min DRO interval, which was followed by one 25- and three 
30-min intervals (see Figure 25.6). 

Cook, Rapp, and Schulze (2015) used differential nega- 
tive reinforcement of other behavior (DNRO) with an 8-year- 
old boy with autism who exhibited problem behavior when a 
medical alert bracelet was placed on his wrist (e.g., yelling, 
demands to remove the bracelet, flopping on the floor, hitting 
others, biting his hand, pulling on and biting the bracelet). For 
a description of the DNRO procedure and graph of the results, 
see Chapter 9. 


Figure 25.5 Four basic variations of DRO that can 
be created by combining the omission requirement 
(interval or momentary) and the schedule of 
reinforcement (fixed or variable). 


Schedule 
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Schedules” by J. S. Lindberg, B. A. Iwata, S. W. Kahng, & 

|. G. DeLeon, 1999. Journal of Applied Behavior Analysis, 32, 
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Variable-interval DRO (VI-DRO) 


To apply a variable-interval DRO (VI-DRO) schedule, a 
behavior analyst (a) establishes a random sequence of varied 
intervals of time and (b) delivers reinforcement at the end of 
each interval if the problem behavior did not occur at any time 
during the interval. For example, a VI-DRO 10-sec schedule 
delivers reinforcement contingent on the omission of the target 
behavior throughout intervals of varying duration that average 
10 seconds (e.g., a random sequence of 2-sec, 8-sec, 5-sec, 
20-sec, and 15-sec intervals). 

Chiang, Iwata, and Dorsey (1979) used VI-DRO to decrease 
the inappropriate behaviors of a 10-year-old student with develop- 
mental disabilities during bus rides to and from school. The boy’s 
disruptive behaviors included aggression (e.g., slapping, poking, 
hitting, kicking), being out of seat, stereotypic behaviors, and 
inappropriate vocalizations (e.g., screaming, yelling). Instead of 
basing the DRO intervals on elapsed time, the researchers used an 
interesting procedure they described as a “distance-based” DRO 
schedule. The driver divided the bus route into sections designated 
by landmarks such as stop signs and traffic lights and mounted a 
four-digit hand counter on the dashboard of the bus. The counter 
was within reach of the driver and within view of the student. At 
each predetermined landmark, the bus driver praised the student’s 
behavior and added one point to the hand counter if no disruptive 
behaviors had been emitted during the DRO interval. Upon arrival 
at home or school, the driver recorded the number of points earned 
on a card and gave the card to the boy’s foster father or teacher. 
The student exchanged the points earned for rewards (e.g., access 
to toys, home and school privileges, small snacks). 

Chiang and colleagues used a two-tier multiple baseline 
across settings design to evaluate the VI-DRO intervention. 
During baseline, disruptive behavior ranged from 20% to 100% 
with an average of 66.2% for the afternoon rides and 0% to 92% 
with an average of 48.5% in the mornings. When implement- 
ing the VI-DRO procedure during the afternoon rides only, dis- 
ruptive behavior decreased immediately to an average of 5.1% 
(range of 0% to 40%) during the afternoon rides but remained 
unchanged during the morning rides. Presenting the intervention 
also during the morning rides eliminated all disruptive behavior. 

Progar and colleagues (2001) implemented VI-DRO as 
one component of a conjunctive schedule of reinforcement to 
reduce the aggressive behaviors of Milty, a 14-year-old boy with 
autism. A conjunctive schedule delivers reinforcement when the 
requirements of two or more schedule components have been 
met (see Chapter 13)—in this case, an FR3 schedule for task 
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completion and a VI 148-sec schedule for the absence of aggres- 
sion. Milty received an edible reinforcer each time he completed 
three components of a task (e.g., making his bed, vacuuming, 
straightening objects in his room) in the absence of aggres- 
sion for the duration of the variable interval. An occurrence of 
aggression prior to completing the FR 3 requirement reset both 
components of the conjunctive schedule. This intervention pro- 
duced a substantial reduction of aggressive behavior. 


Momentary DRO 


On a momentary DRO schedule, reinforcement is contingent 
on the absence of the problem behavior at the exact time each 
interval ends, rather than throughout the entire interval, as with 
interval DRO. Occurrences of the problem behavior between 
those moments have no effect. 


Fixed-momentary DRO (FM-DRO) 


To implement a fixed-momentary DRO (FM-DRO) sched- 
ule, the practitioner (a) establishes a standard interval of time; 
(b) observes the participant at the moment each interval ends; 
and (b) delivers reinforcement at the end of each interval if the 
problem behavior is not occurring at that moment. 

In their review of research on momentary DRO sched- 
ules, Hammond, Iwata, Fritz, and Dempsey (2011) concluded 
that FM-DRO schedules “generally have been ineffective as 
treatment for problem behavior. Because most early research 
on FM DRO included presentation of a signal at the end of 
the DRO interval, it is unclear whether the limited effects of 
FM-DRO were due to (a) the momentary response requirement 
of the schedule per se or (b) discrimination of the contingency 
made more salient by the signal” (p. 69).° The researchers 
addressed this question by comparing the effects of signaled 
versus unsignaled FM-DRO. Four students with special needs 
served as participants—(1) Seth, a deaf and learning-impaired, 
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Figure 25.6 Percent intervals of SIB 

by Jerry during baseline and DRO 

and duration of DRO sessions (right 
y-axis). Other data points during the 
first baseline phase show the results 
of functional analysis, confirming that 
Jerry's SIB occurred most often when 
he was alone. 


“Effects and Side Effects of DRO as Treatment 
for Self-injurious Behavior” by G. Cowdery, 

B. A. lwata, and G. M. Pace, 1990, Journal 

of Applied Behavior Analysis, 23, p. 500. 
Reproduced with permission of John Wiley & 
Sons Inc. 


Session Length (Minutes) 


aggressive 6-year-old; (2) Alex, with autism and seizure dis- 
order, an aggressive 14-year-old; (3) Curtis, with autism, an 
aggressive 13-year-old; and (4) Abby, a 19-year-old with pro- 
found intellectual disabilities, autism, and seizure disorder. 
Functional analyses revealed that positive reinforcement in the 
form of access to preferred items maintained the participants’ 
problem behaviors. 

The four participants had access to preferred items during 
baseline. The signaled and unsignaled conditions were identical 
except for the signal. 


Signaled FM DRO. The experimenter covertly held the 
reinforcer to ensure that it could be delivered quickly 
but that no additional movements (e.g., reaching for 
the reinforcer) might inadvertently signal the end of an 
interval. Three seconds prior to the end of an interval, the 
experimenter (positioned directly in front of or next to the 
subject) signaled impending reinforcement delivery by 
holding the reinforcer above her head. If no problem behav- 
ior was occurring at the precise second the interval clocked 
out, the reinforcer (30-s access to the toy box or video, or 
one edible item) was delivered. If problem behavior was 
occurring at that second, the signal was removed, and the 
reinforcer was withheld until the next scheduled interval. 
Unsignaled FM DRO. This condition was similar 
to the signaled condition, except that the experimenter 
did not signal impending reinforcer delivery. Rather, the 
experimenter quickly delivered the reinforcer contingent 
on the absence of problem behavior when the interval 
ended. (p. 75) 


Seth and Alex had rapid reductions in aggression during 


the signaled FM-DRO conditions (see Figure 25.7). However, 


the signaled FM-DRO conditions produced different results for 


Curtis and Abby (see Figure 25.8). Curtis’s aggression increased 
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during the first introduction of the signaled FM-DRO condi- 
tion, then greatly decreased by the end of the first introduction. 
His aggression did not decrease during the second signaled 
FM-DRO condition, but decreased to zero during the reintro- 
duction of the unsignaled FM-DRO condition. Similar to Cur- 
tis’s behavior, Abby’s head hitting decreased under the second 
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and final unsignaled DRO phase, all the while earning a large 
proportion of available reinforcers. “[T]hese data indicate that 
momentary DRO should be used with caution, because very 
subtle therapist responses (e.g., reaching for the programmed 
reinforcer) may function as signals that could render FM DRO 
ineffective” (Hammond et al., 2011, p. 80). 
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Variable-momentary DRO (VM-DRO) 


In a variable-momentary DRO (VM-DRO) schedule, the prac- 
titioner (a) establishes a random sequence of varied intervals of 
time and (b) delivers reinforcement at the end of each interval 
if the problem behavior is not occurring at that precise moment. 
Lindberg and colleagues (1999) used a VM-DRO intervention to 
help Bridget, a 50-year-old woman with severe developmental 
disabilities, who often banged her head and body. A functional 
analysis indicated that social-positive reinforcement maintained 
Bridget’s self-injurious behaviors (SIB). During the VM-DRO 
intervention, SIB was placed on extinction and Bridget received 
3 to 5 seconds of attention from the therapist if she did not bang 
her head or hit herself when each interval ended. A VM-DRO 
15-sec schedule was implemented for five sessions following 
baseline, and Bridget’s SIB decreased abruptly to almost zero 
(see Figure 25.9). Following a second baseline phase in which 
Bridget’s SIB increased, the researchers reintroduced VM-DRO 
with a mean interval of 11 seconds. SIB did not decrease as 
quickly during the second treatment condition. After several ses- 
sions in which SIB occurred at low rates, the interval length was 
increased to 22 seconds and eventually to the maximum target 
interval of 300 seconds. 

Interval DRO has been used more widely than momen- 
tary DRO. Some researchers have found that interval DRO is 
more effective than momentary DRO for suppressing problem 
behavior, and that momentary DRO might be most useful in 
maintaining reduced levels of problem behavior produced by 
interval DRO (Barton, Brulle, & Repp, 1986; Repp, Barton, & 
Brulle, 1983). Lindberg and colleagues (1999) noted two poten- 
tial advantages of VM-DRO schedules over FI-DRO schedules. 
First, the VM-DRO schedule appears more practical because the 
practitioner need not monitor the participant’s behavior at all 
times. Second, data obtained by these researchers showed that 
participants obtained higher overall rates of reinforcement with 
VM-DRO than with FI-DRO. 
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Guidelines for Using DRO 


In addition to the importance of selecting potent reinforcers, we 
recommend the following guidelines for the effective use of DRO. 


Set Initial DRO Intervals That Ensure 
Frequent Reinforcement 


Practitioners should establish an initial DRO time interval that 
ensures that the learner’s current level of behavior will contact 
reinforcement when the DRO contingency is applied. For exam- 
ple, Cowdery and colleagues (1990) initially made praise and 
token reinforcement contingent on 2-min “scratch-free” inter- 
vals because that was “the longest we had previously seen Jerry 
refrain from scratching while left alone” (p. 501). Beginning 
with an interval that is equal to or slightly less than the mean 
baseline interresponse time (IRT) usually will make an effective 
initial DRO interval. To calculate a mean IRT, the practitioner 
divides the total duration of all baseline measurements by the 
total number of responses recorded during baseline. For exam- 
ple, if a total of 90 responses were recorded during 30 minutes 
of baseline measurement, the mean IRT would be 20 seconds 
(i.e., 1800 seconds + 90 responses). Using these baseline data 
as a guide, the practitioner could set the initial DRO interval at 
30 seconds or less. 


Do Not Inadvertently Reinforce Other 
Undesirable Behaviors 


Reinforcement on a “pure” DRO schedule is contingent on a 
very general response class that is defined only by the absence 
of the targeted problem behavior. Consequently, a “pure” DRO 
is typically used as a treatment of choice for serious behavior 
problems that occur at very high rates by people whose current 
repertoires provide few, if any, other behaviors that might func- 
tion as alternative behaviors and for whom just about anything 
else they might do is less of a problem than the target behavior. 


Figure 25.9 Responses per minute of self-injurious 
behavior (SIB) during baseline and treatment 
conditions. 

“DRO Contingencies: An Analysis of Variable-momentary 
Schedules” by J. S. Lindberg, B. A. lwata, S. W. Kahng, and 
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p. 131. Reproduced with permission of John Wiley & Sons Inc. 
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Because DRO does not require any certain behaviors 
to be emitted for reinforcement, whatever the person is doing 
when reinforcement is delivered is likely to occur more often 
in the future (Jessel, Borrero, & Becraft, 2015). Therefore, 
practitioners using DRO must be careful not to strengthen 
inadvertently other undesirable behaviors. When using DRO, 
the practitioner should deliver reinforcement at the intervals or 
moments in time specified by the schedule contingent on the 
absence of the problem behavior and the absence of any other 
significant inappropriate behaviors. 


Gradually Increase the DRO Interval 


After the initial DRO interval effectively controls the problem 
behavior, the practitioner can increase the DRO interval, by 
increasing the interval through a series of initially small and 
gradually increasing increments. 

Poling and Ryan (1982) suggested three procedures for 
increasing the duration of the DRO interval. 


1. Increase the DRO interval by a constant duration of time. 
For example, a practitioner could, at each opportunity 
for an interval increase, lengthen the DRO interval by 
15 seconds. 


2. Increase intervals proportionately. For example, a prac- 
titioner, at each opportunity to change the interval, could 
increase the DRO interval by 10%. 


3. Change the DRO interval each session based on the learn- 
er’s performance. For example, a practitioner could set the 
DRO interval for each session equal to the mean IRT from 
the preceding session. 


If problem behavior worsens when a longer DRO interval 
is introduced, the practitioner can decrease the duration of the 
interval to a level that again controls the problem behavior. The 
DRO interval can then be extended with smaller, more grad- 
ual durations after previously obtained reductions in problem 
behavior have been re-established. 


Extend the Application of DRO to Other Settings and 
Times of Day 


When the frequency of the problem behavior is reduced sub- 
stantially in the treatment setting, the DRO intervention can 
be introduced during other activities and times in the person’s 
natural environment. Having teachers, parents, or other care- 
givers begin to deliver reinforcement on the DRO schedule 
will help extend the effects. For example, Cowdery and col- 
leagues (1990) began to implement the DRO intervention at 
other times and places during the day after Jerry’s SIB was 
under good control during treatment sessions (see Figure 
25.6). At first, various staff members would praise Jerry and 
give him tokens for refraining from scratching during single 
30-min DRO intervals during leisure activities, instructional 
sessions, or free time. Then they began adding extra 30-min 
DRO intervals until the DRO contingency was extended to all 
of Jerry’s waking hours. Jerry’s SIB decreased sufficiently to 
allow him to be discharged to his home for the first time in 2 
years. Jerry’s parents were taught how to implement the DRO 
procedure at home. 


Combine DRO with Other Procedures 


DRO can be used as a single intervention. However, as with 
DRA, including DRO in a treatment package with other behav- 
ior-reduction procedures often yields more efficient and effec- 
tive behavior change. De Zubicaray and Clair (1998) used a 
combination of DRO, DRI, and a restitution (punishment) pro- 
cedure to reduce the physical and verbal abuse and physical 
aggression of a 46-year-old woman with intellectual disabili- 
ties who lived in a residential facility for people with chronic 
psychiatric disorders. Further, the results of three experiments 
by Rolider and Van Houten (1984) showed that an intervention 
consisting of DRO plus reprimands was more effective than 
DRO alone in decreasing a variety of problem behaviors by 
children (e.g., physical abuse by a 4-year-old girl toward her 
baby sister, thumb sucking by a 12-year-old boy, and bedtime 
tantrums). 

DRO can also be added as a supplement to an intervention 
that has produced insufficient results. McCord, Iwata, Galensky, 
Ellingson, and Thomson (2001) added DRO to a stimulus fading 
intervention that was “having limited success” in decreasing the 
problem behaviors of Sarah, a 41-year-old woman with severe 
intellectual disabilities and visual impairment. Her problem 
behaviors included severe self-injurious behavior, property 
destruction, and aggression evoked by noise. Gradually increas- 
ing the noise volume by 2 dB when DRO was in effect, “a ther- 
apist delivered a preferred edible item (half a cheese puff) to 
Sarah following each 6-s interval in which problem behavior 
was not observed. If problem behavior occurred, the food was 
withheld and the DRO interval was reset” (p. 455). Each time 
Sarah completed three consecutive |-min sessions with no 
occurrences of the problem behavior, the researchers increased 
the DRO interval by 2 seconds. This schedule thinning contin- 
ued until Sarah could complete the session with no occurrences 
of the problem behavior. The therapist then delivered the edible 
item only at the end of the session. The addition of DRO to the 
intervention produced an immediate decrease in Sarah’s prob- 
lem behavior, which remained at or near zero in the treatment 
setting for more than 40 sessions (see Chapter 6, the bottom 
graph of Figure 6.6). Probes conducted in noisy conditions after 
treatment ended showed that the treatment effects had main- 
tained and generalized to Sarah’s home. 


Recognize the Limitations of DRO 


Although DRO can be highly effective for reducing problem 
behaviors, it is not without shortcomings. With interval DRO, 
reinforcement is delivered contingent only on the absence of 
problem behavior during the interval, even though another 
inappropriate behavior might have occurred during that time. 
For instance, let us assume that a 20-sec interval DRO is to 
be implemented to reduce the facial tics of an adolescent with 
Tourette syndrome. Reinforcement will be delivered at the end 
of each 20-sec interval containing no facial tics. However, if 
the adolescent engages in cursing behavior at any time during 
the interval or at the end of the interval, reinforcement will stil] 
be delivered. It is possible that reinforcement delivered con- 
tingent on the absence of facial tics will occur in close tem- 
poral proximity to cursing, thereby inadvertently strengthening 


another inappropriate behavior. In such cases, the length of the 
DRO interval should be shortened and/or the definition of the 
problem behavior expanded to include the other undesirable 


behaviors (e.g., reinforcement contingent on the absence of 


facial tics and cursing). 


With momentary DRO, reinforcement is delivered contin- 
gent on the problem behavior not occurring at the end of each 
interval, even though the inappropriate behavior might have 
occurred during the interval. Continuing the previous example, 
on an FM-DRO 20-sec schedule, reinforcement is delivered at 
each successive 20th second if facial tics are not occurring at 
those precise moments, even if tics occurred for 50%, 75%, 
or even 95% of the interval. In such circumstances, the prac- 
titioner should use interval DRO and reduce the length of the 
DRO interval. 

Another consideration is the number of resources that 
may be available for labor-intensive procedures. Rozenblat and 
colleagues (2009) found that if DRO intervals are extremely 
short—in their case, as low as | sec—it might be clinically 
impossible for a single analyst or practitioner to implement the 
procedure. Working alone and trying to conduct a session with 
other students, time a series of intervals, collect data and record 
performance, and deliver reinforcers is likely to exceed a sin- 
gle person’s capability. Thus, practitioners would be advised to 
assess staff resources before launching a DRO program. 


DIFFERENTIAL REINFORCEMENT OF LOW 
RATES OF RESPONDING (DRL) 


A practitioner applying differential reinforcement of low rates 


(DRL) delivers reinforcement (a) following each occurrence of 


the target behavior that is separated from the previous response 
by a minimum amount of time or (b) contingent on the num- 
ber of responses within a period not exceeding a predetermined 
criterion. In laboratory research, Ferster and Skinner (1957) 
found that delivering a reinforcer following a response that had 
been preceded by increasingly longer intervals of time without a 
response reduced the overall rate of responding. Catania (2013) 
describes typical DRL behavior found in laboratory research. 


In DRL performance, responding is unlikely to extinguish, 
because decreasing low rates produce even more reinforc- 
ers. Once some responding has been reinforced, response 
rate typically increases, but once it has done so responses 
follow each other more closely so fewer meet the sched- 
uled criterion, and response rate decreases once more, 

and so on. Typically the responding stabilizes at some 
value, oscillating between increased rates accompanied by 
decreased reinforcement and decreased rates accompanied 
by increased reinforcement. A pigeon’s pecking can be 
maintained over long periods by such contingencies. Thus, 
DRL performance is highly resistant to change, though its 
rate remains relatively low. It therefore again illustrates the 
independence of response strength and response rate (Nevin 
& Grace, 2000). (p. 209) 


Because reinforcement with a DRL schedule is delivered 
following an occurrence of the target behavior—as contrasted 
with DRO, in which reinforcement is contingent on the absence 
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of the behavior—behavior analysts use DRL to decrease the rate 
of a behavior that occurs too frequently, but not to eliminate the 
behavior entirely. A practitioner may identify a certain behavior 
as a problem behavior not because of its form, but because the 
behavior occurs too often. For example, a student asking for help 
by raising his hand or coming to the teacher’s desk during inde- 
pendent seatwork is appropriate at a limited rate, but a problem 
behavior if it occurs too frequently. 

Some learners may emit a behavior less frequently when 
told to do the behavior less often, or when given rules stating 
an appropriate rate of response. When instruction alone does 
not reduce the behavior to an acceptable rate, the practitioner 
may need to use a consequence-based intervention. DRL offers 
practitioners an intervention for diminishing problem behaviors 
that is stronger than instruction alone but still falls short of more 
restrictive consequences (e.g., punishment). Deitz (1977) named 
and described three DRL procedures: full-session DRL, interval 
DRL, and spaced-responding DRL. 


Full-session DRL 


In a full-session DRL schedule, reinforcement is delivered 
when responding during an entire instructional or treatment 
session is equal to or below a predetermined criterion. If the 
number of responses exceeds the specified limit during the ses- 
sion, reinforcement is withheld. For example, a teacher applying 
full-session DRL with a criterion limit of four disruptions per 
class period would provide reinforcement at the end of the class 
period contingent on four or fewer disruptions. Full-session 
DRL is an easy-to-apply intervention for problem behaviors in 
education and treatment settings. 

Deitz and Repp (1973) demonstrated the efficacy and 
manageability of full-session DRL for diminishing classroom 
misbehavior. They decreased the talk-outs of an 11-year-old 
boy with developmental disabilities. During 10 days of base- 
line, the student averaged 5.7 talk-outs per 50-minute session. 
With the introduction of full-session DRL, the student was 
allowed 5 minutes for play at the end of the day contingent on 
3 or fewer talk-outs during a 50-minute session. Talk-outs during 
the DRL condition decreased to an average of 0.93 talk-out per 
50 minutes. A return to baseline slightly increased the talk-outs 
to an average of 1.5 per session. 

Austin and Bevan (2011) implemented a variation of 
full-session DRL with three female elementary students who 
engaged in excessive attention-seeking behaviors that affected 
work completion in a timely manner. The researchers set the 
number of requests for assistance each student was allowed to 
“reflect not only a reduction in baseline rates but also the num- 
ber of requests the teacher thought was reasonable for each child 
to make in a 20-min work period” (p. 454). The DRL interven- 
tion was implemented as follows: 


At the beginning of independent work sessions, each child 
received a small index card with boxes corresponding to 
the number of requests allowed during the session plus 
one (e.g., Elin was allowed three bids for attention, so her 
card had four boxes). Each time the child requested atten- 
tion from the teacher, the teacher responded to the child’s 
request (up to the number of responses allocated for the 
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session) and initialed one of the boxes on the child’s card. 
If the child exceeded the number of allowable requests, the 
teacher was instructed to initial the remaining box but not 
to interact with the child. However, none of the children 
ever exceeded the number of responses eligible for atten- 
tion during the DRL conditions. If the child had at least 
one blank box at the end of the work session, she earned 

a point for her team. No additional points were awarded if 
more than one box was left empty, because we wanted the 
children to use the opportunities for assistance or attention 
that were available to them. (p. 454) 


Because the children’s requests within the DRL-imposed 
limit were reinforced as they occurred during the session, an 
appropriate procedure given the target behavior, we identify 
Austin and Bevan’s (2011) intervention as a variation of full- 
session DRL. A reversal design demonstrated the intervention’s 
effectiveness in reducing each student’s rate of attention-seeking 
behavior (see Figure 25.10). The procedure earned high marks 
on social validity assessments conducted at the end of the study. 
The teacher found the DRL schedule easy to implement and 
integrate into ongoing classroom routines and reported she 
planned to continue using it. The children indicated they were 
happy to use the check-box cards, liked earning points for their 
group, and wanted their teacher to continue using the cards. 


Interval DRL 


To apply an interval DRL schedule of reinforcement, the prac- 
titioner divides the total session into a series of equal intervals 
of time and provides reinforcement at the end of each interval 
in which the number of occurrences of the problem behavior 
was equal to or below a criterion limit. If the learner exceeds 
the criterion number of responses during an interval, the prac- 
titioner removes the opportunity for reinforcement and begins 
a new interval. 

Deitz and colleagues (1978) used an interval DRL 
schedule of reinforcement to reduce the disruptive behaviors 
of a student with learning disabilities. The 7-year-old student 
had several difficult classroom misbehaviors (e.g., running, 
shoving, pushing, hitting, throwing objects). The student 
received a sheet of paper ruled into 15 blocks, with each block 
representing a 2-min interval. A star was placed in a block each 
time the student completed 2 minutes with | or no misbehav- 
iors. Each star permitted the student to spend | minute on the 
playground with the teacher. If the student emitted 2 misbe- 
haviors during the interval, the teacher immediately started a 
new 2-min interval. 


Spaced-responding DRL 


Using a spaced-responding DRL schedule of reinforcement, 
the practitioner delivers a reinforcer following an occurrence 
of a response that is separated from the previous response by 
at least a specified amount of time.° As you will recall from 
Chapter 4, interresponse time (IRT) is the technical term for 
the duration of time between two responses. IRT and rate of 
responding are directly correlated: The longer the IRT, the lower 
the overall rate of responding; shorter IRTs correlate with higher 
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Figure 25.10 Rates of requesting attention across conditions. 
The solid horizontal line in the DRL phases represents the 
limit on responding specified by the DRL schedule. 


“Using Differential Reinforcement of Low Rates to Reduce Children’s 
Requests for Teacher Attention” by J. L. Austin and D. Bevan, 2011, 
Journal of Applied Behavior Analysis, 44, p. 456. Used by permission. 
Reproduced with permission of John Wiley & Sons Inc. 


response rates. When reinforcement is contingent on longer 
IRTs, response rate will decrease. 

Favell, McGimsey, and Jones (1980) used a spaced- 
responding DRL schedule of reinforcement and response 
prompts to decrease the rapid eating of four persons with profound 
developmental disabilities. At the start of treatment, reinforcement 
was contingent on short independent pauses (IRTs) between bites 
of food. As treatment progressed, gradually increasing longer and 
longer pauses between bites were required for reinforcement. The 
researchers also manually prompted a separation between bites 
of food and faded out those response prompts when a minimum 
of 5-sec pauses occurred independently between approximately 
75% of all bites. Finally, Favell and colleagues gradually thinned 
food reinforcement and praise. The frequency of eating was 
decreased from a pretreatment baseline average of 10 to 12 bites 
per 30 seconds to 3 to 4 bites per 30 seconds during the spaced- 
responding DRL condition. 


Singh, Dawson, and Manning (1981) used a spaced- 
responding DRL intervention to reduce the stereotypic respond- 
ing (e.g., repetitive body movements, rocking) of three teenage 
girls with profound developmental disabilities. During the first 
phase of spaced-responding DRL intervention, the therapist 
praised each girl whenever she emitted a stereotypic response at 
least 12 seconds after the last response. One of the experiment- 
ers timed the intervals and used a system of automated lights to 
signal the therapist when reinforcement was available. After the 
DRL 12-sec IRT resulted in an abrupt decrease in stereotypic 
behavior by all three girls (see Figure 25.11), Singh and col- 
leagues systematically increased the IRT criterion to 30, 60, and 
then 180 sec. The spaced-responding DRL procedure not only 
produced substantial reductions in stereotypic responding in all 
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three subjects but also had the concomitant effect of increasing 
appropriate behavior (e.g., smiling, talking, playing). 

Most behavior-reduction procedures that manipulate 
consequences have the potential to reduce behavior to zero 
occurrences. Spaced-responding DRL is unlikely to do that. 
For that reason, spaced-responding DRL is an important inter- 
vention for diminishing behavior that should still continue but 
at a slower rate. A spaced-responding DRL contingency tells 
learners that their behavior is acceptable, but they should do it 
less often. For example, a teacher could use a spaced-respond- 
ing DRL schedule to diminish a student’s problem behavior of 
asking too many questions. The questions occurred so frequently 
that they interfered with the classroom learning and teaching. To 
intervene, the teacher could respond to the student’s question if 
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Figure 25.11 Effects of spaced-responding DRL on stereotypic 
responding of three teenage girls with profound developmental 


disabilities. 


“Effects of Spaced Responding DRL on the Stereotyped Behavior of Profoundly 
Retarded Persons” by N. N. Singh, M. J. Dawson, and P. Manning, 1981. 
Journal of Applied Behavior Analysis, 38, p. 524. Reproduced with permission 


of John Wiley & Sons Inc. 


656 Part9 * Decreasing Behavior with Nonpunishment Procedures 


the student had not asked a question during a preceding mini- 
mum of 5 minutes. This spaced-responding DRL intervention 
could decrease, yet not eliminate, asking questions. 


Guidelines for Using DRL 


Several factors influence the effectiveness of the three DRL 
schedules in diminishing problem behaviors. The following 
guidelines address those factors. 


Choose the Most Appropriate DRL Procedure 


Full-session, interval, and spaced-responding DRL schedules 
provide different levels of reinforcement for learners. Of the 
three DRL procedures, only spaced-responding DRL delivers 
reinforcement immediately following the occurrence of specific 
response; a response must be emitted following a minimum IRT 
before reinforcement. Practitioners use spaced-responding DRL 
to reduce the occurrences of a behavior while maintaining those 
behaviors at lower rates. 

With full-session and interval DRL, a response does not 
need to occur for the participant to receive the reinforcer. Practi- 
tioners may apply full-session or interval DRL when it is accept- 
able that the rate of the problem behavior reaches zero or as an 
initial step toward the goal of eliminating the behavior. 

Spaced-responding and interval DRL usually produce 
reinforcement at a higher rate than full-session DRL does. 
Arranging frequent contact with the reinforcement contingency 
is especially appropriate, and most often necessary, for learners 
with severe problem behaviors. 


Recognize the Limitations of DRL 


If a practitioner needs to reduce an inappropriate behavior 
quickly, DRL would not be the method of first choice. Reducing 
the inappropriate behavior to appropriate levels with DRL may 
take more time than the practitioner can afford. Further, DRL 
should not be used with self-injurious, violent, or potentially 
dangerous behaviors. Finally, from a practical standpoint, using 
DRL means that the practitioner must focus on the inappropriate 
behavior. If a teacher, for example, is not cautious, he may 
unintentionally give too much attention to the inappropriate 
behavior, inadvertently reinforcing it. 

Effective application of interval DRL and spaced-respond- 
ing DRL procedures requires continuous monitoring of problem 
behavior, careful timing, and frequent reinforcement. Without 
the help of an assistant, practitioners may have difficulty apply- 
ing the interval DRL procedure in group settings. Interval and 
spaced-responding DRL procedures are quite reasonable for 
one-on-one instruction or when competent assistance is avail- 
able. Jessel and Borrero (2014) offer guidance for the selection 
and application of DRL schedules. 


Evaluation and implementation of spaced-responding 
DRL and full-session DRL should be considered in 
the context of clinical or research goals. The more 
effortful approach (spaced responding DRL) may 

be required when the goal is to sustain respond- 

ing, albeit at rates lower than those produced under 
baseline conditions (e.g., applications to rapid eating, 


excessive hand raising, or tattling). In addition, the 
limitations of using spaced-responding DRL schedules 
(e.g., requires individual timers for each student to 

be reset following responding below the programmed 
IRT) may be assuaged when considering the possibility 
of the use of widespread technology such as tablets 

in classroom settings. Applications could be created 
on tablets with which teachers could select the IRT 

for each student immediately before the start of class- 
room activities. The teacher would only have to click 
on the child’s name after each response to determine 
whether or not he or she met the criteria for reinforcer 
presentation. (p. 321) 


Use Baseline Data to Guide the Selection 
of the Initial Response or IRT Limits 


Practitioners can use the mean number of responses emitted 
during baseline sessions, or slightly lower than that average, 
as the initial full-session DRL criterion. For example, 8, 13, 
10, 7, and 12 responses recorded per session over five baseline 
sessions equals a mean of 10 responses per session. Therefore, 
a limit of 8 to 10 responses per session would be an appropriate 
initial full-session DRL criterion. 

Interval DRL and spaced-responding DRL initial time cri- 
teria can be set at the baseline mean or slightly lower. For exam- 
ple, | response per 15 minutes makes an acceptable initial interval 
DRL criterion, as calculated from a baseline mean of 4 responses 
per 60-min session. With the same baseline data (1.e., 4 responses 
per 60-min session), it appears reasonable to use 15 minutes as the 
initial IRT criterion for a spaced-responding DRL schedule. That 
is, a response will produce reinforcement only if it is separated 
from the prior response by a minimum of 15 minutes. 


Gradually Thin the DRL Schedule 


Practitioners should gradually thin the DRL schedule to achieve 
the desired final rate of responding. Practitioners commonly use 
three procedures for thinning the initial DRL time criterion. 


1. With full-session DRL, the practitioner can set a new 
DRL criterion using the participant’s current DRL per- 
formances. Another option is to set a new DRL criterion 
at slightly less than the mean number of responses emitted 
during recent DRL sessions. 


2. With interval DRL, the practitioner can gradually decrease 
the number of responses per interval if the current crite- 
rion is more than one response per interval; or, gradually 
increase the duration of the criterion interval if the current 
criterion is one response per interval. 


3. With spaced-responding DRL, the practitioner can adjust 
the IRT criterion based on the mean IRT of recent sessions, 
or slightly less than that average. For example, Wright and 
Vollmer (2002) set the IRT for the DRL component of 
a treatment package that successfully reduced the rapid 
eating of a 17-year-old girl at the mean IRT of the previ- 
ous five sessions. The researchers did not exceed an IRT 
of 15 seconds because it was not necessary to reduce the 
girl’s eating below a rate of four bites per minute. 


. Practitioners who successfully thin DRL schedules of 
reinforcement make gradual, but systematic, changes in time 
and response criteria associated with full, interval, and space- 
responding DRL variations. 


Two possible decision rules for thinning the DRL sched- 
ule are as follows: 


Rule I: Practitioners may want to change the DRL crite- 
rion whenever the learner meets or exceeds the criterion 
for three consecutive sessions. 


Rule 2: Practitioners may want to change the DRL cri- 
terion whenever the learner receives reinforcement for 


at least 90% of the opportunities for three consecutive 
sessions. 


Provide Feedback 


The effectiveness of a DRL procedure can be enhanced by 
feedback to help the learner monitor her rate of responding. 
Full-session, interval, and spaced-responding DRL procedures 
provide different levels of feedback for participants. The 
most accurate feedback comes with spaced-responding DRL 
because reinforcement follows each response that meets the 
IRT criterion. When a response occurs that does not meet the 
IRT criterion, reinforcement is withheld, the time interval is 
immediately reset, and a new interval begins. This process 
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SUMMARY 


Differential Reinforcement Defined 


1. Differential reinforcement entails reinforcing one response class 
and withholding reinforcement for another response class. 


2. When used as a reductive procedure for problem behav- 
ior, differential reinforcement consists of (a) providing 
reinforcement contingent on a behavior other than the 
problem behavior (or the problem behavior occurring at a 
reduced rate) and (b) withholding reinforcement as much 
as possible for the problem behavior. 


Differential Reinforcement of Alternative 
Behavior (DRA) 


3. A practitioner applying DRA reinforces occurrences 
of a desirable alternative to the problem behavior and 
withholds reinforcement for the problem behavior. 


4. DRA can be conceptualized as a concurrent schedule of rein- 
forcement in which two response classes receive reinforcement 
at different rates: (a) the alternative behavior on a dense sched- 
ule of high-quality reinforcement and (b) the problem behavior 
on extinction or a very lean schedule of reinforcement. 


5. The learner allocates more responding to the alternative 
behavior and less responding to the problem behavior, as 
described by the matching law. 


6. When DRA entails reinforcing a behavior that cannot 
occur simultaneously with the problem behavior, the 
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provides learners with immediate feedback on responding 
per interval. 

Interval DRL also provides a high level of feedback, 
although it is less than that of spaced-responding DRL. Inter- 
val DRL provides the learner with two types of feedback. The 
first problem behavior does not provide feedback. The second 
response, however, resets the time interval, providing a conse- 
quence for the problem behavior. Reinforcement occurs at the 
end of the interval when one or no problem behaviors occurred 
during the interval. These two types of feedback improve the 
effectiveness of the interval DRL intervention (Deitz et al., 1978). 

Applied behavior analysts can arrange full-session DRL 
with or without feedback. The usual arrangement does not 
provide feedback concerning moment-to-moment accumu- 
lation of responses. Deitz (1977) stated that with full-session 
DRL, learners would respond only to the DRL criterion: Once 
the learner loses the opportunity for reinforcement during the 
session, he could then emit high rates of misbehavior without 
consequence. When the schedule is arranged without moment- 
to-moment feedback, learners usually stay well below the DRL 
limit. Full-session DRL, without moment-to-moment feedback, 
may not be as effective for learners with severe problem behav- 
iors as spaced-responding and interval DRL. The effectiveness 
of full-session DRL relies heavily on an initial verbal description 
of the contingencies of reinforcement (Deitz, 1977). 


procedure is sometimes called differential reinforcement 
of incompatible behavior (DRI). 


7. When using DRA, practitioners should: 


¢ Select alternative behaviors that are present in the 
learner’s repertoire, that require equal or less effort than 
the problem behavior, that are being emitted prior to 
intervention with sufficient frequency to provide oppor- 
tunities for reinforcement, and that are likely to produce 
reinforcement when the intervention ends. 


¢ Select potent reinforcers that can be delivered when 
the alternative behavior occurs and withheld following 
instances of the problem behavior. The consequences 
that maintain the problem behavior can function as a 
reinforcer for the alternative behavior. 


Reinforce the alternative behavior on a continuous rein- 
forcement schedule initially and then gradually thin the 
schedule of reinforcement. 


¢ Be alert to increased occurrences of the problem behav- 
ior when treatment shifts from a clinical to a natural 
setting, when new therapists and/or caregivers begin 
working with the client, and when the reinforcement 
schedule for the alternative behavior is thinned to 
levels that can be maintained in the client’s everyday 
environment. 


¢ Combine DRA with other reductive procedures to 
produce a more potent intervention. 
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Differential Reinforcement of Other Behavior (DRO) separated from the previous response by a minimum amount 
: of time or (b) contingent on the number of responses within 

8. A practitioner applying DRO delivers reinforcers contin- a period not exceeding a predetermined criterion. 
gent on the problem behavior not occurring throughout 
intervals of time (interval DRO) or at specific moments of 14. DRL is used to decrease the rate of a behavior that . 
time (momentary DRO). occurs too frequently, but not to eliminate the behavior 

rk entirely. 

9. With interval DRO, the omission of the problem behavior 
throughout a predetermined time interval produces rein- 15. Reinforcement on a full-session DRL schedule is delivered 
forcement, and any instance of the problem behavior during when responding during an entire instructional or treat- 
the interval resets the interval and postpones reinforcement. ment session is equal to or below a criterion limit. 

10. With momentary DRO, absence of the problem behavior at 16. On an interval DRL schedule of reinforcement, the practi- 
predetermined moments in time produces reinforcement; tioner divides the total session into a series of equal inter- 
instances of the target behavior at other points in time vals of time and provides reinforcement at the end of each 
do not alter when the next opportunity for reinforcement interval in which the number of responses was equal to or 
becomes available. below a criterion limit. 

11. Availability of reinforcement with interval and momen- 17. Reinforcement on a spaced-responding DRL schedule 
tary DRO procedures can occur on fixed- or variable-time follows each occurrence of the target behavior that is 
schedules. separated from the previous response by a minimum 


: Bt interresponse time (IRT). 
12. When using DRO, practitioners should: 


¢ Establish an initial DRO time interval that ensures the 18. When using DRL, practitioners should: 
learner’s current level of behavior will produce frequent ¢ Not use DRL if a problem behavior needs to be reduced 
reinforcement when the DRO contingency is applied. quickly. 

¢ Be careful not to reinforce inadvertently other inappro- ¢ Not use DRL with self-injurious or other violent 
priate behaviors. behaviors. 

¢ Deliver reinforcement at the intervals or moments in * Select the most appropriate DRL schedules: full-session 


time specified by the DRO schedule contingent on the 
absence of the problem behavior and the absence of any 
other significant inappropriate behaviors. 


or interval DRL when it is acceptable that the rate of 
the problem behavior reaches zero or as an initial step 
toward the goal of eliminating the behavior; spaced- 


¢ Increase the DRO interval gradually based on decreases responding DRL for reducing the rate of a behavior to 
in the problem behavior. be maintained in the learner’s repertoire. 
¢ Extend DRO in other settings and times of day after the * Use baseline data to guide the selection of the initial 
problem behavior is substantially reduced in the treat- response or IRT limits. 
PSH SELLS ¢ Thin the DRL schedule gradually to achieve the desired 
¢ Combine DRO with other reductive procedures. final rate of responding. 
OTR ROCIO Treen TOTO Dotoe ay ° nae peak to help the learner monitor the rate of 
Responding (DRL) rst aie 
13. A practitioner applying DRL delivers reinforcement (a) 
following each occurrence of the target behavior that is 
KEY TERMS 
differential reinforcement of alternative differential reinforcement of other interval DRL 
behavior (DRA) behavior (DRO) spaced-responding DRL 
differential reinforcement of incompatible fixed-interval DRO (FI-DRO) variable-interval DRO (VI-DRO) 
behavior (DRI) AD : 
ao fixed-momentary DRO (FM-DRO) variable-momentary DRO (VM-DRO) 
differential reinforcement of low full-session DRL 


rates (DRL) 
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MULTIPLE-CHOICE QUESTIONS 


1. Joni works in a group home, with a group of men with 5. DRO interventions: 
disabilities. She has a good relationship with each of the 


a. Provide reinforcement for a specific alternative 
men, but one of the residents continues to make sexually- ‘ 


: : ; behavior 

Inappropriate comments to her. Joni typically replies, } 

ipyaeetoptealiinte mie tat! vou'lnow ldon i likeitand b. Provide reinforcement for slowly increasing occur- 
it’s not appropriate,” as she turns red in the face. An infor- RGSS She enone 

mal functional behavior assessment indicates this behavior c. Provide reinforcement for incompatible behaviors 
is maintained by Joni’s attention. The group home staff d. Provide reinforcement for not responding 


decides to implement a DRA intervention. Which of the 


Hint: (See “Differential Reinforcement of Other Behavior’) 
following would be an appropriate example of DRA in 


this case? 6. A limitation of DRO is: 
a. Providing enthusiastic attention whenever Frank says a. Other inappropriate behaviors may be reinforced 
anything appropriate to Joni, while ignoring him if he accidentally 
makes inappropriate remarks b. One may accidentally punish all responding in the 
b. Providing enthusiastic attention whenever Frank makes individual. 
an appropriate greeting to Joni (such as “You look nice c. It often produces an extinction burst effect. 
today, Joni.”), while ignoring Frank if he uses inappro- d. The effects have been shown to be slow and gradual. 


rlate ereetings : ; : ; 
P g g Hint: (See “Differential Reinforcement of Other 


e 


Providing increased enthusiastic attention to Frank 


Behavior’) 
throughout the day 
: : . pela: 7. A DRL intervention is useful when the practitioner 
d. Ignoring all of Frank’s inappropriate verbalizations eet oe : Make 
Hint: (See “Differential Reinforcement Defined” : ; 
( ) a. Get rid of a target behavior all together 


2. Differential reinforcement always includes which two b 


D he. . Teach a new response to replace the problem behavior 
principles of behavior? 


c. Decrease the overall rate of a problem behavior, but not 


a. Reinforcement and punishment get rid of it all together 


b. Reinforcement and stimulus control d. Increase the overall rate of a target behavior 

c. Reinforcement and extinction Hint: (See “Differential Reinforcement of Low Rates of 
d. Positive and negative reinforcement Responding’) 

Hint: (See “Differential Reinforcement Defined”) 8. If the mean interresponse time (IRT) of a target problem 


behavior is 12 responses per hour, and you are implement- 
ing a DRL procedure, what should the initial DRL crite- 
rion be? 


3. In DRA, a practitioner: 


a. Reinforces a desirable alternative to the problem 


behavior 
‘ ee a. More than 12 responses per hour 
b. Places the problem behavior on extinction 


: ; vee : b. No more than 11 responses per hour 
c. May choose to reinforce a behavior that is incompatible 


with the problem behavior c. No more than | response per hour 


d. All of these are correct d. No more than 4 responses per hour 
Hint: (See “Differential Reinforcement of Low Rates of 


Hint: (See “Differential Reinforcement of Alternative we 
Responding’) 


Behavior’) 

4. The behavior selected as an alternative behavior in a DRA: 
a. Should already be in the learner’s repertoire 
b. Should require more effort than the problem behavior 


c. Should be emitted at a very low rate prior to 
intervention 

d. Should require extensive training for practitioners to 
learn how to reinforce it 

Hint: (See “Differential Reinforcement of Alternative 

Behavior’) 
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ESSAY-TYPE QUESTIONS 


1. 


ee eee connec aces cen: Score 


The introduction to this chapter states that the procedures 
described in this chapter are reinforcement-based proce- 
dures for decreasing problem behavior. But, doesn’t any 
reinforcement-based procedure, by definition, increase 

a target behavior and not decrease a target behavior? 
Explain how a differential reinforcement procedure can be 
used to decrease a target behavior. 


Hint: (See “Differential Reinforcement Defined’’) 


When implementing a DRA intervention, it is useful to 
select reinforcers that are powerful and can be delivered 
consistently. Explain the importance of selecting reinforc- 
ers for such interventions, and explain the best method for 
selecting reinforcers for such interventions. 


Hint: (See “Differential Reinforcement of of Alternative 
Behavior’) 


NOTES 


ib 


Differential reinforcement is also used to increase desired behavior, as in 
the differential reinforcement of high rates (DRH) (see Chapter 13) and 
shaping (see Chapter 22). Various forms of differential reinforcement con- 
tingencies are also used as experimental control procedures (see Chapter 8 
and Thompson & Iwata, 2005). 


. DRA isa primary component of functional communication training (FCT), 


an intervention in which the alternative behavior serves the same communi- 
cative function as the problem behavior (e.g., saying “Break, please” results 
in the same reinforcement as aggression or tantrums did previously). FCT 
is described in detail in Chapter 26. 


. Because instances of target problem behavior in DRO and DRL (to be 


discussed later in this chapter) often postpone reinforcement, some behav- 
ior analysts conceptualize DRO and DRL as negative punishment pro- 
cedures rather than reinforcement techniques. For example, Van Houten 
and Rolider (1990) consider DRO and DRL variations of a procedure they 
called contingent reinforcement postponement. 


3. Explain the difference between a momentary and inter- 


val DRO and when you might use each. 
Hint: (See “Differential Reinforcement of Other Behavior’) 


. Explain the difference between full-session, interval, and 


spaced-responding DRL and when you might use each. 


Hint: (See “Differential Reinforcement of Low Rates of 
Responding”’) 


. DRO can also be applied using data from permanent product measurement. 


For example, Alberto and Troutman (2013) described a procedure in which 
a teacher provided reinforcement each time a student submitted a paper that 
contained no doodles. If the papers were basically of the same length or 
type, the procedure would be a variation of the fixed-interval DRO. 


. Hammond and colleagues (2011) reported several examples of different 


signals used with FM-DRO: turning away, prompts, and praise (Harris & 
Wolchik, 1979), gesture (Repp, Barton, & Brulle, 1983), tone (Conyers, 
Miltenberger, Romaniuk, Kopp, & Himle, 2003), and music (Sisson, Van 
Hasselt, Hersen, & Aurand, 1988). 


. Because reinforcement in a spaced-responding DRL immediately follows 


an instance of the target behavior, it is the applied variation of DRL that 
most closely resembles the DRL schedules of reinforcement described by 
Ferster and Skinner (1957). 
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Antecedent Interventions 


LEARNING OBJECTIVES 


¢ Define and discuss applications of antecedent interventions. 

¢ Define and discuss applications of noncontingent reinforcement. 

¢ Define and discuss applications of high-probability request sequences. 
¢ Define and discuss applications of functional communication training. 


The owner of a private elementary school decided to ded- 
icate her school to the applications of direct instruction, 
precision teaching, and behavior analysis. The school 
served students with mild to moderate intellectual disabili- 
ties, learning disabilities, and problem behaviors. To assist 
with the school’s change of focus, the owner contacted 

a professor at a local university with expertise in applied 
behavior analysis to conduct weekly after-school in-service 
trainings for her teachers. The school’s owner wanted the 
in-services to concentrate on (a) principles of behavior, 

(b) measurement and analysis, and (c) strategies and tactics 
for application. 

The in-service progressed well. The teachers devel- 
oped a school-wide token economy that included a store 
where students could exchange their tokens based on their 
performance. However, the teachers reported a growing 
problem during recess—four boys were hitting and shoving 
other students. 

In subsequent discussions with the teachers, several 
intervention approaches were presented, including response 
cost or time-out, to address the inappropriate playground 
behavior. One teacher made this suggestion. 

Let’s not use punishment with these boys. I suggest 
we develop a playground safety patrol, with the four boys 
serving as the patrol officers, and expand the use of token 
economy during recess. We could meet with the boys, tell 
them about the new safety patrol, and that they have been 
selected to serve as the first patrol officers. Then, we give 
them their safety patrol badges to wear. We will need to 
teach the new officers (a) how to identify good playground 
behavior—we can describe examples of good playground 
behavior, and role-play this playground behavior; (b) how 
to praise good playground behavior; and, finally, (c) how to 
present tokens for good playground behavior of other chil- 
dren during recess. 

The group embraced this option and implemented 
it. Within 2 weeks of implementation, inappropriate play- 
ground behavior decreased, and the teachers gradually 
faded out the presence of the playground safety patrol. 


Anecdotally, the teachers reported that whereas the 
four boys did not have many positive playground interac- 
tions with other students before the antecedent intervention, 
they appeared to be making friends on the playground 
subsequent to it. 


n the early days of applied behavior analysis, researchers 

and practitioners emphasized the three-term contingency: 

how consequences affect behavior and how differential 
consequences produce stimulus discrimination and stimulus 
control. Seldom did applied behavior analysts attempt to change 
behavior by manipulating antecedent events. This situation 
changed dramatically following three groundbreaking publica- 
tions in the 1980s: Jack Michael’s (1982) conceptual analysis 
of establishing operations; Brian Iwata and colleagues’ demon- 
stration of a method to assess functional relations between 
self-injury and specific antecedent events (Iwata, Dorsey, 
Slifer, Bauman, & Richman, 1982); and Ted Carr and Mark 
Durand’s (1985) experiment showing that teaching children to 
emit an appropriate communicative response during conditions 
that previously occasioned disruptive behavior reduced misbe- 
havior to near zero. Building upon the concepts and techniques 
introduced in these discipline-altering articles, behavior analysts 
have developed a wide array of antecedent interventions that 
diminish problem behaviors and promote adaptive behaviors in 
social, academic, leisure, and work environments. 


DEFINING AND CLASSIFYING ANTECEDENT 
INTERVENTIONS 


An antecedent intervention is implemented prior to and inde- 
pendent of the target behavior’s occurrence. Some antecedent 
interventions manipulate motivational variables, some make 
the target behavior more or less effortful, some entail training 
an alternative behavior, some include differential consequences 
for responding, and some change the environment such that the 
opportunities to engage in a problem behavior are restricted (or, 
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conversely, opportunities made more prevalent for a desired 
behavior). Table 26.1 shows a sampling of antecedent inter- 
ventions practitioners might use to address common behavior 
problems and promote positive behavior. 

Although identifying such varied interventions with a 
single term is economical, recognizing the basic principles at 
the heart of each intervention is the first step toward effective 
application. Antecedent interventions based on stimulus control 
(discriminative stimuli [ S?s ]) evoke behavior because they have 
been correlated with increased availability of reinforcement. The 
evocative function of motivating operations (MOs), however, is 
independent of the differential availability of reinforcement. For 
example, an establishing operation will increase the current rate 
of certain types of behavior even when an effective reinforcer 
is not available. Failure to distinguish the different functions of 
antecedent events leads to conceptual confusion and ineffective, 
or possibly even unethical, service to clients. 

In addition to improved conceptual clarity and consis- 
tency, understanding the different reasons for the evocative 
functions of Ss and MOs has important applied implications. 
Antecedent treatments involving stimulus control must include 
manipulating the differential availability of reinforcement in 
the presence and absence of the S?. Behavior change strategies 
based on motivating operations must change antecedent events. 
Understanding these differences may improve the development 
of more effective and efficient behavior change efforts involving 
antecedent events. 


Classifying Antecedent Interventions 


Antecedent interventions can be classified in several ways. 
For example, Smith (2011) identified two broad categories of 
antecedent interventions for problem behavior. “Function-based 
antecedent interventions directly manipulate at least one compo- 
nent of a particular operant contingency that has been identified 
to maintain the problem behavior” (p. 297). With this defini- 
tion, antecedent interventions based on stimulus control and 
those that involve motivating operations would be considered 
function-based. Smith classified antecedent-based interventions 


“whose effects do not depend on specific identification of the 
variables that set the occasion for and maintain the problem 
behavior” as default interventions (p. 297). 

Antecedent interventions can also be classified by whether 
or not differential consequences are required for behavior 
change. The effects of a contingency-dependent antecedent 
intervention depend on differential consequences for the target 
behavior (or alternative behavior) in the presence or absence 
of the antecedent. Antecedent interventions based on stimulus 
control are contingency dependent. For example, in the presence 
of 2 + 2 = ?, a student responds 4, not because of the stim- 
ulus 2 + 2 = ?, but because of a past reinforcement history 
for saying 4, including perhaps a history of non-reinforcement 
for responses other than 4. Chapter 17 addresses contingency- 
dependent antecedent interventions based on stimulus control. 

The effects of a contingency-independent antecedent inter- 
vention do not depend on differential consequences for the target 
behavior. Smith’s (2011) default antecedent interventions are 
contingency independent, as are antecedent interventions based 
on motivating operations. The evocative and abative effects of 
establishing and abolishing operations occur independent of 
differential consequences (see Chapter 16). Table 26.2 provides 
examples of abolishing operations to decrease the effectiveness 
of the reinforcer maintaining various problem behaviors and a 
corresponding reduction of those behaviors. Chapter 16 dis- 
cusses motivating operations to effect behavior change. 

Smith and Iwata (1997) remind us that the effects of MOs 
are temporary and may not produce permanent improvements in 
behavior. However, practitioners can supplement MO manipu- 
lations with extinction and differential reinforcement to change 
behavior on a more permanent basis (Phillips, Iannaccone, 
Rooker, & Hagopian, 2017). 

The next sections of this chapter describe three 
contingency-independent antecedent interventions with estab- 
lished scientific evidence of their effectiveness: noncontingent 
reinforcement, high-probability instructional sequence, and 
functional communication training. The chapter closes with 
brief descriptions of three default antecedent interventions: 
restraint, enriched environment, and antecedent exercise. 


TABLE 26.1 Examples of Challenging Behavior and Antecedent Interventions That Might Address Them 


Challenging Behavior Situation 


When directed to do homework or take a bath, the child is 
noncompliant. 


When requested to complete a math worksheet with 25 prob- 
lems, student disrupts class. 


After entering the classroom and waiting for the first lesson to 
begin, several students misbehave. 


After staying on task for several minutes, a nonvocal verbal 
employee terminates the work activity by crying and throwing 
materials to the floor. 


While creeping and crawling around the house, an infant may 
encounter many potentially dangerous events. 


Antecedent Intervention 


Parents provide a choice: “Do you want to do your homework 
first or take a bath first?” 


Teacher breaks 25 problems into 5 worksheets of 5 problems each. 


Teacher pins an index card for each student on the bulletin 
board. Each card contains a personalized question. Upon enter- 
ing the classroom, students remove their cards from the bulletin 
board, go to their desks, and write an answer on the card. 


Supervisor teaches the employee to sign “Break please” 
when tired or frustrated, and provides a brief break when the 
employee does so. 


Attach gates to stairways, plug electrical outlets, put locks on 
cabinet doors, and remove table lamps. 
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TABLE 26.2 Examples of Antecedent Interventions Using Abolishing Operations 


Abolishing Operation Example 


Provide corrective prompts as an 


a ne ee oS ee ee ee eth at 6s VOOR GINIAE TD gene FOR 
Antecedent corrective academic prompts reduced destructive behavior to zero (Ebanks & 


antecedent event. Fisher, 2003). 


Provide presession exposure to 
stimuli that function as reinforcers. 


Provide free access to leisure 
activities. 


Reduce noise levels. 


Change levels of social proximity. 
Offer choices. 


A father-son playtime preceding compliance sessions improved son’s compliance with father’s 
requests (Ducharme & Rushford, 2001). 


Manipulation of leisure items effectively competed with self-injurious behavior (SIB) maintained 
by automatic reinforcement (Lindberg, lwata, Roscoe, Worsdell, & Hanley, 2003). 


Reducing noise levels decreased stereotypic covering of ears with hands (Tang, Kennedy, 
Koppekin, & Caruso, 2002). 


Low levels of distant proximity reduced aggressive behaviors (Oliver, Oxener, Hearn, & Hall, 2001). 
Escape-maintained problem behaviors decreased when students had opportunities to choose 


among tasks (Romaniuk et al., 2002). 


Increasing response effort. 
& Abt, 2002). 


NONCONTINGENT REINFORCEMENT 


Noncontingent reinforcement (NCR) is an antecedent inter- 
vention in which stimuli with known reinforcing properties are 
delivered on a fixed-time (FT) or variable-time (VT) schedule 
independent of the learner’s behavior.'! Noncontingent reinforce- 
ment may effectively diminish difficult-to-manage behaviors 
because the reinforcers that maintain the behavior are available 
freely and frequently, thereby functioning as an abolishing oper- 
ation (AO) that reduces the motivation to engage in the behavior. 
NCR is an important and effective antecedent intervention for the 
treatment of severe problem behavior displayed by persons with 
intellectual and/or developmental disabilities (Phillips et al., 2017). 

A functional analysis of the reinforcing contingency 
maintaining the problem behavior determines whether NCR is 
implemented with positive reinforcement, negative reinforce- 
ment (escape), or automatic reinforcement. 


NCR with Positive Reinforcement 


Kahng, Iwata, Thompson, and Hanley (2000) provided an excel- 
lent example of applying NCR with positive reinforcement. 
A functional analysis showed that social-positive reinforcement 
maintained self-injurious behavior (SIB) or aggression in three 
adults with developmental disabilities. During baseline, each 
occurrence of SIB or aggression produced attention for two of the 
adults and a small bit of food for the third adult. During the initial 
sessions of NCR, the adults received attention or small bits of 
food on an initial fixed-time (FT) schedule (e.g., 5 seconds). Later, 
the schedule was thinned to a terminal criterion of 300 seconds. 
Figure 26.1 presents the baseline and NCR performances of the 
three participants and shows that the NCR procedure effectively 
decreased occurrences of SIB and aggression. 


NCR Escape 


Kodak, Miltenberger, and Romaniuk (2003) analyzed the 
effects of NCR escape on the instructional task compliance 
and problem behaviors of Andy and John, 4-year-old boys with 
autism. Andy’s task was to point to cards with teacher-specified 


Increasing response effort for pica produced reductions in pica (Piazza, Roane, Kenney, Boney, 


pictures, words, or letters on them. John used a marker to trace 
each letter of written words. Problem behaviors included 
resisting prompts, throwing materials, and hitting. During 
baseline, the therapist gave an instruction for task engagement 
and, contingent on problem behavior following the instruction, 
removed the task materials and turned away from the child for 
10 seconds. During the NCR escape condition, the therapist 
used an initial FT 10-sec schedule for escape, meaning that 
the student had a break from the instructional requests every 
10 seconds of the session. The initial 10-sec FT schedule was 
thinned each time the boy achieved a criterion for two con- 
secutive sessions: 10 seconds to 20 seconds, to 30 seconds, to 
1 minute, to 1.5 minutes, and finally to a terminal criterion of 
2 minutes. The NCR escape procedure increased compliance 
and decreased problem behaviors. 


NCR with Automatic Reinforcement 


Lindberg, Iwata, Roscoe, Worsdell, and Hanley (2003) used 
NCR as a treatment to decrease the self-injurious behavior (SIB) 
of two women with profound intellectual disability. A functional 
analysis documented that automatic reinforcement maintained 
their SIB. The NCR procedure provided Julie and Laura with 
free access to a variety of home-based, highly preferred leisure 
items (e.g., beads, string) that they could manipulate throughout 
the day. Figure 26.2 shows that NCR object manipulation of pre- 
ferred leisure items effectively diminished SIB, and the effects 
were maintained up to a year later. This experiment is important 
because it showed that NCR object manipulation could compete 
with automatic reinforcement to reduce the occurrence of SIB. 
However, treating problem behaviors maintained by automatic 
reinforcement often will require adding other treatments (e.g., 
response blocking) to accomplish clinically important effects 
(Phillips et al., 2017). 


Using NCR Effectively 


The following procedural recommendations identify three 
key elements for enhancing the effectiveness of NCR. (a) The 
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Responses per Minute 


Figure 26.1 Number of SIB or aggressive responses per 
minute during baseline and NCR by three adults with 
developmental disabilities. 


From “A Comparison of Procedures for Programming Noncontin- 
gent Reinforcement Schedules,” by S. W. Kahng, B. A. Iwata, 

|. G. DeLeon, and M. D. Wallace, 2000, Journal of Applied Behavior 
Analysis, 33, p. 426. Used by permission. Copyright 2000 by the 
Society for the Experimental Analysis of Behavior. Reproduced by 
permission. 


% of Intervals 


Figure 26.2 Levels of SIB and object manipulation 
exhibited by Julie and Laura during observations at 
home while NCR was implemented daily. 


From “Treatment Efficacy of Noncontingent Reinforcement 
During Brief and Extended Application,” by J. S. Lindberg, 

B. A. Iwata, E. M. Roscoe, A. S. Worsdell, and G. P. Hanley, 
2003, Journal of Applied Behavior Analysis, 36, p. 14. Copyright 
2003 by the Society for Experimental Analysis of Behavior. Repro- 
duced by permission. 


SIB (Responses per Minute) 


amount and quality of stimuli with known reinforcing properties 
influence the effectiveness of NCR. (b) Most treatments include 
extinction with NCR interventions. (c) Reinforcer preferences 
can change during intervention. That is, the NCR stimuli may 
not continue competing with the reinforcers that maintain the 
problem behavior. DeLeon, Anders, Rodriguez-Catter, and 
Neider (2000) recommended periodically using a variety of 


Noncontingent Reinforcement 


Baseline 


Extinction 


Sessions 


Baseline Noncontingent Reinforcement 


SIB 


Object Manipulation 


Object Manipulation (% Intervals) 


40 
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available stimuli with the NCR intervention to reduce problems 
of changing preferences. 


Functional Behavior Assessment 


The effectiveness of using NCR depends on the correct iden- 
tification of the positive, negative, or automatic reinforcers 
maintaining the problem behavior. Advances in conducting 


functional behavior assessments have greatly improved the 
effectiveness of NCR by accurately pinpointing the contingen- 
cies maintaining the behavior.” 


Emphasizing NCR 


Applied behavior analysts can enhance the effectiveness of an 
NCR intervention by increasing the number of known rein- 
forcing stimuli over the non-NCR condition. For example, 
Ringdahl, Vollmer, Borrero, and Connell (2001) found that 
NCR was ineffective when the baseline condition and the NCR 
condition contained a similar number of reinforcers. NCR was 
effective, however, when the NCR schedule was denser than the 
baseline schedule. Applied behavior analysts can use the rates 
of reinforcement during baseline to establish an initial NCR 
schedule to ensure a discrepancy between baseline and NCR 
conditions. 

Ringdahl and colleagues (2001) suggested three pro- 
cedures for emphasizing reinforcement during the NCR 
intervention: (a) increase the delivery of stimuli with known 
reinforcing properties, (b) use a different schedule of rein- 
forcement at treatment onset (e.g., continuous delivery or 
availability of stimuli with known reinforcing properties), 
and (c) combine differential reinforcement of other behavior 
(DRO) with the NCR treatment package. DRO will decrease 
the adventitious reinforcement of the problem behavior from 
the time-based NCR schedule. 


Time-based NCR Schedules 


Most applications of NCR use a fixed-time schedule (FT) 
for the delivery of stimuli with known reinforcing proper- 
ties. A fixed-time schedule (FT) means that the interval of 
time for the presentation of these stimuli remains the same 
from delivery to delivery. When applied behavior analysts 
program the NCR time interval to vary across deliveries, it 
is called a variable-time schedule (VT). For example, an 
NCR VT schedule of 10 seconds means that, on the aver- 
age, stimuli with known reinforcing properties are presented 
every 10 seconds. This VT schedule could use time intervals 
such as 5, 8, 10, 12, or 15 seconds, arranged to occur in ran- 
dom sequence to produce effective results (Carr, Kellum, & 
Chong, 2001). 

Setting the initial NCR time schedule is an important 
aspect of the NCR procedure. The initial schedule can have an 
impact on the effectiveness of the intervention (Kahng, Iwata, 
DeLeon, & Wallace, 2000). Researchers consistently recom- 
mend an initial dense FT or VT schedule (e.g., Van Camp, 
Lerman, Kelley, Contrucci, & Vorndran, 2000). The therapist 
can set a dense time value (e.g., 4 seconds) arbitrarily. Usually, 
however, it is more effective to set the initial time value based on 
the number of occurrences of the problem behavior, which will 
ensure frequent contact with the NCR stimuli. 

The following procedure can be used to determine an 
initial NCR schedule: Divide the total duration of all baseline 
sessions by the total number of occurrences of the problem 
behavior recorded during baseline, and set the initial interval at 
or slightly below the quotient. For example, if the participant 


Chapter 26 * Antecedent Interventions 665 


emitted 300 aggressive acts during 5 days of the baseline, 
and each baseline session was 10 minutes in duration (i.e., 
600 seconds), then 3000 seconds divided by 300 responses 
produces a quotient of 10 seconds. Accordingly, these baseline 
data suggest an initial FT interval of 7 to 10 seconds. Analysts 
then thin the schedule by adding small time increments to the 
NCR interval. However, thinning a time-based schedule is best 
begun only after the initial NCR interval has produced a reduc- 
tion in the problem behavior. 

Applied behavior analysts have used three procedures 
to thin NCR schedules: (a) constant time increase, (b) propor- 
tional time increase, and (c) session-to-session time increase 
or decrease (Hanley, Iwata, & Thompson, 2001; Van Camp 
et al., 2000). A therapist, for example, can increase the FT or 
VT schedule intervals by using a constant duration of time, 
and decrease the amount of time that the learner has access 
to the NCR stimul by a constant time factor. So, a therapist 
might increase the schedule interval by 7 seconds at each 
opportunity, and each time decrease access to the stimuli by 
3 seconds. 

Likewise, a therapist can increase the FT or VT sched- 
ule intervals proportionately, meaning that each interval is 
increased by the same percentage of time. With this procedure, 
each time interval might be increased by 5%. If the initial time 
interval was 600 seconds, then the next interval would be 630 
seconds (5% X 600 = 30 seconds; 600 + 30 = 630 seconds) 
and so on. 

Finally, a therapist can use the learner’s performance to 
change the schedule interval on a session-to-session basis. For 
example, at the end of a session, the therapist establishes a new 
NCR time interval for the next session by dividing the number of 
problem behaviors that occurred in that session by the duration 
of the session and using that quotient as the next session’s FT 
interval. 

A therapist should adjust the interval if the problem 
behavior worsens during schedule thinning. The duration of 
the NCR interval can be readjusted after control of the problem 
behavior has been re-established. The readjustment, however, 
should proceed more gradually. 


Setting Terminal Criteria 


Applied behavior analysts usually select an arbitrary terminal 
criterion for NCR schedule thinning. Kahng and colleagues 
(2000) concede that research has not established a de facto ter- 
minal criterion for NCR schedule thinning. While a variety of 
time intervals have been used—3 minutes, 5 minutes, 10 min- 
utes—it seems that a S-min FT schedule has found the most 
favor in applied settings. The 5-min FT schedule seems to be 
both practical and effective. 

Box 26.1 describes guidelines for using NCR in the class- 
room suggested by Coy and Kostewicz (2018). 


Considerations for Using NCR 


NCR makes an effective intervention. It has advantages in addi- 
tion to effectiveness, and some disadvantages. Table 26.3 lists 
the advantages and disadvantages of using NCR. 
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BOX 26.1 


Implementing NCR in the Classroom 


Establish context. The first step involves using data 
collection and observation to identify and place a well- 
defined problem behavior in context. Definitions should 
avoid large categories of behavior and the absence 
of behavior, such as “off task,” “disruptive,” or “not 
listening.” Behavioral definitions should include exam- 
ples and nonexamples; for instance, calling out without 
permission includes all instances when the student yells 
or says something without being called on but does not 
include instances of choral responding. After defining 
the behavior, teachers then determine what commonly 
occurs before and after the target behavior. This con- 
textual information provides better insight into why the 
behavior occurs. In addition to anecdotal or observa- 
tional information, more formalized methods can pro- 
vide a well-rounded picture of the target behavior. After 
determining the context surrounding the target behavior, 
teachers collect baseline data on the target behavior, col- 
lecting data at times when the behavior is most likely 
to occur. Typical types of data collection include (a) 
frequency (how many times the behavior occurs within 
a certain amount of time), (b) duration per occurrence 
(how long the behavior occurs), (c) latency (how long it 
takes for the behavior to initiate following a direction or 
request), or (d) interresponse time (IRT; time between 
the end of one instance of the behavior and the beginning 
of another). 

Identify potential reinforcers. NCR relies on careful 
and purposeful use of positive reinforcers (e.g., items 
or events the student enjoys or will work for). The rein- 
forcer should match the common consequence for the 
target behavior. Using a functional reinforcer (such as a 
break from task demands for escape-maintained behav- 
iors or praise for attention-seeking behaviors) gives NCR 
the greatest chance of success. 


3. Deliver reinforcers. During NCR, the functional rein- 
forcer should be provided on a continuous basis and 
not in response to the occurrence of the target behavior. 
Teachers enrich the environment by providing reinforce- 
ment more often than the student currently receives for 
the problem behavior. 

4. Finalize the plan. Any formal behavioral intervention 
must be well organized prior to implementation. Written 
behavior plans include background information, current 
levels of performance, implementation guidance, and 
monitoring or evaluation sections. NCR interventions 
maintain the same requirements. 

5. Implement and evaluate. After reviewing the student's 
baseline behavior data and completing the NCR Planning 
Worksheet, it is time to put the plan into action. Prac- 
titioners implementing NCR must follow the schedule 
closely and carefully. Electronic technologies (e.g., cell 
phone applications) can assist with the implementation of 
NCR. Teachers should be aware of a few considerations 
when implementing NCR. The student may engage in 
the problem behavior just before the end of an interval. 
Instead of immediately providing the reinforcer, with- 
hold the presentation of the reinforcer for approximately 
10 seconds or until the student no longer displays the 
target problem behavior. Once the reinforcer is provided, 
the next NCR interval starts. Practitioners should also 
be prepared for brief escalations in the problem behav- 
ior during NCR. The teacher monitors yet ignores these 
escalations and provides the reinforcer according to the 
current schedule of reinforcement. 


Adapted from “Noncontingent Reinforcement: Enriching the Classroom 
Environment to Reduce Problem Behaviors,” by J. N. Coy and D. E. Kostewicz, 
2018, Teaching Exceptional Children, 50, 301-309. Copyright 2018 by Council 
for Exceptional Children. Used by permission. 


TABLE 26.3 Possible Advantages and Disadvantages of Noncontingent Reinforcement 


ns ee eS 
Advantages 


NCR is easier to apply than positive reductive techniques that require monitoring the client's behavior for the contingent delivery 
of the reinforcer (Phillips et al., 2017). 


NCR helps create a positive learning environment, which is always desirable during treatment. 
A package treatment that includes NCR with extinction may reduce extinction-induced response bursts (Van Camp et al., 2000). 
Chance pairings of appropriate behavior and NCR delivery of stimuli with known reinforcing properties could strengthen and 


maintain those desirable behaviors (Roscoe, Iwata, & Goh, 1998) 


Disadvantages 


e Free access to NCR stimuli may reduce motivation to engage in adaptive behavior. 


¢ Chance pairings of problem behavior and NCR delivery of stimuli with known reinforcing properties could strengthen the problem 
behavior (Van Camp et al., 2000). 


e NCR escape can disrupt the instructional process. 


HIGH-PROBABILITY INSTRUCTIONAL 
SEQUENCE 


Student compliance provides opportunities for the develop- 
ment of many important behaviors. Noncompliance, however, 
is a prevalent problem with persons who have developmen- 
tal disabilities and behavior disorders. The high-probability 
instructional sequence is a nonaversive procedure for improv- 
ing compliance and diminishing escape-maintained problem 
behaviors. The high-probability instructional sequence may 
decrease excessively slow responding to instructions and 
thereby reduce overall time for completing tasks (Mace 
et al., 1988). 

When using a high-probability instructional sequence 
(high-p), the practitioner presents two to five easy-to-follow 
instructions for which the participant has a history of compli- 
ance. When the learner complies with several high-p instruc- 
tions in sequence, the practitioner immediately gives the target 
instruction (1.e., low-p). Engelmann and Colvin (1983) provided 
one of the first formal descriptions of the high-p instructional 
sequence in their compliance training procedure for managing 
severe behavior problems. They used the term hard task in the 
following dressing anecdote. 

Typical Instructional Sequence 
Teacher: ‘Please put on your shirt.” (low-p request) 
Student: Throws a tantrum to avoid or escape the hard 
task request. 
Improved Instructional Sequence with Embedded High-p 
Instructional Sequence 

Teacher: ‘Give me five!” (high-p request) 
Student: 
Teacher: “Allright, nice job! Now, take this ball and put 

it in your pocket.” (high-p request) 
Student: 


Slaps teacher’s outstretched hand appropriately. 


Puts the ball in pocket. 

Teacher: “Great! That is right! Now, please put on the 
shirt.” (low-p request) 

Student: Puts on the shirt without the tantrum. (p. 13, 
text in parentheses added) 


The reductive effects of the high-p instructional sequence 
on noncompliance resemble the abative effects of an AO that 
reduces the current potency of reinforcement for noncompliance 
with the low-probability requests (i.e., reducing the value of 
escape from requests). In effect, “(djuring the high-p sequence, 
compliance with several high-p instructions, and receipt of 
reinforcement contingent upon this compliance, may therefore 
increase compliance with the subsequent low-p instruction” 
(Lipschultz & Wilder, 2017, p. 424). Some behavior analysts 
use behavioral momentum to describe a behavior’s resistance 
to change following an alteration in reinforcement conditions, 
and the term is sometimes used to describe the effects produced 
by the high-p instructional sequence (Mace et al., 1988; Nevin, 
Mandell, & Atak, 1983).° 

Research has demonstrated the high-p instructional 
sequence to be an effective treatment for a wide variety of 
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behavior problems exhibited by participants ranging from pre- 
schoolers to adults, with and without disabilities. Behavioral 
improvements produced by the high-p instructional sequence 
have included compliance by preschoolers (Normand, Kestner, 
& Jessel, 2010; Wilder, Majdalany, Sturkie, & Smeltz, 2015), 
compliance by school-age students with teachers’ academic 
instructions (Axelrod & Zank, 2012; Lee, Belfiore, Scheeler, 
Hua, & Smith, 2004), food acceptance by children with pediatric 
feeding disorders (Dawson et al., 2003; Ewry & Fryling, 2016; 
Patel et al., 2006), increased social interactions (Jung, Sainato, 
& Davis, 2008), and compliance with medical examination 
requests by children with autism (Riviere, Becquet, Peltret, 
Facon, & Darcheville, 2011). 

Killu, Sainato, Davis, Ospelt, and Paul (1998) evaluated 
the effects of the high-p instructional sequence on compliant 
responding to low-p instructions and occurrences of problem 
behaviors by three preschool children with developmental 
delays. The researchers used a compliance criterion of 80% or 
higher for selecting high-p requests for two children and a 60% 
criterion for the third child. Compliance of less than 40% was 
used to select the low-p instructional. 

The request sequence began with the experimenter or a 
trainer presenting three to five high-p requests. When a child 
complied with at least three consecutive high-p requests, a 
low-p request was immediately presented. Praise was provided 
immediately following each compliant response. Figure 26.3 
shows the children’s performances before, during, and after the 
high-p sequence. The sequence delivered by two different train- 
ers increased compliant responding to the low-p requests of the 
three children. Compliant responding maintained across time 
and settings. 


Using the High-p Instructional Sequence 
Effectively 


Select High-p Instructions from Current Repertoire 


The tasks selected for high-p instructions should be in the 
learner’s current repertoire, occur regularly, and have a very 
short duration. Ardoin, Martens, and Wolfe (1999) selected 
high-p instructions by (a) creating a list of requests that corre- 
sponded to student compliance, (b) presenting each request on 
the list for five separate sessions, and (c) selecting as high-p 
requests only those tasks that the student complied with 100% 
of the time. A criterion of 80% compliance or greater for 
high-p instructions is commonly used (e.g., Belfiore, Basile, 
& Lee, 2008; Mace et al., 1988). 

Axelrod and Zank (2012) empirically identified high-p 
and low-p instructions for Charles and Thomas, two fifth- 
grade students who emitted high rates of noncompliance and 
received special education services in a general education 
classroom. The students’ teacher created a list of 40 typi- 
cal classroom instructions and then presented each student 
each instruction 10 times randomly during reading and seat- 
work sessions over a period of 10 days. Instructions with 
80% or greater compliance were categorized as high-p, and 
those with 40% compliance or less were categorized as low- 
p. Commands complied with between 40% and 80% of the 
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Number of Responses 


Figure 26.3 Number of compliant responses to 
low-probability requests delivered by the investigator 
and second trainer across sessions and conditions. 
Participants were given 10 low-p requests each 
session. Dashed lines indicate student absences. 

From “Effects of High-probability Request Sequences on 
Preschoolers’ Compliance and Disruptive Behavior,” by K. Killu, 
D. M. Sainato, C. A. Davis, H. Ospelt, and J. N. Paul, 1998, Jour- 
nal of Behavioral Education, 8, p. 358. Used by permission. 


time were not used in the study. Table 26.4 shows Thomas’ 
percentage compliance with teacher commands. 

Mace (1996) reported that the effectiveness of the high-p 
sequence correlates with the number of increases in high-p requests. 
A high-p sequence with five requests may be more effective than a 
sequence with two requests; but the increase in effectiveness may 
have a trade-off in efficiency. For example, if the same, or nearly 
the same, effectiveness can be obtained with two or three high-p 
instructions as can be obtained with five or six, a teacher might 
select the shorter sequence because it is more efficient. When par- 
ticipants consistently comply with the low-p requests, the trainer 
should gradually reduce the number of high-p requests. 


Present Requests Rapidly 


The high-p requests should be presented in rapid succession, 
with short inter-request intervals. The first low-p request should 
immediately follow the reinforcer for high-p compliance (Davis 
& Reichle, 1996). 


Acknowledge Compliance 


The learner's compliance should be acknowledged immedi- 
ately. Notice how the teacher in the previous dressing example 
acknowledged and praised the student’s compliance (“All right, 
nice job!”) before presenting the next request. 
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Use Potent Reinforcers 


Individuals may emit aggression and self-injurious behaviors to 
escape from low-p request demands. Mace and Belfiore (1990) 
cautioned that social praise may not increase compliance if 
motivation for escape behavior is high. Therefore, high-quality 
positive stimuli immediately following compliance will increase 
the effectiveness of the high-p intervention. 

Table 26.5 summarizes guidelines for using the high-p 
instructional sequence. 


FUNCTIONAL COMMUNICATION TRAINING 


Functional communication training (FCT) establishes an 
appropriate communicative behavior to compete with problem 
behaviors evoked by a motivating operation (MO). In contrast 
to NCR and the high-p instructional sequence that alter current 
MOs, FCT develops communicative behaviors that are sensitive 
to existing MOs. The practitioner uses differential reinforce- 
ment of alternative behavior (DRA) (see Chapter 25) to teach 
an adaptive response that produces the same reinforcer that has 
maintained the problem behavior, thereby making the alternative 
behavior functionally equivalent to the problem behavior. The 
alternative communicative responses can take many forms, such 
as vocalizations, signs, communication boards, word or picture 
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TABLE 26.4 Thomas’ Percentage Compliance with Teacher Commands During Identification Period 


Instruction Category Instruction Compliance During 


Identification Period (%) 


ee ee eS I ere areerkeiitt nit Pig SS ee WSO Woes 


High-p Give teacher high five 100 
Give teacher fist pound 100 
Put hands on desk 90 
Pick up writing utensil (e.g., pencil, pen, color pencil) 90 
Put writing utensil in desk 80 
Put down writing utensil 80 
Put hands on lap 80 
Stand up 80 
Slide chair closer to desk 80 
Walk over to group 80 
Sit down with group 80 

Low-p Stop talking and continue working on assignment 10 
Put inappropriate materials (e.g., toy, drawing paper, 10 
color pencil box) in desk 
Stop talking and raise your hand 20 
Walk back to desk after unauthorized leave 20 
Read sentence or passage from text 30 
Write name on paper 30 
Begin working on assignment 30 
Take writing utensil out of mouth 30 
Walk over to the teacher's desk 40 


Based on “Increasing Classroom Compliance: Using a High Probability Command Sequence with Noncompliant Students” by M. J. Axelrod 
and A. J. Zank, 2012, Journal of Behavioral Education, 21, p. 125. 


TABLE 26.5 Guidelines for Using the High-p Instructional Sequence 


¢ Empirically identify high-p and low-p instructions before using the procedure. Compliance with high-p instructions should be 
80% or greater; with low-p, 40% or less. 

© Do not use the high-p request sequence just after an occurrence of the problem behavior. The student might learn that 
responding to a low-p request with the problem behavior will produce a series of easier requests. 

° Present the high-p request sequence at the beginning and throughout the instructional period to reduce the possibility of problem 
behaviors producing reinforcement. 

° Present high-p instructions rapidly, with brief intertrial intervals (1-5 sec) and reinforce compliance with each high-p instruction. 

e {If the participant does not comply with high-p instructions, stimuli associated with the low-p instruction should be identified and 
eliminated, or other high-p instructions should be used. 

© Present first low-p instruction immediately following compliance with three to five high-p instructions. 

© Present high-quality reinforcers for compliance with high-p instructions. 

e Fade the ratio of high-p to low-p instructions to 1-to-1. 

© Be alert to treatment drift in which practitioners present only high-p instructions to avoid escape-motivated aggression and 
self-injury evoked by low-p requests. 

e |f the procedure is ineffective, consider adding an extra intervention component. 


Sources; Adapted from Belfiore, Basile, and Lee (2008); Davis and Reichle (1996); Dawson et al. (2003); Horner, Day, Sprague, O’Brien, and Heathfield 
(1991); Lipschultz and Wilder (2017); Mace (1996); Mace et al. (1988); Normand, Kestner, and Jessel (2010); Penrod, Gardella, and Fernand (2012); Pitts and 


Dymond (2012); and Wilder, Majdalany, Sturkie, and Smeltz (2015). 
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cards, vocal output systems, or gestures (Brown et al., 2000; 
Shirley, Iwata, Kahng, Mazaleski, & Lerman, 1997). 

Carr and Durand (1985) described functional communi- 
cation training as a two-step process: (1) conduct a functional 
behavior assessment to identify the stimuli with known reinforc- 
ing properties that maintain the problem behavior and (2) use 
those stimuli as reinforcers to develop an alternative behavior to 
replace the problem behavior. FCT is an effective treatment for 
many problem behaviors maintained by social attention. 

FCT-based interventions typically involve several behavior 
change tactics in addition to teaching the alternative communica- 
tive response. For example, applied behavior analysts often use a 
combination of response prompting, time-out, physical restraint, 
response blocking, redirection, and extinction to address the dif- 
ficult behavior. 

Durand (1999) used FCT in school and community settings 
to reduce the challenging behaviors of five students with severe 
disabilities. Durand first completed functional behavior assessments 
to identify the objects and activities maintaining the problem behav- 
iors. Following the functional assessments, the students learned to 
use a communication device that produced digitized speech to 
request objects and activities identified during the functional behav- 
ior assessments. The five students reduced the occurrences of their 
problem behaviors in school and community settings in which they 
used digitized speech to communicate. Figure 26.4 presents data on 
the percentage of intervals of problem behaviors occurring in com- 
munity settings for each student. These data are socially significant 
because they show the importance of teaching skills that recruit 
reinforcement in natural settings, thereby promoting generalization 
and maintenance of intervention effects. 


Effective Use of FCT 


A number of factors enhance the use of FCT, an appropriate 
and effective antecedent intervention for decreasing behavior. 
We discuss the principal factors in the next sections. How- 
ever, we recognize Andzik, Cannella-Malone, and Sigafoos’s 
(2016) point that few studies have analyzed practitioners’ 
fidelity in implementing FCT. Their review sheds light on 
this important issue. 


Dense Schedule of Reinforcement 


The alternative communicative response should produce the 
reinforcers that maintain the problem behavior on a continuous 
schedule of reinforcement during the early stages of communi- 
cation training. 


Decreased Use of Verbal Prompts 


While teaching the alternative communicative response, verbal 
prompts such as saying “look” or “watch me” are used often. After 
the communicative response is established firmly, the trainer 
should gradually reduce any verbal prompts and, if possible, elim- 
inate them altogether to remove any prompt dependence associ- 
ated with the intervention (Miltenberger, Fuqua, & Woods, 1998). 


Behavior Reduction Procedures 


The effectiveness of FCT is likely to be enhanced if it is indi- 
vidualized within a treatment package consisting of extinction 
or time-out (Shirley et al., 1997). 
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Figure 26.4 Percentage of intervals of challenging behavior 
by each of the five participants in baseline and FCT in 

community settings. The hatched bars show the percentage 
of intervals of unprompted communication by each student. 


From “Functional Communication Training Using Assistive Devices: Recruit- 
ing Natural Communities of Reinforcement,” by V. M. Durand, 1999, 
Journal of Applied Behavior Analysis, 32, p. 260. Copyright 1999 by the 
Society for Experimental Analysis of Behavior. Reproduced by permission. 


Schedule Thinning 


Thinning the reinforcement schedule for a firmly established com- 
municative response is an important part of the FCT treatment 
package. Time-based procedures, such as constant time increase, 
proportional time increase, and session-to-session time increase, 
are not appropriate for schedule thinning with the alternative com- 
municative response. They are incompatible with the methods 
used to differentially reinforce the alternative communicative 
behavior because the FCT intervention does not alter the MO 
that evokes the problem behavior. The alternative communicative 
behavior must remain sensitive to the evocative function of the 
MO to compete with the problem behavior. For example, con- 
sider a child with developmental disabilities who has a history 
of self-stimulatory behavior when presented with a difficult task. 
A therapist teaches the child to ask for assistance with these tasks 
(i.e., the alternative communicative behavior), which reduces the 
self-stimulatory behavior. After firmly establishing asking for 
assistance, the therapist or caregiver should not decrease instances 
of providing that assistance because such a decrease might break 
the alternative communicative behavior—reinforcer contingency, 
and risk the re-emergence of self-stimulation. 


Hanley, Piazza, Fisher, Contrucci, and Maglieri (2001) 
recommended a procedure for schedule thinning that used 
a dense fixed-interval schedule of reinforcement (6 .o0h] 
2 sec, FI 3 sec) during the initial teaching of the alternative 
communicative response. Once the communicative response 
was established, they suggested gradually thinning the FI 
schedule. This procedure, in contrast to time-based procedures, 
maintained the contingency between responding and reinforce- 
ment. They cautioned that thinning the FI schedule during 
FCT interventions could produce undesirable high rates of the 
alternative communicative response that could disrupt home or 
classroom settings. Hanley and colleagues further suggested the 
use of picture cues and external “clocks” to announce when rein- 
forcement is available, to control the undesirable high rate of the 
communicative response. 

For further review of research on FCT schedule thinning 
and recommendations for practice, see Greer, Fisher, Saini, Owen, 
and Jones (2016) and Hagopian, Boelter, and Jarmolowicz (2011). 


Table 26.6 summarizes the advantages and disadvantages 
of using FCT. 


DEFAULT INTERVENTIONS 


Smith (2011) defined default antecedent interventions as treat- 
ments ‘“‘whose effects do not depend on specific identification of 
the variables that set the occasion for and maintain the problem 
behavior” (p. 297) and described three principal types: anteced- 
ent exercise, enriched environment, and restraint. Smith and 
Iwata (1997) noted that antecedent exercise is used regularly for 
self-injury, aggression, and stereotypy; enriched environment is 
most frequently applied for problem behaviors maintained by 
automatic positive reinforcement; and restraint is an appropriate 
intervention for severe behavior problems in limited selected 
situations. 


Antecedent Exercise 


Antecedent exercise requires the individual to engage in some 
effortful form of aerobic activity (e.g., walking, jogging, danc- 
ing, calisthenics, roller skating) prior to receiving a low-p task 
to complete. Upon completion of the exercise, the individual 
is directed to complete the task, and performance is recorded. 
Research over a few decades has reported that anteced- 
ent exercise has decreased many maladaptive behaviors, 
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such as SIB and aggression, as well as diverse behaviors 
such as inappropriate vocalizations, repetitive movements, 
and talking-out, out-of-seat, and stereotypic behaviors. “It 
is important to note that in contrast to other interventions 
involving effortful activity, such as overcorrection (Foxx & 
Azrin, 1972, 1973), antecedent exercise is implemented inde- 
pendently of occurrences of the problem behavior” (Smith, 
2011, pp. 298-299). 

Celiberti, Bobo, Kelly, Harris, and Handleman (1997) 
used a multielement design to analyze the effects of two lev- 
els of antecedent exercise on the self-stimulatory behavior of a 
5-year-old boy with autism. 


The exercise conditions were applied immediately 

before periods of academic programming. Maladap- 

tive self-stimulatory behaviors were separately tracked, 
enabling identification of behaviors that were more sus- 
ceptible to change (e.g., physical self-stimulation and “out 
of seat” behavior) versus those that were more resistant 
(e.g., visual self-stimulation). Examination of temporal 
effects indicated a decrease in physical self-stimulation 
and “out of seat” behavior, but only for the jogging condi- 
tion. In addition, sharp reductions in these behaviors were 
observed immediately following the jogging intervention 
and gradually increased but did not return to baseline lev- 
els over a 40 min period (p. 139). (See Figure 26.5.) 


Enriched Environment 


An enriched environment (EE) intervention provides noncon- 
tingent access to preferred sources of reinforcement (e.g., toys, 
games, social and recreation activities). This noncontingent 
access to preferred sources of reinforcement arranges a compe- 
tition between the EE intervention and the stimulation provided 
by the problem behavior (Horner, 1980). When effective, this 
competition decreases the problem behavior. 

Vollmer, Marcus, and LeBlanc (1994) examined inter- 
ventions for three participants following inconclusive func- 
tional analyses. These participants included Ron and Korey, 
3-year-old boys with severe disabilities and self-injurious 
behaviors (SIB), and Rhonda, a 4-year-old-girl, also with 
severe disabilities, as well as chronic hand mouthing. During 
a stimulus preference assessment (see Chapter 11), Vollmer 
et al. arranged an array of teacher- or parent-suggested items 


TABLE 26.6 Possible Advantages and Disadvantages of Functional Communication Training 


Advantages 


e Excellent chance for generalization and maintenance of the alternative communicative response because the communicative 
response often functions to recruit reinforcement from significant others (Fisher, Kuhn, & Thompson, 1998). 


e May have high social validity. Participants report preferences for FCT over other procedures to diminish behavior 


(Hanley et al., 1997). 


Disadvantages 


e FCT treatment packages usually include extinction, which may produce undesirable effects (see Chapter 22). 

e The extinction procedure is difficult to use consistently, allowing for intermittent reinforcement of problem behaviors. 

e Participants may emit inappropriately high rates of the alternative communicative response (Fisher et al., 1998). 

e Recruitment of reinforcement can occur at inconvenient or impossible times for the caregiver (Fisher et al., 1998). 

© FCT leaves intact the environment that evoked the problem behavior, which may limit its overall effectiveness (McGill, 1999). 
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Figure 26.5 Mean percentage of intervals of physical self- 
stimulatory behaviors and visual self-stimulatory behaviors 
and mean frequency of “out of seat” behavior across four 
10-min blocks of time after implementation of treatment 
conditions for baseline (open square), walking (open 
triangle), and jogging (filled circle). 
From “The Differential and Temporal Effects of Antecedent Exercise on 
the Self-Stimulatory Behavior of a Child with Autism,” by D. Celiberti, 


H. E. Bobo, K. S. Kelly, S. L. Harris, and J. S. Handleman, 1997, Research 
in Developmental Disabilities, 18, p. 147. Used by permission. 


for each individual, including food, drinks, toys, and balls. 
Using standard assessment protocols, the researchers dis- 
covered the preferred and differential reinforcers for each 
child. For example, Rhonda preferred sound-making toys and 
cookies, while the other children preferred different items. 


The EE interventions of preferred stimuli decreased the 
aberrant behaviors of the three children. 


Restraint 


Restraint involves physically limiting, prohibiting, or securing 
an individual in such a manner that the target behavior cannot 
occur. Three forms of restraint as an antecedent intervention 
appear in the behavior analysis literature: personal restraint, 
protective equipment restraint, and self-restraint. 


Personal Restraint 


Personal restraint is “implemented in situations where problem 
behavior is highly likely to occur [and] involves caregivers phys- 
ically securing and holding body parts, so that problem behavior 
cannot occur” (Smith, 2011, p. 300). Personal restraint as an 
antecedent intervention should not be confused with response 
blocking, which entails stopping a problem behavior that has 
been initiated (see Chapter 14). 


Protective Equipment Restraint 


Protective equipment restraints (e.g., padded helmets, arm 
splints, boxing gloves, padded hand mitts, safety belts, safety 
straps) are used to protect individuals from serious self-injury 
and to reduce the frequency of SIB (Smith, 2011). For exam- 
ple, Mazaleski, Iwata, Rogers, Vollmer, and Zarcone (1994) 
used oven mitts for treating the chronic hand mouthing of two 
women with profound intellectual disabilities. Marty was 33 
years old, and Ava was 34 years old. Both lived in a pub- 
lic residential facility. Marty and Ava emitted chronic hand 
mouthing that produced mild tissue damage. In addition to 
hand mouthing, Ava also engaged in tongue pulling, which 
resulted in periodic irritation and ulcers. After collecting base- 
line on hand mouthing, two conditions were implemented. In a 
noncontingent-equipment condition, one participant wore oven 
mitts throughout an entire session. In a contingent-equipment 
condition, mitts were placed on the participant only following 
hand mouthing episodes. Results showed that both noncontin- 
gent and contingent mitts produced a large decrease in the rate 
of hand mouthing. 


Self-restraint 


Persons with self-injurious behavior often apply self-restraint. 
Wendy, a 12-year-old girl with Down syndrome, received 
treatment for hand-to-head hitting that caused vision loss and 
other facial injuries. Wendy applied self-restraint by “sitting on 
her hands or placing her arms between folded legs or wrapped 
in clothing. Wendy’s caregivers reported that self-restraint 
was encouraged and seldom blocked, but that self-restraint inter- 
fered with adaptive skills” (Scheithauer, O’Connor, & Toby, 
2015, p. 908). 

The unique relationship between SIB and self-restraint 
raises important questions related to the controlling variables 
during treatment (Fisher & Iwata, 1996). Will the same rein- 
forcement contingency maintain both behaviors? Do SIB and 


self-restraint have a shared history, but different contingencies 
of reinforcement? Will access to SIB (or self-restraint) function 
as reinforcement for self-restraint (or SIB)? Research has begun 
to address these questions. For example, Smith, Lerman, and 
Iwata (1996) suggested that self-restraint provided positive rein- 
forcement for SIB. Fisher, Grace, and Murphy (1996) proposed 
that self-restraint was maintained by negative reinforcement by 
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escape from blocking SIB (e.g., pain). Derby, Fisher, and Piazza 
(1996) found that self-restraint and SIB were members of the 
same functional response class (both behaviors were maintained 
by social attention) and suggested noncontingent reinforcement 
as a possible treatment for self-restraint and SIB. Much research 
will be required to fully understand the variables that account for 
the effects of self-restraint. 


SUMMARY 


Defining and Classifying Antecedent Interventions 


1. An antecedent intervention is implemented prior to and 
independent of the target behavior’s occurrence. 


2. Smith (2011) identified two broad categories of antecedent 
interventions: Function-based antecedent interventions 
manipulate at least one component of the contingency 
identified as maintaining the problem behavior; default 
interventions do not depend on identifying the variables 
that evoke and maintain the problem behavior. 


3. Antecedent interventions can also be classified by whether 
or not differential consequences are required for behavior 
change. Contingency-independent antecedent interventions 
do not depend on differential consequences for the target 
behavior; contingency-dependent antecedent interventions 
depend on differential consequences for the target behav- 
ior (or alternative behavior) in the presence or absence of 
the antecedent. 


Noncontingent Reinforcement 


4. NCR is an antecedent intervention in which stimuli with 
known reinforcing properties are delivered on a fixed-time 
(FT) or variable-time (VT) schedule independent of the 
learner’s behavior. 


5. An NCR-enriched environment may function as an abol- 
ishing operation (AO) that reduces motivation to engage in 
the problem behavior. 


6. A functional analysis of the reinforcing contingency main- 
taining the problem behavior determines whether NCR is 
implemented with positive reinforcement, negative rein- 
forcement (escape), or automatic reinforcement. 


High-probability Instructional Sequence 


7. The basic procedure for implementing the high-probability 

instructional sequence (high-p) is as follows: 

¢ First, present two to five instructions for which the par- 
ticipant has a history of compliance and reinforce each 
compliant response. 

* Second, immediately after the participant has complied 
with several high-p instructions in sequence, present a 
target (i.e., low-p) instruction. 


¢ Third, gradually fade the ratio of high-p to low-p 
instructions to 1-to-1. 


8. The reductive effects of the high-p instructional sequence 
on noncompliance resemble the abative effects of an 
AO that reduces the current potency of reinforcement 
for noncompliance with the low-probability requests 
(i.e., reducing the value of escape from requests). 


9. Behavioral momentum is sometimes used to describe 
the effects produced by the high-p instructional 
sequence. 


Functional Communication Training 


10. FCT establishes an appropriate communicative behavior to 
compete with problem behaviors evoked by a motivating 
operation (MO). 


11. The practitioner uses DRA to teach an adaptive response 
that produces the same reinforcer that has maintained the 
problem behavior, thereby making the alternative commu- 
nicative response functionally equivalent to the problem 
behavior. 


Default Interventions 


12. Default antecedent interventions do not depend on 
identifying the variables that evoke and maintain the 
problem behavior. 


13. Antecedent exercise requires the participant to engage in 
some form of aerobic activity prior to receiving an instruc- 
tion or entering an environment in which the problem 
behavior has frequently occurred. 


14. The noncontingent access to preferred sources of rein- 
forcement provided by an enriched environment interven- 
tion arranges a competition between the intervention and 
the stimulation provided by the problem behavior. 


15. Three methods of restraint—physically limiting, pro- 
hibiting, or securing an individual in such a manner 
that the problem behavior cannot occur—have been 
evaluated in the applied behavior analysis literature: 
personal restraint, protective equipment restraint, and 
self-restraint. 
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antecedent intervention functional communication noncontingent reinforcement (NCR) 
behavioral momentum training (FCT) variable-time schedule (VT) 
fixed-time schedule (FT) high-probability (high-p) request 

sequence 
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MULTIPLE-CHOICE QUESTIONS 


1. Functional communication training is considered to be: 
a. Consequent intervention 
b. Punishment strategy 
c. Antecedent intervention 
d. Behavioral momentum technique 


Hint: (See “Functional Communication Training”’) 


2. Behaviors selected for a high-p request sequence should: 
a. Be a part of the learner’s current repertoire 
b. Be new behavior(s) the learner has to acquire 
c. Have a very long duration of occurrence 


d. Be those behaviors with which the learner is not 
compliant 


Hint: (See “High-Probability Instructional Sequence”’) 
3. To effectively implement a high-p request sequence, it 1s 
important to: 


a. Select behaviors not in the learner’s current behavior 
repertoire 


b. Not acknowledge compliance 
c. Use reinforcers with limited reinforcing value 
d. Present requests in a rapid sequence 


Hint: (See “High-Probability Instructional Sequence’) 


4. Noncontingent reinforcement involves the presentation of: 


a. Stimuli with unknown reinforcing properties that are 
delivered on a fixed-time or variable time schedule 
independent of the learner’s behavior 


b. Stimuli with known reinforcing properties that are 
delivered on a fixed-time or variable-time schedule 
independent of the learner’s behavior 


Cc 


Stimuli with known reinforcing properties that are 
delivered on a fixed-ratio or variable-ratio schedule 
independent of the learner’s behavior 


d. Stimuli with known reinforcing properties that are 
delivered on a fixed-time or variable-time schedule 
dependent on the learner’s behavior 


Hint: (See “Noncontingent Reinforcement’) 
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. Noncontingent reinforcement may effectively decrease 


problem behavior because: 

a. Reinforcers that maintain the problem behavior are 
available freely and frequently. 

b. Reinforcers that maintain the problem behavior are 
withheld and made contingent upon the occurrence of 
alternative behaviors. 


Reinforcers that maintain the problem behavior are 
presented only when the problem behavior occurs. 


© 


d. Reinforcers do not maintain the problem behavior 
presented regardless of the behaviors. 


Hint: (See “Noncontingent Reinforcement’) 


. Functional communication training: 


Develops alternative behaviors that are sensitive to 
abolishing operations 


= 
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Develops alternative behaviors that create changes in 
motivating operations 


c. Develops alternative behaviors that are sensitive to 
establishing operations 


d. Develops alternative behaviors that produce punishing 
effects 


Hint: (See “Functional Communication Training”) 


. Antecedent interventions are also known as: 


a. Antecedent procedures 

b. Antecedent control 

c. Antecedent manipulations 
d. All of these 


Hint: (See “Defining and Classifying Antecedent 
Interventions”’) 
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ESSAY-TYPE QUESTIONS 


1. What is an antecedent intervention? 


Hint: (See “Defining and Classifying Antecedent 
Interventions”) 


2. Describe noncontingent reinforcement (NCR) and its 


NOTES 


il 


application as an antecedent intervention strategy. 


Hint: (See “Noncontingent Reinforcement”) 


Interpreting the NCR procedure as presenting a reinforcer is techni- 
cally inconsistent with the definition of reinforcement, which requires a 
response-reinforcer relation (Poling & Normand, 1999). We use NCR to 
describe the time-based delivery of stimuli with known reinforcing proper- 
ties because applied behavior analysts have continued its use, and the term 
NCR serves a descriptive purpose. 


3. Describe high-probability (high-p) request sequence and 


its application as an antecedent intervention strategy. 


Hint: (See “High-Probability Instructional Sequence”) 


. Describe functional communication training (FCT) and its 


application as an antecedent intervention strategy. 


Hint: (See “Functional Communication Training”) 


. Chapter 27 provides a detailed description of functional behavior 
assessment. 


. To learn more about behavioral momentum theory describing response rate 


and resistance to change as two separate aspects of operant behavior, see 
Craig, Nevin, and Odum (2014); Greer, Fisher, Romani, and Saini (2016); 
Nevin (1992); and Shahan and Sweeney (2011). 


ead pte dh celiathigen ee taaat ia ant 
romana nee sihld 
enont arias ee aE (high pldeue 


“word lern Nehavicd hoonse 
‘ er pve ean La ass SY 8 ; ihe 
Weems, " “gy re me ea shebes 


Poulin ia he! 


Fis 


are 
shemnitive ‘behave “~ 
- Rein hinders thar Die has ih the problen i 


— ‘ vceac ited on! ¥ when heatoh heat Tae! ii 


aT Ties SconTeut: peperiiatiy | 
r(x) the learner lean 1 eee rrr = meintnnoers te fiw yulinitiore thie pnb mt 

ing ot anausi i oroented megardlies OF the betaviorn. 
iT tt rena - a 


_ . ~ junk: (See “Noncunten wat Rein fcecemvenet 
ieee With Which the learner is eo < rg a 


» Cpectawat someumicalide tranny: i 
bability Instructional Sequetar ) - = De velop aie ates bela viaes hat we semey 
fi , ie ale stisaae Tipevacns ~ “a — 
armen i) NE mp request uch & 7) @= , ae bus 
&® Develo 4 alternate soles vrOre tueh-eeaie Cf ages 
ww MVALAY CE COPPOC eli ree - ace 


mae oi The lenrritt 4 current behav ie 
Des — alerfumve belay line that are ‘ecngitive 10 
. Geledige Comuptionce 7 a Hbtuhig ope +. (i 
ae a , Se ih Baewatops ateinadive bela "prac 


ie teintorcers Wh lifted reinforoayg value - —% 


eit Fequests i a hip science : 


he fé nt “| Fel es iui wil 4 ‘mununicatian Th ining’ i 
ee igth- Pr ebarality inatrectianal . scoucKCE J 2 
= eee ary alod Renate ase ian 
oe) igen! feintercemem mvilvey the presdmaing of;- 


oa el The 
Ds Mal will umkne we reinleo Atte pertics (hatwe — 


. Verudenl ptoshie _ — 


Geliveral cna fixedime wr artable mic acheitidte 7 ? ye ae ; = —— 
aieperwentat tre hovetet’s Shalon : C. VOMOCC RIOT ATUL - aa 

Se, Siinuli with known feel cing propretics that are * Ati af Greve : wih 
deterred on @ fiked-linw or eariable time schedule tea Wwe ‘Dreit ning emi iesasty wig Quscedent ote % 
jndeiendernt of (ie barnes § behavior Lad ow Arise) <% 


& Sool with known veintieriny properties thar are 


Palivered an « fyted-catio dr eanable-ratio scheelule 

, 
independent of the leorner + bchovics 

7 ; 7 

: a. Stimwil, With Viney iiowrCioa properi¢es thatare 

cee icipvedics!l tu «! vai t imedin chile “a 
jes ert cm ity 44 : i _ 
f i if reciuto iracert) 


PART 10 


Functional Assessment 


Changing behavior is often challenging, perplexing, and frustrating. The behavior analyst’s task 

of determining what to do and how to do it can be particularly difficult when helping people with 
chronic problem behaviors that have been resistant to change. In Chapter 27 Stephanie Peterson and 
Nancy Neef describe an assessment process that explores the function (or purpose) a behavior serves 
for a person. Functional behavior assessment enables behavior analysts to make empirically based 
hypotheses for why problem behaviors occur, information that can point to the design of effective 
interventions. Peterson and Neef describe the basis for functional behavior assessments and their role 
in treating and preventing problem behaviors, and present case examples illustrating three methods 
for conducting functional behavior assessments. 
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Functional Behavior Assessment 


Stephanie M. Peterson and Nancy A. Neef 


LEARNING OBJECTIVES 


e Name the functions that problem behavior can serve. 


° Describe the role functional behavior assessment plays in preventing problem behavior and 


developing interventions for problem behavior. 


State the different methods of conducting a functional behavior assessment. 


e State the primary characteristics of and rationale for conducting a descriptive assessment. 


° Describe various methods for gathering descriptive assessment data and under what 


circumstances each is appropriate. 


* Given a set of descriptive data, interpret the data to form a hypothesis regarding the possible 


function of problem behavior. 


e State the primary characteristics of and rationale for conducting a functional analysis as a 


form of functional behavior assessment. 
¢ Describe how to conduct a functional analysis. 


° Given a set of data from a functional analysis, interpret the data to determine the function 


of problem behavior. 


When it is time to wash hands before lunch, Stella turns the 
handles of the faucet and places her hands under the running 
water, but Flo screams and tantrums. The teacher is confused 
about why Stella performs so well in this context and Flo 
performs so poorly. Consequently, she is at a loss as to how 
to respond when Flo screams and tantrums. The crying and 
tantrums could occur for a variety of reasons. To understand 
why they are occurring, the teacher could perform a func- 
tional behavior assessment of Flo’s problem behaviors. This 
assessment may provide information valuable for creating an 
effective behavior intervention. Consistent with the scientific 
precept of determinism described in Chapter 1, behaviors— 
including problem behaviors—are lawfully related to other 
events in the environment. Functional behavior assessment 
(FBA) enables hypotheses about the relations among specific 
types of environmental events and behaviors. Specifically, 
FBA is designed to obtain information about the purposes 
(functions) a behavior serves for a person. This chapter 
describes the basis for FBA, its role in the intervention and 
prevention of behavior difficulties, and alternative approaches 
to functional assessment. We will revisit Flo’s problem 
behavior throughout this chapter to illustrate. 


FUNCTIONS OF BEHAVIOR 


Evidence from decades of research indicates that both desir- 
able and undesirable behaviors, whether washing hands or 
screaming and tantrumming, are learned and maintained 
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through interaction with the social and physical environment 
(see Schlinger & Normand, 2013, for a review). As explained in 
Chapters 11 and 12, these behavior—environment interactions are 
described as positive or negative reinforcement contingencies. 
Behaviors can be strengthened by either “getting something” or 
“getting out of something.” 

FBA is used to identify the type and source of reinforce- 
ment for challenging behaviors as the basis for intervention efforts 
designed to decrease the occurrence of those behaviors. FBA can 
be thought of as a reinforcer assessment of sorts. It identifies the 
reinforcers currently maintaining problem behavior. Those rein- 
forcers might be positive or negative social reinforcers provided by 
someone who interacts with the person, or automatic reinforcers 
produced directly by the behavior itself. The idea behind FBA is 
that if these reinforcement contingencies can be identified, then 
interventions can be designed to decrease problem behavior and 
increase adaptive behavior by altering these contingencies. FBA 
fosters proactive, positive interventions for problem behavior. 
Although reinforcement contingencies are discussed in other 
chapters, a brief review of their role in FBA is warranted. 


Positive Reinforcement 


Social Positive Reinforcement (Attention) 


Problem behavior often results in immediate attention from 
others, such as head turns; surprised facial expressions; repri- 
mands; attempts to soothe, counsel, or distract; and so on. These 
reactions can positively reinforce problem behavior (even if 


inadvertently), and the problem behavior is then more likely to 
occur in similar circumstances. Problem behavior maintained by 
positive reinforcement in the form of reactions from others can 
often occur in situations in which attention is otherwise infre- 
quent, whether because the person does not have a repertoire to 
gain attention in desirable ways or because others in the envi- 
ronment are typically otherwise occupied. 


Tangible Reinforcement 


Many behaviors result in access to reinforcing materials or other 
stimuli. Just as pressing a button on the television remote changes 
the channel to a desired television show, problem behaviors can 
produce reinforcing outcomes. A child may cry and tantrum until 
a favorite television show is turned on; stealing another child’s 
candy produces access to the item taken. Problem behaviors may 
develop when they consistently produce a desired item or event. 
This often occurs because providing the item temporarily stops 
the problem behavior (e.g., tantrum), although it can have the 
inadvertent effect of making the problem behavior more probable 
in the future under similar circumstances. 


Automatic Positive Reinforcement 


Some behaviors do not depend on the action of others to provide an 
outcome; some behaviors directly produce their own reinforcement. 
For example, thumb sucking might be reinforced by physical 
stimulation of either the hand or the mouth. Hand flapping while 
looking at a light might be reinforced by the visual stimulation of 
the light being blocked and revealed, as the hand flapping might 
simulate a flashing light. Swinging a tennis racket correctly as 
a tennis ball approaches and hits the head of the racket might 
be reinforced by the “pop” sound the ball makes when it hits 
the “sweet spot” of the racket. Placing hands under the faucet 
might be reinforced by the warm sensation of water on the skin. 
A behavior is assumed to be maintained by automatic reinforcement 
only after social reinforcers have been ruled out (¢.g.., when the 
behavior occurs even when the individual is alone). 


Negative Reinforcement 


Social Negative Reinforcement (Escape) 


Many behaviors are learned as a result of their effectiveness in ter- 
minating or postponing aversive events. Hanging up the phone ter- 
minates interactions with a telemarketer; completing a task or chore 
terminates requests from others to complete it or the demands asso- 
ciated with the task itself. Problem behaviors can be maintained in 
the same way. Behaviors such as aggression, self-injurious behav- 
ior (SIB), and bizarre speech may terminate or avoid unwanted 
‘nteractions with others. For example, noncompliance postpones 
engagement in a nonpreferred activity, and disruptive classroom 
behavior often results in the student being sent out of the class- 
room, thereby allowing escape from instructional tasks or teacher 
demands. All of these behaviors can be strengthened by negative 
reinforcement to the extent that they enable the individual to escape 
or avoid difficult or unpleasant tasks, activities, or interactions. 


Automatic Negative Reinforcement 


Aversive stimulation, such as a physically painful or uncom- 
fortable condition, is a motivating operation that makes its 


Chapter 27 * Functional Behavior Assessment 679 


termination reinforcing. Behaviors that directly terminate 
aversive stimulation are therefore maintained by negative 
reinforcement that is an automatic outcome of the response. 
Automatic negative reinforcement can account for behaviors 
that are either appropriate or harmful. For example, putting 
calamine lotion on a poison ivy rash can be negatively reinforced 
by alleviation of itching, but intense or prolonged scratching that 
breaks the skin can be negatively reinforced in the same manner. 
Some forms of SIB may distract from other sources of pain, 
which may account for their correlation with specific medical 
conditions (e.g., DeLissovoy, 1963). 


Function Versus Topography 


Several points can be made from the previous discussion of the 
sources of reinforcement for behavior. It is important to rec- 
ognize that environmental influences do not make distinctions 
between desirable and undesirable topographies of behavior; 
the same reinforcement contingencies that account for desirable 
behavior can also account for undesirable behavior. For example, 
Stella, the child who washes and dries her hands before lunch, 
has probably received praise for doing so. Flo, the child who 
frequently engages in tantrums in the same situation, may have 
received attention (in the form of reprimands). Both forms of 
attention have the potential to reinforce the respective behaviors. 

Likewise, the same topography of behavior can serve dif- 
ferent functions for different individuals. For example, tantrums 
may be maintained by positive reinforcement in the form of 
attention for one child and by negative reinforcement in the form 
of escape for another child (e.g., Kennedy, Meyer, Knowles, & 
Shukla, 2000). 

Because different behaviors that look quite different 
can serve the same function, and behavior of the same form 
can serve different functions under different conditions, the 
topography, or form, of a behavior often reveals little useful 
information about the conditions that account for it. Identify- 
ing the conditions that account for a behavior (its function), in 
contrast, suggests what conditions need to be altered to change 
the behavior. Assessing the function of a behavior can therefore 
yield useful information with respect to intervention strategies 
that are likely to be effective. 


ROLE OF FUNCTIONAL BEHAVIOR 
ASSESSMENT IN INTERVENTION 
AND PREVENTION 


FBA and Intervention 


If a causal relation between environmental events and a problem 
behavior can be determined, and that relation can be altered, the 
problem behavior will occur less often. Interventions informed 
by FBA primarily consist of three strategic approaches: alter 
antecedent variables, alter consequent variables, and teach alter- 
native behaviors. 


Alter Antecedent Variables 


FBA can identify antecedents that might be altered so the prob- 
lem behavior is less likely to occur. Altering the antecedents 
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for problem behavior can change and/or eliminate either (a) the 
motivating operation for problem behavior or (b) the discrimi- 
native stimuli that trigger problem behavior. For example, the 
motivating operation for tantrums when Flo is asked to wash her 
hands before lunch could be modified by changing the charac- 
teristics associated with lunch so that the avoidance of particular 
events is no longer reinforcing. Depending on the function of 
the tantrums, this might involve initially reducing table-setting 
demands, altering seating arrangements to minimize taunts from 
a sibling or peer, or reducing snacks before lunch and offering 
more preferred foods during lunch. Alternatively, if the FBA 
shows that running water is the discriminative stimulus that trig- 
gers problem behavior when Flo is asked to wash her hands, she 
might be given a waterless antibacterial hand gel instead. In this 
case, the discriminative stimulus for problem behavior has been 
removed, thereby decreasing problem behavior. 


Alter Consequent Variables 


FBA can also identify a source of reinforcement to be eliminated 
for the problem behavior. For example, an FBA that indicates 
Flo’s tantrums are maintained by social negative reinforcement 
(avoidance or escape) suggests a variety of treatment options, 
which, by altering that relation, are likely to be effective. Flo’s 
tantrums can be placed on extinction by ensuring that the rein- 
forcer (e.g., avoidance of the activities associated with setting 
the table for snack or lunch) no longer follows problem behavior 
(tantrums). Another approach could be to provide the reinforcer 
for alternative behaviors instead of for the problem behavior. 
Finally, the schedule might be modified so that hand washing 
follows—and thereby provides escape from—an event that is 
even less preferred. 


Teach Alternative Behaviors 


FBA can also identify the source of reinforcement to be provided 
for appropriate replacement behaviors. Alternative appropriate 
behaviors that have the same function (i.e., produce the same 
reinforcer) as tantrums could be taught. For example, if Flo’s tan- 
trums are maintained by escaping lunchtime activities, she might 
be taught to touch a card communicating “later” after washing 
her hands, to produce a delay in being seated at the lunch table. 


FBA and Default Technologies 


Interventions based on an FBA may be more effective than those 
selected without consideration of behavioral function (e.g., Ervin 
et al., 2001; Iwata et al., 1994b). Understanding why a behavior 
occurs often suggests how it can be changed for the better. In 
contrast, premature efforts to treat problem behavior before seek- 
ing an understanding of the purposes it has for a person can be 
inefficient, ineffective, and, in some cases, harmful. For example, 
suppose that a time-out procedure is implemented in an attempt 
to attenuate Flo’s tantrums when she is asked to wash her hands 
before lunch. Flo is removed from the hand-washing activity to 
a chair in the corner of the room. It may be, however, that the 
events typically following hand washing (those associated with 
lunch time, such as the demands of arranging chairs or setting 
the table or interactions with others) are aversive for Flo, and 
tantrumming has allowed her to avoid those events. In this case, 


the intervention would be ineffective because it has done nothing 
to alter the relation between tantrums and the consequence of 
avoiding the aversive events associated with lunch. In fact, the 
intervention may exacerbate the problem if it produces a desired 
outcome for Flo. If stopping the hand-washing activity and hav- 
ing Flo sit on a chair as “time-out” for tantrumming enables her 
to avoid the aversive lunchtime events—or to escape them alto- 
gether—tantrums may be more likely under similar circumstances 
in the future. When the time-out intervention proves unsuccessful, 
other interventions might be attempted. Without understanding 
the function that the problem behavior serves, however, the effec- 
tiveness of those interventions cannot be predicted. 

At best, a trial-and-error process of selecting and imple- 
menting interventions without consideration of behavioral func- 
tion can be lengthy and inefficient. At worst, such an approach 
may cause the problem behavior to become more frequent or 
severe. As a result, caregivers might resort to increasingly intru- 
sive, coercive, or punishment-based interventions, which are 
often referred to as default technologies. 

FBA can decrease reliance on default technologies and 
contribute to more effective interventions in several ways. When 
FBAs are conducted, reinforcement-based interventions are 
more likely to be implemented than are those that include pun- 
ishment (Pelios, Morren, Tesch, & Axelrod, 1999). In addition, 
the effects of interventions based on FBAs are likely to be more 
durable than those that do not take the function of problem behay- 
ior into account. If contrived contingencies are superimposed on 
unknown contingencies that are maintaining the behavior, their 
continuation is often necessary to maintain improvements in the 
behavior. If those superimposed contingencies are discontinued, 
the behavior will continue to be influenced by the unchanged 
operative contingencies. 


FBA and Prevention 


By furthering understanding of the conditions under which 
certain behaviors occur, FBA can also contribute to the pre- 
vention of difficulties. Although problem behavior may be 
suppressed without regard to its function by using punishment 
procedures, additional behaviors not subject to the punishment 
contingencies may emerge because the motivating operations 
for problem behavior remain. For example, contingent loss of 
privileges might eliminate tantrums that occur whenever Flo is 
asked to wash her hands, but it will not eliminate avoidance as 
a reinforcer or the conditions that establish avoidance as a rein- 
forcer. Thus, other behaviors that result in avoidance, such as 
aggression, property destruction, or running away may develop. 
These unintended effects are less likely to occur with interven- 
tions that address (rather than override or compete with) the 
reinforcing functions of problem behavior. 

On a broader scale, the accumulation of FBA data may 
further assist in prevention efforts by identifying the conditions 
that pose risks for the future development of problem behay- 
iors. Preventive efforts can then focus on those conditions. For 
example, based on data from 152 analyses of the reinforcing 
functions of self-injurious behavior (SIB), Iwata and colleagues 
(1994b) found that escape from task demands or other aver- 
sive stimuli accounted for the behavior in the largest proportion 
of cases. The authors speculated that this outcome might have 


been an unintended result of a move toward providing more 
aggressive treatment. For example, if Flo tantrums when she is 
required to wash her hands, the teacher might assume that she 
does not know how to wash her hands. The teacher might decide 
to replace playtime with a period of intensive instruction on 
hygiene. Rather than decreasing problem behavior, such inter- 
ventions may exacerbate it. The data reported by Iwata and col- 
leagues suggest that preventive efforts should be directed toward 
modifying instructional environments (such as providing more 
frequent reinforcement for desirable behavior, opportunities for 
breaks, or means to request and obtain help with difficult tasks) 
so that they are less likely to function as sources of aversive 
stimulation (motivating operations) for escape. 


OVERVIEW OF FBA METHODS 


FBA methods can be classified into three types: (a) functional 
(experimental) analysis, (b) descriptive assessment, and (c) 
indirect assessment. The methods can be arranged on a contin- 
uum with respect to considerations such as ease of use and the 
type and precision of information they yield (see Figure 27.1). 
Selecting the method or combination of methods that will best 
suit a particular situation requires consideration of each meth- 
od’s advantages and limitations. We discuss functional analysis 
and its variations first because descriptive and indirect methods 
of functional assessment developed as an outgrowth of func- 
tional analysis. As noted later, functional analysis is the only 
FBA method that allows practitioners to confirm hypotheses 
regarding functional relations between problem behavior and 
environmental events. 
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Functional (Experimental) Analysis 
Basic Procedure 


Functional analysis procedures were first pioneered by Iwata, 
Dorsey, Slifer, Bauman, and Richman (1994a). In a functional 
analysis, antecedents and consequences representing those in 
the person’s natural environment are arranged so that their sep- 
arate effects on problem behavior can be observed and mea- 
sured. This type of assessment is often referred to as an analog 
because antecedents and consequences similar to those occurring 
in the natural routines are presented in a systematic manner, but 
the analysis is not conducted in the context of naturally occur- 
ring routines. Analog conditions are often used because they 
allow the behavior analyst to better control the environmental 
variables that may be related to the problem behavior than can 
be accomplished in naturally occurring situations. Analogs refer 
to the arrangement of variables rather than the setting in which 
assessment occurs. Research has found that functional analyses 
conducted in natural environments (e.g., classroom settings) often 
yield the same (and, in some cases, clearer) results compared to 
those conducted in simulated settings (Noell, VanDerHeyden, 
Gatti, & Whitmarsh, 2001). 

Functional analyses (FA) typically comprise four condi- 
tions: three test conditions—contingent attention, contingent 
escape, and alone—and a control condition, in which prob- 
lem behavior is expected to be low because reinforcement is 
freely available and no demands are placed on the individual 
(see Table 27.1). However, it should be understood that there is 
no “standard” FA. FAs should be flexible and individualized. 
The practitioner can implement myriad conditions to address 


Functional 
Analysis 


Specialized expertise & training 
Time & effort to complete 


Prediction, control, & confidence 


Indirect Functional Assesment (e.g., 
interviews, checklists, rating scales, 
questionnaires) 


Figure 27.1 FBA methods. The amount or value of each dimension or outcome identified within the 


arrows increases from the bottom to the top of the pyramid. 


Diagram contributed by Rebecca Eldridge. 
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TABLE 27.1 
Functional Analysis 


Condition 


Play (control) 


Contingent attention 


Contingent escape 


Contingent tangible 


Alone 


Antecedent conditions (motivating operation) 


Preferred activities are continuously available, social atten- 
tion is provided, and no demands are placed on the person. 


Attention is diverted or withheld from the person. 


Task demands are delivered continuously using a three- 
step prompting procedure (e.g., [1] “You need to fold the 
towel.” [2] Model folding the towel. [3] Provide 
hand-over-hand assistance to fold the towel). 


After having access to a preferred activity/toy, the preferred 
activity/toy is removed (e.g., “My turn to play with that.”). 
Moderately preferred or neutral toys are still continuously 
available, as is adult attention. 


Low level of environmental stimulation is present (i.e., 
therapist, task materials, and play materials are absent). 


Motivating Operations and Reinforcement Contingencies for Typical Control and Test Conditions of a 


Consequences for problem behavior 


Problem behavior is ignored or neutrally 
redirected. 


Attention is provided in the form of mild 
reprimands or soothing statements (e.g., 
“Don't do that. You'll hurt someone.”). 


Break from the task is provided by remov- 
ing task materials and stopping prompts to 
complete the task. 


The preferred activity/toy is returned to the 
individual. 


Problem behavior is ignored or neutrally 
redirected. 


specific hypotheses. For example, some FAs also include a 
tangible test condition (Day, Rea, Schussler, Larsen, & John- 
son, 1988). Each test condition contains a motivating opera- 
tion (MO) and a potential source of reinforcement for problem 
behavior. The conditions are presented systematically one at a 
time and in an alternating sequence, and occurrences of prob- 
lem behavior are recorded during each session. Sessions are 
repeated to determine the extent to which problem behavior 
consistently occurs more often under one or more conditions 
relative to another. 


Interpreting Functional Analyses 


The function problem behavior serves for a person can be deter- 
mined by visually inspecting a graph of the results of an anal- 
ysis to identify the condition(s) under which high rates of the 
behavior occurred. A graph for each potential behavioral func- 
tion is shown in Figure 27.2. Problem behavior is expected to be 
low in the play condition because no motivating operations for 
problem behavior are present. Elevated problem behavior in the 
contingent attention condition suggests that problem behavior 
is maintained by social positive reinforcement (see graph in top 
left of Figure 27.2). Elevated problem behavior in the contingent 
escape condition suggests that problem behavior is maintained 
by negative reinforcement (graph in top right of Figure 27.2). 
Elevated problem behavior in the alone condition suggests that 
problem behavior is maintained by automatic reinforcement 
(graph in bottom left of Figure 27.2). Further analysis is needed 
to determine if the source of the automatic reinforcement is 
positive or negative. Problem behavior may be maintained by 
multiple sources of reinforcement. For example, if problem 
behavior is elevated in the contingent attention and contingent 
escape conditions, it is most likely maintained by both positive 
and negative reinforcement. 

If problem behavior occurs frequently in all conditions 
(including the play condition), or is variable across condi- 
tions, responding is considered undifferentiated (see graph in 


bottom right of Figure 27.2). Such results are inconclusive, but 
can also occur with behavior that is maintained by automatic 
reinforcement. 

Functional analysis has been replicated and extended in 
hundreds of studies, thereby demonstrating its generality as an 
approach to the assessment and treatment of a wide range of 
behavior difficulties. (See the 1994 and 2013 special issues of 
the Journal of Applied Behavior Analysis for a sample of such 
applications and Hanley, Iwata, & McCord, 2003, and Beavers, 
Iwata, & Lerman, 2013, for reviews.) 


Advantages of Functional Analysis 


The primary advantage of functional analysis is its ability to 
yield a clear demonstration of the variable or variables that 
influence the occurrence of a problem behavior. In fact, func- 
tional (experimental) analyses are the standard of scientific 
evidence by which other assessment alternatives are evaluated, 
and represent the method most often used in research on the 
assessment and treatment of problem behavior (Arndorfer & 
Miltenberger, 1993). Because functional analyses allow valid 
conclusions concerning the variables that maintain problem 
behavior, they have enabled the development of effective 
reinforcement-based treatments and less reliance on punish- 
ment procedures (Brosnan & Healy, 2011; Ervin et al., 2001; 
Horner, Carr, Strain, Todd, & Reed, 2002; Iwata et al., 1994b; 
Pelios et al., 1999). 


Limitations of Functional Analysis 


A risk of functional analysis, however, is that the assessment 
process may temporarily strengthen or increase the undesir- 
able behavior to unacceptable levels, or possibly result in the 
behavior acquiring new functions. Second, relatively little 
is known about the acceptability of functional analysis pro- 
cedures to practitioners (Ervin et al., 2001; Reid & Nelson, 
2002). The deliberate arrangement of conditions that set the 
occasion for, or potentially reinforce, problem behavior can 
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be counterintuitive to persons who do not understand its pur- 
pose (or that such conditions are analogs for what occurs in 
the natural routine). O’ Neill, Bundock, Kladis, and Hawken 
(2015) found that teachers reported positive perceptions of, and 
willingness to engage in, systematic direct observations and 
functional analysis manipulations. Interestingly, their positive 
perceptions were reportedly higher than those of school psy- 
chologists. School psychologists also expressed more concern 
about the time and effort required to conduct FBAs. Across 
both teachers and school psychologists, the most concerns were 
expressed about FA procedures. O’N eill and colleagues suggest 
that some of the concerns about these issues emanated from 
a lack of training and a lack of time to conduct the analyses. 
A third limitation is that some behaviors (e.g., those that, albeit 
serious, occur infrequently) may not be amenable to functional 
analyses due to setting and other factors. For example, if proper 
safety equipment or individuals with enough training to keep 
the client safe are not available, the setting may not be safe 
for conducting the FA. Fourth, functional analyses that are 
conducted in contrived settings might not detect the variables 
that account for the occurrence of the problem behavior in the 
natural environment, particularly if the behavior is controlled 
by idiosyncratic variables that are not represented in the func- 
tional analysis conditions (e.g., Noell et al., 2001). Finally, the 


time, effort, and professional expertise required to conduct 
and interpret functional analyses have been frequently cited 
as obstacles to its widespread use in practice (e.g., Spreat & 
Connelly, 1996; 2001, Volume 2 issue of School Psychology 
Review), although Hanley (2012) argues that these perceptions 
are not warranted. 

Of course, untreated or ineffectively treated problem 
behaviors also consume a great deal of time and effort (with- 
out a constructive long-term outcome), and implementation 
of effective treatments (based on an understanding of the 
variables that maintain them) is likely to require skills simi- 
lar to those involved in conducting functional analyses. These 
concerns have led to research on ways to enhance the prac- 
tical use of functional analyses, including variations and the 
development of alternative methods of FBA described in the 
following sections. 


Variations of Functional Analysis Procedures 


Rather than dictating particular settings or conditions, functional 
analysis procedures achieve relevance through flexible arrange- 
ments tailored to individual circumstances. Some procedural 
variations are described here; for a review, see Lydon, Healy, 
O’Reilly, and Lang (2012). 
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Brief Functional Analysis. Functional analyses are typically 
conducted across multiple sessions. Sometimes time constraints 
or the severity of the problem behavior may restrict the number 
of sessions that can be conducted. In such circumstances, a brief 
functional analysis (Derby et al., 1992; Northup et al., 1991) 
may be appropriate. In a brief functional analysis, only one or 
two 5- to 10-min sessions are conducted for each condition. 
A convincing demonstration of function can be achieved by 
either alternating a condition that produces problem behavior 
with one that does not or conducting a contingency reversal. 
A contingency reversal is when the practitioner first makes the 
putative reinforcer contingent on one target behavior, such as 
tantrums, and then on an appropriate replacement behavior, such 
as making requests. (See the case example with Marie later in 
this chapter for an illustration.) 

A brief functional analysis can be a robust procedure 
(Tincani, Castrogiavanni, & Axelrod, 1999) that preserves 
many advantages of an FA; however, it reveals a function 
clearly in fewer cases than a full functional analysis (Kahng & 
Iwata, 1999; Wallace & Knights, 2003). Some researchers (e.g., 
Vollmer, Marcus, Ringdahl, & Roane, 1995) have suggested that 
practitioners may want to consider beginning with a brief func- 
tional analysis and proceeding to a full functional analysis only 
if clear and replicable patterns do not emerge. 


Conducting Functional Analyses in Natural Settings. Although 
not a requirement of functional analyses, they are often con- 
ducted in clinical settings. However, it can be advantageous 
to conduct functional analyses in the setting where the prob- 
lem behaviors typically occur because the relevant stimulus 
cues are present in those settings. Sasso and colleagues (1992) 
provided one of the first demonstrations of functional analysis 
in classroom settings. Since then, many others have demon- 
strated the utility of functional analysis in the classroom (e.g., 
Wright-Gallo, Higbee, Reagon, & Davey, 2006), vocational 
(e.g., Wallace & Knights, 2003), and home settings (e.g., 
Wacker et al., 2013). One challenge when implementing func- 
tional analyses in natural settings is that the procedures can 
be disruptive to ongoing routines and may require additional 
supervision so that the other students, children, or clients are 
appropriately supervised while the teacher or caregiver devotes 
his or her attention to the analysis. An adaptation that can be 
made in natural settings is to use a trial-based approach, which 
may be more easily embedded into the ongoing activities of the 
classroom (Sigafoos & Saggers, 1995). 


Trial-based Functional Analysis. A trial-based functional 
analysis (Bloom, Iwata, Fritz, Roscoe, & Carreau, 2011; 
Lambert, Bloom, & Irvin, 2012; Sigafoos & Sagers, 1995) 
consists of a series of trials interspersed among classroom 
activities. (See the case example with Carson later in this chap- 
ter for an illustration.) Each trial consists of two components, 
each lasting | minute. The first component consists of pre- 
senting the establishing operation and contingency for problem 
behavior (test condition), and the second component consists 
of continuous access to the reinforcer (control condition). For 
example, during the first component, a demand may be placed 


on the child and as soon as a tantrum occurs, the demand is 
terminated. During the second component, a task break is con- 
tinuously provided. These conditions are implemented at times 
when they naturally occur (e.g., during free play periods for 
tangible conditions and during classroom instruction for escape 
conditions). Teachers may be able to perform the procedure 
with fidelity in their classrooms (Rispoli et al., 2015) and find 
it more manageable than session-based functional analyses. 
However, Bloom et al. describe a number of considerations 
that may result in discrepant results using trial-based functional 
analyses. For example, exposure to the relevant establishing 
operations and consequences may be too brief to evoke or 
maintain problem behavior. 


Synthesized Functional Analyses. Hanley, Jin, Vanselow, and 
Hanratty (2014) and Jessel, Hanley, and Ghaemmaghami (2016) 
describe a variation of functional analysis that is designed to 
increase efficiency by using an interview-informed synthesized 
contingency analysis (IISCA). In the test condition, multiple 
contingencies are implemented simultaneously (e.g., attention 
and escape) when the problem behavior is demonstrated. In the 
control condition, those same reinforcers are presented noncon- 
tingently and continuously. For example, for one participant, 
Gail, in Hanley et al. (2014), problem behavior was not observed 
in the typical attention and tangible conditions, even though 
the behavior analysts hypothesized these were the variables 
maintaining her problem behavior. When these two contingen- 
cies were presented together (i.e., both attention and tangibles 
presented simultaneously contingent on problem behavior), a 
clear pattern of problem behavior emerged, as compared to a 
condition in which attention and tangibles were available con- 
tinuously. Furthermore, these effects were observed only when 
Gail’s mother presented the contingencies and not when the 
behavior analyst presented them. 

This technology may have particular utility when problem 
behavior is not observed in typical FA conditions and when the 
behavior analyst suspects multiple contingencies may be main- 
taining problem behavior. At this time, the limited research that 
has compared IISCA with functional analysis is conflicting 
(Fisher, Greer, Romani, Zangrillo, & Owen, 2016; Hanley et al., 
2014). Thus, the utility of IISCA is as yet unclear. We suggest 
that behavior analysts closely monitor additional research on 
this topic as it emerges. 


Latency-based Functional Analysis. Sometimes, problem 
behavior is so severe that it is not possible to conduct an 
assessment that requires repeated occurrences of the problem 
behavior. In these cases, a latency-based functional analysis 
may be helpful (Thomasen-Sassi, Iwata, Neidert, & Roscoe, 
2011). (See the case example with Elija later in this chapter 
for an illustration.) In a latency-based functional analysis, like 
a trial-based assessment, each session is terminated as soon 
as a problem behavior occurs. However, in this case, the ses- 
sion may last longer than 1 minute, because the establishing 
operation is present as long as necessary to evoke the problem 
behavior (or until a pre-established time limit has expired). 
The index of problem behavior is the latency from onset of the 


establishing operation to the first occurrence of the problem 
behavior. Using a latency-based measure can reduce the num- 
ber of occurrences of problem behavior necessary for analysis 
and shorten the time required to conduct the analysis. As with 
trial-based assessments, a limitation to latency-based assess- 
ment is that there is only one exposure to the contingency per 


session. Analyses that rely on response-repetition measures are 
generally preferable. 


Functional Analysis of Precursors. Sometimes, even one 
occurrence of problem behavior can be so severe that evok- 
ing it poses unacceptable risk to the client and/or others. In 
these cases, it can be helpful to conduct the functional analy- 
sis on precursor behavior—a behavior that reliably precedes 
the target behavior (Borrero & Borrero, 2008; Herscovitch, 
Roscoe, Libby, Bourret, & Ahearn, 2009; Najdowski, 
Wallace, Ellsworth, MacAleese, & Cleveland, 2008; Smith 
& Churchill, 2002). (See the case example with Will later in 
this chapter for an illustration.) For example, growling may 
reliably signal aggression, in which case the analysis can be 
conducted using growling. 

In addition to these variations of functional analysis, 
other methods of FBA can be used to inform the functional 
analysis process. Although these methods have sometimes 
been used as alternatives to functional analyses, we view them 
as valuable steps in the process, not as substitutes for a func- 
tional analysis. 


Safety Considerations for FAs 


When conducting FAs, the behavior analytic practitioner must 
consider ethical issues surrounding the client’s safety. Kahng 
et al. (2015) noted that, in a retrospective study of 99 cli- 
ents’ functional analyses, the injury rate was relatively low 
and any injuries that did occur were mild. However, because 
target behaviors are expected to be evoked during FAs, it is 
important to implement precautionary measures to ensure the 
safety of all individuals involved. In particular, steps should be 
taken when analyzing potentially harmful behaviors, such as 
aggression, self-injurious behavior, and pica. Some procedures 
that have been used to mitigate risk of injury when assessing 
potentially dangerous behaviors are as follows: medical exam- 
inations to rule out health and physical concerns, presence 
of medical staff for especially severe behavior (Iwata 1982, 
1994a), ensuring that trained practitioners plan and run anal- 
yses, having additional staff present to block behavior, using 
protective equipment, establishing a behavioral and/or medical 
criterion for terminating sessions (Kahng et al., 2015), using 
soft toys to conduct the analysis (Hanley, 2012), and using 
brief FA or assessing precursor behaviors (Najdowski et al., 
2008; Smith & Churchill, 2002). Protective equipment can 
be utilized during the analysis as well; however, researchers 
have found the use of protective equipment can invalidate FA 
results (Borrero, Vollmer, Wright, Lerman, & Kelley, 2002; 
Le & Smith, 2002). (For a list of some other protections that 
can be taken, see Hanley, 2012, and Weeden, Mahoney, & 
Poling, 2010). Thus, although these precautions are often 
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cited as recommendations, additional research is warranted to 


develop widely accepted evidence-based safety standards and 
best-practice guidelines. 


Descriptive Functional Behavior Assessment 


As with functional analyses, descriptive functional behavior 
assessment encompasses direct observation of behavior; 
unlike functional analyses, however, observations are made 
under naturally occurring conditions. Thus, descriptive assess- 
ments involve observation of the problem behavior in relation 
to events that are not arranged in a systematic manner. Descrip- 
tive assessments have roots in the early stages of applied 
behavior analysis; Bijou, Peterson, and Ault (1968) initially 
described a method for objectively defining, observing, and 
coding behavior and contiguous environmental events. This 
method has been used subsequently to identify events that may 
be correlated with the target behavior. Events that are shown to 
have a high degree of correlation with the target behavior may 
suggest hypotheses about behavioral function. We describe 
three variations of descriptive analysis: ABC (antecedent-be- 
havior-consequence) continuous recording, ABC narrative 
recording, and scatterplots. 

Before we describe the forms of descriptive assessment, 
we think it is important for the reader to consider some caveats. 
All of the forms of descriptive assessments explained below are 
generally considered invalid for detecting behavioral function. 
This is because they tend to yield false positives for an attention 
function. When an individual displays severe problem behavior, 
attention is often ubiquitous in the environment—attention is 
often delivered both independent of and contingent on problem 
behavior. In addition, false negatives can occur for escape-main- 
tained problem behavior, because others in the environment 
may deliberately avoid making demands on an individual or 
placing the individual in any situations that trigger the problem 
behavior. Instead, the reader is cautioned to restrict conclusions 
based on descriptive assessments to only the prevalence of envi- 
ronmental factors that precede and follow the target behavior 
(Hanley, 2012). 


ABC Continuous Recording 


With ABC continuous recording, an observer records occur- 
rences of the targeted problem behaviors and selected environ- 
mental events in the natural routine during a period of time. 
Codes for recording specific antecedents, problem behaviors, 
and consequences can be developed based on information 
obtained from a Functional Assessment Interview or ABC 
narrative recording (described later). For example, following 
an interview and observations using narrative recording, Lalli, 
Browder, Mace, and Brown (1993) developed stimulus and 
response codes to record the occurrence or nonoccurrence of 
antecedent (e.g., one-to-one instruction, group instruction) and 
subsequent events (attention, tangible reinforcement, escape) for 
problem behavior during classroom activities. 

With ABC continuous recording, the occurrence of 
a specified event is marked on the data sheet (using partial 
interval, momentary time sampling, or frequency recording) 
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(see Figure 27.3). The targeted environmental events (anteced- 
ents and consequences) are recorded whenever they occur, 
regardless of whether problem behavior occurred with it. 
Recording data in this manner may reveal events that occur 
in close temporal proximity to the target behavior. For exam- 
ple, descriptive data may show that tantrums (behavior) often 
occur when a student is given an instruction to wash her hands 
(antecedent); the data may also show that tantrums are typically 
followed by the removal of task demands. A possible hypothe- 
sis in this situation is that tantrums are motivated by academic 


demands and are maintained by escape from those demands 
(negative reinforcement). (See the case example with Chris later 
in this chapter for an illustration.) 


Advantages of ABC Continuous Recording. Descriptive 
assessments based on continuous recording use precise mea- 
sures (similar to functional analyses), and in some cases the 
correlations may reflect causal relations (e.g., Alter, Conroy, 
Mancil, & Haydon, 2008; Sasso et al., 1992). Because the 
assessments are conducted in the context in which the problem 


Figure 27.3 Sample data collection form for ABC continuous recording. 


ABC Recording Form 
observer: R. Van Norman 
Time begin: 9.30 A.M 
Date: January 25, 2006 


Time end: 10:15 A.M. 


Antecedent Behavior Consequence 

Q) Task prompt/instruction QO) Social attention 

® Attention diverted ® Reprimand 

Q) Social interaction @ Tantrum UO Task demand 

Q) Engaged in preferred activity L) Aggression Q) Access to preferred item 
Q) Preferred activity removed Q) Task removed 

) Alone (no attention/no activities) QO) Attention diverted 

@ Task prompt/instruction Q) Social attention 

Q) Attention diverted U Reprimand 

Q) Social interaction Tantrum Q) Task demand 

UL Engaged in preferred activity Q) Aggression UO Access to preferred item 
WU Preferred activity removed Task removed 

Q) Alone (no attention/no activities) U) Attention diverted 

@ Task prompt/instruction OQ) Social attention 

Q) Attention diverted Q) Reprimand 

Q) Social interaction Tantrum UU) Task demand 

Q) Engaged in preferred activity QO) Aggression Q) Access to preferred item 
Q) Preferred activity removed &) Task removed 

QO) Alone (no attention/no activities) UO Attention diverted 

Q) Task prompt/instruction Q) Social attention 
Attention diverted Q) Reprimand 

Q) Social interaction Tantrum QU) Task demand 

Q) Engaged in preferred activity Q) Aggression ) Access to preferred item 
Q) Preferred activity removed Q) Task removed 

Q) Alone (no attention/no activities) 4 Attention diverted 

4 Task prompt/instruction Q) Social attention 

Q) Attention diverted Q) Reprimand 

UO) Social interaction Q) Tantrum Q) Task demand 

Q) Engaged in preferred activity Q) Aggression U) Access to preferred item 
Q) Preferred activity removed QO) Task removed 

UW) Alone (no attention/no activities) Q) Attention diverted 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


OcoOUCCO®B 


@ Tantrum 
) Aggression 


Q) Social attention 

Q) Reprimand 

QO) Task demand 

Q) Access to preferred item 
& Task removed 

QO) Attention diverted 


(continued) 


Figure 27.3 (continued) 


Chapter 27 * Functional Behavior Assessment 687 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


OOOCOBL 


Tantrum 
Aggression 


Social attention 
Reprimand 

Task demand 

Access to preferred item 
Task removed 

Attention diverted 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


Tantrum 
Aggression 


Social attention 
Reprimand 

Task demand 

Access to preferred item 
Task removed 

Attention diverted 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


L) 
Q) 
Q) 
a) 
Q) 
Q) 
Q 
bs) 
a 
L) 
Q 


Tantrum 
Aggression 


Social attention 
Reprimand 

Task demand 

Access to preferred item 
Task removed 

Attention diverted 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


Tantrum 
Aggression 


Social attention 
Reprimand 

Task demand 

Access to preferred item 
Task removed 

Attention diverted 


Task prompt/instruction 
Attention diverted 

Social interaction 

Engaged in preferred activity 
Preferred activity removed 
Alone (no attention/no activities) 


OOCBOO|;COUDLUB 


Tantrum 
Aggression 


Social attention 
Reprimand 

Task demand 

Access to preferred item 
Task removed 

Attention diverted 


BOOOCOOC|OBOCCO/OOOCOBO|OBOOOO;OOOOUCR 


Source: Recording form developed by Renée Van Norman. Used by permission. 


behaviors occur, they are likely to provide useful information 
for designing a subsequent functional analysis if that proves 
necessary. In addition, they do not require disruption to the 
person’s routine. 


Considerations for ABC Continuous Recording. Although 
descriptive analyses of this type may show a correlation between 
particular events and the problem behavior, such correlations 
can be difficult to detect in many situations. This is especially 
likely if the influential antecedents and consequences do not 
reliably precede and follow the behavior. In such cases, it may be 
necessary to analyze descriptive data by calculating conditional 
probability (the likelihood that a target problem behavior will 
occur in a given circumstance) or conducting a contingency 
space analysis (CSA, see Box2del): 

However, conditional probabilities can be misleading. If a 
behavior is maintained by intermittent reinforcement, the behavior 
might occur often even though it is not followed consistently by a 
particular consequence. For example, the teacher might send the 
student to time-out only when tantrums are so frequent or severe 
that they are intolerable. In this case, only a small proportion of 


tantrums would be followed by that consequence, and the condi- 
tional probability would be low. One possibility, therefore, is that a 
functional relation that exists (e.g., tantrums negatively reinforced 
by escape) will not be detected. Furthermore, the child’s current 
behavior intervention plan might require three repetitions of the 
instruction and an attempt to provide physical assistance before 
time-out is implemented. In that situation, the conditional prob- 
ability of tantrums being followed by attention would be high. A 
descriptive analysis might, therefore, suggest a functional relation 
(e.g., tantrums positively reinforced by attention) that does not 
exist. Perhaps for these reasons, studies that have used conditional 
probability calculations to examine the extent to which descrip- 
tive methods lead to the same hypotheses as functional analyses 
have generally found low agreement (e.g., Lerman & Iwata, 1993; 
Noell et al., 2001). 


ABC Narrative Recording 

ABC narrative recording is a form of descriptive assessment 
that differs from continuous recording in that (a) data are col- 
lected only when behaviors of interest are observed, and (b) the 
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What Is Contingency Space Analysis? 
by Nathan VanderWeele and Corinne Gist 


Contingency spaces display the relationship between events 
in terms of the probability of one event (e.g., a particular 
consequence) given or not given another event (e.g., the 
presence or absence of a particular behavior) (Martens, 
DiGennaro, Reed, Szczech, & Rosenthal, 2008; Matthews, 
Shimoff, & Catania, 1987; Schwartz, 1989). For example, if 
Johnny yells out 13 times during 20 observation intervals 
and, as aresult, the teacher removes him from the classroom 
following 2 of those occurrences, we would say that the 
probability of the teacher removing Johnny from the class- 
room was 2/13, or approximately 15% of occurrences (see 
raw data below in row, Escape/Present; columns B, B+C, 
and B+C/B). 


Attention 
Present 
Absent 


Escape 
Present 
Absent 


However, identifying the probability of a behavior— 
consequence relation in isolation represents an incomplete 
analysis of the event. In addition to determining the probability 
of the occurrence of the consequence following the behavior, 
the probability of the consequence following the absence of the 
behavior should also be determined, (e.g., 43% per the raw 
data shown above in row, Escape/Absent; columns B, B+C, 
and B+C/B) with the resulting values compared in what is 
called a contingency space analysis (CSA). 

Graphing conditional probabilities in the general oper- 
ant contingency space reveals the degree of contingency 


Figure A Hypothetical CSA data with graph 
illustrating potential positive (attention), negative 
(escape), and neutral (other) contingencies provides 
an example of a contingency space graph. The y-axis 
value equals the probability of a consequence given 
the occurrence of a behavior, and the x-axis value 
equals the probability of a consequence given the 
absence of a behavior. 


between events and behavior (see Figure A below), which 
may address some of the limitations associated with ABC 
recordings. A CSA is conducted in the natural environ- 
ment and does not require experimental manipulation of 
the consequences. Direct observation is used to identify 
the presence of a positive, negative, or neutral contingency 
between behavior and its consequences. The contingency is 
said to be “positive” if a specific consequence is more likely 
to occur after the behavior. The contingency is said to be 
“negative” if the consequence is less likely to occur follow- 
ing the behavior. For example, if Johnny yells out 13 times 
during an observation period and receives teacher attention 
following 9 of those instances, we would say that the con- 
sequence of teacher attention (c) was more likely to occur 
following yelling out (b), indicating a positive contingency 
for yelling out and teacher attention. Alternatively, if the 
teacher more frequently withholds attention (c) following 
yelling out (b), we would say that teacher attention is Jess 
likely to occur following the behavior. Thus, a negative con- 
tingency may exist, since the consequence is not likely to 
follow the behavior. 

As previously mentioned, to determine the operant 
strength of a behavior—consequence relation, the proba- 
bility of the consequence following the behavior and the 


aR aE NET LT ais TREN see eM ater ae oT 
Contingency Space Analysis 


@ Attention 


Other @ Escape 


Probability (behavior+consequence/behavior) 


0. 01402503, 0:45,05; 0.6 0.7 


Probability (no behavior+consequence/no behavior) 


0:8.170:9 31 


probability of the consequence following the absence of 
behavior are calculated. To calculate the probability, the 
number of times the consequence followed the behavior is 
divided by the total number of behaviors emitted during an 
observation period, and the number of times the consequence 
occurred following the absence of the behavior is divided by 
the total number of intervals in which the behavior did not 
occur. These values are then plotted along the x-axis of the 
graph (see Figure A). The location of the resulting data point 
on the graph indicates the type and strength of the behavior— 
consequence relation. If the data point falls above and to the 
left of the line, a positive contingency may exist, suggesting 


recording is open-ended (any events that immediately precede and 
follow the target behavior are noted) (see Figure 3.3 in this text). 
Because data are recorded only when the target behavior occurs, 
narrative recording may be less time-consuming than continuous 
recording. However, narrative recording has several disadvan- 
tages in addition to those described earlier. 


Considerations for Narrative Recording. Because narra- 
tive recording data are seldom reported in published research, 
their utility in identifying behavioral function has not been 
established. However, ABC narrative recording might identify 
functional relations that do not exist because antecedent and 
consequent events are recorded only in relation to the target 
behavior; it would not be evident from ABC data if particu- 
lar events occurred just as often in the absence of the target 
behavior. For example, ABC data might erroneously indicate a 
correlation between peer attention and disruption even though 
peer attention also occurred frequently when the student was 
not disruptive. 

Another potential limitation of ABC narrative recording 
concerns its accuracy. Unless observers receive adequate train- 
ing, they may report inferred states or subjective impressions 
(e.g., “felt embarrassed,” “was frustrated”) instead of describ- 
ing observable events in objective terms. In addition, given 
the likelihood that a number of environmental events occur 
in close temporal proximity to one another, discriminating 
the events that occasion a behavior can be difficult. Lerman, 
Hovanetz, Strobel, and Tetreault (2009) found that teachers 
collected ABC data more accurately when using a more struc- 
tured format than with narrative recording. Thus, ABC nar- 
rative recording may be best suited as a means of gathering 
preliminary information to inform continuous recording or 
functional analyses. 


Scatterplot Recording 


Scatterplot recording is a procedure for recording the 
extent to which a target behavior occurs more often at par- 
ticular times than others (Symons, McDonald, & Wehby, 
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that the consequence could be maintaining the behavior. If 
the data point falls below and to the right of the line, a nega- 
tive contingency may exist, suggesting that the consequence 
is likely not maintaining the behavior. If data points fall on 
or near the neutrality line, no contingency is likely to exist, 
indicating that the consequence is equally likely to occur 
independent of the occurrence of behavior. Additionally, the 
greater the distance between the data point and the neutrality 
line, the stronger (positive) or weaker (negative) the hypoth- 
esized behavior—consequence relationship. It is important 
to note, however, that the efficacy of CSA results will rely 
heavily on the fidelity of the descriptive data collected. 


1998; Touchette, MacDonald, & Langer, 1985). Specifically, 
scatterplots involve dividing the day into blocks of time 
(e.g., a series of 30-min segments). For every time segment, 
an observer uses different symbols on an observation form to 
indicate whether the target problem behavior occurred a lot, 
some, or not at all (see Figure 27.4). After data have been 
collected over a series of days, they are analyzed for patterns 
(specific time periods that are typically associated with prob- 
lem behavior). If a recurring response pattern is identified, 
the temporal distributions in the behavior can be examined 
for a relation to particular environmental events. For exam- 
ple, a time period in which the behavior occurs often might 
be correlated with increased demands, low attention, certain 
activities, or the presence of a particular person. If so, changes 
can be made on that basis. 


Considerations for Scatterplots. The primary advantage of 
scatterplots is that they identify time periods during which the 
problem behavior occurs. Such information can be useful in 
pinpointing periods of the day when more focused ABC assess- 
ments might be conducted to obtain additional information 
regarding the function of the problem behavior. Although scat- 
terplots are often used in practice, little is known about their util- 
ity. It is unclear whether temporal patterns are routinely evident 
(Kahng et al., 1998). Another problem is that obtaining accurate 
data with scatterplots may be difficult (Kahng et al., 1998). In 
addition, the subjective nature of the ratings of how often the 
behavior occurs (e.g., “a lot” versus “some’’) can contribute to 
difficulties with interpretation (standards for these values might 
differ across teachers or raters). 


Indirect Functional Behavior Assessment 


Indirect functional assessment methods use structured inter- 
views, checklists, rating scales, or questionnaires to obtain infor- 
mation from those who are familiar with the person exhibiting 
the problem behavior (e.g., teachers, parents, caregivers, and/ 
or the individual himself or herself) to identify possible condi- 
tions or events in the natural environment that correlate with the 
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Baus 27. n Example “6 scatterplot recording form. Different shapes represent different observed rates of 
talk-outs per daily observation intervals. Open squares, 2 or fewer talk-outs; shaded circles, 4-10 talk-outs; 


filled squares, 11 or more occurrences. 
From “Functional Assessment and Teacher Collected Data,” 


by F. J. Symons, L. M. McDonald, and J. H. Wehby, 1998, Education and 


Treatment of Children, 21, p. 145. Copyright 1998 Used by permission. 


problem behavior. Such procedures are referred to as “indirect” 
because they do not involve direct observation of the behavior, 
but rather solicit information based on others’ recollections of 
the behavior. 


Behavioral Interviews 


Interviews are used routinely in assessment. The goal of a 
behavioral interview is to obtain clear and objective information 
about the problem behaviors, antecedents, and consequences. 
This might include clarifying descriptions of the behavior (con- 
sequences); when (times), where (settings, activities, events), 
with whom, and how often it occurs; what typically precedes the 
behavior (antecedents); what the child and others typically do 
immediately following the behavior (consequences); and what 
steps have previously been taken to address the problem, and 
with what result. Similar information might be solicited about 
desirable behavior (or the conditions under which undesirable 
behavior does not occur) to identify patterns or conditions that 
predict appropriate versus problem behavior. Information can 
also be obtained about the child’s apparent preferences (e.g., 
favorite items or activities), skills, and means of communicat- 
ing. A skillful interviewer poses questions in a way that evokes 
specific, complete, and factual responses about events, with 
minimal interpretation or inferences. 

Lists of interview questions have been published, and they 
provide a consistent, structured format for obtaining informa- 
tion through either an interview or a questionnaire format. For 


example, the Functional Assessment Interview (O’ Neill et al., 
1997) has 11 sections, which include description of the form 
(topography) of the behavior, general factors that might affect 
the behavior (medications, staffing patterns, daily schedule), 
antecedents and outcomes of the behavior, functional behav- 
ior repertoires, communication skills, potential reinforcers, and 
treatment history. 

A form of the Functional Assessment Interview for stu- 
dents who can act as their own informants is also available (Kern, 
Dunlap, Clarke, & Childs, 1995; O’ Neill et al., 1997). Ques- 
tions include the behavior or behaviors that cause trouble for the 
students at school, a description of their class schedule and its 
relation to problem behavior, rating of intensity of behaviors on 
a scale of | to 6 across class periods and times of day, aspects 
of the situation related to the behavior (e.g., difficult, boring, or 
unclear material; peer teasing; teacher reprimands), other events 
that might affect the behavior (e.g., lack of sleep, conflicts), con- 
sequences (what occurs when the individuals engage in the behay- 
ior), possible behavior alternatives, and potential strategies for a 
support plan. 

Two other questionnaires are the Behavioral Diagnosis 
and Treatment Information Form (Bailey & Pyles, 1989) and 
the Stimulus Control Checklist (Rolider & Van Houten, 1993), 
which also address questions about the conditions under which 
the behavior occurs or does not occur and how often. In addition, 
they include questions about physiological factors that might 
affect the behavior. 


Behavior Rating Scales 


Behavior rating scales designed for functional assessment 
ask informants to estimate the extent to which behavior 
occurs under specified conditions, using a Likert scale (e.g., 
never, seldom, usually, always). Hypotheses about the func- 
tion of a behavior are based on the scores associated with 
each condition. Those conditions assigned the highest cumu- 
lative or average rating are hypothesized to be related to the 
problem behavior. For example, if an informant states that 
problem behavior always occurs when demands are placed 
on a child, a negative reinforcement hypothesis might be 
made. Table 27.2 summarizes features of several behavior 
rating scales. 


Considerations for Indirect FBA 


Some indirect assessment methods can provide a useful source 
of information in guiding subsequent, more objective assess- 
ments, and contribute to the development of hypotheses about 
variables that might occasion or maintain the behaviors of 
concern. Because indirect forms of FBA do not require direct 
observation of problem behavior, most behavior analytic prac- 
titioners view them as convenient and consider them useful in 
conducting an FBA. However, we caution readers that closed- 
ended indirect assessment instruments such as the MAS and 
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QABF have been repeatedly demonstrated to be unreliable 
and therefore of questionable validity in identifying function 
(see Hanley, 2012). 


CONDUCTING A FUNCTIONAL BEHAVIOR 
ASSESSMENT 


FBA can best be viewed as a four-step process: 


1. Gather information with indirect and descriptive assessment. 


2. Interpret information from indirect and descriptive assess- 
ment and formulate hypotheses about the purpose of prob- 
lem behavior. 


3. Test hypotheses using functional analysis. 


4. Develop intervention options based on the function of 
problem behavior. 


Gathering Information 


It is often helpful to begin the FBA process by conducting 
Functional Assessment Interviews with the person’s teacher, 
parent, caregiver, and/or others who work closely with the per- 
son. The interview can be helpful in preparing the evaluator 
to conduct direct observations by identifying and defining the 
target problem behaviors, identifying and defining potential 


TABLE 27.2 Behavior Rating Scales Used to Assess Possible Functions of Problem Behavior 


Behavior rating scale Functions assessed 


Motivation Assessment 
Scale (MAS) (Durand & 
Crimmins, 1992) 


Sensory reinforcement, 


reinforcement 


Motivation Analysis Rating 
Scale (MARS) (Wieseler, 
Hanson, Chamberlain, & 
Thompson, 1985) 


and attention 


escape, attention, and tangible 


Sensory reinforcement, escape, 


Format and number of items Example item and possible 


function 


Does the behavior seem to 
occur in response to your 
talking to other persons in the 
room? (attention) 


16 questions (4 for each of 
4 functions), 7-point scale from 
always to never 


6 statements (2 for each of 
3 functions), 4-point scale from 
always to never 


The behavior stops occurring 
shortly after you stop working 
or making demands on this 
person. (escape) 


Problem Behavior 
Questionnaire (PBQ) (Lewis, 
Scott, & Sugai, 1994) 


Functional Analysis 
Screening Tool (FAST) 
(Iwata & DeLeon, 1996) 


Questions About Behavioral 
Function (QABF) (Paclawsky), 
Matson, Rush, Smalls, & 
Vollmer, 2000) 


Peer attention, teacher 
attention, escape/avoid 

peer attention, escape/avoid 
teacher attention, and assess- 
ment of setting events 


Social reinforcement 
(attention, preferred items), 
social reinforcement (escape), 
automatic reinforcement by 
sensory stimulation, auto- 
matic reinforcement by pain 
attenuation 


Attention, escape, nonsocial, 
physical, tangible 


Questions, 7-point range 


Yes or no as to whether 
statements are descriptive 


Statements, 4-point range 


When the problem behavior 
occurs, do peers verbally 
respond to or laugh at the 
student? (peer attention) 


When the behavior occurs, 
do you usually try to 

calm the person down or 
distract the person with 
preferred activities (leisure 
items, snacks, etc.)? (social 
reinforcement, attention, 
preferred items) 


Participant engages in the 
behavior to try to get a reaction 
from you. (attention) 
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antecedents and consequences that may be observed, and gain- 
ing an overall picture of the problem behavior as well as the 
strengths of the person. The interview can also help deter- 
mine if other assessments are warranted before a more exten- 
sive FBA is conducted. For example, if the interview reveals 
that the person has chronic ear infections that are currently 
untreated, a medical evaluation should be conducted before 
further behavioral assessment takes place. 

In many cases, conducting an interview with the person 
who has problem behavior can be helpful if he has the language 
skills to understand and respond to interview questions. Some- 
times the person has useful insights regarding why he displays 
problem behavior in specific contexts. 

At this point, conducting direct observations of the 
problem behavior within the natural routine is useful. Such 
observations help confirm or disconfirm the information 
obtained through the interviews. If it is not clear when the 
problem behavior occurs most often, a scatterplot analysis 
may be useful to determine when further behavioral observa- 
tions should be conducted. When the problematic time peri- 
ods have been determined, the behavior analyst can conduct 
ABC assessments. Information obtained from the interviews is 
helpful in guiding the ABC assessment because the evaluator 
should already have clear definitions of the target behavior(s), 
antecedents, and consequences. However, the behavior ana- 
lyst must also watch for additional, unexpected antecedents 
and consequences that might present themselves in the natural 
environment. Teachers or caregivers sometimes overlook or 
are unaware of specific stimuli triggering or following prob- 
lem behavior. 


Interpreting Information 
and Formulating Hypotheses 


Results from indirect assessments should be analyzed for patterns 
of behavior and environmental events so that hypotheses regard- 
ing the function of the problem behavior can be made. If problem 
behavior occurs most frequently when low levels of attention are 
available and problem behavior frequently produces attention, 
a hypothesis that attention maintains the problem behavior is 
appropriate. If the problem behavior occurs most frequently 
in high-demand situations and often produces a reprieve from 
the task (e.g., through time-out, suspension, or another form of 
task delay), then a hypothesis that escape maintains the prob- 
lem behavior is appropriate. If problem behavior occurs in an 
unpredictable pattern or at high rates across the school day, a 
hypothesis that the behavior is maintained by automatic rein- 
forcement may be appropriate. In reviewing assessment results 
and considering possible hypotheses, behavior analysts should 
remember that behaviors may serve multiple functions and dif- 
ferent topographies of problem behavior may serve different 
functions. 

Hypothesis statements should be written in ABC format. 
Specifically, the hypothesis statement should state the anteced- 
ent(s) hypothesized to trigger the problem behavior, the topog- 
raphy of problem behavior, and the maintaining consequence. 
An example is presented below: 


i 


Hypothesized Antecedent Behavior Consequence 

Function 

Escape from When Flo is she screams termination of 

hand washing prompted and falls to hand washing 

and/or lunch. — to wash her the ground, and lunch by 
hands in which is being sent to 
preparation for followed time-out. 
lunch, fae year 


Writing hypothesis statements in this manner is useful 
because it requires the behavior analyst to focus on potential 
avenues for intervention: modifying the antecedent and/or 
modifying the reinforcement contingencies (which may involve 
teaching a new behavior and altering what behaviors are rein- 
forced or placed on extinction). 


Testing Hypotheses 


After hypotheses have been developed, a functional analysis can 
be conducted to test them. The functional analysis should always 
contain a control condition that promotes the lowest frequency 
of problem behavior. Often this is the play condition, which con- 
sists of (a) continuous availability of preferred toys and/or activ- 
ities, (b) no demands, and (c) continuously available attention. 
Then, conditions are selected to test specific hypotheses. For 
example, if the primary hypothesis is that the problem behav- 
ior is maintained by escape, then a contingent escape condition 
should be implemented. Other test conditions may not need to 
be implemented. 

Being selective about the test conditions implemented 
will help keep the functional analysis as brief as possible; 
however, no conclusions can be made regarding additional 
functions of problem behavior if additional test conditions are 
not implemented. For example, if play and contingent escape 
are the only conditions tested, and problem behavior occurs 
most frequently in the contingent escape condition and sel- 
dom or never occurs in the play condition, the conclusion that 
escape maintains the problem behavior is supported. However, 
because a contingent attention condition was not implemented, 
one could not rule out the possibility that attention also main- 
tains the problem behavior. 


Developing Interventions 


When an FBA has been completed, an intervention that matches 
the function of problem behavior can be developed. Interventions 
can take many forms. Although FBA does not identify which 
interventions will be effective in treating problem behavior, it does 
identify antecedents that may trigger problem behavior, poten- 
tial behavioral deficits that should be remedied, and reinforce- 
ment contingencies that can be altered, as described earlier in this 
chapter. In addition, FBA does identify powerful reinforcers that 
can be used as part of the intervention package. The intervention 
should be functionally equivalent to problem behavior. That is, 
if problem behavior serves an escape function, then the interven- 
tion should provide escape (e.g., in the form of breaks from task 
demands) for a more appropriate response or involve altering task 
demands in a fashion that makes escape less reinforcing. 


One effective way to design interventions is to review 
confirmed hypotheses to determine how the ABC contingency 
can be altered to promote more positive behavior. For example, 
consider the hypothesis developed for Flo, and assume the func- 


tional analysis revealed a pattern similar to that seen in the upper 
right-hand corner of Figure 27.2: 


Hypothesized Antecedent Behavior Consequence 
Function 
Escape from When Flo is she screams termination of 
hand washing prompted and falls to hand washing 
and/or lunch. — to wash her the floor, and lunch by 
hands in which is being sent to 
preparation for followed time-out. 
WAC Aes see Oy eee 


The antecedent could be altered by changing the time of 
day when Flo is asked to wash her hands (so that it does not 
precede lunch, thereby decreasing the motivation for escape- 
motivated tantrums). 


Hypothesized Antecedent Behavior Consequence 
Function 
Escape from Flo is N/A N/A (The 
hand washing prompted (Problem consequence 
and/or lunch. — to wash her behavior is _ is irrelevant 
hands before avoided.) because 
recess. problem 
behavior did 
not occur.) 


i  ————— 


The behavior could be altered by teaching Flo a new 
behavior (e.g., signing “break”’) that results in the same outcome 
(escape from lunch). 


Hypothesized Antecedent Behavior Consequence 
Function 
Escape from WhenFlois _ Flols termination of 
hand washing prompted prompted to hand washing 
and/or lunch. towashher sign “break,” and lunch. 
hands In which is fol- 
preparation lowed by... 
for UNChiA ee 


Or, the consequences could be altered. For example, the 
reinforcer for problem behavior could be withheld so that prob- 
lem behavior is extinguished. 


Hypothesized Antecedent Behavior Consequence 
Function 
Escape from When Flo is she screams continued pre- 
hand washing prompted and tan- sentation of 
and/or lunch. towashher — trums, hand washing 
hands in which is and lunch 
preparation — followed activities. 
Fou (UNC ae a DV anu 


Neen ee ee eee ee EE 
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An intervention can also consist of several different com- 
ponents. For example, Flo could be taught a replacement behav- 
ior (signing “break”), which results in breaks from lunch, while 
tantrums are simultaneously placed on escape extinction. 

FBA can also help identify interventions that are likely 
to be ineffective or that may worsen the problem behavior. 
Interventions involving time-out, in-school or out-of-school 
suspension, or planned ignoring are contraindicated for prob- 
lem behaviors maintained by escape. Interventions involving 
reprimands, discussion, or counseling are contraindicated for 
problem behaviors maintained by attention. 

We offer a final word about intervention: When an inter- 
vention has been developed, FBA is not “done.” Assessment is 
ongoing once intervention is implemented, as it is important 
for the continued monitoring of intervention effectiveness. The 
functions of behavior are not static. Rather, they are dynamic 
and change over time. Intervention may lose its effective- 
ness because the function of problem behavior may change 
(Lerman, Iwata, Smith, Zarcone, & Vollmer, 1994). In such 
cases, additional functional analyses may be needed to revise 
the intervention. 


CASE EXAMPLES ILLUSTRATING 
THE FBA PROCESS 


FBA is a highly idiosyncratic process. It is unusual for any 
two FBAs to be exactly the same because each person pres- 
ents with a unique set of skills and behaviors, as well as a 
unique history of reinforcement. FBA requires a thorough 
understanding of behavioral principles to parcel out the rel- 
evant information from interviews and ABC assessments, 
to form relevant hypotheses, and to test those hypotheses. 
Beyond these skills, a solid understanding of behavioral 
interventions (e.g., differential reinforcement procedures, 
schedules of reinforcement, and tactics for promoting main- 
tenance and generalization) is needed to match effective 
treatments to the function of challenging behavior. Further- 
more, it is important for the practitioner to keep abreast of 
the current literature in functional analysis and treatment of 
problem behavior, because the evidence base is constantly 
evolving. This can seem like a daunting process. In an 
attempt to demonstrate the application of FBA across the 
idiosyncratic differences in people, we present an array of 
case examples that illustrate its utility in developing inter- 
ventions that render the problem behavior ineffective, irrel- 
evant, or inefficient. 


Marie—Brief Functional Analysis 


Gathering Information, Interpreting Information, and 
Forming Hypotheses 


Marie was an 8-year-old girl with autism who had been referred 
to a behavior analyst for an evaluation of severe problem behav- 
ior, specifically aggression toward her family members. Her 
insurance covered 3 hours of “ABA therapy” per week, which 
she was receiving from a community mental health agency 
social worker. Her mother and sister attended every session. 
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The only behavior analyst with the skills necessary to consult 
on this case was 150 miles away at a local university, so the 
behavior analyst helped the agency staff conduct the functional 
analysis via teleconsultation. 

The behavior analyst began by conducting a Functional 
Assessment Interview with Marie’s mother, who reported that 
aggression occurred in a variety of settings, toward children 
and adults, and typically occurred when Marie did not get what 
she wanted, usually a preferred toy. Aggression also took place 
during her therapy sessions when the therapist was talking to 
Marie’s mother, updating her on the session. However, aggres- 
sion toward her on-site therapist had recently decreased. The 
behavior analyst then collected some ABC data during a play 
session with Marie’s mother and younger sister, as well as 
during her ABA therapy session at the clinic. The ABC data 
showed a pattern of aggression when Marie’s preferred items 
were removed. In addition, the ABC data indicated that the 
consequence that most often followed aggression was Marie’s 
mother or sister relinquishing the item back to Marie. Based on 
the descriptive data, it was hypothesized that Marie’s aggression 
could be maintained by access to tangibles. 


Testing Hypotheses 


Due to the limited number of hours per week Marie received 
services, a brief functional analysis was conducted to deter- 
mine the function of Marie’s aggression. The consulting 
behavior analyst trained Marie’s therapist to conduct the 
assessment via teleconsultation, and was available during the 
assessment to provide coaching and feedback. Four 5-min 


functional analysis conditions (free play, attention, demand, 
and tangible) were conducted, and aggression occurred only 
in the tangible condition. During each session, the behavior 
analyst scored the occurrence of problem behavior using a 
10-sec partial interval recording procedure. During the con- 
tingency reversal, the number of requests was also recorded. 
The functional analysis results are shown in Figure 27.5. 
After the first four conditions, the behavior analyst coached 
the on-site therapist on how to implement functional commu- 
nication training (FCT) to teach Marie to ask for the tangible 
item. This was selected as a replacement behavior because it 
matched the function of the problem behavior and could effi- 
ciently compete for the reinforcer. During three 5-min training 
sessions, requests increased and aggression dropped to zero. 
The behavior analyst instructed the on-site therapist to con- 
duct a contingency reversal by implementing a second tangible 
functional analysis condition. During this reversal, aggression 
increased to 50% of intervals, and requesting decreased from 
11 to 2 occurrences. Upon reinstatement of FCT, aggression 
returned to zero and requesting returned to the previous treat- 
ment level. All functional analysis and treatment conditions 
were completed in a 60-min appointment with Marie’s thera- 
pist, mother, and sister present. 


Developing an Intervention 


During the brief FA, functional communication training (FCT; 
teaching appropriate requesting) was shown to be an effective 
treatment for decreasing aggression. It matched the function and 
was more efficient than the problem behavior in obtaining access 
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Figure 27.5 Results from the brief functional analysis for Marie. The graph depicts the percentage of 10-sec intervals in 
which problem behavior occurred (left y-axis) and the number of independent mands (right y-axis) in each condition. 
Marie’s case example contributed by Rebecca Eldrige and Nathan VanderWeele. 


to the tangible item. During a follow-up meeting, the behavior 
analyst, the on-site therapist, and Marie’s mother met to develop 
the treatment plan. Marie’s mother was concerned about the 
times that she would need to say “no” to Marie when Marie was 
asking for preferred items and was worried that aggression would 
result. The behavior analyst worked with the team to develop a 
treatment plan that included initially delivering the item each 
time Marie requested it until aggression had decreased to zero for 
three consecutive 30-min sessions. When Marie was regularly 
making requests without aggression, a program to teach waiting 
for the preferred item was introduced. To gain access to the pre- 
ferred item, Marie had to wait (without aggression) for 1 to 60 
seconds after requesting it. Once waiting was mastered, Marie 
was taught to complete a task while while waiting. She could 
then earn the preferred item only after the task was complete. 
The behavior analyst and on-site therapist then taught Marie’s 
mother to implement the procedure, which she did successfully. 


Carson—tTrial-based Functional Analysis 


Gathering Information, Interpreting Information, 
and Forming Hypotheses 


Carson, a 10-year-old boy with a chromosomal deletion dis- 
order, was receiving special education services in the general 
education classroom. He displayed several problem behav- 
iors—most notably, talking to peers during group instruction, 
yelling out answers, and running around the room. These behav- 
iors occurred only in the general education classroom. They 
had increased in intensity, resulting in more time spent in the 
resource room, away from the general education setting. Con- 
sequently, Carson was missing valuable instruction time and 
falling behind in the curriculum. 

The behavior analyst assigned to Carson’s case was asked 
to complete an FBA in order to design an intervention plan 
that would increase the time he spent in the general education 
classroom and decrease his disruptive behavior. After obtaining 
consent from Carson’s parents to conduct an FBA, the behavy- 
ior analyst conducted a Functional Assessment Interview with 
Carson’s teachers and paraprofessionals. From the interview, it 
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appeared that Carson’s problem behaviors occurred most often 
during group lessons on the carpet and individual work at his 
desk. Carson’s teachers and paraprofessionals could not identify 
a clear pattern of consequences for his behavior because they 
had tried so many different strategies without success. Following 
the interview, the behavior analyst used a structured data form to 
begin collecting ABC data during those times reported as most 
problematic by the teachers. The ABC assessment showed that 
escape from demands or access to one-on-one attention could 
be maintaining Carson’s disruptive behavior. 


Testing Hypotheses 


At this point, the behavior analyst needed to test her hypoth- 
eses, but she had several reservations about conducting a 
traditional FA in the classroom setting. She was concerned 
about the time that a traditional FA would take as well the 
teacher’s ability to implement it with a high degree of fidelity 
while also teaching. For these reasons, the behavior analyst 
decided to conduct a trial-based functional analysis (TBFA) 
on disruptive behavior, with the teacher implementing the 
conditions. Four conditions were conducted: attention test, 
attention control, escape test, and escape control. During 
the attention test condition, teacher attention was diverted 
unless Carson engaged in disruption. If he engaged in dis- 
ruption, the teacher provided attention for 10 seconds and 
ended the trial. During the attention control condition, atten- 
tion was delivered every 5 seconds for 60 seconds, with no 
programmed consequences for disruptive behavior. During 
the escape test condition, the teacher worked one-on-one with 
Carson, repeatedly prompting him to complete his work. If he 
engaged in disruption, the teacher walked away and removed 
task demands for 10 seconds. During the escape control con- 
dition, the teacher provided a 1-min break with no attention 
or task demands and no programmed consequences for dis- 
ruption. Each condition was implemented 8 times (trials) for 
| min during regular instruction throughout the school day. 
The percentage of trials with disruption in each condition was 
then calculated and graphed. The results (see Figure 27.6) 
showed that disruption occurred most often in the attention 


Figure 27.6 Results from the trial-based 
functional analysis for Carson. The graph shows 
the percentage of trials, for both the control and 
the test conditions, in which disruptive behavior 
occurred. 


Carson's case example contributed by Rebecca Eldrige and 


Attention Escape 


Nathan VanderWeele. 


696 Part 10 ¢ Functional Assessment 


test condition, and thus supported the hypothesis that atten- 
tion was maintaining disruptive behavior. In addition, the 
escape test condition evoked no problem behavior, which 
suggested that escape from demands was not a reinforcing 
variable for disruptive behavior. 


Developing an Intervention 


Based on the results of the trial-based functional analysis, a 
multicomponent treatment plan was developed that included 
discontinuing attention when Carson engaged in disruptive 
behavior, teaching him to raise his hand to gain attention, and 
providing contingent breaks with teacher attention for sitting 
quietly at his desk for increasing amounts of time. Thus, prob- 
lem behavior was placed on extinction, and more appropriate 
behaviors (hand raising and sitting quietly engaged in his work) 
produced teacher attention. 


Elija—Latency-based Functional Analysis 


Gathering Information, Interpreting Information, and 
Forming Hypotheses 


Elija was a 9-year-old boy diagnosed with autism spectrum dis- 
order. He engaged in elopement and noncompliance and was 
referred to a behavior analyst for a functional analysis. 

The behavior analyst began with a Functional Assess- 
ment Interview (O'Neill et al., 1997) with Elija’s mother. 
Elija’s mother reported that he typically engaged in elopement 
in the community as well as in the home; he would run out of 
the home or store and into the street when he wanted some- 
thing he couldn’t have (while in the community) or when he was 
denied access to his toys (at home). Consequently, his mother 
did not take him out much, which limited the number of activ- 
ities Elija could engage in throughout the day. His mother kept 
all the doors locked in the house. 

Next, the behavior analyst conducted some direct obser- 
vation of Elija’s behavior at home using remote technology. Eli- 
ja’s mother was observed working with him under a variety of 
naturally occurring conditions. The behavior analyst took data 
on elopement attempts as well as the immediate antecedents 
and consequences. Attempts to elope were blocked to keep Elija 
safe. During the ABC assessment, the behavior analyst observed 
elopement attempts when Elija was asked to complete academic 
tasks. Additionally, when he eloped, Elija’s mother provided 
extensive attention in an attempt to get him to sit back down 
at the table and do his work. Based on the ABC observations, 


Figure 27.7 The latency to elopement for each 
condition in the FA for Elija. Note that elopement 
never occurred in the Play/Control condition; thus, 
the data points are plotted at 300 s, which represents 
the entire session duration (5 min). The latency 

was shortest during the attention and demand 
conditions, indicating escape and attention functions. 


Elija’s case example contributed by Denice Rios and Nicole Hollins. 
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the behavior analyst hypothesized that Elija’s elopement was 
maintained by escape from task demands as well as access to 
tangible items, and attention. 


Testing Hypotheses 


The functional analysis was conducted at the center where Elija 
received services. The behavior analyst decided to conduct a 
latency-based functional analysis. She chose this assessment 
given the dangerous nature of the problem behavior. That is, 
she wanted to minimize the number of occurrences of elope- 
ment, and staff had to intervene when elopement did occur. The 
behavior analyst immediately reinforced elopement, which pre- 
vented Elija from eloping too far from the setting. Elopement 
was defined as any instance of Elija moving more than 3 feet 
away from the instructional setting and/or from an adult without 
permission. In addition to conducting the latency FA, the behav- 
ior analyst also ensured that additional therapists were assigned 
to all exits in the building to prevent Elija from going out of the 
building. The FA consisted of the following conditions: tangi- 
ble, play (control), attention, and demand. The behavior analyst 
started a stopwatch when the condition began and let it run until 
elopement occurred. When elopement occurred, the behavior 
analyst stopped the stopwatch and recorded the time that had 
elapsed. This was the latency to elopement. The results of the 
FA are shown in Figure 27.7. The graph depicts that elopement 
was maintained by escape from demands and access to attention, 
because Elija’s latency to elopement was shortest during the 
escape and attention conditions. 


Developing an Intervention 


It can be challenging when elopement or any other dangerous 
problem behavior serves an attention function, because the prob- 
lem behavior cannot be placed on extinction (i.e., ignored) safely. 
The problem behavior will likely always result in at least some 
attention. Such was the case for Elija. As a result, the behavior 
analyst designed an intervention based on concurrently available 
reinforcement contingencies. When Elija was presented with a 
task demand, he was asked to make a choice from among three 
concurrently available options, which were signaled by three 
different colors (green, yellow, and red). Green was associated 
with Elija completing the assigned task demand. If he appro- 
priately completed the task, he was given a high-quality break 
for 60 seconds. During his break, he received attention from his 
mother, as well as preferred toys. If he asked for a break from the 
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task (the yellow option), he was given a task break. However, 
the break lasted only 30 seconds, he had access to less preferred 
toys, and his mother stayed with him for only a few seconds. If 
he engaged in elopement (the red option), he escaped the task 
for 10 seconds, no toys were available, and his mother neutrally 
blocked his egress from the room. Over time, the duration of the 
task demand to earn the high-quality break increased until Elija 
reliably engaged in the task demand rather than eloping. 


Will—Evaluating Precursor Behavior 


Gathering Information, Interpreting Information, 
and Forming Hypotheses 


Will was 14 years old with a primary diagnosis of autism. He 
was quite large for his age and displayed high levels of physi- 
cal aggression, verbal outbursts, and urinating on people. As a 
result of the severity and frequency of his problem behaviors, his 
size, a lack of informed treatment, and limited resources, Will 
was sent to a high-security juvenile detention center, where he 
was exposed to individuals without diagnosed disabilities who 
had committed violent, serious, and/or gang-related crimes. The 
county serving Will recognized that the juvenile detention cen- 
ter was not an appropriate placement for him but had no other 
placement options. As a last resort, the service coordinator on 
the case sought a behavior analytic consultation. 

Upon getting involved with the case, the behavior ana- 
lyst realized that everyone working with Will was skeptical that 
anything would help. The behavior analyst conducted an FBA 
interview with individuals who had worked with Will in the deten- 
tion center. Because the center was a lock-down facility, the staff 
could not give much information about how Will performed in 
a variety of conditions. The interview yielded little useful infor- 
mation. Also, observations of natural routines were limited due 
to the nature of the placement. As a result, no formal descriptive 
assessments were completed. The behavior analyst proposed a 
functional analysis, but one of Will’s caseworkers was concerned 
that once Will got upset, he would engage in aggression for hours. 
She was also concerned that Will’s behaviors were so severe that 
allowing them to occur even once could be dangerous. 

Based on a review of Will’s records and the limited 
information available, it was hypothesized that Will’s aggres- 
sive behaviors were maintained by escape from demands or by 
attention. In addition, it was determined that verbal outbursts 
describing the aggressive behavior he was about to engage in 
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(e.g., “I’m going to piss on you!”) reliably preceded Will’s 
aggression. As a result of all these factors, it was decided that 
the functional analysis would be conducted on Will’s verbal out- 
bursts rather than on his aggression. 


Testing Hypotheses 


The functional analysis consisted of four conditions: tangible, 
free play, escape, and attention. In each of these conditions, 
occurrences of verbal outbursts were recorded using a 10-sec 
partial interval measurement system and plotted as the percent- 
age of intervals with problem behavior (see Figure 27.8). Will 
displayed high levels of verbal outbursts during the tangible 
condition. The behavior analyst noted that Will’s verbal out- 
bursts could be interrupted if the therapist provided attention 
or task demands (i.e., the attention and demand conditions that 
followed the tangible condition) that redirected Will’s behavior 
to some form of independent work. 


Developing an Intervention 


Based on the results of the functional analysis, the behavior 
analyst recommended that Will receive training in effectively 
and appropriately asking for tangible items. In addition, it was 
recommended that the training focus on teaching Will to tolerate 
delays in getting access to those tangible items. In his typical 
environment (e.g., school and home setting), gaining immediate 
access to tangible items was not always going to be feasible, so 
tolerating delays would be imperative for Will’s success. Thus, 
a structured treatment that involved providing access to tangible 
items following appropriate requests, but not verbal outbursts, 
was recommended. After Will reliably requested items in an 
appropriate way, the behavior analyst slowly built in delays to 
the preferred item and asked Will to work on an independent 
task while waiting. Following appropriate waiting, Will received 
access to the preferred item. After observing the process, Will’s 
caseworker—albeit initially resistant—considered that Will 
could live in a less restrictive environment. 


Chris—Evaluating ABC Data 


Gathering Information, Interpreting Information, 
and Forming Hypotheses 


Chris was a 37-year-old man diagnosed with intellectual disabil- 
ities who lived in a group home. He had a limited vocal verbal 
repertoire and he used a wheelchair for mobility. Chris displayed 
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Figure 27.8 Results of Will’s functional analysis 
of precursor behavior (verbal outbursts). 


Will's case example contributed by Cody Morris, Denice Rios, 
and Lloyd Peterson. 
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frequent screaming toward caregivers and peers across many 
locations, including his home, various day programs, and out- 
ings. Interviews with staff members indicated that the scream- 
ing occurred throughout the day, but most frequently at his day 
programs. The staff hypothesized that the screaming occurred 
when Chris observed other roommates getting attention from 
staff. The behavior analyst also noticed that when staff mem- 
bers moved Chris’s wheelchair, they often did so by walking up 
behind him and moving his wheelchair without saying anything 
to Chris about what was going to happen. This seemed to star- 
tle Chris, and the behavior analyst wondered if his screaming 
might also be maintained by avoidance of being removed from 
a specific area in the day program. 

Structured ABC observations (similar to that in Figure 27.3) 
and narrative recordings (providing descriptors of each event) 
were conducted across various times in Chris’s home and 
day programs. Both inappropriate vocalizations (screaming) 
and appropriate vocalizations were tracked. After data were 
collected, each antecedent and consequence entry was coded 
into categories, such as attention, presentation or removal of 
a demand, being relocated, and alone. Each occurrence of 
appropriate and inappropriate vocalization was then analyzed 
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to determine what category of antecedent immediately pre- 
ceded it and what category of consequence followed it. The 
percentage of both appropriate and inappropriate vocalizations 
preceded or followed by each category of variables was then 
graphed and analyzed using methods described in Chapter 6. 
The results are shown in Figure 27.9. 

Based on the ABC observations and graphed ABC data, 
it was hypothesized that Chris’s screaming served an attention 
function. The graphed data clearly showed that the most com- 
mon antecedent variable that preceded both appropriate and 
inappropriate vocalizations was Chris being alone. However, 
Chris rarely received attention following appropriate attempts 
to interact with staff. In contrast, he reliably received staff 
attention following screaming. In fact, Chris was about twice 
as likely to receive attention for inappropriate vocalizations as 
for appropriate ones. Moreover, the narrative aspect of the ABC 
recording (see Table 27.3) indicated that staff members reliably 
ignored Chris while he scrolled through a wide variety of appro- 
priate vocalizations (e.g., “hi,” “how are you”) that escalated 
to grunting and eventually screaming. Typically, it wasn’t until 
Chris began screaming that he received high-quality attention 
from staff. 
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Figure 27.9 Results of Chris's ABC assessment. The left-hand panel shows the percentage of appropriate and inappropriate 
vocalizations that were preceded by the antecedent variables of one-on-one attention, access to edibles/tangibles having 
someone move his wheelchair (relocated), having a demand placed on him, or being left alone. The right-hand panel 
shows the percentage of appropriate and inappropriate vocalizations that were followed by the consequence variables 

of one-on-one attention, access to edibles/tangibles, having access to a new area, having a demand removed, or being 


left alone. 
Case example contributed by Cody Morris and Becky Kolb. 
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TABLE 27.3 Results of ABC Narrative Assessment for Chris’s Appropriate and Inappropriate Behavior 


Antecedent Behavior 

Chris alone Said “hi you” 

Chris alone Said “a ba” while pointing to a book 
Chris alone Screamed 

Chris transitioned to a new activity Screamed 


(sorting) 


Chris transitioned to a different room 


(because of an event occurring) while walking 


Developing an Intervention 


Because the ABC analysis identified a very apparent issue with 
staff interaction with Chris, it was not clear that a functional 
analysis was necessary. The behavior analyst wanted to first 
try having staff attend to appropriate behavior that Chris was 
already displaying. The intervention consisted of reversing 
staff’s differential reinforcement of the inappropriate vocaliza- 
tions. This was done by having staff reinforce the appropriate 
vocalizations (i.e., responding to the appropriate bids for atten- 
tion) while placing screaming on extinction. In addition, because 
Chris engaged in high levels of appropriate vocalizations 
already, he was taught to tolerate a delay to reinforcement. 
Staff members were asked to acknowledge his requests and to 
provide reinforcement (attention) immediately, whenever pos- 
sible. However, staff members were also taught to work directly 
with Chris to teach him to “wait” when necessary by slowly 
increasing delays to attention following their acknowledgement 
of his request. Finally, because Chris’s vocalizations were some- 
what difficult to understand and limited in variety, interventions 
to expand and improve the quality of Chris’s mand repertoire 
were introduced. 


Brian—Multiple Functions of Problem Behavior 


Gathering Information 


Brian was 13 years old and diagnosed with pervasive develop- 
mental delay, oppositional defiant disorder, and attention-deficit 
hyperactivity disorder. He had moderate delays in cognitive 
and adaptive skills. Brian displayed several problem behavy- 
iors, including aggression, property destruction, and tantrums. 
Brian’s aggression had resulted in several of his teachers having 
bruises, and his property destruction and tantrums frequently 
disrupted the daily activities of the classroom. 

A Functional Assessment Interview (O’ Neill et al., 1997) 
was conducted with Brian’s teacher, Ms. Baker, who reported 
that Brian’s problem behavior occurred most frequently when he 
was asked to perform a task that required any kind of physical 
labor (e.g., shredding papers) and occurred least during leisure 
activities. However, Ms. Baker reported that Brian often engaged 
in problem behavior when he was asked to leave a preferred 
activity. She noted that Brian used complex speech (sentences), 
although he often used verbal threats, curse words, and/or 
aggression, property destruction, and tantrums to communicate 
his wants and needs. 


Said “are you my boy” to staff 


Consequence 
Ignored by staff 
Ignored by staff 


Approached and told “Please stop screaming” and 
asked “What's wrong, Chris?” 


Staff asked Chris why he was upset—" Chris, what 
do you need?” 


Ignored by staff 


Because Brian had a spoken repertoire, a Student-Assisted 
Functional Assessment Interview (Kern et al., 1995) was also 
conducted. In this interview, Brian reported that he found his 
math work too difficult but that writing and using a calculator 
were too easy. He reported that he sometimes received help from 
his teachers when he asked for it, that sometimes teachers and 
staff noticed when he was doing a good job, and that he some- 
times received rewards for doing good work. Brian indicated 
that his work periods were always too long, especially those that 
consisted of shredding papers. Brian reported that he had the 
fewest problems in school when he was allowed to answer the 
phone (his classroom job), when he was completing math prob- 
lems, and when he was playing with his Gameboy. He stated that 
he had the most problems at school when he was outside playing 
with the other students because they often teased him, called him 
names, and cursed at him. 

An ABC assessment was conducted on two separate occa- 
sions. The results of the ABC assessment are shown in Table 27.4. 


Interpreting Information and Formulating Hypotheses 


Based on the interviews and ABC assessments, the function of 
Brian’s problem behavior was unclear. It was hypothesized that 
some of Brian’s problem behaviors were maintained by access 
to adult attention and preferred items. This hypothesis was a 
result of the ABC assessment, which indicated that many of Bri- 
an’s problem behaviors occurred when adult attention was low 
or when access to preferred items was restricted. Brian’s prob- 
lem behavior often resulted in access to adult attention or pre- 
ferred activities. It was also hypothesized that Brian’s problem 
behavior was maintained by escape because his teacher reported 
that Brian frequently engaged in problem behavior in the pres- 
ence of task demands and because Brian reported that some of 
his work was too hard and work periods lasted too long. Thus, 
a functional analysis was conducted to test these hypotheses. 


Testing Hypotheses 


The functional analysis consisted of the same conditions as 
described previously, with two exceptions. First, an alone 
condition was not conducted because there was no reason to 
believe that Brian’s problem behavior served an automatic 
function. Second, a contingent tangible condition was added 
because there was reason to believe Brian engaged in problem 
behavior to gain access to preferred tangibles and activities. 
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TABLE 27.4 Results of ABC Assessments for Brian’s Aggression, Property Destruction, and Tantrums e 


Antecedent Behavior 


Adult attention diverted to another student; 
denied access to Nintendo by teacher (i.e., told 
no when he asked if he could play it) 


Teacher attending to another student 
Teacher attention diverted to another student 


Story time, teacher attending to other students Laughed loudly 


Story time, teacher listening to other students 


Yelled at teacher, “That's not fair! 
Why do you hate me?!” 


Yelled “Stop!” at another student 


Interrupted other students while they 
were talking: “Hey, it’s my turn. | 
know what happens next 


Consequence 
Told to “calm down” 


Hit sofa, attempted to leave classroom Given choice of activity and verbal 


warning to stay in classroom 


Reprimand from teacher: “Don’t worry, 
Brian. | will take care of it.” 


Reprimand from teacher: “Stop it!” 


Reprimand from teacher: “You need to 
listen.” 


|" 


This condition was just like the play condition (i.e., Brian had 
access to adult attention and preferred toys at the beginning of 
the session), except that intermittently throughout the session, 
he was told it was time to give his toy to the teacher and to 
play with something else (which was less desirable). If Brian 
complied with the request to give the toy to the teacher, he was 
given a less preferred toy. If he engaged in problem behavior, 
he was allowed to continue playing with his preferred toys for 
a brief period. 

The results of the functional analysis are shown in 
Figure 27.10. Notice that problem behavior never occurred in 
the play condition when continuous attention and preferred items 
were available and no demands were placed on Brian. However, 
it occurred at high rates in all three of the test conditions (contin- 
gent attention, escape, and tangible). These results indicated that 
Brian’s problem behavior was maintained by escape, attention, 
and access to preferred items. 


Developing an Intervention 

Based on the results of the functional analysis, a multicom- 
ponent intervention was implemented. The intervention com- 
ponents changed at different points in time, depending on the 


context. For example, when Brian was engaged in a work task, 
it was recommended that he be given frequent opportunities to 
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Figure 27.10 Results of Brian’s functional analysis. 
Inappropriate behavior consists of aggression, 
property destruction, and tantrums. 0 


Based on Brian’s functional analysis conducted by Renee Van 
Norman and Amanda Flaute. 


request breaks. In addition, the time-out intervention that the 
teacher had been using was discontinued in the work context. 
During leisure times, when Brian had previously been expected 
to play alone, the classroom schedule was rearranged so that 
Brian could play and interact with peers. Brian was also taught to 
request toys appropriately while playing with peers. In addition, 
several interventions aimed at increasing teacher attention for 
appropriate behavior were implemented. Brian was taught how 
to request teacher attention appropriately, and teachers began to 
respond to these requests rather than ignoring them, as they had 
been doing. In addition, a self-monitoring plan was established, 
in which Brian was taught to monitor his own behavior and 
to match his self-recordings to the recordings of his teachers. 
Accurate self-recording resulted in teacher praise and access to 
preferred activities with the teacher. Brian’s teachers also imple- 
mented their own plan to increase attention and praise every 5 
minutes as long as Brian was not engaged in problem behavior 
during independent work. 


Lorraine—Multiple Topographies That Serve 
Multiple Functions 


Lorraine was 32 years old and functioned in the moderate range 
of intellectual disability. She had a diagnosis of Down syndrome 
and bipolar disorder with psychotic symptoms, for which Zoloft 
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(sertraline) and Risperdal (risperidone) were prescribed. She 
also took Tegretol (carbamazepine) for seizure control. Her 
verbal skills were low and her articulation was poor. She com- 
municated through some signs, a simple communication device, 
gestures, and some words. 

Lorraine had resided in a group home for 9 years and 
attended a sheltered workshop during the day. Lorraine dis- 
played noncompliance, aggression, and self-injurious behavior 
(SIB) in both settings, but the FBA focused on her problem 
behavior in the group home, where it was more severe and 
frequent. Noncompliance consisted of Lorraine putting her head 
down on the table, pulling away from people, or leaving the 
room when requests were made of her; aggression consisted 
of kicking others, throwing objects at others, biting others, and 
squeezing others’ arms very hard; SIB consisted of biting her 
arm, pulling her hair, or pinching her skin. 


Gathering Information 


Interviews were conducted with Lorraine, her parents, and 
workshop and group home staff. Lorraine’s parents noted that 
some of her behavior problems had increased when changes in 
her medication had been made. Workshop staff noted that Lor- 
raine was more likely to have problem behavior at work if many 
people were around her. Workshop staff had also noted that 
noncompliance had increased shortly after a dosage change in 
medication 2 months previously. The group home staff members 
noted that they were most concerned about Lorraine’s leaving 
the group home when she was asked to perform daily chores. 
Lorraine would often leave the group home and not return until 
the police had picked her up. Many neighbors had complained 
because Lorraine would sit on their porches for hours until the 
police came and removed her. 

An ABC assessment was conducted at the workshop and 
group home to determine whether environmental variables 
differed across the two settings (e.g., the manner in which tasks 
were presented, the overall level of attention). At the work- 
shop, Lorraine was engaged in a jewelry assembly task (one 
she reportedly enjoyed), and she worked well for 25 hours. 
She appeared to work better when others paid attention to her 
and often became off task when she was ignored; however, no 
problem behavior was observed at work. At the group home, 
aggression was observed when staff ignored Lorraine. No other 
problem behavior occurred. No demands were placed on Lor- 
raine in the group home during the ABC observation. Group 
home staff rarely placed any demands on Lorraine in an attempt 
to avoid her problem behavior. 


Interpreting Information and Formulating Hypotheses 


Some of Lorraine’s problem behaviors seemed to be related to a 
dosage change in her medication. Because Lorraine’s physician 
judged her medication to be at therapeutic levels, a decision 
was made to analyze the environmental events related to her 
problem behavior. Observations of problem behavior during the 
ABC assessment were limited because workshop staff placed 
minimal demands on Lorraine to avoid problem behaviors. 
However, Lorraine’s noncompliance reportedly occurred when 
demands were placed on her. Therefore, it was hypothesized 
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that these problem behaviors were maintained by escape from 
task demands. Aggression occurred during the ABC assessment 
when Lorraine was ignored. Although SIB was not observed 
during the ABC assessment, group home staff reported that 
Lorraine often engaged in SIB during the same situations that 
evoked aggression. Therefore, it was hypothesized that both 
aggression and SIB were maintained by attention. 


Testing Hypotheses 


The functional analysis consisted of free play, contingent 
attention, and contingent escape conditions (see Figure 27.11). 
Because the problem behaviors may have served different 
functions, each problem behavior was coded and graphed 
separately. Noncompliance occurred most frequently during 
the contingent escape condition and rarely occurred during 
the free play or contingent attention condition. SIB occurred 
most frequently during the contingent attention condition and 
rarely occurred during the free play or contingent escape con- 
dition. These data suggested that noncompliance served an 
escape function and SIB served an attention function. As is 
often the case for low-frequency, high-intensity behaviors, it 
was difficult to form hypotheses about the function of aggres- 
sion because the behavior occurred rarely in any of the FBA 
conditions. 


Developing an Intervention 


Different interventions were developed for the problem behav- 
iors because results of the FBA suggested that the behaviors 
served different functions. To address noncompliance, Lorraine 
was taught to request breaks from difficult tasks. Tasks were 
broken down into very small steps. Lorraine was presented with 
only one step of a task at a time. Each time a task request was 
made, Lorraine was reminded that she could request a break 
(either by saying “Break, please” or by touching a break card). 
If she requested a break, the task materials were removed for a 
brief period. Then they were presented again. Also, if Lorraine 
engaged in noncompliance, she was not allowed to escape the 
task. Instead, she was prompted through one step of the task, and 
then another step of the task was presented. Initially, Lorraine 
was allowed to completely escape the task if she appropriately 
requested a break each time the task was presented. Over time, 
however, she was required to complete increasing amounts of 
work before a break was allowed. 

Intervention for aggression consisted of teaching Lor- 
raine appropriate ways to gain attention (e.g., tapping some- 
one on the arm and saying, “Excuse me”) and teaching group 
home staff to regularly attend to Lorraine when she made such 
requests. In addition, because her articulation was so poor, a 
picture communication book was created to assist Lorraine in 
having conversations with others. This communication book 
could be used to clarify words that staff could not understand. 
Finally, staff members were encouraged to ignore Lorraine’s 
SIB when it did occur. In the past, staff had approached Lor- 
raine and stopped her from engaging in SIB when it occurred. 
The functional analysis demonstrated that this intervention 
may have increased the occurrence of SIB, so this practice 
was discontinued. 
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Percentage of 10-Second Intervals with Problem Behavior 


Figure 27.11 Results of Lorraine’s functional 
analysis. 0 


Based on Lorraine’s functional analysis conducted by 
Corrine M. Murphy and Tabitha Kirby. 
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SUMMARY 


Functions of Behavior 


1. Many problem behaviors are learned and maintained by 
positive, negative, and/or automatic reinforcement. In this 
respect, problem behavior can be said to have a “function” 
(e.g., to “get” something or to “get out of’ something). 


2. The topography, or form, of a behavior often reveals little 
useful information about the conditions that account for it. 
Identifying the conditions that account for a behavior (its 
function) suggests what conditions need to be altered to 
change the behavior. Assessment of the function of a behav- 
ior can therefore yield useful information with respect to 
intervention strategies that are likely to be effective. 


Role of Functional Behavior Assessment in 
Intervention and Prevention 


3. FBA can lead to effective interventions in at least three ways: 
(a) It can identify antecedent variables that can be altered to 
prevent problem behavior, (b) it can identify reinforcement 
contingencies that can be altered so that problem behavior 
no longer receives reinforcement, and (c) it can help identify 
reinforcers for alternative replacement behaviors. 


4. FBA can decrease reliance on default technologies 
(increasingly intrusive, coercive, and punishment-based 
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interventions) and contribute to more effective interven- 
tions. When FBAs are conducted, reinforcement-based 
interventions are more likely to be implemented than are 
interventions that include a punishment component. 


Overview of FBA Methods 


5. FBA methods can be classified into three types: (a) func- 
tional (experimental) analysis, (b) descriptive assessment, 
and (c) indirect assessment. The methods can be arranged 
on a continuum with respect to considerations such as ease 
of use and the type and precision of information they yield. 


6. Functional analysis involves systematically manipulating 
environmental events thought to maintain problem behavior 
within an experimental design. The primary advantage of 
functional analysis is its ability to yield a clear demonstra- 
tion of the variable or variables that relate to the occurrence 
of a problem behavior. However, this assessment method 
requires a certain amount of expertise to implement and 
interpret. Several variations of functional analysis proce- 
dures have been developed to adapt to a variety of situations 
and contexts, including brief functional analysis, conducting 
functional analyses in natural settings, trial-based functional 
analysis, synthesized functional analysis, latency-based 
functional analysis, and analyzing precursor behaviors. 


7. Descriptive assessment involves observation of the prob- 
lem behavior in relation to events that are not arranged in 
a systematic manner and includes ABC recording (both 
continuous and narrative) and scatterplot recording. The 
primary advantages to these assessment methodologies are 
that they are easier to do than functional analyses and they 
represent contingencies that occur within the individual’s 
natural routine. Caution must be exercised when interpret- 
ing information from descriptive assessments, however, 


because it can be very difficult to parse the contingencies 
via them. 


. Indirect functional assessment methods use structured 
interviews, checklists, rating scales, or questionnaires 
to obtain information from persons who are familiar 
with the individual exhibiting the problem behavior 
(e.g., teachers, parents, caregivers, and/or the individ- 
ual himself or herself) to identify possible conditions 
or events in the natural environment that correlate with 
the problem behavior. Again, these forms of FBA are 
easy to conduct, but they are limited in their accuracy. 
As such, they are probably best reserved for hypothesis 
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formulation. Further assessment of these hypotheses is 
almost always necessary. 


Conducting a Functional Behavior Assessment 


»: 


10. 


Given the strengths and limitations of the different FBA 
procedures, FBA can best be viewed as a four-step process: 


¢ First, gather information via indirect and descriptive 
assessment. 


¢ Second, interpret information from indirect and 
descriptive assessment and formulate hypotheses about 
the purpose of problem behavior. 


Third, test hypotheses using functional analysis. 


¢ Fourth, develop intervention options based on the 
function of problem behavior. 


When teaching an alternative behavior as a replacement 
for problem behavior, the replacement behavior should be 
functionally equivalent to the problem behavior (i.e., the 
replacement behavior should produce the same reinforcers 
that previously maintained the problem behavior). 


eR I TRE EPH COS SIE SOL RIE BOD SBE RIDDEN SPE EE SL SESE YEE A SOI 


KEY TERMS 


conditional probability functional analysis functionally equivalent 


contingency reversal functional behavior assessment (FBA) indirect functional assessment 


descriptive functional behavior 
assessment 
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MULTIPLE-CHOICE QUESTIONS 


1. Devonia hits her head with a closed fist when her one- 


playing on the computer, he rarely spits on his teacher. What 


on-one teaching assistant leaves her side to interact 
with another student. Usually, when Devonia does this, 
her teaching assistant returns to her side, asks her to 
stop hitting herself, and soothes her. She rarely engages 
in head hitting when her assistant works directly with 
her. What is the most likely function of Devonia's 
problem behavior? 

a. Escape 

b. Attention 

c. Automatic reinforcement 

d. Tangible 

e. Escape and/or tangible 

Hint: (See “Functions of Behavior’) 

. Charles spits on his teacher when he prompts him to complete 
a toothbrushing task. For obvious reasons, this behavior really 
grosses out the teacher, who refuses to work with him when 
he behaves like this. When Charles spits on him, the teacher 


walks away and allows Charles to leave the toothbrushing 
task and go to the computer to calm down. As long as he is 


is the most likely function of Charles’ behavior? 
a. Escape 

b. Automatic reinforcement 

c. Tangible 

d. Escape and/or tangible 

e. Escape and/or attention 


Hint: (See “Functions of Behavior.”) 


. On a practical level, FBA is important for prevention of 


and intervention for problem behavior because: 


a. When the cause-and-effect relation between environmental 
effects and behavior can be determined, that relation can 
be altered to improve behavior. 


b. It meets the federal guidelines for best practices in 
treating problem behavior. 


c. It will tell a teacher exactly what to do for intervention. 
d. None of these 


Hint: (See “Role of Functional Behavior Assessment in 
Intervention and Prevention’’) 
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4. Devonia’s one-on-one assistant (see Question 1) decides 


that he will no longer leave Devonia’s side to interact with 
other children as an intervention for her problem behavior. 
This is an example of what form of intervention? 


a. Altering antecedent variables 

b. Altering consequent variables 

c. Teaching alternative behaviors 

d. All of these 

Hint: (See “Role of Functional Behavior Assessment in 
Intervention and Prevention’) 

. There are at least three forms of FBA. They are: 


a. Functional analysis, standardized assessment, and 
descriptive assessment 


b. Indirect assessment, descriptive assessment, and 
behavioral observation 


c. Functional analysis, descriptive assessment, and 
indirect assessment 


d. Behavioral observation, standardized assessment, and 
curriculum-based assessment 


Hint: (See “Overview of FBA Methods’”’) 

. Which methods of FBA allow you to confirm hypotheses 
regarding the function of problem behavior? 

a. Functional analysis 

b. Descriptive assessment 

c. Indirect assessment 

d. All of these 

Hint: (See “Overview of FBA Methods’’) 

. Which of the following describes a descriptive functional 
behavior assessment? 


a. An analog analysis in which consequences representing 
those in the natural routine are systematically arranged. 


b. Direct observation of behavior made under naturally 
occurring conditions 


c. Structured checklists that caregivers fill out to identify 
events that correlate with problem behavior 


d. Behavior rating scales that caregivers fill out to identify 
events that correlate with problem behavior 


Hint: (See “Descriptive Functional Behavior Assessment”) 


. A limitation of descriptive assessment is: 


a. It may be misleading in that it can identify environmental 
variables that occur in close proximity to the problem 
behavior but that are not causally related to the problem 
behavior. 


b. It may not be a very reliable measure of problem 
behavior and environmental events. 


c. Itis extremely difficult and time-consuming to perform. 
d. The first two answer choices are correct. 


Hint: (See “Descriptive Functional Behavior Assessment”) 


. Ms. Frieder, who teaches fifth grade at Franklin Elementary 
School, decides to conduct a descriptive FBA for Amelia. 


10. 


11. 


Amelia has been refusing to do her schoolwork lately and has 
been ripping up her worksheets on a daily basis. Ms. Frieder 
creates a form on which she will mark specific antecedents 
and consequences that precede and follow Amelia’s work 
refusals. For antecedents, she will mark one of the following: 
Math work given, reading work given, spelling work given, 
error correction given, work that requires writing given. For 
consequences, she will mark one of the following: verbal 
reprimand, another worksheet (same worksheet) provided, 
another worksheet (different worksheet) provided, time out, 
ignore. She will record these environmental events, whether 
or not a problem behavior occurs. What kind of descriptive 
assessment is Ms. Frieder conducting? 


a. Scatterplot recording 

b. ABC continuous recording 

c. ABC narrative recording 

d. Conditional probability recording 

Hint: (See “Descriptive Functional Behavior Assessment’) 


Mr. Peterson has been struggling with Arnold, a boy with 
severe disabilities who hums and rocks back and forth 
intermittently throughout the day. Mr. Peterson wants to 
see if these behaviors are associated with any specific 
activities during the school day so that he can then more 
closely analyze what occurs during those time periods. 
Which descriptive assessment method would be the best 
choice for what he wants to accomplish? 


a. Scatterplot recording 

b. ABC continuous recording 
c. ABC narrative recording 
d. Functional analysis 


Hint: (See “Descriptive Functional Behavior Assessment”) 


Ms. Carmichael is conducting an FBA for Jamal, a boy 

in her class who runs away from activities and teachers. 
Ms. Carmichael began by interviewing her teaching 
assistants and by participating in the conversation with 
them to define the target behavior and to determine what 
antecedents and consequences she would watch for during 
her ABC assessment. Ms. Carmichael completes her ABC 
assessment and finds that running away from tasks and 
materials occurs most frequently when difficult tasks are 
presented and that the most common consequence for 

this behavior is a teacher chasing after him and returning 
him to the classroom. Sometimes he is required to return 
to the task and sometimes he is allowed to do a different 
activity when he comes back to the classroom. What can 
be concluded from this descriptive assessment? 


a. The problem behavior most likely occurs to get 
attention from teachers 


b. The problem behavior most likely occurs to escape 
from nonpreferred tasks. 


c. The problem behavior most likely occurs to get access 
to more preferred activities. 


d. The function of the problem behavior remains unclear. 


Hint: (See “Conducting a Functional Behavior Assessment’’) 


12. 


13. 


14. 


15. 


16. 


After you complete a descriptive assessment, you should 
write hypothesis statements that reflect your interpretation 
of the data. Which of the following is a hypothesis 
statement that contains all of the important elements? 


a. Gain peer attention: When Valerie is playing alone on 
the playground, she bear hugs her peers. 


b. Gain peer attention: Valerie bear hugs her peers, which is 
followed by attention in the form of teasing from her peers. 


c. When Valerie is playing alone on the playground, she 
bear hugs her peers, which is followed by attention in 
the form of teasing from her peers. 


= 


Gain peer attention: When Valerie is playing alone 
on the playground, she bear hugs her peers, which is 
followed by attention in the form of teasing from her 
peers. 


Hint: (See “Conducting a Functional Behavior Assessment’) 

The primary reason for conducting a functional 

analysis is: 

a. To test hypotheses generated via indirect and 
descriptive assessments. 


b. To generate hypotheses that can be further evaluated 
via indirect and descriptive assessments. 


c. To observe problem behavior within the naturally 
occurring routine. 


d. To identify any temporal patterns in problem behavior. 
Hint: (See “Overview of FBA Methods”) 


Characteristics of functional analysis include: 


a. They are conducted within naturally occurring 
routines. 


b. They are conducted within analog conditions that 
represent naturally occurring routines. 


c. They utilize interviews and rating scales rather than 
direct observation of problem behavior. 


d. They are conducted in clinical settings only. 


Hint: (See “Functional (Experimental) Analysis’) 


In addition to a control condition, a functional analysis 
typically consists of what test conditions? 


a. Contingent attention 

b. Contingent escape 

c. Alone 

d. All of these 

Hint: (See “Functional (Experimental) Analysis’) 


Mr. Moon is conducting a functional analysis with 

Ira, who kicks his teachers frequently. Mr. Moon iS 

implementing a contingent escape condition. He prompts 

Ira to engage in a difficult task. Ira kicks Mr. Moon in the 

shin. How should Mr. Moon respond if he is to implement 

the contingent escape condition correctly? 

a. He should continue prompting Ira to do the task, but he 
should provide a mild reprimand, such as, “Ira, don’t 
kick me. That hurts. Come on, it’s time to work.” 


17. 


Percentage of Intervals with Tantrums 


18. 


Rate of Skinpicking 
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. He should remove the task materials and turn away 


from Ira for a short period of time. 


- He should ignore Ira’s kicking and continue with the task. 


. He should provide Ira with a more preferred activity for 


a short period of time. 


Hint: (See “Table 27.1, page 682”) 


Look at the following graph from a functional analysis of 
Elsa’s tantrums. What would you conclude is the function 
of Elsa’s tantrums? 


+> Alone 
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—+— Control 
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. Escape 


Attention 


. Both escape and attention 


. The pattern of behavior is undifferentiated; therefore 


the analysis is inconclusive. 


Hint: (See “Overview of FBA Methods”) 


Look at the following graph from a functional analysis of 
Walter’s skin picking. What would you conclude is the 
function of Walter’s skin picking? 


-- Alone 

—=— Contingent Escape 
“&~ Contingent Attention 
—<— Control 
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a. Escape 
b. Attention 
c. Automatic 
d. The pattern of behavior is undifferentiated; therefore 


the analysis is inconclusive. 


Hint: (See “Overview of FBA Methods’’) 
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ESSAY-TYPE QUESTIONS 
iE 


Provide an example of a problem behavior that is 

maintained by social positive reinforcement, social negative 
reinforcement, and automatic reinforcement. In your answer, 
state what happens prior to the problem behavior, what the 
problem behavior is, and what follows the problem behavior. 


Hint: (See “Functions of Behavior’) 


. Discuss three different ways in which the results of a 


functional behavior assessment can be used to formulate 
interventions that prevent future occurrences of problem 
behavior. 


Hint: (See “Role of Functional Behavior Assessment in 
Intervention and Prevention’’) 


. Explain the three types of functional behavior assessment. 


Discuss them in the order in which they might best be 
implemented and explain this sequence. Your explanation 
of the sequence should include the strengths and 
limitations of each type. 


Hint: (See “Overview of FBA Methods and Case Examples”’) 


. Compare and contrast descriptive and indirect 


assessments. What is the primary variable that sets these 
two forms of gathering descriptive information apart? 


Hint: (See “Overview of FBA Methods: Descriptive 
Functional Behavior Assessment and Indirect Functional 
Behavior Assessment’) 


. Give an example of a descriptive assessment and state 


what specific information this assessment would yield. 


Hint: (See “Overview of FBA Methods: Descriptive 
Functional Behavior Assessment’) 


. Assume you are a behavioral consultant for a local 


developmental disabilities agency. You have been asked 
to consult on young adult, Melissa, who is 22 years old 
and frequently disrobes while at work. You conduct 
indirect and descriptive assessments with her staff and 
learn the following: Melissa rarely disrobes when she is 
in the break room having a snack and interacting with her 
co-workers and staff. She disrobes most frequently when 
she is working on her assigned job task. It doesn’t seem 
to matter what task she is working on. You conduct an 


sl 


ABC assessment and note that she works well when a staff 
member comes by to check her work, and this seems to 
continue for a few minutes after the staff member leaves. 
About 4 minutes after the staff member leaves, however, 
the clothes start coming off. Melissa is happy to put her 
clothes back on when directed to by a staff member. 
Because of these behaviors, Melissa’s work station has 
been moved to behind a curtain in the work room. Much 
to the staff's chagrin, Melissa's disrobing has actually 

been worsening since they did this. What is your working 
hypothesis for Melissa’s problem behavior? Write your 
hypothesis in the form recommended by the text, and 
provide a rationale for your hypothesis. 

Hint: (See “Conducting a Functional Behavior Assessment: 
Gathering Information and Interpreting Information and 
Forming Hypotheses’) 


. Explain the difference between a functional analysis 


and other forms of functional behavior assessment (e.g., 
descriptive and indirect assessments). Specifically, why is 
it that a functional analysis allows you to test hypotheses, 
while other forms of functional behavior assessment are 
used for hypothesis formulation? 


Hint: (See “Overview of FBA Methods: Basic Procedure’”’) 


. Describe the conditions that might be implemented 


in a functional analysis, as well as how and why each 
condition is implemented. 
Hint: (See “Overview of FBA Methods: Basic Procedure” 


and “Conducting a Functional Behavior Assessment: 
Testing Hypotheses”) 


. Assume that in your role as a consultant, you decide 


to conduct a functional analysis of Melissa's disrobing 
(see Question 6). Assume that your functional analysis 
indicates her disrobing is maintained by both attention 
from others and escape from task demands. Assume that 
the functional analysis also confirmed your hypothesis 
that she rarely engages in problem behavior when she has 
attention and no task demands. Draw a graph that would 
illustrate this function appropriately. 


Hint: (See “Case Examples Illustrating the FBA Process’’) 


PART 11 


Special Applications 


Parts 4 through 10 described basic principles of behavior and behavior change tactics derived 
from those principles. Part 11 describes several applications of behavior change technology. 
Each application represents a strategic approach to change behavior using multiple principles 
and procedures. Chapter 28 addresses the token economy, group contingencies, and contingency 
contracting. Chapter 29 details tactics individuals can implement to change their own behavior. 
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Token Economy, Group Contingencies, 
and Contingency Contracting 


LEARNING OBJECTIVES 


¢ Define and identify the components of a contingency contract. 


¢ Define and identify the components of a token economy. 


¢ Explain the considerations that must be made when implementing a token economy. 
¢ Define and identify the different types of group contingencies. 
e Explain the considerations that must be made when implementing a group contingency. 


tingencies, and contingency contracting as behavior 

change strategies. Each strategy will be defined and its 
relationship to behavioral principles explained; requisite com- 
ponents of each strategy will be addressed and guidelines for 
designing, implementing, and evaluating each will be presented. 
These three strategies are grouped because they have several 
features in common. A well-established research base supports 
their effectiveness. Further, they can be combined with other 
approaches in package programs to provide an additive effect. 
Also, each strategy can be used with individual and multiple 
group arrangements. Finally, the flexibility of these three special 
applications makes them attractive options for practitioners in 
various settings. 


sk chapter addresses the token economy, group con- 


TOKEN ECONOMY 


The token economy is a highly developed and broadly 
researched behavior change system. It has been applied suc- 
cessfully in virtually every educational, residential, employ- 
ment, and therapeutic setting possible (Bullock & Hackenberg, 
2006; Doll, McLaughlin, & Barretto, 2013; Glynn, 1990; Hack- 
enberg, 2009; Musser, Bray, Kehle, & Jenson, 2001; Pietras 
& Hackenberg, 2005; Stilitz, 2009; Wolfe, 1936). The utility 
of a token economy to change behaviors that previously have 
been resistant to modification is well established in the exper- 
imental and applied literatures. Further, token economies have 
proved successful whether administered by practitioners (Jowett 
Hirst, Dozier, & Payne, 2016) or even peer groups of children 
in elementary schools (Alstot, 2015). This section defines and 
describes the token economy and outlines effective procedures 
for using it in applied settings. 


Token Economy Defined 


A token economy (or token reinforcement system) consists of 
three major components: (a) a list of target behaviors; (b) tokens, 


708 


points, or marks participants earn for emitting the target behav- 
iors; and (c) a menu of backup reinforcers—preferred items, 
activities, or privileges—that participants obtain by exchang- 
ing earned tokens. Hackenberg (2009) described a token econ- 
omy as “an interconnected set of contingencies that specifies 
the relations between token production, accumulation, and 
exchange” (p. 259). Operationally, these interconnections 
work in three stages. First, the behavior or behaviors to be 
reinforced are identified and defined. Second, a medium of 
exchange—the token—is selected. Third, participants exchange 
earned tokens for backup reinforcers (i.e., objects, privileges, 
or activities known or presumed to function as reinforcers for 
participants). 


In addition to having three stages, A token economy 
derives its complexity from the three interconnected 
schedules of reinforcement that govern when tokens will 
be delivered (token production), when tokens will be 
exchanged (exchange production), and the cost in tokens 
of the various goods or services available for exchange 
(token exchange; Hackenberg, 2009). These schedules 
of reinforcement represent internal mechanisms of a 
token economy that can be arranged using a number 

of basic reinforcement schedules (e.g., fixed ratio and 
variable interval) or complex schedules (e.g., differential 
reinforcement of other behavior). (Ivy, Meindl, Overley, 
& Robson, 2017, p. 3) 


The Functions of Tokens 


Store-based or manufacturer coupons function as tokens. When 
a customer purchases an item from a participating store, the 
cashier provides the purchaser with a paper coupon directly or 
electronically through a smartphone or tablet device. That cou- 
pon is traded later for another item at a reduced price, or it is 
redeemed immediately for the backup conditioned reinforcer. 
Money, in the form of currency, is an example of a token 
that can be exchanged at a later time for backup objects and 
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activities (e.g., food, clothing, transportation, entertainment). As 
Bell and McDevitt (2014) stated: 


To see the enormous power of conditioned reinforce- 
ment, one must only consider the potent effects of 
money on human behavior. A “token” is an object that 
can be exchanged for other reinforcers. It is also called 

a “generalized reinforcer,” as it does not need to rely 

on a single reinforcer in order to be effective. Money is 
one such token and maintains much of our behavior, but 
other kinds of tokens can be used to establish or maintain 
specific behaviors. (p. 242) 


Hackenberg (2009) noted that, “A token, as the name 
implies, is an object of no intrinsic value; whatever function(s) 
a token has is established through relations to other reinforc- 
ers, both unconditioned (e.g., food or water) or conditioned 
(e.g., money or credit). As such, a token may serve multiple 
functions—reinforcing, punishing, discriminative, eliciting— 
depending on its relationship to these other events” (p. 259). 

The multiple functions of tokens figure importantly in 
a token economy, which is often effective in applied settings, 
for three reasons. First, tokens bridge the time gap between the 
occurrence of a behavior and delivery of a backup reinforcer. 
For example, a token may be earned during the afternoon, but 
the backup reinforcer is not awarded until the next morning. 
Second, tokens bridge the setting gap between the behavior and 
the delivery of the backup reinforcer. For instance, tokens earned 
at school could be exchanged for reinforcers at home, or tokens 
eared in a general education classroom in the morning could 
be exchanged in a special education classroom in the afternoon. 
Finally, as generalized conditioned reinforcers, tokens make the 
management of motivation less critical for the behavior analyst. 

Tokens are generalized conditioned reinforcers, and as 
such are independent of specific states of motivation because 
they are associated with a wide array of backup reinforcers. 
However, a generalized conditioned reinforcer is a relative 
concept: Effectiveness depends to a large extent on the exten- 
siveness of the backup reinforcers and their ability to affect 
the future occurrence of behaviors (Moher, Gould, Hegg, & 
Mahoney, 2008). Tokens exchangeable for a wide variety of 
backup reinforcers have considerable utility in schools, clinics, 
centers, and hospitals where it is difficult to manage the delivery 
of powerful reinforcers, or deprivation states, which may change 
with time or location. 

Higgins, Williams, and McLaughlin (2001), for example, 
used a token economy to decrease the disruptive behaviors of 
an elementary-age student with learning disabilities. The student 
exhibited high levels of out-of-seat, talking out, and poor sitting 
behaviors. After collecting baseline data on these behaviors, a 
token economy was implemented. If an alternative behavior was 
being emitted instead of the behaviors targeted for reduction, 
the student earned a checkmark exchangeable for free time. 
Maintenance checks were taken on two subsequent occasions 
to determine duration effects. 

Figure 28.1 displays the results of the study across the 
three dependent variables. A functional relation was established 
between the onset of the token economy and the reduction 


of the behaviors. Further, maintenance checks indicated that 
the behavior remained at low levels after the token economy 
phase ended. 


Level Systems 


A level system is a type of token economy in which participants 
move up (and sometimes down) a hierarchy of tiers contingent 
on meeting specific performance criteria with respect to the tar- 
get behaviors. As participants move “up” from one level to the 
next, they have access to more privileges and are expected to 
demonstrate more independence (see example in Table 28.1). 
The schedule of token reinforcement is gradually thinned so that 
participants at the highest levels are functioning on schedules of 
reinforcement similar to those in natural settings. 

According to Smith and Farrell (1993), level systems are 
an outgrowth of two major educational advances that began in 
the late 1960s and 1970s: (a) Hewett’s Engineered Classroom 
(1968) and (b) Phillips, Phillips, Fixen, and Wolf’s Achievement 
Place (1971). In both cases, systematic academic and social 
programming combined token reinforcement, tutoring systems, 
student self-regulation, and managerial arrangements. Smith and 
Farrell stated that level systems are designed to 


foster a student’s improvement through self-management, 
to develop personal responsibility for social, emotional, 
and academic performance . . . and to provide student tran- 
sition to a less restrictive mainstream setting. . . . Students 
advance through the various levels as they show evidence 
of their achievement. (p. 252) 


In a level system, students must acquire and achieve 
increasingly more refined repertoires while tokens, social 
praise, or other reinforcers are simultaneously decreased. Level 
systems have built-in mechanisms for participants to advance 
through a series of privileges, and they are based on at least three 
assumptions: (a) Combined techniques—so-called “package 
programs”—are more effective than individual contingencies 
introduced alone, (b) student behaviors and expectations must 
be stated explicitly, and (c) differential reinforcement 1s neces- 
sary to reinforce closer and closer approximations to the next 
level (Smith & Farrell, 1993). 

Lyon and Lagarde (1997) proposed a three-level group 
of reinforcers that placed useful, but less desirable, reinforcers 
at Level 1. At this level, students had to earn 148 points, or 
80%, of the 185 maximum points that could be earned during 
the week to purchase certain items. At Level 3, the powerful, 
and highly desirable, reinforcers could be purchased only if the 
students had accumulated at least 167 points, or 90%, of the 
total points possible. As the levels progress, the expectations for 
performance increase. 

Cavalier, Ferretti, and Hodges (1997) incorporated a 
self-management approach with an existing level system to 
improve the academic and social behavior of two adolescent 
students with learning disabilities for whom increased partici- 
pation in a general education classroom was an individualized 
education program (IEP) goal. Basically, other students in the 
classroom were making adequate progress through the six-level 
point system that the teacher had devised, but the inappropriate 
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Figure 28.1 Number of talk-outs, out-of-seat 
behavior, and poor posture during baseline, token 
economy, and maintenance conditions. 


From “The Effects of a Token Economy Employing Instruc- 
tional Consequences for a Third-Grade Student with Learning 
Disabilities: A Data-Based Case Study,” by J. W. Higgins, 

R. L. Williams, and T. F. McLaughlin, 2001, Education and 
Treatment of Children, 24(1), p. 103. Copyright 2001 by 

The H. W. Wilson Company. Reprinted by permission. 


Poor Posture 


verbalizations of the two target students during class stalled their 
progress at Level 1. After collecting baseline data on the number 
of inappropriate verbalizations, the researchers trained the stu- 
dents to self-record occurrences of these behaviors during two 
50-min intervals in the day. Inappropriate verbalizations were 
defined explicitly, and during mock trials the students practiced 
self-recording and received feedback based on teacher observa- 
tions during the same interval. An emphasis was placed on the 
accuracy of the students’ recording, and desirable reinforcers 
were awarded for accurate recording. During the intervention 
(level system plus self-recording), students were told that they 
would be observed during the two 50-min intervals for accu- 
racy. If they met the criterion for each level (e.g., a decrease of 
five inappropriate verbalizations from the previous session), a 
reinforcer was delivered. As the students progressed through the 
levels, the reinforcer changed to more highly desirable items. 
Results showed a high number of verbalizations during 
baseline; however, when the packaged intervention was initiated 


Baseline Token Economy Maintenance 
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for Student 1, inappropriate verbalizations decreased. This out- 
come was replicated for Student 2, confirming a functional rela- 
tion between the intervention and the decrease in inappropriate 
verbalizations. 


Designing a Token Economy 


The basic steps in designing and preparing to implement a token 
economy are as follows: 


1. Identify target behaviors and rules. 

2. Select tokens. 

3. Develop a menu of backup reinforcers. 
4. Establish a token exchange ratio. 
5 


- Determine when and how tokens will be dispensed and 
exchanged and what will happen if the requirements to 
earn a token are not met. 


6. Field-test the system before full-scale implementation. 
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TABLE 28.1 Students’ Increased Access to and Independent Use of Internet and Internet-enabled Devices as Rewards 


for Achieving Higher Levels of Performance 


Orientation phase Growth phase 


Device Level 1 


Level 1+ 


Class computer Internet access with 


staff supervision staff supervision 


Own games console No No 
Own handheld games No No 
console 


Houseroom computer No 
staff supervision 


Community (café, No No 
library, etc.) 

Own laptop No No 
Own tablet computer No No 
Own basic mobile No No 


(calls and texts only) 
Own smartphone No No 


Internet access with 


Internet access with 


Independence phase 
Level 2 Level 2+ 


Internet access with 
staff supervision 


Level 3 


Internet access with 
staff supervision 


No No 


Internet access with 
staff supervision 
Internet access 


Internet access with 
staff supervision 


Internet access with 
Staff supervision 


Internet access 


Internet access with 
Staff supervision 


Internet access Internet access 


Internet access Internet access Internet access 


Internet access with 
staff supervision 


Internet access Internet access 


Internet access 
No Yes Yes 


Internet access Internet access 


No No Internet access 


From “The ACHIEVE! Program: A Point and Level System for Reducing Severe Problem Behavior,” by D. Pritchard, H. Penney, and R. C. Mace, © 2018. 


Reproduce with permission of John Wiley & Sons Inc. 


Identify Target Behaviors and Rules 


Chapter 3 addressed the selection and definition of behavior 
change targets. The criteria presented in that chapter also apply 
to the selection and definition of rules and target behaviors 
for a token economy. Generally, the guidelines for selecting 
behaviors for a token economy include: (a) selecting only mea- 
surable and observable behaviors; (b) specifying criteria for 
successful task completion; (c) starting with a small number 
of behaviors, including some that are easy for the individual to 
accomplish; and (d) being sure the participants have the prereq- 
uisite skills for any targeted behaviors (Myles, Moran, Ormsbee, 
& Downing, 1992). 

After rules and behaviors that apply to everyone are 
defined, then criteria and behaviors specific to individual learn- 
ers should be established. Many token economy failures can 
be traced to requiring the same behaviors and setting the same 
criteria for all learners. Token economies usually need to be 
individualized. For example, in a classroom setting, the teacher 
may want to select different behaviors for each student, or per- 
haps the token economy should not be applied to all students 
in the classroom. Instead, only the students with the greatest 
challenges should be included. However, students who do not 
need a token system should still continue to receive other forms 


of reinforcement. 


Select Tokens 

Frequently used tokens include points, tally marks, plastic strips, 
checkers, coupons, poker chips, holes punched in a card, or 
teacher initials. Criteria to consider in selecting the token itself 
are important. First, tokens should be safe. If a young child or 


participant with severe learning or behavioral problems is to 
receive the token, it should not be an item that can be swal- 
lowed or otherwise used to cause injury. Also, tokens should 
not be large, cumbersome, or unwieldy. Second, the analyst 
should control token physical configuration; learners should not 
be able to bootleg the acquisition of tokens by surreptitiously 
manufacturing their own. If tally marks are used, they should 
be on a special card, or made with a special marking pen that is 
available only to the analyst. Likewise, if holes are punched in 
a card, the paper punch should be available only to the analyst 
to avoid counterfeiting. 

Tokens should be durable because they may have to be used 
for an extended period, and they should be easy to carry, handle, 
bank, store, or accumulate. Tokens should also be readily acces- 
sible to the practitioner at the moment they are to be dispensed. 
It is important that they be provided immediately after the target 
behavior. Tokens should be inexpensive; there is no need to spend 
a large sum of money to purchase tokens. Injectable stamps, stars, 
checkmarks, and buttons are all inexpensive items that can be 
used repeatedly as tokens. 

Although using desirable objects as tokens can backfire—a 
teacher who used baseball cards as tokens found students spent 
so much time interacting with the tokens (e.g., reading about 
players) that the tokens distracted students from the purpose of 
the system—for some students, the “objects of obsession” can 
serve as effective tokens. In a study by Charlop-Christy and 
Haymes (1998), three children with autism who attended an 
after-school program served as participants. All three children 
were consistently off task during activities, preoccupied with 
some objects, and engaged in self-stimulatory behaviors. During 
baseline, the students earned stars for appropriate behavior. 
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Inappropriate behaviors or incorrect responses were addressed 
by saying, “try again” or “no.” When the students earned five 
stars, the tokens could be exchanged for backup reinforcers 
(e.g., food, pencils, tablets). During the token condition, an 
“obsession” object (one the children had been previously pre- 
occupied with) was used as the token. Once they had earned 
five obsession objects, children could exchange them for food 
items or other known reinforcers. Charlop-Christy and Haymes 
reported that the overall pattern of responding demonstrated 
that when the token was an obsession object, student perfor- 
mance improved. 


Develop a Menu of Backup Reinforcers 


Most token economies can use naturally occurring activities 
and events as backup reinforcers. For example, in a classroom, 
center, or school setting, tokens can be exchanged for popular 
games or materials, or they can be traded for favorite classroom 
jobs, such as office messenger, materials distributor, teacher 
assistant, or media operator. Tokens can also be used for school- 
wide privileges, such as a library or study hall pass; a special 
period (e.g., music, physical education) with another class; or 
special responsibilities, such as school patrol, cafeteria monitor, 
or tutor. Higgins and colleagues (2001) used naturally occurring 
activities and events as backup reinforcers for their token econ- 
omy (i.e., the students had access to computer games and leisure 
books). However, play materials, hobby-type games, snacks, 
television time, allowance, permission to go home or downtown, 
sports events, and coupons for gifts or special clothing could 
also be used as backup reinforcers because these objects or items 
tend to occur in many settings. 

If naturally occurring routine activities and events have 
not served as reinforcers in the past, then conducting a rein- 
forcer assessment would be advisable (Sran & Borrero, 2010). 
Identifying backup reinforcers using a formal assessment pro- 
cedure increases the chances that those stimulus items will 
serve as reinforcers. Further, an assessment would identify 
known reinforcers specific to the individual, and not suspected 
generic reinforcers for the group. Depending on the population 
of students to be served, analysts should be prepared to conduct 
multiple assessments over time as reinforcer preference can shift 
and change (DeLeon et al., 2001). All practitioners should use 
the most powerful, but least intrusive, and naturally occurring 
reinforcers possible. 

Finally, selection of backup reinforcers should follow con- 
sideration of ethical and legal issues, as well as state and local 
education agency policies. Token reinforcement contingencies 
that would deny the learner basic needs (e.g., food) or access 
to personal or privileged information or events (e.g., access to 
mail, telephone privileges, attending religious services, medical 
care, scheduled recess) should not be used. Furthermore, general 
comforts that are associated with basic rights afforded to all 
citizens (e.g., clean clothing, adequate heating, ventilation, hot 
water) should not be used in a token program. 


Establish a Token Exchange Ratio 


Initially, the ratio between the number of tokens earned and the 
price of backup items should be small to provide immediate 


success for learners. Thereafter, the ratio of exchange should be 
adjusted to maintain the responsiveness of the participants. Fol- 
lowing are general guidelines for establishing the ratio between 
earned tokens and the price of backup items: 


* Keep initial ratios low. 

¢ As token-earning behaviors and income increase, increase 
the cost of backup items, devalue tokens, and increase the 
number of backup items. 

¢ With increased earnings, increase the number of luxury 
backup items. 

¢ Increase the prices of necessary backup items more than 
those of luxury items. 


Determine How the Tokens Will Be Dispensed 


If objects such as tally marks or holes punched in a card are 
selected as the tokens, how the learner will receive them is obvi- 
ous. If objects such as coupons or poker chips are used, some 
container should be made available for storing the accumulated 
tokens before they are exchanged for the backup items. Some 
practitioners have learners construct individual folders or con- 
tainers for storing their tokens. Another suggestion is to deposit 
the tokens through cut slots in the plastic tops of coffee cans. 
With younger learners, tokens can be chained to form a necklace 
or bracelet. 


Determine How the Tokens Will Be Exchanged 


A menu of the backup items should be provided with a given 
cost for each item. Learners can then select from the menu. 
Some teachers have a table store with all the items displayed 
(e.g., games, balloons, toys, certificates for privileges). To avoid 
noise and confusion at shopping time, individual orders can be 
filled by writing in or checking off the items to be purchased. 
Those items are then placed in a bag with the order form stapled 
to the top and are returned to the purchaser. Initially, the store 
should be open frequently, perhaps twice per day. Lower func- 
tioning learners may need more frequent exchange periods. 
Later, exchange periods might be available only on Wednesdays 
and Fridays, or only on Fridays. As quickly as possible, token 
exchange should occur on an intermittent basis. 


Specify What Happens if Token 
Requirements Are Not Met 


Occasionally, token requirements will not be met for one reason 
or another. One approach is to nag the individual: “You didn’t 
do your homework. You know your homework must be com- 
pleted to earn tokens. Why didn’t you do it?” A better approach 
is a matter-of-fact restatement of the contingency: “I’m sorry. 
You haven’t enough tokens to exchange at this time. Try again.” 
It is important to know whether the individual has the skills 
required to earn tokens. A learner should always be able to meet 
the response requirements. 


What Should Be Done When a Learner Tests the System? How 
should a practitioner respond when a learner says she does not 
want any tokens or backup items? A bad approach would be 
to argue, debate, or cajole the learner. A better approach is to 
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say something neutral (e.g., “That is your decision.’) and then 
walk away, precluding any argument or debate. In this way a 
confrontation is avoided, and the occasion remains set for token 
delivery for the learner. Most learners can and should have input 
in selecting the backup items, generating the rules for the econ- 
omy, establishing the price for the backup items, and perform- 
ing general duties in managing the system. A learner can be a 
salesperson for the store or a bookkeeper to record who has how 
many tokens and what items are purchased. When learners are 
involved and their responsibilities for the economy are empha- 
sized, they are less likely to test the system. 


Will the Token Economy Include a Response Cost Contingency? 
Procedures for including response cost—a negative punishment 
procedure—with a token economy were presented in Chapter 15. 
Token economies can include a token loss contingency for inap- 
propriate behaviors and rule infractions (Musser et al., 2001). 
Any behaviors subject to response cost should be defined and 
stated clearly in the rules. Learners need to be aware of what 
actions will result in token loss and how much the behavior will 
cost. The more serious the inappropriate behavior is, the greater 
the token loss should be. Clearly, fighting, acting out, or cheat- 
ing should result in greater token loss than minor infractions 
(e.g., out-of-seat behavior or talk-outs). Token loss should never 
be applied to a behavior if the learner does not have tokens. 
Students should not be allowed to go into debt, which would 
likely decrease the reinforcement value of the tokens. A learner 
should always earn more tokens than she loses. 

Several studies have compared the effectiveness of token- 
loss (response cost) or token-earn contingencies. In response 
cost procedures, each participant starts with a number of tokens 
and loses a token on each instance of the problem behavior 


(e.g., Conyers et al., 2004; Donaldson, DeLeon, Fisher, & 
Kahng, 2014; McGoey & DuPaul, 2000). With a token-earn 
procedure, children earn tokens contingent on emitting speci- 
fied desired behaviors or for the absence of problem behavior 
during specified intervals (DRO). Jowett Hirst et al. (2016) com- 
pared the effects of token-earn and token-loss contingencies on 
the on-task performance of students working in individual and 
group contexts. Results showed that both contingencies were 
effective in increasing on-task behavior (see Figure 8.17). When 
given a choice of which contingency would operate during 
each session, five of six children chose the response cost over 
the token earn. 

Because the token-loss contingency required less time 
to implement, teachers who use a token system as a behavior 
management strategy might find arranging loss contingencies to 
be a better option than arranging earn contingencies. Box 28.1: 
“Coupons for Cursing,” describes how a teacher incorporated 
response cost procedure into a classwide token economy. 


Field-Test the System 


The final step before actually implementing a token system is to 
field-test it. For 3 to 5 days, token delivery is tallied exactly as if 
tokens were being earned, but no tokens are awarded during the 
field test. Data from the field test are used for assessment. Are 
learners actually deficient in the targeted skills? Are some learn- 
ers demonstrating mastery of behaviors targeted for interven- 
tion? Are some learners not receiving tokens? Based on answers 
to questions such as these, final adjustments in the system can 
be made. For some learners, more difficult behaviors may need 
to be defined; others may need less demanding target behaviors. 
Perhaps more or fewer tokens need to be delivered relative to 
the price of the backup reinforcers. 


BOX 28.1 


Coupons for Cursing 


During my second year as a middle school teacher for students 
with learning disabilities and behavior disorders, I was plagued 
with a number of students for whom refraining from blurting 
out curse words and expletives was nearly impossible. At the 
first sign of frustration with a seatwork assignment or a nega- 
tive encounter with a classmate, the expletives would roll out 
of their mouths like an “Old Faithful” gusher. Such outbursts 
were contagious and extremely problematic. 

To address this issue, after collecting baseline data on 
the number of curses per morning, I devised a program that 
provided each target student with a stapled pack of 10 index 
cards on which I had written: Coupon: Good for One Free 
Curse. Turn in After Cursing. 

I told the students in individual meetings with each of 
them that since our morning workload kept me busy, I would 
not have time to personally redirect them when they cursed. 


Instead, I told them that when I heard them curse, I would 
look at them and give them a personal signal (i.e., a tug on 
my ear, a tap to my nose, a touch to my shoulder, etc.). Tugs, 
taps, and touches were specific to each student. When the 
student saw that signal, he was to tear off the top coupon 
from the pack of 10 and put it on my desk. Also, he was 
told that all remaining coupons could be added to the pool 
of points that he otherwise would earn for completing aca- 
demic work. In effect, the remaining coupons became “bonus 
tokens” exchangeable for inexpensive, but desirable, items 
that were available on Friday at the “Classroom Store.” The 
program quickly reduced the number of expletives emitted by 
the target students. In effect, they learned that the extra cou- 
pons provided the opportunity to buy more highly desirable 
items that they otherwise would not have been able to afford. 
—Timothy E. Heron 
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Implementing a Token Economy 


Training Staff 


To launch a token economy successfully, practitioners must 
learn all implementation elements. The behavior analyst can 
assist in this process by developing checklists of key operational 
components and teaching these components systematically 
using a direct instruction ““model-lead-test’”” (MLT) procedure. 
The MLT procedure emphasizes the behavior analyst’s role in 
demonstrating, role playing with trainees, conducting supervised 
and independent practice, and providing feedback and rein- 
forcement. Further, and more importantly, the behavior analyst 
must recognize that during initial training, the trainees are on 
a learning curve and will make mistakes. The behavior analyst 
should be prepared to conduct booster training, review sessions, 
or monitoring protocols to ensure that the token system 1s being 
implemented as designed when the trainee is on her own. 

Plavnick, Ferreri, and Maupin (2010) found that two 1-hr 
training sessions on how to implement the steps of a token econ- 
omy did not produce high levels of implementation by three 
teachers in an early childhood special education class. Prior to 
training sessions, teachers implemented virtually none of the 
key elements of the token program. During training, the num- 
ber of token steps executed correctly increased substantially for 
all three teachers. However, during the implementation phase, 
when teachers were conducting the program alone, the number 
of steps decreased markedly. The data show that the teachers 
failed to execute the checklist items they “learned” during train- 
ing. When self-monitoring was introduced, the number of check- 
list items completed accurately increased and remained high. 
Plavnick et al. remind practitioners that additional support and 
monitoring may be needed after initial training to maintain high 
levels of performance. 


Initial Token Training 


The manner in which initial training is conducted to launch a 
token economy depends on the functioning level of the learners. 
For high-functioning learners, or those children with mild dis- 
abilities, initial training might require minimal time and effort 
and consist primarily of verbal instructions or modeling. Usually 
the initial token training for these individuals can be accom- 
plished in one 30- to 60-min session. Three steps are normally 
sufficient. First, an example of the system should be given. The 
practitioner might describe the system as follows: 


This is a token and you can earn it by [specify behavior]. 
I will watch your behavior; and when you [specify 
behavior], you will earn a token. Also, as you continue 
[specify behavior], you will earn more tokens. At [spec- 
ify time period] you will be able to exchange the tokens 
you have earned for whatever you want and can afford on 
this table. Each item is marked with the number of tokens 
needed for purchase. You can spend only the tokens you 
have earned. If you want an item that requires more tokens 
than you have earned, you will have to save your tokens 
over several [specify time period]. 


The second step is to model the procedure for token deliv- 
ery. For instance, each learner, or all group members, might be 
directed to emit the specified behavior. Immediately following 


the occurrence of the behavior, the learner(s) should be praised 
(e.g., “Enrique, I’m pleased to see how well you are working by 
yourself!”) and the token delivered. 

The third step is to model the procedure for token 
exchange. Learners should be taken to the store and shown the 
items for purchase. All learners should already have one token, 
which was acquired during the modeling of token delivery. At 
this time, several items should be able to be purchased for one 
token (the cost may go up later)—a game, 5 minutes of free 
time, a sticker or teacher helper privilege. Students should actu- 
ally use their tokens in this exchange. Lower functioning learn- 
ers may require several sessions of initial token training before 
the system is functional for them. Further response prompts may 
be needed. 


Ongoing Token Training 

During token reinforcement training, the practitioner and the 
students should follow the guidelines for effective use of rein- 
forcement (see Chapter 11). For example, tokens should be dis- 
pensed contingently and immediately after the occurrence of the 
desired behavior. Procedures for delivery and exchange should 
be clear and should be followed consistently. If a booster session 
is needed to improve teacher, parent, or student performance of 
how tokens are dispensed, earned, or exchanged, practitioners 
should deliver it. Finally, the focus should be on building and 
increasing desirable behaviors through token delivery rather 
than decreasing undesirable behaviors through response cost. 


Management Issues During Implementation 


Students must be taught how to manage the tokens they earn. 
For instance, once received, tokens should be placed in a safe, 
but accessible, container so that they are out of the way, but 
readily available when needed. If the tokens are in clear view 
and easily accessible, some students might play with them at the 
expense of performing academic tasks assigned by the teacher. 
Also, placing the tokens in a secure location reduces the risk of 
other students counterfeiting or confiscating them. Preemptive 
measures should be taken to ensure that tokens are not easily 
fabricated or reachable by anyone other than the recipient. If 
counterfeiting or stealing occurs, switching to different tokens 
will help reduce the likelihood of these tokens being exchanged 
under false pretenses. 

Another management issue, however, relates to students’ 
token inventories. Some students may hoard their tokens and not 
exchange them for backup reinforcers. Other students may try to 
exchange their tokens for a backup reinforcer, but they lack the 
requisite number of tokens to do so. Both extremes should be 
discouraged. That is, the program expectation should be that all 
students exchange at least some of their earned tokens periodi- 
cally. Obviously, students without the requisite number of tokens 
should not be permitted to “borrow” against future projected 
earnings so as to participate in an exchange. Stated otherwise, 
they should not be permitted to buy backup reinforcers on credit. 

A final management issue relates to chronic rule break- 
ers, saboteurs, or students who test the system at every turn. 
Practitioners can minimize this situation by (a) ensuring that 
the tokens function as a generalized conditioned reinforcers, 
(b) conducting a reinforcer assessment to determine that the 
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backup items function as reinforcers and are preferred by each of 
the students, and (c) applying alternative procedures for chronic 
rule breakers. Specifically, if an individual student tries to sab- 
otage the system, that student’s program might be modified so 
that tokens are combined with a differential reinforcement of 
alternative behavior procedure, extinction, or brief time-out. 


Withdrawing a Token Economy 


Practitioners may be so encouraged by the effectiveness of the 
token economy they have worked so hard to design and imple- 
ment that they do not want to remove the system. However, 
in most cases, practitioners and analysts should consider how 
they will gradually scale down, and ultimately remove, the pro- 
gram. One goal of the token program should be to have the 
descriptive verbal praise that is delivered simultaneously with 
the token acquire the reinforcing capability of the token. From 
the beginning, a systematic aim of the token economy should be 
to withdraw the program. Such an approach, aside from having 
functional utility for practitioners (i.e., they will not have to 
issue tokens forever), also has advantages for the learner. For 
example, if a special education teacher is using a token econ- 
omy with a student scheduled for full-time placement in a reg- 
ular fourth-grade classroom, the teacher wants to be certain that 
the student’s responses can be maintained in the absence of the 
token economy. 

Various methods have been used to withdraw token rein- 
forcers gradually after criterion levels of behavior have been 
reached. The following six guidelines allow the practitioner to 
develop, and later withdraw, token reinforcers effectively. 

First, the token presentation should always be paired with 
social approval and verbal praise. This should increase the rein- 
forcing effect of the social approval and serve to maintain behav- 
iors after token withdrawal. 

Second, the number of responses required to earn a 
token should be gradually increased. For instance, if a student 
receives a token initially after reading only one page, he should 
be required to read more pages later for token delivery. 

Third, the duration the token economy is in effect should be 
gradually decreased. For example, during September the system 
might be in effect all day; in October, the time might be 8:30 AM 
to 12:00 PM and 2:00 PM to 3:00 PM; and in November, the 
time could be reduced to 8:30 AM to 10:00 AM and 2:00 PM to 
3:00 PM. In December, the times might be the same as those in 
November, but on only 4 days a week, and so on. 

Fourth, the number of activities and privileges that serve 
as backup items and are likely to be found in the untrained set- 
ting should be increased gradually. For example, the analyst 
should start taking away tangible items in the store that might 
not be present in the regular classroom. Are edibles available at 
the store? They are usually not available for reinforcement in 
regular classrooms. Gradually, items should be introduced that 
would be common in a regular class (e.g., special award sheets, 
gold stars, positive notes sent home). 

Fifth, the price of more desirable items should be 
increased systematically while keeping a very low price on less 
desirable items for exchange. For example, in a token system 
with adolescent girls who have moderate to severe intellectual 
disabilities, the prices of candy bars and trips to the canteen and 


grooming aids (e.g., comb, deodorant) were initially about the 
same. Slowly, the cost of items such as candy bars was increased 
to such a high level that the girls no longer saved tokens to pur- 
chase them. More girls used their tokens to purchase grooming 
aids, which cost substantially less than candy. 

Sixth, the physical evidence of the token should be faded 
over time. The following sequence illustrates how the physical 
evidence of the token can be faded. 


¢ Learners earn physical tokens such as poker chips or 
washers. 

¢ The physical tokens are replaced with slips of paper. 

¢ The slips of paper are replaced with tally marks on an 
index card that is kept by the learners. 

¢ In aschool setting, the index card can now be taped to the 
learner’s desk. 

¢ The index card is removed from the learner and is kept 
by the analyst, but participants can check their balance 
at any time. 

¢ The analyst keeps tallies, but no checking is allowed 
during the day. Totals are announced at the end of the 
day, and then every other day. 

* The token system is no longer operative. The behavior 
analyst does not announce point totals even though they 
are still kept. 


Evaluating a Token Economy 


Token economies can be evaluated using any number of reliable, 
valid, field-tested, best-practice designs (e.g., reversal, multiele- 
ment). Given that most token economy programs are conducted 
with small groups, single-subject evaluation designs allow for 
participants to serve as their own control. Applied behavior ana- 
lysts should always strive to evaluate effectiveness at the indi- 
vidual level because despite aggregate group data demonstrating 
effectiveness, individual data may not be so robust (Davis & 
Chittum, 1994). 

The Jowett Hirst et al. (2016) study described earlier is 
informative here. When group data were aggregated, both pro- 
cedures were shown to be equally effective. However, when 
individual data served as the basis of comparison, many stu- 
dents performed better under the response cost contingency (see 
Figure 8.17). Had an individual analysis not been conducted, 
these results would have been masked by the overall group data. 
The authors concluded 


_.. the procedures were differentially effective for some 
individuals in the group, which suggests that analyzing 
individual data is important because these differences may 
not have been observed if we reported only group averages. 
The importance of analyzing individual data is further 
supported by the results of Study 2, which showed dif- 
ferential effects for three participants . . . , whereas the 
overall results suggest that the two procedures are equally 
effective. (pp. 340-341) 


Collecting social validity data before, during, and/or after 
token economy with the target participants and the significant 
others helps to answer the question “Did the result(s) of token 
implementation make a difference for the participant(s)?” Jowett 
Hirst et al.’s (2016) post-study preference assessment showed 
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that the majority of students preferred the response cost con- 
dition, an interesting finding given the negative punishment 
protocol associated with this procedure. Importantly, such a 
finding gives at least slight pause to the generally recommended 
guidance to use “positive” approaches initially to effect behav- 
ior change, when evidence to the contrary shows that for some 
students, a negative punishment procedure may be effective and 
preferred. 


Further Considerations 


Its effectiveness notwithstanding, the token economy does pres- 
ent some challenges to learners and practitioners. 


Intrusive. Token systems can be intrusive. It takes time, energy, 
and resources to establish, implement, and evaluate token pro- 
grams. Also, because most natural environments do not rein- 
force a person’s behavior with tokens, careful thought must be 
given to how to thin the token schedule while simultaneously 
maintaining performance. In any case, token economy programs 
can have lots of “moving parts,” and practitioners must be pre- 
pared to deal with them. 


Self-perpetuating. A token economy can be an effective pro- 
cedure for managing behavior, and analysts can be so encour- 
aged by the results that they do not want to remove the system. 
Learners then continue working for backup reinforcers that may 
not normally be available in the natural environment. 


Cumbersome. Token economies can be cumbersome to imple- 
ment, especially if there are multiple participants with multiple 
schedules of reinforcement. The system may require additional 
time and effort from the learner and the behavior analyst. 

However, even though the effectiveness of the token econ- 
omy has been shown repeatedly, Hackenberg (2018) states there 
is much still to learn: 


Work in the area of token systems is best served by a bi- 
directional interplay between laboratory and applied 
research, where applied questions inspire research on 
basic mechanisms. When based on and contributing to an 
analysis, applied research on token economies can be on 
the leading edge of theoretical advances, helping set the 
scientific research agenda. (393) 


GROUP CONTINGENCIES 


Thus far in this text we have focused primarily on how contin- 
gencies of reinforcement can be applied to individuals. However, 
applied research has also demonstrated how contingencies can 
be applied to groups. Behavior analysts have increasingly turned 
their attention toward group contingencies as interventions in 
such areas as decreasing disruptive behavior in public school 
settings (Stage & Quiroz, 1997), increasing leisure activities of 
adults with traumatic brain injury (Davis & Chittum, 1994), pro- 
moting abstinence from cigarette smoking (Meredith, Grabinski, 
& Dallery, 2011), improving behavior and academic perfor- 
mance of children in schools and classrooms (Skinner, Skinner, 
Skinner, & Cashwell, 1999; Popkin & Skinner, 2003; Skinner, 
Cashwell, & Skinner, 2000; Skinner, Skinner & Burton, 2009), 


and reducing problem behaviors on playgrounds (Lewis, Powers, 
Kelk, & Newcomer, 2002), to name just a few. Each of these 
applications has shown that group contingencies, properly man- 
aged, can be an effective and practical approach to changing the 
behavior of many people simultaneously (Stage & Quiroz, 1997). 
Longitudinal data collected over a 30-year period support the 
proposition that group contingencies with general education and 
exceptional children populations, applied across a myriad of set- 
tings, have been effective (Little, Akin-Little, & O’Neill, 2015). 


Group Contingency Defined 


In a group contingency a common consequence (usually, but 
not necessarily, a reward intended to function as reinforcement) 
is contingent on the behavior of one member of the group, the 
behavior of part of the group, or the behavior of everyone in 
the group. Group contingencies can be classified as dependent, 
independent, or interdependent (Litow & Pumroy, 1975). 


Rationale for and Advantages of 
Group Contingencies 


There are a number of reasons for using a group contingency in 
applied settings. First, it can save time during administration. 
Instead of repeatedly administering a consequence to each mem- 
ber of a group, the practitioner can apply one consequence to all 
members of the group. From a logistical perspective, a practi- 
tioner’s workload may be reduced. A group contingency can be 
effective and economical, requiring fewer practitioners or less 
time to implement. 

Another advantage is that a practitioner can use a group 
contingency in a situation in which an individual contingency 
is impractical (Hanley & Tiger, 2011). For example, a teacher 
attempting to reduce disruptive behaviors of several students 
might have difficulty administering an individual program for 
each pupil in the classroom. A substitute teacher, in particular, 
might find the use of a group contingency a practical alternative 
because her knowledge of the students’ previous histories of 
reinforcement would be limited and the group contingency could 
be applied across a variety of behaviors, settings, or students. 

A group contingency can also be used in cases in which 
the practitioner must resolve a problem quickly, as when serious 
disruptive behavior occurs. The practitioner might be interested 
not only in decreasing the disruptive behavior rapidly but also 
in building improved levels of appropriate behavior. 

Furthermore, a group contingency can capitalize on peer 
influence or peer monitoring because this type of contingency 
sets the occasion for peers to act as change agents (Gable, Arllen, 
& Hendrickson, 1994; Skinner et al., 1999). Admittedly, peer 
pressure can have a detrimental effect on some people; they may 
become scapegoats, and negative effects may surface (Romeo, 
1998). However, potentially harmful or negative outcomes can 
be minimized by structuring the contingency elements randomly 
(Kelshaw-Levering, Sterling-Turner, Henry, & Skinner, 2000: 
Popkin & Skinner, 2003). 

Finally, practitioners can establish a group contingency to 
facilitate positive social interactions and positive behavioral sup- 
ports within the group (Kohler, Strain, Maretsky, & DeCesare, 
1990). For example, a teacher might establish a group contingency 
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for a student or a group of students with disabilities. The students 
with disabilities might be integrated into the general education 
classroom, and a contingency could be arranged in such a way 
that the class would be awarded free time contingent on the per- 
formance of one or more of the students with disabilities. 


Independent Group Contingencies 


An independent group contingency is an arrangement in which 
a contingency is presented to all members of a group, but rein- 
forcement is delivered only to those group members who meet 
the criterion outlined in the contingency (see Figure 28.2). Inde- 
pendent group contingencies are frequently combined with con- 
tingency contracting and token reinforcement programs because 
these programs usually establish reinforcement schedules inde- 
pendent of the performance of other members of the group. 

Brantley and Webster (1993) used an independent group 
contingency in a general education classroom to decrease the 
disruptive behavior of 25 fourth-grade students. After collecting 
data on off-task behavior, call-outs, and out-of-seat behavior, 
the teachers posted rules related to paying attention, seeking the 
teachers’ permission before talking, and remaining in their seats. 
An independent group contingency was established whereby 
each student could earn a checkmark next to his or her name 
on a list that was posted publicly in the room during any of the 
intervals that marked the observation periods during the day. 
When a student emitted an appropriate or prosocial behavior, 
a checkmark was registered. The criterion for earning a reward 
was increased from four to six checkmarks over 4 out of 5 days 
per week. 

Results showed that after 8 weeks, the total number of 
combined disruptions (e.g., off-task behavior, call-outs, and 
out-of-seat behavior) decreased by over 70%, and some off- 
task behaviors (e.g., not keeping hands to self) were eliminated 
completely. The teacher’s satisfaction with the approach was 
positive, and parents reported that they were able to understand 
the procedures that were in place for their children at school. 
Brantley and Webster (1993) concluded: 


The independent contingency added structure for students 
by using clear time intervals and clarified teacher expec- 
tations by limiting and operationally defining rules to be 
followed, monitoring behavior consistently, and setting 
attainable criteria for students. (p. 65) 


A —————> B —_- C 


SR+ for each member 
achieving criterion 


Criterion stated Criterion met 
for all 


group members* 


*(E.g., “Each student who spells 9 out of 10 words 
correctly on the test Friday will earn 10 bonus 
points.”) 


Figure 28.2 An independent group contingency. 


From Hirst, E. S. J., Dozier, C. L., Payne, S. W. (2016). Efficacy of and 
preference for reinforcement and response cost in token economies. 
Journal of Applied Behavior Analysis, 49, 329-345. Reproduced with 


permission of John Wiley & Sons,Inc. 


—a eee 


Criterion stated 
for one 
individual 
or small group* 


Criterion met $+ for whole group 


AEG: “When all students at Table 2 finish their math 
assignments, the class will have 5 minutes of free time.”) 


Figure 28.3 A dependent group contingency. 


From “The Effects of Self-Monitoring on the Procedural Integrity of 

a Behavioral Intervention for Young Children with Developmental 
Disabilities” by J. B., Plavnick, S.J. Ferreri, S. J., and A. N. Maupin, 
2010, Journal of Applied Behavior Analysis, 43, 318. Reproduced with 
permission of John Wiley & Sons,|nc. 


Dependent Group Contingencies 


Under a dependent group contingency, the reward for the 
whole group is dependent on the performance of an individual 
student or small group. Figure 28.3 illustrates the dependent 
group contingency as a three-term contingency. The contin- 
gency operates like this: If an individual (or small group within 
the total group) performs a behavior to a specific criterion, the 
group shares the reinforcer. The group’s access to the reward 
depends on the individual’s (or small group’s) performance. If 
the individual performs below the criterion, the reward is not 
delivered. When an individual, or small group, earns a reward 
for a class, the contingency is sometimes referred to as the hero 
procedure. According to Kerr and Nelson (2002), the hero 
procedure can facilitate positive interactions among students 
because the class as a whole benefits from the improved behay- 
ior of the student targeted for the group contingency. 

Gresham (1983) used an interesting variation of the hero 
procedure within a dependent group contingency whereby the 
contingency was applied at home, but the reward was deliv- 
ered at school. In this study, an 8-year-old boy who was highly 
destructive at home (e.g., set fires, destroyed furniture) earned 
good notes for nondestructive behavior at home. Billy received 
a good note—a daily report card—each day that no destructive 
acts took place. Each note was exchangeable for juice, recess, 
and five tokens at school the next day. After Billy received five 
good notes, the whole class received a party, and Billy served 
as the host. Gresham reported that the dependent group contin- 
gency reduced the number of destructive behaviors at home, 
and represented the first application of a dependent group con- 
tingency in a combined home-school setting. 


Interdependent Group Contingencies 


An interdependent group contingency is one in which all mem- 
bers of a group must meet the criterion of the contingency (indi- 
vidually and as a group) before any member earns the reward 
(Elliot, Busse, & Shapiro, 1999; Kelshaw-Levering et al., 2000; 
Lewis et al., 2002; Skinner et al., 1999; Skinner et al., 2000) (see 
Figure 28.4). Theoretically, interdependent group contingencies 
have a value-added advantage over dependent and independent 
group contingencies insofar as they yoke students to achieve a 
common goal, thereby capitalizing on peer pressure and group 
cohesiveness. 
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A 


Cee en 


Criterion stated Criterion met $+ for whole group 


for all if all 
group members* members of 
the group 


achieve criterion 


*(E.g., “Each student must complete at least four 
science projects by the 6th week in the term in 
order for the class to go on the field trip.) 


Figure 28.4 An interdependent group contingency. 


The effectiveness of dependent and interdependent group 
contingencies may be enhanced by randomly arranging some or 
all components of the contingency (Popkin & Skinner, 2003). 
That is, randomly selected students, behaviors, or reinforcers 
are targeted for the contingency (Kelshaw-Levering et al., 2000; 
Skinner et al., 1999). Kelshaw-Levering and colleagues (2000) 
demonstrated that randomizing either the reward alone or mul- 
tiple components of the contingency (e.g., students, behaviors, 
or reinforcers) was effective in reducing disruptive behavior. 

Procedurally, an interdependent group contingency can be 
delivered (a) when the group as a whole meets the criterion, (b) 
when the group achieves a mean group score (Kuhl, Rudrud, 
Witts, & Schulze, 2015), or (c) based on the results of the Good 
Behavior Game or the Good Student Game. In any case, interde- 
pendent group contingencies represent an “all or none” arrange- 
ment. That is, all students earn the reward or none of them do 
(Popkin & Skinner, 2003). 


Figure 28.5 Frequency of problem behaviors across 
recess periods. Recess 1 was composed of second- and 
fourth-grade students, Recess 2 was composed of first- 
and third-grade students, and Recess 3 was composed of 
fifth- and sixth-grade students. Kindergarten students 
were on the playground across Recess 1 and 2. 

From “Reducing Problem Behaviors on the Playground: An Investigation 
of the Application of School-Wide Positive Behavior Supports” by T. J. 
Lewis, L. J. Powers, M. J. Kelk, and L. L. Newcomer, © 2002. Reproduce 
with permission of John Wiley & Sons Inc. 


Frequency of Problem Behavior 


Total Group Meets Criterion 


Lewis and colleagues (2002) used the total group meets criterion 
variation to reduce the problem playground behaviors of students 
enrolled in a suburban elementary school. After a faculty team 
conducted an assessment of problematic playground behaviors, 
social skill instruction in the classroom and on the playground was 
coupled with a group contingency. During social skills instruction, 
students learned how to get along with friends, cooperate with 
each other, and be kind. During the group contingency, students 
earned elastic loops they could affix to their wrists. After recess, 
students placed the loops in a can on the teacher’s desk. When 
the can was full, the group earned a reinforcer. Figure 28.5 shows 
the results of the social skills plus group contingency intervention 
across three recess periods during the day. 


Group Averaging 

Baer and Richards (1980) used a group averaging interdepen- 
dent group contingency to improve the math and English perfor- 
mance of 5 elementary-aged students. In their study, all of the 
students in a class of 10, including the 5 target students, were 
told they would earn | extra minute of recess for each point of 
class improvement beyond the previous weekly average. Also, 
all students were given a contract stating this same contingency. 
The extra recess was awarded every day of the following week. 
For example, if the students’ weekly averages exceeded their 
previous weekly averages by 3 points, they would receive 
3 minutes of extra recess every day of the following week. The 
results of this 22-week study showed that all students improved 
when the group contingency was in effect. 
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Kuhl et al. (2015) used an interdependent group contin- 
gency as part of a comparison study on the effects of group 
versus individual goal setting on the increased level of physical 
activity of 4 selected students (2 males and 2 females) across 
two classrooms with total class populations of 25 to 30 students, 
respectively. In their study, students received a pedometer that 
registered their total number of steps during a 24-hour period, 
4 days per week (Monday through Thursday). Prior to launching 
the study, teachers selected five activities from a list of 20 potential 
items thought to be reinforcing to the students. Ultimately, from 
that list of five activities, students voted for 15 minutes of extra 
free time as their reinforcer. Recess was awarded on Fridays. 

A counterbalanced reversal design (ABACX) was used 
where X served as the best phase. During baseline, students 
received a calibrated pedometer, set to zero steps, which the stu- 
dents wore for a 24-hour period, excluding time spent bathing/ 
swimming, or sleeping, over the course of the study. During 
baseline, students could not see the number of steps taken, as 
the pedometer was masked. Data were recorded daily, 4 days 
per week. During the Classroom Cumulative Total Condition, 
a cumulative total goal was determined by averaging each 
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participant’s daily step count, and then adding 1500 steps per 
student per day to that average.' Fifteen minutes of extra free 
time was awarded on Friday if the cumulative total of steps 
exceeded the goal. Pedometers were unmasked in this condition. 
Each day, the teachers publically posted cumulative daily step 
counts and provided verbal feedback to show progress toward 
their goal. When the Individual Goals Condition was introduced, 
the children were given their unmasked pedometers, but they 
met with the teacher, who informed them of their individual 
goal, which was determined by averaging the previous 4 days 
of data and adding 1500 steps to that figure. If a student met his 
goal, he was praised. If the goal was not met, the teacher encour- 
aged the student to keep trying to meet the goal. Extra free time 
was administered on Friday if 80% of the students met their 
individual goal each day. Finally, a feedback schedule-thinning 
phase was introduced. During the 2-day condition, step count 
and recording occurred on only 2 days. During the 4-day con- 
dition, counts and recordings occurred on only | day per week. 
Reinforcement conditions remained the same. 

The results of the study (see Figure 28.6) showed that 
step counts increased under both the cumulative and individual 
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contingencies; however, higher step counts were registered 
under the individual contingency regardless of its order during 
counterbalancing. Anecdotal evidence, collected after the 
study, suggested that under the individual contingency students 
engaged in extra walking practices that otherwise may not have 
occurred. That is, they walked in the gym before school, played 
action games with peers after school, and had parents drop them 
off farther from school each day to increase step counts. The fact 
that step counts during the thinning phase were still higher than 
baseline, but lower than during the individual phase, suggested 
the important role of daily feedback in establishing and main- 
taining increased step counts. 


Good Behavior Game 


Barrish, Saunders, and Wolf (1969) first investigated the Good 
Behavior Game (GBG) as an interdependent group contin- 
gency to address a variety of student classroom disruptive 
behaviors. Procedurally, to play the GBG a class (or group) 


Figure 28.7 Percentage of 1-min intervals containing 
talking-out and out-of-seat behaviors in a classroom 

of 24 fourth-grade children during math and reading 
periods. 

From “Good Behavior Game: Effects of Individual Contingencies 

for Group Consequences on Disruptive Behavior in a Classroom” by 
H. H. Barrish, M. Saunders, and M. M. Wolf, 1969, Journal of Applied 
Behavior Analysis, 2, p. 122. Reproduced with permission of John 
Wiley & Sons,Inc. 


Percentage of Intervals in which Behavior Occurred 


Math Period 


Reading Period 


is divided into two or more teams. Prior to the game being 
played, the teams are told that whichever team has the fewest 
marks against it at the end of the game will earn a reward. Each 
team is also told that it can win a reward if it has fewer than a 
specified number of marks (a differential reinforcement of low 
rates [DRL] schedule). Data reported by Barrish et al. show 
that this strategy can be an effective method of reducing dis- 
ruptive behavior in the classroom. When game conditions were 
in effect during math or reading, talking-out and out-of-seat 
behaviors occurred at low levels. When game conditions were 
not in effect, disruptive behaviors occurred at much higher lev- 
els (see Figure 28.7). 

In the Good Behavior Game, teacher attention is directed 
toward observing and recording occurrences of misbehavior, 
with the incentive that if one or more teams have fewer than 
the criterion number of infractions, a reinforcer is delivered. 
The advantage of the Good Behavior Game is that competition 
can occur within teams, across teams, or against a criterion. 
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Since Barrish et al.'s groundbreaking study, a number of 
researchers have investigated the GBG across a broad spectrum of 
settings and age groups, and in combination with other procedures 
(for reviews, see Little, Akin-Little, & O’Neill, 2015; Tingstrom, 
Sterling-Tuner, & Wilczynski, 2006). Further, several investigators 
have demonstrated its effectiveness as a preventative measure 
(Embry, 2002; Lannie & McCurdy, 2007; Van Lier, Van Der Sar, 
Muthen, & Crijen, 2004). By implementing a successful GBG 
program with children who are young—but disruptive and out-of- 
control—the goal is to “prevent” even worse outbursts in middle 
or high school by presumably socializing students at an early age 
to expectations, rules, and contingencies of reinforcement. 

Donaldson, Vollmer, Krous, Downs, and Berard (2011), 
for example, conducted a replication of the Good Behavior 
Game with five teachers across their respective kindergarten 
classes, each experiencing disruptive students during morning 
group work that interfered with the goals of the lesson. After 
collecting baseline data on the total rate of disruptive behaviors, 
the class was divided into two teams, each containing, based 
on the teacher’s judgment, a comparable number of disrup- 
tive students. During the GBG, teachers provided rules related 
to the children staying on their carpet patches, keeping their 
hands and feet to themselves, and speaking when called upon. 
If a disruptive behavior occurred, the experimenter—and later 
the teacher—placed a mark on the board and simultaneously 
announced aloud what rule was broken by a member of that 


18 Baseline Good Behavior Game 


Teacher 
Implementation 


Experimenter 
Implementation 


team. One team won the game by having fewer points than the 
other team, or both teams won if they both met a criterion set by 
the teacher of an 80% or better reduction from baseline levels 
of misbehavior. 

Figure 28.8 shows that under the GBG contingency, dis- 
ruptive behavior decreased across all classrooms. Further, low 
levels of disruptive behavior continued after the experimenter 
shifted game implementation to the teacher, despite treatment 
integrity data indicating that teachers did not implement all pro- 
cedures as trained. In short, even though implementation steps 
were likely compromised due to teachers simultaneously con- 
ducting an academic lesson, results were still robust. 

Finally, teachers reported anecdotally that the children, 
when offered a choice, “voted” to continue playing the game 
for the rest of the year, and four teachers continued the game 
the following year. Donaldson et al. (2011) suggest that the 
(a) rewards that the students earned for winning may have served 
as social reinforcers, (b) hash marks on the board for disruptive 
behavior may have served as punishers, or (c) children’s social 
interplay during the game (or later) may have affected the over- 
all outcome. In any case, this study showed the powerful effects 
of the GBG with a very young population of students. 

Kleinman and Saigh (2011) also conducted a successful 
replication study of the GBG with a class of extremely disruptive 
high school-level students attending a multiethnic public high 
school in a large urban area (Harlem, New York). Students were 
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kindergarten students as a function of the Good 
Behavior Game implemented by the experimenter 
and classroom teachers. 

From “An Evaluation of the Good Behavior Game in Kindergarten 
Classrooms,” by J. M. Donaldson, T. R. Vollmer, T. Krous, 

S. Downs, and K. P. Berard, 2011, Journal of Applied Behavior 
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Sessions 


50 Analysis, 44, pp. 605-609. Reproduced with permission of John 
Wiley & Sons,|nc. 
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observed to be so highly disorderly, with out-of-seat behavior, 
calling out, and physical altercations in class, that a previous 
teacher resigned her position. The new teacher’s reception was 
met with cursing, screaming, and throwing objects. To address 
this situation, the new teacher conducted a reinforcer prefer- 
ence phase that was used to determine rewards that could be 
delivered given the ages of the students and budget constraints 
imposed by circumstances. A reversal design was employed 
that involved a short adaptation period to acclimate the students 
to observers in the class, followed by a Baseline I, GBG, Base- 
line Il, and GBG 2. 

During Baseline I, students were divided into two teams 
with approximately equal numbers of disruptors on each 
team. Further, the teacher posted a list of “expectations” for 
student behavior on the front board and read them before each 
session. Finally, all disruptions were handled in the past man- 
ner (e.g., teacher reprimands, removal from the classroom). 
During GBG 1, the teacher announced to the students that the 
teams would now compete with each other for daily and weekly 
prizes. They were told that any occurrence of defined target 
behaviors would result in the disruptor being named aloud and 
a mark placed against that disruptor’s team. At the end of the 
period, the team with fewer marks would earn a prize (e.g., 
bite-sized candy). A weekly prize (i.e., a pizza party, cupcakes) 
would be awarded to the team with fewer marks for the week. 
Baseline II conditions replicated Baseline I. GBG 2 replicated 
GBG 1. A follow-up phase was conducted several weeks after 
the study’s completion. Results indicated that when baseline 
conditions were in effect, extremely high levels of disruptive 
behavior occurred. When GBG conditions were in effect, all 
levels of disruptive behavior decreased markedly, and remained 
so during a successive follow-up phase (see Figure 28.9). 

Kleinman and Saigh concluded that the GBG (a) reduced 
the disruptive target behaviors rapidly, (b) provided strong exter- 
nal validity of the efficacy of the GBG for older students, and 
(c) produced convincing social validity approval data from stu- 
dents in an inner-city, multiethnic setting. That is, the students 


Baseline | 


Figure 28.9 Disruptive behavior 
by general education high school 
students as a function of the 
Good Behavior Game. 

From “The Effects of the Good Behavior 
Game on the Conduct of Regular 
Education New York City High School 
Students,” by K. E. Kleinman and 
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themselves reported that they could observe the dramatic effects 
of improved behavior during GBG phases. 


Good Student Game 


The Good Student Game combines an interdependent group con- 
tingency (like the Good Behavior Game) with an added self- 
monitoring twist (Babyak, Luze, & Kamps, 2000). Basically, the 
Good Student Game is intended for implementation during inde- 
pendent seatwork periods when problematic behaviors surface. In 
the Good Student Game, the teacher (a) chooses target behaviors 
to modify, (b) determines goals and rewards, and (c) determines 
whether group or individual monitoring (or both) will occur. 

Students are trained in the Good Student Game using a 
model-lead-test instructional sequence: Students are arranged 
in clusters of four or five, the target behaviors are defined, 
examples and nonexamples are provided, practice occurs under 
teacher supervision, and one or more students record their own 
or the group’s performance. Table 28.2 shows how the Good 
Student Game compares to the Good Behavior Game. Note 
that two distinctions relate to target behaviors, the delivery of 
rewards, and feedback. 


Guidelines for Implementing 
a Group Contingency 


Implementing a group contingency requires as much preplan- 
ning as any other behavior change procedure. Presented here are 
six guidelines to follow before and during the application of a 
group contingency. 


Choose an Effective Reward 


One of the more important aspects of a group contingency is the 
strength of the consequence; it must be strong enough to serve 
as an effective reward. Practitioners are advised to use general- 
ized conditioned reinforcers or reinforcer menus at every oppor- 
tunity. Both of these strategies individualize the contingency, 
thereby increasing its power, flexibility, and applicability. 


Baseline II Re-introduction — Follow-up 


—-*—- Talk or verbal 
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TABLE 28.2 Components of the Good Behavior Game and the Good Student Game 


Component Good Behavior Game 


Organization Students play in teams. 
Management 
Target behaviors 

following. 
Recording 

behaviors as they occur. 
System of reinforcement Positive. 
Criterion for reinforcement 

rule-breaking behaviors. 
Delivery of reinforcement 
Feedback 


rule-breaking behaviors occur. 


Teacher monitors and records behavior. 


Behaviors are stated as rule breaking or rule 


Teachers record incidents of rule-breaking 


Teams must not exceed a set number of 


Dependent on group performance. 
Teacher provides feedback when 


Good Student Game 
Students play in teams or as individuals. 
Students self-monitor and record their behavior. 


Behaviors are stated as rule following. 


Students record rule-following behaviors on a 
variable-interval schedule. 


Positive. 


Groups or individuals achieve or exceed a set 
percentage of rule-following behaviors. 


Dependent on individual or group performance. 


Teacher provides feedback at intervals. Praise and 
encouragement are presented during the game to 
reinforce positive behaviors. 


Republished with permission of PRO ED, Inc, from The Good Student Game: Behavior Management for Diverse Classrooms, A. E. Babyak, G. J. Luze, and 
D. M. Kamps, 2000, Intervention in School and Clinic, 35(2), p. 217. Permission conveyed through Copyright Clearance Center, Inc. 


Determine the Behavior to Change and Any Collateral 
Behaviors That Might Be Affected 


Let us suppose a dependent group contingency is established 
in which a class receives 10 minutes of extra free time, contin- 
gent on the improved academic performance of a student with 
developmental disabilities. Obviously, the teacher will need to 
collect data on the student’s academic performance. However, 
data might also be collected on the number of positive inter- 
actions between the student and her classmates without dis- 
abilities within and outside the room. An additional benefit of 
using a group contingency might be the positive attention and 
encouragement that the student with developmental disabilities 
receives from her classmates. 


Set Appropriate Performance Criteria 


If a group contingency is used, the persons to whom the contin- 
gency is applied must have the prerequisite skills to perform the 
specified behaviors. Otherwise, they will not be able to meet the 
criterion and could be subject to ridicule or abuse (Stolz, 1978). 

According to Hamblin, Hathaway, and Wodarski (1971), 
the criteria for a group contingency can be established by using 
the average-, high-, or low-performance levels of the groups 
as the standard. In an average-performance group contingency, 
the mean performance of the group is averaged, and reinforce- 
ment is contingent on the achievement of that mean score or 
a higher score. If the average score for a math exercise was 
20 correct problems, a score of 20 or more would earn the 
reward. In a high-performance group contingency, the high 
score determines the level of performance needed to receive 
the reward. If the high score on a spelling test was 95%, only 
students achieving a score of 95% would receive the reward. In 
the low-performance group contingency, the low-performance 
score determines the reinforcer. If the low score on a social 
studies term paper was C, then students with a C or better would 


receive the reinforcer. 


Hamblin et al. (1971) indicated that differential effects 
can be noted with these performance contingencies. Their data 
show that academically challenged students performed the worst 
under a high-performance contingency, whereas gifted students 
performed best under this contingency. The data of Hamblin and 
colleagues suggest that a group contingency can be effective in 
improving behavior, but it should be applied with the understand- 
ing that effectiveness may vary for different members of the group. 


Combine with Other Procedures When Appropriate 


According to LaRowe, Tucker, and McGuire (1980), a group 
contingency can be used alone or in combination with other 
procedures to systematically change performance. The LaRowe 
and colleagues study was designed to reduce excessive noise 
levels in an elementary school lunchroom; their data suggest that 
differential reinforcement of low rates of behavior (DRL) can 
be incorporated easily into a group contingency. In situations in 
which higher levels of group performance are desired, differen- 
tial reinforcement of high rates of behavior (DRH) can be used. 
With either DRL or DRH, the use of a changing criterion design 
may facilitate the analysis of treatment effects. 


Select the Most Appropriate Group Contingency 


The selection of a specific group contingency should be based on 
the overall programmatic goals of the practitioner, the parents (if 
applicable), and the participants whenever possible. For instance, 
if the group contingency is designed to improve the behavior of 
one person or a small group of individuals, then perhaps a depen- 
dent group contingency should be employed. If the practitioner 
wants to differentially reinforce appropriate behavior, then an 
independent group contingency should be considered. But if the 
practitioner wants each individual within a group to perform at a 
certain level, then an interdependent group contingency should be 
chosen. Regardless of which group contingency is selected, the 
ethical issues discussed in Chapter 31 must be addressed. 
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Monitor Individual and Group Performance 


When a group contingency is employed, practitioners must 
observe group and individual performance. Sometimes a 
group’s performance improves, but members within the group 
do not improve, or at least do not improve as fast (Jowett Hirst 
et al., 2016). Some members of the group might even attempt 
to sabotage the group contingency, preventing the group from 
achieving the reinforcement. In these cases, individual contin- 
gencies should be arranged for the saboteurs that might combine 
the group contingency with other behavior reduction or behavior 
building procedures. 


Future Applications of Group-Oriented 
Contingencies 


Abundant evidence confirms that group-oriented contingen- 
cies have demonstrated effectiveness in classroom settings for 
individuals and groups, and across a range of instructional, 
therapeutic, and play-based settings (for a review, see Maggin, 
Johnson, Chafouleas, Ruberto, & Berggren, 2012). However, 
despite over 50 years of empirical demonstrations of their 
effectiveness, group contingencies, particularly the Good Behav- 
ior Game and its variations, have not been widely adopted, with 
an ever-wider range of populations yet to be explored (Vollmer, 
Joslyn, Rubow, Kronfli, & Donaldson, 2016). 

Researchers might also turn their attention to evaluating 
group-oriented contingencies on larger issues facing society 
(e.g., global warming, recycling, reducing armed conflict, drug 
abuse). Some initial efforts using Internet-based group contin- 
gences have led to promising results with behaviors known to 
be systemic and addictive (Dallery, Raiff, & Grabinski, 2013; 
Dallery, Meredith, Jarvis, & Nuzzo, 2015). 


CONTINGENCY CONTRACTING 


A contingency contract (also called a behavioral contract) is 
a document that specifies a contingent relationship between the 
completion of a target behavior and access to, or delivery of, a 
specified reward. Contingency contracts stipulate how two or 
more people will behave toward each other. Such quid pro quo 
agreements make one person’s behavior (e.g., preparing dinner) 
dependent on the other person’s behavior (e.g., washing and 
putting away the dishes by a prescribed time the night before). 
Although verbal agreements may be considered contracts in 
the legal sense, they are not contingency contracts because the 
degree of specificity in designing, implementing, and evaluating 
a contingency contract far exceeds what is likely to occur in a 
verbal arrangement between parties. In addition, the act of sign- 
ing a contingency contract and its prominent visibility during 
execution are integral parts of contingency contracts. 
Contingency contracts have been used to improve academic 
performance (Wiikinson, 2003), weight control (Solanto, Jacobson, 
Heller, Golden, & Hertz, 1994), adherence to medical regimens 
(Miller & Stark, 1994), and athletic skills (Simek, O’Brien, & 
Figlerski, 1994). A compelling advantage of contingency con- 
tracts is their ability to be implemented alone or in packaged pro- 
grams that incorporate two or more interventions concurrently 


(De Martini-Scully, Bray, & Kehle, 2000). In their meta-analysis 
of single-case research on behavior contracts, Bowman-Perrott, 
Burke, deMarin, Zhang, and Davis (2015) concluded that contin- 
gency contracts are beneficial for children and youth “regardless 
of grade level, gender, or disability status” (p. 247); are easily 
used by a wide range of practitioners; and can be combined with 
other behavior change interventions (e.g., De Martini-Scully et 
al., 2000; Navarro, Aguilar, Aguilar, Alcalde, & Marchena, 2007). 


Contingency Contract Components 


A contingency contract specifies the task and who is to perform 
it and the reward and who is to deliver it contingent on task com- 
pletion. Figure 28.10 shows a contingency contract between a 
10-year-old boy and his parents to help him get ready for school 
on time in the morning. This contract includes the two compo- 
nents required in all contingency contracts: task description and 
reward description—plus an optional task record. 


Task 


The task component consists of four parts. Who is the person 
who will perform the task and receive the reward—in this case, 
Lee. What is the task or behavior Lee must perform—getting 
ready for school. When identifies the time that the task must be 
completed—every school day. How Well is the most important 
part of the task component, and perhaps of the entire contract. 
It calls for the specifics of the task. Sometimes it is helpful to 
list a series of steps or subtasks so that the person can use the 
contract as a checklist of what must be done. Including a photo 
showing what the completed task should look like is helpful for 
some tasks (e.g., cleaning one’s bedroom, setting the table for 
dinner). Any exceptions should be written in this part. 


Reward 


The reward side of a contract must be as complete and spe- 
cific as the task side. Teachers and parents are usually good at 
specifying the task component of a contract; they know what 
they want their students or children to do. When it comes to the 
reward, however, specificity is lost and problems arise. Reward 
statements such as “Can watch some television” or “Will play 
catch when I get a chance” are not explicit, specific, or fair to 
the person completing the task. 

On the reward side, Who is the person who will judge task 
completion and deliver the reward. With Lee’s getting-ready-for- 
school contract, his mother is that person..What is the reward. 
When specifies the time that the reward can be received by the 
person earning it. With any contract it is crucial that the reward 
come after successful task completion. However, many rewards 
cannot be delivered immediately following task completion. In 
addition, some rewards have built-in, limited availability and can 
be delivered only at certain times (e.g., seeing the hometown 
baseball team play). Lee’s contract specifies that his reward, 
if earned, will be received on Friday nights. How Much is the 
amount of reward that can be earned by completing the task. 
Any bonus contingencies should be included; for example, “By 
meeting her contract Monday through Friday, Ellie will earn 
bonus rewards on Saturday and Sunday.” 
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= 


How Well: __Out of bed, dressed, cereal finished 
by 7:15. No more than 1 reminder from 
Mom or Dad. 


#8 
CONTRACT 
TASK REWARD 
Who: _Lee Who: _Mom 
What: __G¢t ready for school What: _Lee has a friend stay over 
When: __Each school day When: _Ftiday night after perfect week 


How Much: Friend can come home with 


Lee after school and stay overnight. 


Boys get pizza for late night treat. 


Sign Here: Lee Behrle 


Date: _O¢t. 15, 2019 


Sign Here: Audrey Behrle 


TASK RECORD 


nate Oct. 15, 2019 


M T W | TH F M | il W 
x \* KiKi k\|k\* 
oops! Grent wore, Lee 


TH F E T W TH F 
kl * “ne klki* 
E val ~ teal 


le 


Figure 28.10 Example of a contingency contract. 


Adapted from "Sign Here: A Contracting Book for Children and Their Parents” (2nd ed.) by J. C. Dardig and 
W. |. Heward, 2016. Copyright 1995 by J. C. Dardig and W. L. Heward. Used with Permission. 


Task Record 


Including a place on the contract to record task completion 
serves two purposes. First, recording task completion and reward 
delivery on the contract sets the occasion for all parties to review 
the contract regularly. Second, if a certain number of task com- 
pletions are required to earn the reward, a checkmark, smiley 
face, or star can be placed on the task record each time the task 
is completed successfully. Marking the contract in this manner 
helps the person remain focused until the assignment is com- 
pleted and the reward is earned. Lee’s mother used the top row 
of boxes in the task record to record the days of the school week. 
In the middle row of boxes she placed gummed stars each day 
Lee met the conditions of his contract. In the bottom row, Lee’s 
mother wrote comments about the progress of his contract. 


Applications of Contingency Contracting 


Contracting in the Classroom 


Teachers have employed contracting to address specific disci- 
pline, performance, and academic challenges (De Martini-Scully 
et al., 2000; Flood & Wilder, 2002; Kehle, Bray, Theodore, 
Jenson, & Clark, 2000; Mruzek, Cohen, & Smith, 2007; Ruth, 
1996). Newstrom and colleagues (1999), for instance, used a 
contingency contract with a middle school student with behav- 
ior disorders to improve the written mechanics associated with 
spelling and written language. After collecting baseline data on 
the percentage of capitalization and punctuation marks used 
correctly across spelling and sentence writing, a contingency 


contract was negotiated with and signed by the student that spec- 
ified that improved performance would yield free time on the 
classroom computer. The student was reminded of the terms of 
the contract before each language arts class that included spell- 
ing worksheets and journal writing (i.e., sentences). 

Figure 28.11 shows the results of the contingency con- 
tracting intervention. When baseline was in effect for spelling 
and written sentences respectively, mean scores for both vari- 
ables were in the 20% correct range. When contracting was 
initiated, the student’s performance for spelling and written sen- 
tences increased immediately to an average of approximately 
84% correct. The authors also reported positive anecdotal evi- 
dence related to spelling and written language from other teach- 
ers with whom this student interacted. 

Wilkinson (2003) used a contingency contract to reduce 
the disruptive behavior of a first-grade student. Disruptive 
behaviors included being off task, refusals to comply with work 
assignments and instructions, fighting with peers, and temper 
tantrums. A behavioral consultation effort was launched with 
the classroom teacher that included problem identification, 
analysis, intervention, and evaluation. Contingency contracting 
consisted of the student earning preferred rewards and social 
praise from the teacher for three behaviors: increased time on 
task, appropriate interactions with other children, and compli- 
ance with teacher requests. Observations of her behavior over 
13 sessions of baseline and contingency contracting showed a 
decrease in the percentage of intervals with disruptive behavior 
when the contingency contract was in effect. Wilkinson reported 
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Contingency Contracting 
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Figure 28.11 Percentage of capitalization and punctuation 
marks used correctly on spelling worksheets and journal 
writing during baseline and contingency contracting. 


From “The Effects of Contingency Contracting to Improve the Mechanics 
of Written Language with a Middle School Student with Behavior 
Disorders,” by J. Newstrom, T. F. McLaughlin, and W. J. Sweeney, 1999, 
Child & Family Behavior Therapy, 21(1), p. 44. Copyright 1999 by The 
Haworth Press, Inc. Reprinted by permission. 


that the student’s disruptive behavior decreased substantially and 
remained low during a 4-week follow-up period. 

Ruth (1996) conducted a 5-year longitudinal study that 
blended contingency contracting with goal setting with students 
who had emotional and behavioral problems. After students 
negotiated contracts with their teachers, a goal-setting compo- 
nent was added that included statements about their daily and 
weekly goals and the criterion levels for success. Results for 
the 37 out of 43 students who finished the program after 5 years 
showed that 75% of daily goals, 72% of weekly goals, and 86% 
of total goals were reached. Ruth summarized the beneficial 
effects of combining strategies: “When [goal-setting] methods 
are incorporated into a contract, the motivational aspects of 
behavior contracting and goal setting may combine to produce 
maximum effort and success” (p. 156). 


Contracting in the Home 


Miller and Kelley (1994) combined contingency contracting 
and goal setting to improve the homework performance of four 
preadolescent students with histories of poor homework com- 
pletion and who were at risk for other academic problems (e.g., 
procrastination, being off task, errors with submitted work). 
During baseline, parents recorded their children’s homework 
time on task, type and accuracy of problem completion, and 
number of problems completed correctly. Subsequently, parents 
and children entered into a goal setting and contingency con- 
tracting phase that was preceded by parent training on how to 
set and negotiate goals and write contracts. Each night, parents 
and children established their respective goals and negotiated 


a compromise goal based on that interaction. Each week, rene- 
gotiations occurred for tasks, rewards, and sanctions should the 
contract not be met. A recording sheet was used to measure 
progress. 

Figure 28.12 shows the results of the study. When goal 
setting was combined with contingency contracting, accuracy 
performance increased for all students. Miller and Kelley’s 
findings reaffirm the notion that contingency contracting can 
be combined successfully with other strategies to produce func- 
tional outcomes. 


Clinical Applications of Contracting 


Flood and Wilder (2002) combined contingency contracting 
with functional communication training to reduce the off-task 
behavior of an elementary-aged student diagnosed with attention- 
deficit hyperactivity disorder (ADHD) who had been referred 
to a clinic-based program because his off-task behavior had 
reached alarming levels. Antecedent assessment, functional 
communication training, and contingency contracting were con- 
ducted in a therapy room located in the clinical facility. Specif- 
ically, antecedent assessments were conducted to determine the 
level of off-task behavior when the difficulty of academic tasks 
varied from easy to difficult and therapist attention varied from 
low to high. A preference assessment was also conducted. Using 
discrete trial training, the student was taught to raise his hand 
for assistance with tasks (e.g., “Can you help me with this prob- 
lem?”). The therapist sat nearby and responded to appropriate 
requests for assistance and ignored other vocalizations. Once 
requesting assistance was mastered, the contingency contract 
was established whereby the student could earn preferred items, 
identified through the assessment, contingent on accurate task 
completion. The results showed that during baseline, off-task 
performance was high for math division and word problems. 
When the intervention was introduced, an immediate reduction 
in off-task behavior was noted for division and word prob- 
lems. Also, the student’s accuracy in completing the division 
and word problems improved as well. Whereas during baseline 
conditions he solved correctly 5% and 33% of the division and 
word problems, respectively, during intervention he solved 24% 
and 92% of the problems correctly. 


Using Contracting to Teach Self-Management 
Skills to Children 


Ideally, contingency contracting involves the active participa- 
tion of the child throughout the development, implementation, 
and evaluation of the contract. For many children, contracting is 
a first experience in identifying specific ways they would like to 
act and then arranging certain aspects of their environment to set 
the occasion for and reward those acts. If more of the decision 
making for all parts of the contracting process is turned over to 
children gradually and systematically, they can become skilled 
at self-contracting. A self-contract is a contingency contract 
that a person makes with herself, incorporating a self-selected 
task and reward as well as personal monitoring of task comple- 
tion and self-delivery of the reward. Self-contracting skills can 
be achieved by a multistep process of having an adult prescribe 
virtually all of the elements of the task and reward and gradually 
shifting the design of the elements to the child. 


5 f : 
Chapter 28 * Token Economy, Group Contingencies, and Contingency Contracting 727 


BL GS + CC BL GS + CC 


100 
20 


80-4 


' 


w 
oO 
{ba 


wy Cy sa 
eS: © <> 
Fo ie 


f 


Percentage of Accuracy 
ce) 
| 


> 
o oOo OS 
Se, —e) © 
| | J 

1 
‘t 


Fe tO I el aa 


50 


40 
x0 


oe te ee ee ee ee 


(] 
1 
' 
1 
' 
' 
f 
1 
n 
' 
' 
1 
' 
' 
' 
' 
' 
1 
' 
' 
' 
1 
' 
' 


Ann 


= 
0 
2 th 6 8 AQ A Aa ie is 


Sessions 


How Do Contracts Work? 


At first glance the principle of behavior behind contingency con- 
tracts seems deceptively simple: A behavior is followed by a con- 
tingent reward—surely a case of positive reinforcement. Yet, in 
most contracts the reward, although contingent, is too delayed to 
reinforce task completion directly; and many successful contracts 
specify a reward that would not function as a reinforcer for the 
task even if it was presented immediately after the task comple- 
tion. Further, behavioral contracting is not a single procedure with 
a single associated behavior and a single reinforcer. Contracting 
is more accurately conceptualized as an intervention package that 
combines several behavior principles and procedures. 


lc LU lps lca alc a il 
AQ. 22 ML PG 3 


Figure 28.12 Percentage of homework problems 
completed accurately during baseline and a treatment 
condition consisting of goal setting and contingency 
contracting. Sessions correspond to sequential school 
days (i.e., Mondays through Thursdays) on which 
subjects were assigned homework. Data were not 
collected on days on which homework was not 
assigned. 

Erom “The Use of Goal Setting and Contingency Contracting for 
Improving Children’s Homework Performance” by D. L. Miller & 
M. L. Kelley, 1994, Journal of Applied Behavior Analysis, 27, 

p. 80. Reproduced with permission of John Wiley & Sons,|nc. 


So, how do contracts work? Several principles, proce- 
dures, and factors are likely to apply. Certainly, reinforcement 
is involved, but not in as simple or direct a fashion as it might 
seem at first. Rule-governed behavior is probably involved 
(Malott & Shane, 2014; Skinner, 1969; see Chapter 11). A con- 
tract describes a rule: A specified behavior will be followed by a 
specified (and reasonably immediate) consequence. The contract 
itself serves as a response prompt to perform the target behay- 
ior and enables the effective use of a consequence (e.g., going 
to the movies Saturday night), too delayed to reinforce certain 
behaviors (e.g., practicing the trumpet on Tuesday). Delayed 
consequences can help exert control over behaviors performed 
hours and even days before if they are associated with and linked 
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by verbal behavior to the rule (e.g., “I’ve just finished my trum- 
pet practice—that’s another checkmark toward the movies on 
Saturday”) or to interim token reinforcers (e.g., the checkmark 
on the contract after practicing). The physical visibility of the 
contract may also function as a response prompt for escaping 
“ouilt” (Malott & Shane, 2014). 


Developing Contingency Contracts 


Although teachers, therapists, or parents can unilaterally deter- 
mine a contract for a child or client, contracting is usually more 
effective when all of the parties involved play an active role 
in developing the contract. Contract development involves the 
specification of tasks and rewards in a fashion agreeable and 
beneficial to each party. Dardig and Heward (2016) described 
a five-step procedure for identifying tasks and rewards that can 
be used by teachers and families.” 


Step 1: Hold a meeting. To get the entire group (family or 
class) involved in the contracting process, a meeting should be 
held. At this meeting, members can discuss how contracts work, 
how they can help the group cooperate and get along better, 
and how contracts can help individuals meet personal goals. 
Parents or teachers should emphasize that they will participate 
in all of the steps leading up to and including implementation 
of contracts. It is important that children view contracting as 


Figure 28.13 A form for self-identification of possible tasks for contingency contracts. 


List A Name: Charlotte 
THINGS | DO TO HELP MY FAMILY 
. Entertain us with skits 
. Clean up my bedroom 


. Practice my violin 
. Wash dishes 


. Follow morning routine 


a behavior-exchange process shared by all members of the 
group, not as something adults impose on them. The field-tested 
list-making procedures described in the following steps provide 
a simple and logical framework for the selection of tasks and 
rewards for family and classroom contracts. Most groups can 
complete the procedure within | to 2 hours. 


Step 2: Complete List A. Each member completes three 
lists prior to the actual writing of the contract. List A (see 
Figure 28.13) is designed to help each member identify not only 
those tasks he can perform within the context of a contract, but 
also those tasks he already does to help the group. In this way, 
positive attention can be focused on appropriate behaviors that 
individual members are currently completing satisfactorily. 

Each member should be given a copy of List A. Everyone 
should be careful to describe all tasks as specifically as possible. 
Then the completed lists can be put aside, and the group can 
proceed to the next step. If a member is unable to write, that 
person’s list can be completed orally. 


Step 3: Complete List B. List B (see Figure 28.14) is designed 
to help group members identify possible contract tasks for other 
group members and helpful behaviors currently being completed 
by those persons. List B can also identify areas where disagree- 
ment exists between group members as to whether certain tasks 
are actually being completed properly and regularly. 


OTHER WAYS | COULD HELP MY FAMILY AND MYSELF 
. Be on time for dinner 


. Turn off the lights when | leave a room 


. Hang up my coat when | get home from school 


. Listen to Mom and Dad the first time 


. Help Dad fold the laundry 


fs 


From Sign Here: A Contracting Book for Children and Their Parents (2nd ed., p. 111) by J. C. Dardig and W. L. Heward, 
1981. Bridgewater, NJ: Fournies and Associates. Copyright 1981 by Fournies and Associates. Reprinted by permission. 


List B Name: Bobby 


Figure 28.14 A form for identifying potential contracting tasks for other family members. 


THINGS BOBBY DOES TO HELP THE FAMILY OTHER WAYS BOBBY COULD HELP THE FAMILY 


1. Vacuums when asked 1. Put his dirty clothes in hamper 

2. Makes his bed 2. Do homework without being asked 

3. Reads stories to little sister 3. Make his own sandwiches for his school lunch 
4. Empties trash 4. Clean and sponge off table after dinner 

5. Rakes leaves 5. 

6. 6. 

7. 7 


From Sign Here: A Contracting Book for Children and Their Parents (2nd ed., p. 111) by J. C. Dardig and W. L. Heward, 
1981. Bridgewater, NJ: Fournies and Associates. Copyright 1981 by Fournies and Associates. Reprinted by permission. 
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Figure 28.15 A form for self-identification of possible rewards for 


contingency contracts. 


List C Name: Charlotte 


MY FAVORITE THINGS, ACTIVITIES, AND SPECIAL TREATS 


Listening to music 


Movies 


New book 


Sleepover with friend 


Swimming 


Special packed lunches for school 


Ice cream sundaes 


Toys for Kiley (pet dog) 


. Picnics 


1. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 
9 
10 


. New shoes 


=—_ 
= 


ek 
ad a 


—_ 
= 


15. 


From Sign Here: A Contracting Book for Children and Their Parents (2nd ed., p. 111) by 
J. C. Dardig and W. L. Heward, 1981. Bridgewater, NJ: Fournies and Associates. Copyright 
1981 by Fournies and Associates. Reprinted by permission. 


Each member should be given a copy of List B and asked 
to write his or her name in all three blanks at the top. These 
lists can then be passed around the table so that everyone has a 
chance to write at least one behavior on each side of everyone 
else’s list. Everyone writes on every List B except his or her 
own, and each person should be required to write at least one 
positive behavior on everyone else’s List B. After completion, 
these lists should be set aside before moving to the next step. 


Step 4: Complete List C. List C (see Figure 28. 15) is simply a 
sheet with numbered lines on which each group member identi- 
fies potential rewards he would like to earn by completing con- 
tracted tasks. Participants should list not only everyday favorite 
things and activities but also special items and activities they 
may have wanted for a long time. It is all right if two or more 
people indicate the same reward. After List C is completed, each 
person should collect his two other lists and read them carefully, 
talking over any misunderstood items. 


Step 5: Write and Sign Contracts. The final step begins with 
choosing a task for each person’s first contract. Discussion 
should move around the group, with members helping each 
other decide which task is the most important to start doing first. 
Everyone should write who is going to perform the task, exactly 
what the task is, how well and when it has to be done, and any 
possible exceptions. Each person should also look at List C and 
choose a reward that is neither excessive nor insignificant but 
is fair for the selected task. Each member should write who will 
control the reward, what the reward is, when it 1s to be given, 
and how much is to be given. Everyone in the group should write 
and sign one contract during the first meeting. 


Contracting with Nonreaders 

Reading ability by the participant is not a prerequisite for con- 
tingency contracting; however, the individual must be able to 
come under the control of the visual or oral statements (rules) of 
the contract. Contracting with nonreaders involves three types 
of clients: (a) preschoolers with good verbal skills, (b) school- 
aged children with limited reading skills, and (c) adults with 
adequate language and conceptual skills but who lack reading 
and writing skills. Contracts for nonreaders specify tasks and 
rewards with nonprint elements such as symbols, pictures, 
photos, and audio recordings chosen to suit the individual 
skills and preferences of children and adults in all three non- 
reader groups (see Figure 28.16). 


Figure 28.16 A contingency contract for a nonreader. 
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Contracting with Nonsigners 


Although some children are eager, and most at least willing, to try 
a contract, some want nothing to do with the whole idea. Using 
contingency contracting in a collaborative approach (Lassman, 
Jolivette, & Wehby, 1999) may reduce the likelihood of noncom- 
pliance, but a parent or practitioner should never try to force the 
child to sign a contract. Coercion is contrary to the whole idea 
of behavioral contracting. Instead of giving up, however, Dardig 
and Heward (2016) suggest the following strategies for working 
with a child who refuses to enter into a contingency contract: 


¢ Shaping and ignoring. Remember the nonsigner’s 
anti-contracting behavior is just that—behavior. Per- 
haps the worst thing parents can do when their child says 
“Contracts are stupid!” or “You'll never get me to sign 
a contract” is to argue or try to convince the child that 
contracts are “‘cool.” Better instead to pay attention to the 
child when he makes any remotely positive or even neutral 
comments about contracting, such as, “I wonder if anyone 
has ever made a contract with his parents for : 
¢ Modeling. In families with other children, perhaps the 
nonsigner’s sibling will try contracting. In a two-parent 
family, the parents can do contracts with one another; a 
single parent might develop a contract with a friend as a 
way to demonstrate the idea. A nonsigner who observes 
his sibling(s) and/or parent(s) successfully using a con- 
tract may then be willing to give it a try. 
Parent contract. Past experience may lead children to be 
wary of what their parents have in store for them and to 
always believe they get the short end of the stick. The par- 
ent might say something like, “Okay, if you don't think 
contracts are fair (or whatever the nonsigner doesn’t like 
about contracts), let’s make the first contract be for my (our) 
behavior and you control the reward.” After controlling the 
reward for a contract specifying a task his parent(s) must do, 
the nonsigner may learn that contracts can work, and may 
be willing to enter into a contract for his behavior. 
Self-contract. Perhaps the nonsigner does not trust that the 
contract will be honest, that he will complete the task but the 
reward will not be delivered. Allowing the child to control 
both the task and the reward components of a contract is one 
tactic for dealing with this suspicion. Will the child cheat if 
given such freedom? Perhaps the child will, but most will 
not. Some research shows that self-determined standards 
of task and reward result in better performance by children 
than do criteria set by adults (Dickerson & Creedon, 1981; 
Olympia, Sheridan, Jenson, & Andrews, 1994). 


However, even after the best of positive approaches, the 
reality is that some nonsigners may not agree to participate in 
a contingency contract. In such cases, another behavior change 
strategy would be a better alternative for dealing with the target 
behavior. 


Guidelines for Implementing 
and Evaluating Contracts 


In determining whether contingency contracting is an appro- 
priate intervention for a given problem, the practitioner should 


consider the nature of the desired behavior change, the verbal 
and conceptual skills of the participant, the individual’ s relation- 
ship with the person(s) with whom the contract will be made, 
and the available resources. The target behavior to be changed 
by acontingency contract should already be in the person’s rep- 
ertoire and under proper stimulus control in the settings where 
the response is desired. If the behavior is not in the individ- 
ual’s repertoire, other behavior-building techniques should be 
attempted (e.g., modeling, shaping, chaining). Contracting is 
most effective with behaviors that produce permanent products 
(e.g., completed homework assignment, cleaned bedroom) or 
that occur in the presence of the person who is to deliver the 
reward (e.g., the teacher or parent). 

Numerous lists of rules and guidelines for effective contin- 
gency contracting have been published (e.g., Dardig & Heward, 
2016; Downing, 1990; Homme, Csanyi, Gonzales, & Rechs, 
1970). Table 28.3 provides a list of the frequently cited guide- 
lines and rules. 

The simplest way to evaluate a contract is to record the 
occurrence of task completion. Including a task record on the 
contract helps make evaluation a natural by-product of the con- 
tracting process. By comparing data from the task record with 
a precontract baseline of task completion, an objective determi- 
nation can be made as to whether improvement has occurred. 
A good outcome is one that results in the specified task being 
completed more often than it was before the contract. 

Sometimes the outcome data indicate that the task is 
being completed more often and more consistently than it was 
before the contract, but the parties involved are unsatisfied. In 
such cases, either the original problem or goal that prompted 
the development of the contract is not being attained, or one 
or more of the participants are unhappy with how the contract 
is being carried out. The first possibility results from selecting 
the wrong behavior for the task part of the contract. For exam- 
ple, suppose ninth-grader Lenwood wants to improve on the 
D’s and F’s he has been getting on his algebra assignments at 
school and writes a contract with his parents, specifying the 
task as “studying-algebra for | hour each school night.” After 
several weeks, Lenwood has studied algebra for the required 
| hour on all but two school nights, but his poor grades remain 
unchanged. Has Lenwood’s contract worked? The correct 
answer is both yes and no. Lenwood’s contract was successful 
in that he consistently completed the specified task—1 hour 
of study each school night. However, in terms of his original 
objective—earning better algebra grades—the contract was a 
failure. Lenwood’s contract helped him change the behavior 
he specified, but he specified the wrong behavior. Studying for 
1 hour, for Lenwood at least, was not directly related to his goal. 
By changing his contract to require that he solve correctly 10 
algebra equations each night (the behavior required to get good 
grades on algebra tests), his goal of obtaining better grades may 
become a reality. 

It is also important to consider the participant’s reactions 
to the contract. A contract that produces desired change in the 
specified target behavior but causes other maladaptive or emo- 
tional responses may be an unacceptable solution. Having the 
client participate in the development of the contract and jointly 
conducting regular progress checks help to avoid this situation. 
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TABLE 28.3 Contingency Contracting Rules and Guidelines 


Rules and Guidelines 


Write a fair contract. 


Write a clear contract. 


Write an honest contract. 


Build in several layers of rewards. 


Consider a response cost 
contingency. 


Post the contract in a visible place. 


Renegotiate and change a contract 
when either party is consistently 
unhappy with it. 


Terminate a contingency contract. 


SUMMARY el | 


Token Economy 


1. A token economy, also called a token reinforcement 


Comments 


There must be a fair relationship between the difficulty of the task and the amount of the 


reward. The goal is to achieve a win-win situation for both parties, not for one party to gain an 
advantage over the other. 


In many instances, a contract’s greatest advantage is that it specifies each person's expec- 
tations. When a teacher or parent's expectations are explicit, performance is more likely to 
improve. Contingency contracts must say what they mean and mean what they say. 


An honest contract exists if the reward is delivered at the time and in the amount specified 
when the task is completed as agreed. In an honest contract, the reward is not delivered if the 
task has not been completed as specified. 


Contracts can include bonus rewards for beating the best daily, weekly, or monthly 
performance. 


Occasionally, it may be necessary to incorporate a “fine” —the removal of rewards—'f the 
agreed-upon task is not completed. 


Public posting allows all parties to see progress toward achieving the goals of the contract. 


Contracting is designed to be a positive experience for all parties, not a tedious endurance 
contest to determine survivors. If the contract is not working, reconsider the task, the reward 
components, or both. 


A contingency contract is a means to an end, not the end product. Once independent and profi- 
cient performance is achieved, the contract can be terminated. Further, a contract can and should 
be terminated when one party or both parties consistently fail to meet the contract's terms. 


5. When establishing a token economy, decisions must be 
made regarding how to begin, conduct, maintain, remove, 
and evaluate the system. 


system, is a behavior change system consisting of three 


major components: (a) a specified list of target behaviors 
to be reinforced; (b) tokens or points that participants 
receive for emitting the target behaviors; and (c) a menu 
of items or activities, privileges, and backup reinforcers 
from which participants choose and obtain by exchanging 


tokens they have earned. 


Group Contingencies 


6. In a group contingency a common consequence (usually, 
but not necessarily, a reward intended to function as 
reinforcement) is contingent on the behavior of one 
member of the group, the behavior of part of the group, 
or the behavior of everyone in the group. 


. Tokens are objects, symbols, marks, or chips that function 
as generalized conditioned reinforcers because they have 
been paired with a wide variety of backup reinforcers. 


. As participants move “up” from one level to the next, 
they have access to more privileges and are expected to 
demonstrate more independence. A level system is a type 
of token economy in which participants move up or down 
a hierarchy of levels contingent on meeting specific per- 
formance criteria with respect to the target behaviors. 


_ There are six basic steps in designing a token econ- 

omy: (1) identifying the target behaviors and rules; 

(2) selecting the tokens that will serve as a medium of 
exchange; (3) selecting a menu of backup reinforcers; 

(4) establishing a token exchange ratio; (5) writing proce- 
dures to specify when and how tokens will be dispensed 
and exchanged and what will happen if the requirements to 
earn a token are not met; and (6) field-testing the system 
before full-scale implementation. 


. Group contingencies can be classified as independent, 


dependent, and interdependent, and they offer the 
behavior analyst multiple advantages for improving 
or reducing behaviors. 


. Six guidelines can assist the practitioner in implement- 


ing a group contingency: (a) choose a powerful reward, 
(b) determine the behavior to change and any collateral 
behaviors that might be affected, (c) set appropriate 
performance criteria, (d) combine with other proce- 
dures when appropriate, (e) select the most appropriate 
group contingency, and (f) monitor individual and group 
performance. 


. Future applications of group contingencies might 


explore their effectiveness with larger societal issues 
(e.g., global warming), or chronic, systemic, difficult- 
to-treat individual challenges (e.g., smoking cessation, 
addictive behaviors). 
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Contingency Contracting 


10. A contingency contract is a document that specifies a con- 
tingent relation between the completion of a target behavior 
and access to, or delivery of, a specified reward. 


11. Two components required in all contingency contracts 
are a description of the task and a description of the 
reward. Under task, who, what, when, and how well 
should be specified. Under reward, who, what, when, 
and how much should be specified. An optional task 
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KEY TERMS 


backup reinforcer group contingency 


behavioral contract hero procedure 
contingency contract 


dependent group contingency 


interdependent group contingency 


record provides a place for recording the progress of the 
contract and providing interim rewards. 


12. A self-contract is a contingency contract that an individual 


makes with herself, incorporating a self-selected task and 
reward as well as self-monitoring of task completion and 
self-delivery of the reward. 


13. Contracts. for nonreaders specify tasks and rewards with 
nonprint elements such as symbols, pictures, photos, and 
audio recordings. 


level system 


self-contract 


independent group contingency token 


token economy 
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MULTIPLE-CHOICE QUESTIONS 


1. A description of the task should include which of the 
following when writing a contingency contract? 


a. A description of the reward 

b. A description of when the reward will be delivered 
c. A method of tracking progress 

d. A description of when the task should be performed 
Hint: (See “Contingency Contracting”’) 


2. A contingency contract: 
a. Is a verbal agreement among two individuals 


b. Specifies how two people will behave toward 
each other 


c. Often causes scapegoating among learners and peers 


d. Depends largely on the use of punishment for its 
effectiveness 


Hint: (See “Contingency Contracting”) 


3. Reinforcers in a token economy might include: 


a. Naturally occurring activities such as classroom jobs, 
games, etc. 


b. Privileges 
c. Tangible items, such as candy, pencils, etc. 
d. All of these 
Hint: (See “Token Economy”) 
4, The primary reason one might consider using a level 
system with a token system is that: 
a. Doing so can foster self-management 


b. Doing so makes it easier to withdraw the token 
economy 


c. Level systems are more preferred for all learners 
d. Level systems are easier to manage 


Hint: (See “Token Economy’’) 


5. Token economies often need to be: 
a. Complex 
b. Individualized 
c. Maintained forever 
d. All of these 


Hint: (See “Implementing a Token Economy”’) 


6. When selecting backup reinforcers: 
a. Consider ethical! and legal issues 


b. It is a good idea to use basic need items (e.g., lunch) as 
a reinforcer 


c. Keep the pool of backup reinforcers small 
d. Never use edibles 


Hint: (See “Implementing a Token Economy”) 


7. A teacher says to his class: “Melissa is working on 


remembering to follow instructions when I give them. 
Melissa is going to keep track of whether or not she follows 
my instructions. If she follows 90% of my instructions today 
(or better), our whole class will watch a movie at the end of 
the day.” What type of group contingency is this? 

a. Independent 

b. Dependent 

c. Interdependent 

d. None of these 


Hint: (See “Group Contingencies”) 


8. Group contingencies can: 


a. Lighten the workload for the practitioner 
b. Save time 

c. Create problems with learners 

d. All of these 


Hint: (See “Group Contingencies”) 


9. 
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How could a learner sabotage a group contingency? 


a. By purposefully not meeting the performance criterion 
so that no one gets the reinforcer 


b. By not showing up for school on a given day 
c. By urging peers to participate in the program 


d. By congratulating peers who are meeting the 
criterion 


Hint: (See “Group Contingencies’’) 
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ESSAY-TYPE QUESTIONS 


1. 


1. A total of 1500 steps was added, as researchers determined that this number 


De 


When describing the reward in a contingency contract, 
what are the four important things to include in the 
description? 


Hint: (See “Contingency Contracting”’) 


. You are a consultant working with a family that wants 


to write a contingency contract with their children to 
have better behavior in the home. Describe the process 
you would ask the family to go through to develop 

the contract. 


Hint: (See “Developing Contingency Contracts”) 


. What are the main components of a token economy? 


Hint: (See “Token Economy’’) 


of steps represented 15 to 20 minutes of at least moderate exercise per day. 
Research and development of the Sign Here contracting method described 
here was informed by three doctoral dissertations at The Ohio State Uni- 
versity. Kabler (1976) used Sign Here to teach contracting skills to three 
classes of fourth-grade students. Norman (1977) found that eight of nine 


10. An interdependent group contingency is most useful if: 


a. One wants to change the behavior of one person or a 
small group of individuals 


b. One wants to differentially reinforce individuals for 
appropriate behavior 


c. One wants to change everyone’s performance 
d. All of these 


Hint: (See “Group Contingencies’’) 


4. If you were going to implement a token economy, what 
would you use as a token? Why would you select this as a 
token? Make sure in your answer you discuss the important 
considerations one must make when selecting the token. 


Hint: (See “Token Economy’) 


5. Identify and define the different types of group 
contingencies. 


Hint: (See “Group Contingencies”) 


6. What are the six guidelines for implementing a group 
contingency? 


Hint: (See “Guidelines for Implementing a Group 
Contingency”) 


parents of children exhibiting behavior problems at home wrote and imple- 
mented contracts that reduced disruptions in the home. Shrewsberry (1977) 
used Sign Here to teach contracting to parents from 59 families, who wrote 
and implemented 154 contracts during a 6-week period. The parents judged 
138 (90%) of those contracts as successful in increasing their children’s 
completion of the specified tasks. 
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Self-Management 


LEARNING OBJECTIVES 


° Define self-management, its advantages and benefits, and specific types of self-management strategies. 
¢ Define self-monitoring, list and discuss suggested guidelines or procedures for its use, and discuss various 


derivatives of self-monitoring. 


¢ Compare and contrast different types of self-administered consequences. 


e Define and discuss four additional types of self-management tactics. 


e List and discuss various suggestions for conducting an effective self-management program. 


Raylene often failed to do the things she needed and 
wanted to do. Her days were just so busy! But Raylene is 
beginning to get a handle on her hectic life. This morning 
a sticky-note on her closet door reminded her to wear her 
gray suit for her luncheon meeting. A note on the refrig- 
erator prompted her to take her completed sales report to 
work. When Raylene got into her car—sales report in hand 
and looking sharp in her gray suit—and sat on the library 
book she’d placed on the driver’s seat the night before, 
chances were good that she would return the book that day 
and avoid another overdue fine. 

Daryl! collected the last data point for his master’s 
thesis nearly a year ago. It’s a solid study on a topic he 
enjoys and believes important, but Daryl is struggling 
mightily to write his thesis. Daryl knows if he would sit 
down and write for an hour or two a day his thesis would 
get written. But the scope and difficulty of the task are 
daunting. Daryl wishes his ability to sit and write were half 
as well developed as his propensity for distraction. 


aylene recently discovered self-management—behaving 

in ways that alter the occurrence of other behaviors she 

wants to control—and feels on top of the world. Daryl 
sorely needs some self-management. This chapter defines 
self-management, identifies uses for self-management and 
the benefits of teaching and learning self-management skills, 
describes a variety of self-management strategies and tactics, 
and offers guidelines for designing and implementing successful 
self-management programs. We begin with a discussion of the 
role of the self as a controller of behavior. 


THE “SELF” AS BEHAVIOR CONTROLLER 


A fundamental precept of radical behaviorism is that the causes 
of behavior are found in the environment. Throughout the 
evolution of the human species, causal variables selected by 
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the contingencies of survival have been transmitted through 
genetic endowment. Other causes of behavior are found in the 
contingencies of reinforcement that operate during the lifetime of 
an individual. What role, then, if any, is left for the self to play? 


Locus of Control: internal or External 
Causes of Behavior 


Proximate causes of some behaviors are apparent by observing 
events as they unfold. A mother picks up and cuddles her crying 
infant, and the baby stops crying. A highway worker leaps off 
the road when a fast-approaching car nears. An angler casts his 
lure toward the spot where his previous cast produced a strike. 
A behavior analyst would likely suggest the involvement of 
escape, avoidance, and positive reinforcement contingencies, 
respectively, for these events. Although others might offer 
mentalistic explanations for why the person in each scenario 
responded as he or she did (e.g., the infant’s crying triggered the 
mother’s nurturing instinct), most people would identify the same 
antecedent events—a baby cried, a speeding car approached, a 
fish struck the lure—as functional variables in the three sce- 
narios. An analysis of the three episodes would reveal almost 
certainly that the immediately preceding events had functional 
roles: the crying baby and speeding car as motivating operations 
that evoked escape and avoidance responses; the biting fish as a 
reinforcer that led to repeating the cast that produced it. 

But much human behavior does not immediately follow 
such obviously related antecedent events. Nonetheless, we 
humans have a long history of assigning causal status to events 
that immediately precede behavior. As Skinner (1974) noted, 
“We tend to say, often rashly, that if one thing follows another, 
it was probably caused by it—following the ancient precept 
of post hoc, ergo propter hoc (after this, therefore because of 
this)” (p. 7). When causal variables are not readily apparent in 
the immediate environment, the tendency to point to internal 
causes of behavior is particularly strong. As Skinner explained: 


The person with whom we are most familiar is our self; 
many of the things we observe just before we behave occur 
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Figure 29.1 Examples of four target behaviors for self-management and the self-management behavior emitted to 


change each example. 


Target Behavior 


Self-Management Behavior 


e Save money instead of spending iton 
frivolous items. 


garage to the curb on Thursday nights 
for pickup early Friday morning. 

e Ride exercise bike for 30 minutes ° 
each evening. 


CDT Zs 


within our body, and it is easy to take them as the causes 
of our behavior. . . . Feelings occur at just the right time to 
serve as causes of behavior, and they have been cited as 
such for centuries. We assume that other people feel as we 
feel when they behave as we behave. (pp. 7, 8) 


Why does one college student maintain a regular study 
schedule from the first week of the semester while her room- 
mates party night after night? When a group of people joins a 
weight loss or a smoking cessation program in which each mem- 
ber is exposed to the same intervention, why do some people 
meet their self-determined goals but others do not? Why does a 
high school basketball player with limited physical abilities con- 
sistently outperform his more athletically gifted teammates? The 
hardworking student is said to have more willpower than her 
less studious roommates; the group members who lost weight 
or stopped smoking are thought to possess more desire than 
their peers who failed to meet their goals or dropped out; the 
athlete’s superior play is considered a result of his exceptional 
drive. Although some psychological theories grant causal status 
to hypothetical constructs such as willpower, desire, and drive, 
these explanatory fictions lead to circular reasoning, bring us no 
closer to understanding the behaviors they claim to explain, and 
thus have no role in a behavior analysis of self-control.! 


Skinner's Two-response Conceptualization 
of Self-Control 


Skinner was the first to apply the philosophy and theory of rad- 
ical behaviorism to behavior typically considered controlled by 
the self. In his classic textbook, Science and Human Behavior, 
Skinner (1953) devoted a chapter to self-control. 


When a man controls himself, chooses a course of action, 
thinks out the solution to a problem, or strives toward an 
increase in self-knowledge, he is behaving. He controls 
himself precisely as he would control the behavior of 
anyone else—through the manipulation of variables of 
which behavior is a function. His behavior in so doing 

is a proper object of analysis, and eventually it must be 
accounted for with variables lying outside the individual 
himself. (pp. 228-229) 


Enroll in a payroll deduction plan. 


e Take garbage and recycling cans from ¢ After backing the car out of the garage for work Thursday morning, pull the 


trash and recycling cans onto the space in the garage where you will park the 
car in the evening. 


Make a chart of the minutes you ride, and show it to a coworker each morning. 


e Write a 20-page term paper. e (1) Outline paper and divide it into five 4-page parts; (2) specify a due date for 

each part; (3) give roommate five $10 checks made out to a despised organiza- 

tion; (4) on each due date show a completed part of the paper to roommate and 
get one $10 check back. 


ee 


Skinner (1953) conceptualized self-control [Skinner’s 
analysis] as a two-response phenomenon: 


One response, the controlling response, affects variables 

in such a way as to change the probability of the other, the 
controlled response. The controlling response may manipu- 
late any of the variables of which the controlled response is 
a function; hence there are a good many different forms of 
self-control. (p. 231) 


Skinner (1953) described a wide variety of self-control 
techniques, including use of physical restraint (e.g., clapping a 
hand over one’s mouth to prevent yawning at an embarrassing 
moment), changing the antecedent stimulus (e.g., putting a box 
of candy out of sight to reduce overeating), and “doing some- 
thing else” (e.g., talking about another topic to avoid talking 
about a particular topic), to name a few. Since Skinner’s initial 
list of techniques, a variety of taxonomies of self-control tactics 
have been described (e.g., Kazdin, 2013; Watson & Tharp, 2014). 
All self-control—or self-management—tactics can be operation- 
alized in terms of two behaviors: (a) the target behavior a person 
wants to change (Skinner’s controlled response) and (b) the 
self-management behavior (Skinner’s controlling response) 
emitted to change the target behavior. Consider the examples in 
Figure 29.1. 


SELF-MANAGEMENT DEFINED 


There is nothing mystical about self-management.” Self- 
management is behavior a person emits to influence another 
behavior. But many of the responses a person emits each day 
affect other behaviors. Putting toothpaste on a toothbrush makes it 
likely that tooth brushing will occur soon. But we do not consider 
squeezing toothpaste on a brush an act of “self-management” just 
because it evokes tooth brushing. What gives some responses 
the special status of self-management? How can we distinguish 
self-management from other behavior? 

Numerous definitions of self-control or self-management 
have been proposed, many of which are similar to one offered by 
Thoresen and Mahoney (1974). They suggested that self-control 
occurs when, in the “relative absence” of immediate external 
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controls, a person emits a response designed to control another 
behavior. For example, a man shows self-control if, when home 
alone and otherwise free to do whatever he wants, he forgoes his 
usual peanuts and beer and rides his exercise bike for 30 minutes. 
According to definitions such as Thoresen and Mahoney’s, 
the man would not be credited with self-control if his wife were 
there reminding him not to overeat, praising him for riding the 
exercise bike, and marking his time and mileage ridden on a chart. 

What, however, if the man had asked his wife to prompt, 
praise, and chart his exercise? Would his having done so consti- 
tute a form of self-control? One problem with conceptualizing 
self-control as occurring only in the absence of “external 
control” is that it excludes those situations in which a person 
designs contingencies that entail external control for a behavior 
he desires to change. Another problem with the “relative absence 
of external controls” concept of self-control is that it creates a 
false distinction between internal and external controlling vari- 
ables when, in fact, all causal variables for behavior ultimately 
reside in the environment. 

Definitions of self-control such as Kazdin’s (2013), “those 
behaviors that an individual deliberately undertakes to achieve 
self-selected outcomes by manipulating antecedent and conse- 
quence events” (p. 627), are more functional for applied behavior 
analysts. With this definition, self-control occurs whenever a 
person purposely emits behavior that changes the environment 
to modify another behavior. Self-control is considered purpose- 
ful in the sense that a person labels (or tacts) her responses as 
designed to attain a specified result (e.g., to reduce the number 
of cigarettes smoked each day). 

We define self-management as the personal application of 
behavior change tactics that produces a desired improvement in 
behavior. This intentionally broad definition of self-management 
encompasses one-time self-management events, such as Raylene’s 
taping a note to her closet door to remind herself to wear her gray 
suit the next day, as well as complex, long-running self-directed 
behavior change programs in which a person plans and imple- 
ments one or more contingencies to change his behavior. It is 
also a functional definition in that the desired change in the target 
behavior must occur for self-management to be demonstrated. 

Self-management is a relative concept. A behavior change 
program may entail a small degree of self-management or be 
totally conceived, designed, and implemented by the person. 
Self-management occurs on a continuum along which the person 
controls one or all components of a behavior change program. 
When a behavior change program is implemented by a person 
or team (e.g., therapist, parent, school staff) on behalf of another 
(e.g., client, child, student), the external change agent manip- 
ulates motivating operations, arranges discriminative stimuli, 
provides response prompts, delivers differential consequences, 
and observes and records the occurrence or nonoccurrence of 
the target behavior. Some degree of self-management is involved 
whenever a person performs (that is to say, controls) any element 
of a program that changes his behavior. 

It is important to recognize that defining self-management 
as the personal application of behavior change tactics that results 
in a desired change in behavior does not explain the phenome- 
non. Our definition of self-management is descriptive only, and 
purposively broad. Although self-management tactics can be 


classified according to their emphasis on a given component of 
the three- or four-term contingency, or by their structural simi- 
larity with a particular principle of behavior (e.g., stimulus con- 
trol, reinforcement), it is likely that all self-management tactics 
involve multiple principles of behavior. Thus, when researchers 
and practitioners describe self-management tactics, they should 
provide a detailed statement of the procedures used. Behavior 
analysts should not ascribe the effects of self-management inter- 
ventions to specific principles of behavior in the absence of an 
experimental analysis demonstrating such relations. Only through 
such research will applied behavior analysis develop a more com- 
plete understanding of the mechanisms that account for the effec- 
tiveness of self-management.” 


Terminology: Self-Management or 
Self-Control? 


Although self-management and self-control often appear inter- 
changeably in the behavioral literature, we recommend that 
self-management be used in reference to a person acting “in some 
way in order to change subsequent behavior” (Epstein, 1997, 
p. 547, emphasis in original). We make this recommendation for 
three reasons. First, self-control is an “inherently misleading” 
term that implies that the ultimate control of behavior lies within 
the person (Brigham, 1980). Although Skinner (1953) acknowl- 
edged a person could achieve practical control over a given 
behavior by acting in ways that manipulate variables influencing 
that behavior, he argued that the controlling behaviors themselves 
are learned from the person’s interactions with the environment. 


A man may spend a great deal of time designing his own 
life—he may choose the circumstances in which he is to 
live with great care, and he may manipulate his daily envi- 
ronment on an extensive scale. Such activity appears to 
exemplify a high order of self-determination. But it is also 
behavior, and we account for it in terms of other variables 
in the environment and history of the individual. It is these 
variables which provide the ultimate control. (p. 240) 


In other words, the causal factors for “self-control” (i.e., 
controlling behaviors) are to be found in a person’s experiences 
with his environment. Beginning with the example of a person 
setting an alarm clock (controlling behavior) so as to get out 
of bed at a certain time (controlled behavior) and concluding 
with an impressive instance of self-control by the main charac- 
ter in Homer’s Odyssey, Epstein (1997) described the origin of 
self-control as follows: 


As is true of all operants, any number of phenom- 

ena might have produced the [controlling] behavior 
originally: instructions, modeling, shaping, or generative 
process (Epstein, 1990, 1996), for example. Its occurrence 
might have been verbally mediated by a self-generated rule 
(Tl bet I'd get up earlier if I set an alarm clock”), and 
that rule, in turn, might have had any number of origins. 
Odysseus had his men tie him to his mast (controlling 
behavior) to lower the probability that he would steer 
toward the Siren’s song (controlled behavior). This was an 
elegant instance of self-control, but it was entirely instruc- 
tion driven: Circe had told him to do it. (p. 547) 


Second, attributing the cause of a given behavior to 
self-control Can serve as an explanatory fiction. As Baum (2017) 
pointed out, self-control “seems to suggest controlling a [sepa- 
rate] self inside or [that there is] a self inside controlling exter- 
nal behavior. Behavior analysts reject such views as mentalistic. 
Instead they ask, ‘What is the behavior that people call “self- 
control”? (p. 167, words in brackets added). Sometimes self 
1s synonymous with mind and “not far from the ancient notion 
of homunculus—an inner person who behaves in precisely the 
ways necessary to explain the behavior of the outer person in 
whom he dwells” (Skinner, 1974, p. 120 

Third, laypeople and behavior analysts alike often use 
self-control to refer to a person’s ability to delay gratification. In 
operant terms, this definition of self-control [as impulse control] 
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entails responding to achieve a delayed, but larger or higher qual- 
ity, reinforcer instead of acting to obtain an immediate, less valu- 
able, reinforcer (Rachlin & Greene, 1972; see Box 29.1). Using 
the same term to refer to a tactic for changing behavior and to a 
certain type of behavior that may be an outcome of that tactic is 
confusing and logically faulty. Restricting the use of self-control 
to describe a certain type of behavior (i.e., not acting impulsively) 
may reduce the confusion caused by using self-control as the 
name for both an independent variable and dependent variable. 
In this sense, self-control can be a possible goal or outcome of 
a behavior change program irrespective of whether that behay- 
ior 1s the product of an intervention implemented by an external 
agent or the subject. Thus, a person can use self-management to 
achieve, among other things, self-control. 


BOX 29.1 


Do You Want $10 Now or $100 Next Year? 


One way that behavior analysts conceptualize self-control is 
based on what organisms do when offered choices between 
rewards of different value and when those rewards can be 
obtained. Given a choice between a small reward and a larger 
reward, humans and nonhuman laboratory animals will most 
often select the larger reward. Offered a choice between an 
immediate reward and a delayed reward of equal value, organ- 
isms tend to choose the immediate reward. Such simple deci- 
sions are easy to predict—who doesn’t want rewards sooner 
and have more rather than less?—and require no self-control 
(Green & Estle, 2003; Odum, 2011). 

But things get more difficult to predict, and typical of sit- 
uations we humans face every day, when the choice is between 
a smaller, sooner reward (SSR) and a larger, later reward 
(LLR). In behavioral research, choosing the SSR (e.g., smok- 
ing a cigarette) over the LLR (e.g., better health in the future) is 
considered impulsive behavior. Forgoing the SSR to obtain an 
LLR reward is viewed as self-control (Ainslie, 1974; Rachlin 
& Green, 1972). 

Delayed rewards, regardless of their significance and 
magnitude (e.g., enough money for a secure retirement), exert 
decreasing influence over choice-making behavior as a func- 
tion of their temporal distance from present circumstances. 
Both humans and nonhuman laboratory animals discount the 
value of delayed rewards; the greater the delay to the reward, 
the greater the discount (i.e., the less value or influence the 
reward has on current behavior. Behavior analysts call this 
phenomenon delay discounting (or temporal discounting) 
(Odum, 2011; Madden & Johnson, 2010). 

Children with attention-deficit hyperactivity disorder 
(ADHD; Neef et al., 2005), compulsive gamblers (Dixon, 
Marley, & Jacobs, 2003), heavy smokers (Bickel, Odum, & 
Madden, 1999), binge drinkers (Vuchinich & Simpson, 1998), 
people with obesity (Davis, Patte, Curtis, & Reid, 2010), and 
heroin addicts (Kirby, Petry, & Bickel, 1999) more steeply 
discount the value of delayed rewards than do comparison 


groups, suggesting increased likelihood of impulsiveness. 
Behavior analysts have researched three strategies for decreas- 
ing impulsivity: gradually increasing the delay to the LLR, 
engaging in intervening activities during the delay, and making 
a commitment response. 


Progressive Delay 


The progressive delay strategy, first developed to teach “self- 
control” to pigeons (Ferster, 1953; Mazur & Logue, 1978), 
essentially shapes preference for LLRs. Subjects initially choose 
between an SSR and LLR, with each reward available imme- 
diately (or after a brief and equal delay, such as a 2-sec delay). 
After the subject selects the LLR some criterion number of times, 
the choice trials are repeated with a slight delay to the LLR. The 
duration of delay to the LLR is gradually increased over sessions. 

Schweitzer and Sulzer-Azaroff (1988) reported one of 
the first applied studies using the progressive delay technique. 
Subjects were six preschool children described as impulsive 
by their teachers. Preassessment confirmed the children con- 
sistently chose SSRs over LLRs. Following treatment, five 
of the children chose the LLR more often and three children 
consistently preferred the LLR at all delay intervals tested. 
Progressive delay training has reduced impulsive choices by 
children with ADHD (Neef, Bicard, & Endo, 2001; Neef, 
Mace, & Shade, 1993), children and adults with autism (Dixon 
& Cummings, 2001), and adults with developmental disabili- 
ties (Dixon & Holcomb, 2000). 


Intervening Activities 


Engaging in intervening activities during the LLR delay interval 
may increase self-control. Binder, Dixon, and Ghezzi (2000) 
required the three children with ADHD to engage in an interven- 
ing verbal activity during the delay to the LLR. Two conditions 
with intervening activities during the delay alternated in multi- 
element fashion. In one condition, the children repeated aloud, 
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BOX 29.1 (continued) 


“If I wait a little longer, I will get the bigger one.” In the other 
condition, the children named pictures of objects depicted 
on flash cards. All children exhibited self-control by select- 
ing the delayed rewards during both intervening activity 
conditions. 

A self-control training procedure combining progres- 
sive delay and intervening activities has also been evaluated 
(Dixon et al., 1998; Dixon, Rehfeldt, & Randich, 2003; 
Dunkel-Jackson, Dixon, & Szekely, 2016). In one study, 
three adults with autism spectrum disorder were asked, “Do 
you want a little [reinforcer] now or a lot of [reinforcer] after 
[doing measuring cups or handwriting]”? (Dunkel-Jackson 
et al., 2016, p. 707). The delay to the LLR, and thus the 
duration of task engagement required, was then progres- 
sively increased. All three participants chose to engage in the 
intervening target behavior and earn the LLR on 79%, 77%, 
and 92% of the intervention trials. This approach not only 
enhanced self-control—it also increased task engagement. 


Commitment Response 


Azrin and Powell (1968) conducted a study to limit chain 
smoking that featured a specially designed cigarette case that 
locked after the smoker removed a cigarette. By choosing to 
only smoke cigarettes taken from the case, a participant could 


APPLICATIONS, ADVANTAGES, 
AND BENEFITS OF SELF-MANAGEMENT 


In this section, we identify four basic applications of self- 
management and describe numerous advantages and benefits 
that accrue to people who use self-management, practitioners 
who teach it others, and to society as a whole. 


Self-Management Applications 


Self-management can help a person be more effective and effi- 
cient in daily life, replace bad habits with good ones, accomplish 
difficult tasks, and achieve personal goals. 


Living a More Effective and Efficient Daily Life 


Raylene’s writing notes to herself and placing her library book 
on her car seat are examples of self-management techniques for 
overcoming forgetfulness or lack of organization. Most people 
use simple self-management techniques, such as making a shop- 
ping list before going to the store or creating “to-do” lists, as ways 
to organize their day; but few probably consider such ordinary 
efforts to be “self-management.” Although many of the most 
widely used self-management techniques can be considered com- 
mon sense, people with an understanding of basic principles of 
behavior can use that knowledge to apply those commonsense 
techniques more systematically and consistently in their lives. 


not smoke another cigarette for a predetermined amount of 
time, which increased from 6 minutes to 65 minutes over the 
course of the study. Rachlin and Green (1972) suggested that a 
smoker’s use of the locking case was a commitment response 
leading to self-control. Since the smoker is just about to have a 
cigarette, the value of a future cigarette is negligible, making it 
easy to shut the case and forgo the next cigarette for a period of 
time. Similarly, an employee’s commitment response of enroll- 
ing in a payroll saving plan at the beginning of the month, 
when money from future paychecks is unavailable (i.e., at a 
discounted value), is easier than depositing money from a pay- 
check in hand into a savings account (behavior leading to LLR) 
instead of impulsively spending it on another dinner out and 
tickets to a concert (the SSR). 

Rachlin (2016) distinguishes between hard commit- 
ments to future behavior (snapping shut the locking cigarette 
case) and soft commitments that create patterns of behavior 
over time that come into contact with temporally distant 
contingencies (e.g., | can smoke as many cigarettes as I want 
today, but I commit to smoking the same number of cigarettes 
for each of the next 6 days). 

To learn more about the experimental analysis, impli- 
cations, and potential applications of delay discounting, see 
Critchfield and Kollins (2001); Hursh, Madden, Spiga, DeLeon, 
and Francisco (2013); and Jacobs, Borrero, and Vollmer (2013). 


Breaking Bad Habits and Acquiring Good Ones 


Many behaviors we want to do more (or less) often and know 
that we should do (or not do) are caught in reinforcement traps. 
Baum (2017) suggested that impulsiveness, bad habits, and 
procrastination are products of naturally existing reinforcement 
traps in which immediate but smaller consequences have greater 
influence on our behavior than more significant outcomes that 
are delayed. Baum gave this description of a reinforcement trap 
for smoking: 


Acting impulsively leads to a small but relatively 
immediate reinforcer. The short-term reinforcement for 
smoking . . . lies in the effects of nicotine and social 
reinforcers such as appearing grown-up or sophisticated. 
The trouble with impulsive behavior lies in long-term ill 
effects. Gradually, over months or years, the bad habit 
takes its toll in consequences such as cancer, heart disease 
and emphysema. 

The alternative to impulsiveness, self-control, . . . 
refraining from smoking, . . . also leads to both short-term 
and long-term consequences. The short-term consequences 
are punishing but relatively minor and short-lived: with- 
drawal symptoms (e.g., headaches) and possibly social 
discomfort. In the long term, however, . . . refraining from 
smoking reduces the risk of cancer, heart disease, and 
emphysema; ultimately, it promotes health. (pp. 168-169) 


> 


Reinforcement traps are two-sided contingencies that 
promote bad habits while simultaneously working against the 
selection of behavior that is beneficial in the long term. Even 
though we may know the rules describing such contingencies, 
the rules are difficult to follow. Malott (1984) described weak 
rules as those with delayed, incremental, or unpredictable out- 
comes. An example of a weak rule is I’d better not smoke or 
I might get cancer someday and die. Even though the potential 
consequence—cancer and death—is a major one, it is far off in 
the future and not a sure thing even then, which severely limits 
its effectiveness as a behavioral consequence. So, the rule don’t 
smoke or you might get cancer is hard to follow. The deleterious 
effect of any one cigarette is small, so small that it is not even 
noticed. Emphysema and lung cancer are likely years and thou- 
sands of cigarettes away. How can one more puff hurt? 

Self-management provides one strategy for avoiding the 
deleterious effects of reinforcement traps. A person can use 
self-management tactics to arrange immediate consequences 
that will counteract the consequences currently maintaining the 
self-destructive behavior. 


Accomplishing Difficult Tasks 


Immediate outcomes that bring a person only infinitesimally 
closer to a significant, long-term outcome do not control behav- 
ior. Many self-management problems are the result of outcomes 
that are small, but of cumulative significance. Malott (2005a) 
argued that our behavior is controlled by the outcome of each 
individual response, not by the cumulative effect of a large 
number of responses. 


The natural contingency between a response and its out- 
come will be ineffective if the outcome of each individual 
response is too small or too improbable, even though the 
cumulative impact of those outcomes is significant. So, 
we have trouble managing our own behavior when each 
instance of that behavior has only an insignificant outcome, 
though the outcomes of many repetitions of that behav- 
ior will be highly significant. And we have little trouble 
managing our own behavior, when each instance of that 
behavior has a significant outcome, though that outcome 
might be greatly delayed. 

Most obese people in the United States know the 
rule describing that contingency: If you repeatedly overeat, 
you will become overweight. The problem is, knowing the 
overweight rule does not suppress this one instance of eat- 
ing that delicious fudge sundae topped with the whipped 
cream and the maraschino cherry, because a single instance 
of eating this dessert will cause no significant harm, and 
will, indeed, taste great. Knowledge of this rule describing 
the natural contingency exerts little control over gluttony. 

But the outcome of our behavior also needs to be 
probable. Many people have trouble following the buckle-up 
rule, even though they might get in a serious auto accident 
as soon as they pull onto the street. Even though the delay 
between the failure to buckle up and the accident could 
be only a few seconds, they fail to buckle up because the 
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probability of the accident is so low. However, if they 
were driving in a dangerous auto race or a dangerous 
stunt-driving demonstration, they would always buckle 
up, because the probability of a serious accident would be 
fairly high. (pp. 516-517) 


Just as a person who smokes just one more cigarette or 
eats another hot fudge sundae cannot detect that lung cancer or 
obesity is a noticeably closer outcome, graduate student Daryl 
does not perceive that writing one more sentence brings him 
closer to his goal of a completed master’s thesis. Daryl is capa- 
ble of writing one more sentence but he often has trouble getting 
started and working steadily on difficult tasks in which each 
response produces little or no apparent change in the size of the 
remaining task. So, he procrastinates. 

Applying self-management to this type of performance 
problem involves designing and implementing one or more 
contrived contingencies to compete with the ineffective natu- 
ral contingencies. The self-management contingency provides 
immediate consequences or short-term outcomes for each 
response or small series of responses. These contrived conse- 
quences increase the occurrence of the target responses, which 
over time produce the cumulative effects necessary to complete 
the task. 


Achieving Personal Goals 


People can use self-management to achieve personal goals, such 
as learning to play a musical instrument, learning a foreign lan- 
guage, running a marathon, sticking to a schedule of daily yoga 
sessions (Hammer-Kehoe, 2012), or simply taking some time each 
day to relax (Harrell, 2012) or listen to enjoyable music (Dams, 
2012). For example, a graduate student who wanted to become 
a better guitarist used self-management to increase the time he 
practiced scales and chords (Rohn, 2002). He paid a $1 fine toa 
friend for each day that he failed to practice playing scales and 
chords for a criterion number of minutes before going to bed. 
Rohn’s self-management program also included several contin- 
gencies based on the Premack principle (see Chapter 11). For 
example, if he practiced scales (ordinarily a low-rate behavior) 
for 10 minutes, he could play a song (a high-rate behavior). 


Advantages and Benefits of 
Self-Management 


A dozen potential advantages and benefits accrue to people 
who use self-management tactics and to practitioners who teach 
self-management skills to their clients and students. 


Self-Management Can Influence Behaviors Not 
Accessible to External Change Agents 


Self-management can be used to change behaviors that are inac- 
cessible to observation by others. Behaviors such as thoughts 
of self-doubt, obsessive thoughts, and feelings of depression 
are private events for which a self-managed treatment approach 
may be needed (e.g., Kostewicz, Kubina, & Cooper, 2000; 
Kubina, Haertel, & Cooper, 1994). 
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Even publicly observable target behaviors are not always 
emitted in settings and situations accessible to an external 
change agent. The behavior a person wishes to change may need 
to be prompted, monitored, evaluated, or reinforced or punished 
on a day-to-day or even minute-to-minute basis in all of the 
situations and environments that the person encounters. Most 
successful smoking, weight loss, exercise, and habit-reversal 
programs, while planned and directed by others, rely heavily 
on participants using various self-management techniques while 
away from the treatment setting. Many behavior change goals 
brought to clinicians pose the same challenge: How can an effec- 
tive contingency be arranged that follows the client at all times 
wherever she goes? For example, target behaviors designed to 
increase an employee’s self-esteem and assertiveness can be 
identified and practiced in the therapist’s office, but implemen- 
tation of an active contingency in the workplace is apt to require 
self-management techniques. 


External Change Agents Often Miss Important 
Instances of Behavior 


In most education and treatment settings, particularly large- 
group situations, many important responses go unnoticed by the 
person responsible for applying behavior change procedures. 
Classrooms, for example, are especially busy places where 
desired behaviors by students often go unnoticed because the 
teacher is engaged with other tasks and students. As a result, 
students miss opportunities to respond, or they respond and 
receive no feedback because, in a behavioral sense, the teacher 
is not there. However, students who have been taught to evaluate 
their own performance and provide their own feedback in the 
form of self-delivered rewards and error correction, including 
seeking assistance and praise from their teacher when needed 
(e.g., Alber & Heward, 2000; Rouse, Everhart-Sherwood, & 
Alber-Morgan, 2014), are not dependent on the teacher’s direc- 
tion and feedback for every learning task. 


Self-Management Can Promote the Generalization 
and Maintenance of Behavior Change 


A behavior change that (a) continues after treatment has 
ended, (b) occurs in relevant settings or situations other than 
the one(s) in which it was learned originally, or (c) spreads to 
other related behaviors has generality (Baer, Wolf, & Risley, 
1968). Important behavior changes without such generalized 
outcomes must be supported indefinitely by continued treat- 
ment.> When the student or client is no longer in the setting 
in which a behavior change was acquired, she may no longer 
emit the desired response. Particular aspects of the original 
treatment environment, including the person (teacher, thera- 
pist, parent) who administered the behavior change program, 
may have become discriminative stimuli for the newly learned 
behavior, enabling the learner to discriminate the presence or 
absence of certain contingencies across settings. Generaliza- 
tion to nontreatment environments is also hampered when the 
naturally existing contingencies in those settings do not rein- 
force the target behavior. 

Self-management may overcome challenges to achiev- 
ing generalized outcomes. Baer and Fowler (1984) posed and 


answered a pragmatic question related to the problem of promot- 
ing the generalization and maintenance of newly learned skills. 


What behavior change agent can go with the student to 
every necessary lesson, at all times, to prompt and reinforce 
every desirable form of the behavior called for by the cur- 
riculum? The student’s own “self” can always meet these 
specifications. (p. 148) 


A Small Repertoire of Self-Management Skills 
Can Control Many Behaviors 


A person who learns a few self-management tactics can con- 
trol a potentially wide range of behaviors. For example, self- 
monitoring—observing and recording one’s own behavior—has 
been used to increase on-task behavior (Clemons, Mason, 
Garrison-Kane, & Wills, 2016), academic productivity and 
accuracy (e.g., Rock, 2005), homework completion and accu- 
racy (Falkenberg & Barbetta, 2013), productivity at work 
(e.g., Christian & Poling, 1997), physical activity (Hayes & 
Van Camp, 2015), and reciprocal social conversations (Koegel, 
Park, & Koegel, 2014), and to decrease target behaviors such 
as cigarette smoking (Foxx & Brown, 1979) and stereotypy or 
tics (Crutchfield, Mason, Chambers, Wills, & Mason, 2015; 
Fritz, Iwata, Rolider, Camp, & Neidert, 2012; Tiger, Fisher, & 
Bouxsein, 2009). 


People with Diverse Abilities Can Learn 
Self-Management Skills 


People of wide-ranging ages and cognitive abilities have used 
self-management tactics successfully. Preschoolers (e.g., Sainato, 
Strain, Lefebvre, & Rapp, 1990), typically developing students 
from primary grades through high school (Busacca, Moore, & 
Anderson, 2015), students with learning disabilities (McDougall 
et al., 2017), students with emotional and behavior disorders 
(Bruhn, McDonald, & Kreigh, 2015), children with autism (Carr, 
Moore, & Anderson, 2014; Southall & Gast, 2011), and children 
and adults with intellectual and developmental disabilities (Grossi 
& Heward, 1998; Reynolds, Gast, & Luscre, 2014) have all used 
self-management successfully. Even college professors are 
capable of improving their performance with self-management 
(Malott, 2005a)! 


Some People Perform Better Under Self-Selected 
Tasks and Performance Criteria 


Participants in some studies have performed better with self- 
selected work tasks and consequences than under contingencies 
determined by others (e.g., Baer, Tishelman, Degler, Osnes, & 
Stokes, 1992; Olympia, Sheridan, Jenson, & Andrews, 1994; 
Parsons, Reid, Reynolds, & Bumgarner, 1990). For example, 
in three brief experiments with a single student, Lovitt and 
Curtiss (1969) found pupil-selected rewards and contingen- 
cies to be more effective than teacher-selected performance 
standards. In the first phase of Experiment 1, the subject, a 
12-year-old boy in a special education classroom, earned a 
teacher-specified number of minutes of free time based on the 
number of math and reading tasks he completed correctly. In 
the next phase of the study, the student was allowed to specify 


the number of correct math and reading items needed to earn 
each minute of free time. During the final phase of the study, 
the original teacher-specified ratios of academic production 
to reinforcement were again in effect. The student’s median 
academic response rate during the self-selected contingency 
phase was 2.5 correct responses per minute (math and reading 
tasks reported together), compared to correct rates of 1.65 and 
1.9 in the two teacher-selected phases. 

An experiment by Dickerson and Creedon (1981) used a 
yoked-control (Sidman, 1960) and between-group comparison 
experimental design with 30 second- and third-graders and 
found that pupil-selected standards resulted in significantly 
higher academic production of both reading and math tasks. 
Both the Lovitt and Curtiss (1969) and the Dickerson and 
Creedon studies demonstrated that self-selected reinforcement 
contingencies can be more effective than teacher-selected 
contingencies. However, research in this area also shows that 
simply letting children determine their own performance stan- 
dards does not guarantee high levels of performance; some 
studies have found that children select too-lenient standards 
when given the opportunity (Felixbrod & O’Leary, 1973, 
1974). But interestingly, in a study of student-managed home- 
work teams by Olympia and colleagues (1994), although 
students assigned to teams who could select their own per- 
formance goals chose more lenient accuracy criteria than 
the 90% accuracy criterion required of students on the teams 
that worked toward teacher-selected goals, the overall perfor- 
mance of the teams with student-selected goals was slightly 
higher than that of the teams working toward teacher-selected 
goals. More research is needed to determine the conditions 
under which students will self-select and maintain appropriate 
standards of performance. 


People with Good Self-Management Skills Contribute 
to Efficient and Effective Group Environments 


The overall effectiveness and efficiency of any group of people 
who share a working environment is limited when a single per- 
son is responsible for monitoring, supervising, and providing 
feedback for everyone’s performance. Use of self-management 
skills by students, teammates, band members, or employees 
without having to rely on teachers, coaches, band directors, 
or managers for every task, improves the overall performance 
of the entire group. In the classroom, for example, teachers 
traditionally assume full responsibility for setting performance 
criteria and goals for students, evaluating students’ work, 
delivering consequences for those performances, and managing 
students’ social behavior. The time required for these functions 
is considerable. Students who can self-score their work, pro- 
vide their own feedback using answer keys or self-checking 
materials (Bennett & Cavanaugh, 1998: Goddard & Heron, 
1998), and behave appropriately without teacher supervision 
enable their teacher to attend to other aspects of the curriculum 
and perform other instructional duties (Mitchem, Young, West, 
& Benyo, 2001; Saddler & Good, 2006). 

When Hall, Delquadri, and Harris (1977) conducted 
observations of elementary classrooms and found low levels 
of active student response, they surmised that higher rates of 
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academic productivity might actually be punishing to teachers. 
Even though considerable evidence links high rates of student 
responding with academic achievement (e.g., Hattie, 2009; 
MacSuga-Gage & Simonsen, 2015), generating more student 
responses in most classrooms results in the teacher grading 
more papers. The time savings produced by students with even 
the simplest self-management skills can be significant. In one 
study, a class of five elementary special education students 
completed as many arithmetic problems as they could during 
a 20-min session each day (Hundert & Batstone, 1978). When 
the teacher graded the papers, an average of 50.5 minutes was 
required to prepare and conduct the session and to score and 
record each student’s performance. When the students scored 
their own papers, the total teacher time needed to conduct the 
arithmetic period averaged 33.4 minutes. 


Teaching Students Self-Management Skills Provides 
Meaningful Practice in Other Curriculum Areas 


When students learn to define and measure behavior and to 
graph, evaluate, and analyze their own responses, they practice 
various math and science skills in a relevant way. When students 
are taught how to conduct self-experiments such as using 
A-B designs to evaluate their self-management projects, they 
receive meaningful practice in logical thinking and scientific 
methodology (Marshall & Heward, 1979; Moxley, 1998). 


Self-Management Is an Ultimate Goal of Education 


When asked to identify what education should accomplish for its 
students, most people—educators and laypeople alike—include 
in their reply the development of independent, self-directed 
citizens capable of behaving appropriately and constructively 
without the supervision of others. John Dewey (1939), one of 
America’s most influential educational philosophers, said, “the 
ideal aim of education is the creation of self-control” (p. 75). 
A student’s ability to be self-directed, with the teacher as a guide 
or facilitator, and his ability to evaluate his own performance 
have long been considered cornerstones of humanistic education. 
As Lovitt (1973) observed, the fact that systematic instruc- 
tion of self-management skills is not a regular part of most 
schools’ curricula is a paradox because “one of the expressed 
objectives of the educational system is to create individuals 
who are self-reliant and independent” (p. 139). Although self- 
control is expected and valued by society, it is seldom addressed 
directly by the school curriculum. Teaching self-management as 
an integral part of the curriculum requires that students learn a 
fairly sophisticated sequence of skills (e.g., Marshall & Heward, 
1979: McConnell, 1999). However, systematic teaching of 
self-management skills, if successful, is well worth the effort 
required; students will have an effective means of dealing with 
situations in which there is little, if any, external control. 


Self-Management Benefits Society 


Self-management serves two important functions for society 
(Epstein, 1997). First, citizens with self-management skills are 
more likely to fulfill their potential and make greater contribu- 
tions to society. Second, self-management helps people behave 
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in ways (e.g., purchasing fuel-efficient automobiles, taking 
public transportation) that forgo immediate reinforcers that are 
correlated with devastating outcomes so long deferred that only 
future generations will experience them (e.g., depletion of nat- 
ural resources and climate change). People who have learned 
self-management techniques that help them save resources, 
recycle, and burn less fossil fuel help create a better world for 
everyone. Self-management skills can provide people with the 
means of meeting the well-intentioned but difficult-to-follow 
rule, Think globally but act locally. 


Self-Management Helps a Person Feel Free 


Baum (2017) noted that people caught in reinforcement traps 
who recognize the contingency between their addictive behav- 
ior, impulsiveness, or procrastination and the eventual conse- 
quence that is likely to occur as a result of that behavior are 
unhappy and do not feel free. However, “Someone who escapes 
from a reinforcement trap, like someone who escapes from 
coercion, feels free and happy. Ask anyone who has kicked an 
addiction” (p. 193). 

Ironically, a person who skillfully uses self-management 
techniques derived from a scientific analysis of behavior— 
environment relations predicated on an assumption of deter- 
minism is more likely to feel free than is a person who believes 
that her behavior is a product of free will. In discussing the 
apparent disconnect between philosophical determinism and 
self-control, Epstein (1997) compared two people: one who 
has no self-management skills and one who is very skilled at 
managing his own affairs. 


First, consider the individual who has no self-control. In 
Skinner’s view, such a person falls prey to all immediate 
stimuli, even those that are linked to delayed punishment. 
Seeing a chocolate cake, she eats it. Handed a cigarette, she 
smokes it. . .. She may make plans, but she has no ability 
to carry them out, because she is entirely at the mercy of 
proximal events. She is like a sailboat blowing uncontrolla- 
bly inagale... 

At the other extreme, we have a skillful self- 
manager. He, too, sets goals, but he has ample ability 
to meet them. He has skills to set dangerous reinforcers 
aside. He identifies conditions that affect his behav- 
ior and alters them to suit him. He takes temporally 
remote possibilities into account in setting his priorities. 
The wind is blowing, but he sets the boat’s destination 
and directs it there. 

These two individuals are profoundly different. 

The first is being controlled in almost a linear fashion by 
her immediate environment. The second is, in a nontrivial 
sense, controlling his own life... . 

The woman who lacks self-control skills feels con- 
trolled. She may believe in free will (in fact, in our culture, 
it’s a safe bet that she does) but her own life is out of con- 
trol. A belief in free will only exacerbates her frustration. 
She should be able to will herself out of any jam, but 
“willpower” proves to be highly unreliable. In contrast, 
the self-manager feels that he is in control. Ironically, 


like Skinner, he may believe in determinism, but he not 
only feels that he is in control, he is in fact exercising 
considerably more control over his life than our impulsive 
subject. (p. 560, emphasis in original) 


Self-Management Feels Good 


A final, but by no means trivial, reason for learning self- 
management is that being in control of one’s life feels good. 
A person who arranges her environment in purposeful ways 
that support and maintain desirable behaviors not only will be 
more productive than she would otherwise be but also will feel 
good about herself. Seymour (2002) noted her personal feelings 
about a self-management intervention she implemented with the 
objective of running for 30 minutes 3 times per week. 


The guilt I had been feeling for the 25 past years had 

been burning strong within me. [As a former scholar- 

ship softball player] I used to exercise 3 hours per day 

(6 days per week) because that exercise paid for my school- 
ing. ... But now the contingencies were missing, and my 
exercising had gone completely down the tubes. Since the 
start of my project, I’ve run 15 out of the 21 times; so I’ve 
gone from 0% during baseline to 71% during my inter- 
vention. I’m happy with that. Because the intervention’s 
successful, the guilt I have felt for the past years has been 
lifted. I have more energy and my body feels new and 
strong. It’s amazing what a few contingencies can do to 
improve the quality of life in a big way. (pp. 7-12) 


ANTECEDENT-BASED 
SELF-MANAGEMENT TACTICS 


In this and the three sections that follow, we describe some of 
the many self-management tactics that behavioral researchers 
and clinicians have developed. Although no standard set of 
labels or classification scheme for self-management tactics 
has emerged, the techniques are often presented according to 
their relative emphasis on antecedents or consequences for the 
target behavior. An antecedent-based self-management tactic 
features the manipulation of events or stimuli antecedent to the 
target (controlled) behavior. Sometimes grouped under gen- 
eral terms such as environmental planning (Bellack & Hersen, 
1977; Thoresen & Mahoney, 1974) or situational induce- 
ment (Martin & Pear, 2015), antecedent-based approaches to 
self-management encompass a wide range of tactics, including 
the following: 


* Manipulating motivating operations to make a desired 

(or undesired) behavior more (or less) likely. 

Providing response prompts. 

* Performing the initial steps of a behavior chain to ensure 
being confronted later with a discriminative stimulus that 
will evoke the desired behavior. 

¢ Removing the materials required for an undesired behavior. 

* Limiting an undesired behavior to restricted stimulus 
conditions. 

* Dedicating a specific environment for a desired behavior. 


Manipulate Motivating Operations 


A motivating operation (MO) is an environmental condition or 
event that (a) alters the effectiveness of some stimulus, object, or 
event as a reinforcer and (b) alters the current rate of all behav- 
iors that have been reinforced by that stimulus, object, or event 
in the past (see Chapter 16). An MO that increases the effective- 
ness of a reinforcer and has an evocative effect on behaviors that 
have produced that reinforcer is called an establishing operation 
(EO); an MO that decreases the effectiveness of a reinforcer and 
has an abative (i.e., weakening) effect on behaviors that have 
produced that reinforcer is called an abolishing operation (AO).° 

The general strategy for incorporating an MO into a 
self-management intervention is to behave in a way (con- 
trolling behavior) that creates a certain state of motivation 
that, in turn, increases (or decreases, as desired) subsequent 
occurrences of the target behavior (controlled behavior). Imag- 
ine you have been invited to dinner at the house of someone 
known to be a first-rate cook. You want to obtain maximum 
enjoyment from what promises to be a special meal but are 
worried you will not be able to eat everything. Skipping lunch 
(controlling behavior) creates an EO that would increase the 
likelihood of your being able to enjoy the evening meal from 
appetizer through dessert (controlled behavior). Conversely, 
eating a meal just before going grocery shopping (controlling 
behavior) could serve as an AO that decreases the momentary 
value of ready-to-eat foods with high sugar and fat content as 
reinforcers and results in purchasing fewer such items at the 
supermarket (controlled behavior). 


Provide Response Prompts 


Creating stimuli that later function as extra cues and reminders for 
desired behavior is a simple and often effective self-management 
technique. Response prompts can take a wide variety of forms 
(e.g., visual, auditory, textual, symbolic), be permanent for reg- 
ularly occurring events (e.g., entering “Put trash out tonight!” 
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each Thursday on one’s calendar), or be one-time affairs, as 
when Raylene wrote, “gray suit today” on a sticky-note and 
attached it to her closet door where she was sure to see it when 
dressing for work in the morning. Dieters often put pictures 
of overweight people, or perhaps even unattractive pictures of 
themselves, on the refrigerator door, near the ice cream, in the 
cupboard—anyplace where they might look for food. Seeing the 
pictures may evoke other controlling responses, such as moving 
away from the food, phoning a friend, going for a walk, or mark- 
ing a point on an “I Didn’t Eat It!” chart. 

The same object can be used as a generic response prompt 
for various behaviors, such as putting a rubber band around the 
wrist as a reminder to do a certain task at some later time. This 
form of response prompt, however, will be ineffective if the per- 
son is unable later to remember what specific task the physical 
cue was intended to prompt. Engaging in some self-instruction 
when “putting on” a generic response prompt may help the per- 
son recall the task to be completed when seeing the cue later. For 
example, one of the authors of this text uses a small carabiner 
he calls a “memor-ring” as a generic response prompt. When 
the thought of an important task that can only be completed in 
another setting later in the day occurs to him, he clips the cara- 
biner to a belt loop or the handle of his briefcase and “activates 
the memor-ring” by stating the target task to himself 3 times 
(e.g., “borrow Nancy’s journal at Arps Hall,” “borrow Nancy's 
journal at Arps Hall,” “borrow Nancy’s journal at Arps Hall”). 
When he sees the “memor-ring” in the relevant setting later, 
it usually functions as an effective prompt for completing the 
targeted task. 

Self-arranged cues can also be used to prompt the occur- 
rence of a behavior the person wants to repeat in a variety of 
settings and situations. In this case, the person might seed his 
environment with supplemental response prompts. For example, 
a father who wants to increase the number of times he inter- 
acts with and praises his children might place colorful sticky- 
notes printed with the word ZAP! (see Figure 29.2) in various 


Figure 29.2 Contrived cues used 

by parents to prompt self-selected 
parenting behaviors. 

Originally developed by Charles Novak. 
From Working with Parents of Handicapped 
Children, by W. L. Heward, J. C. Dardig, 
and A. Rossett, 1979, p. 26, Columbus, 
OH: Charles E. Merrill. 
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places around the house where he will see them routinely—on 
the microwave, on the TV remote control, as a bookmark. Each 
time he sees a ZAP! cue, the father is reminded to make a com- 
ment or ask a question of his child, or to look for some positive 
behavior for which he can provide attention and praise. 
Smartphones, tablets, watches, and other portable digital 
devices can provide self-delivered audio and visual response 
prompts wherever target behavior should be emitted (Cullen 
& Alber-Morgan, 2015; Mechling, 2011). Individuals with 
autism and other disabilities have used digital devices to 
prompt acquisition and continued performance across a wide 
range of daily-living (Cannella-Malone, Brooks, & Tullis, 2013; 
Cannella-Malone, Chan, & Jimenez, 2017) and vocational tasks 
(Cullen, Alber-Morgan, Simmons-Reed, & Izzo, 2017). 


Perform the Initial Steps of a Behavior Chain 


Another self-management tactic featuring manipulation of 
antecedent events involves behaving in a manner that ensures 
your future behavior contacts a discriminative stimulus (S) that 
reliably evokes the target behavior. Although operant behavior 
is selected and maintained by its consequences, the moment- 
to-moment occurrence of many behaviors is influenced by the 
presence or absence of discriminative stimuli. 

Many tasks consist of response chains. Each response in a 
chain produces a change in the environment that serves both as 
a conditioned reinforcer for the response that preceded it and as 
an SP for the next response in the chain (see Chapter 23). Suc- 
cessful completion of a behavior chain requires each component 
response to be evoked by its related S?. By performing part of 
a behavioral chain (the self-management response) at one point 
in time, a person changes her environment so that she will be 
confronted later with an SP that will evoke the next response 
in the chain and will lead to the completion of the task (the 
self-managed response). Skinner (1983b) provided an excellent 
example of this tactic. 


Ten minutes before you leave your house for the day you 
hear a weather report: It will probably rain before you 
return. It occurs to you to take an umbrella (the sentence 
means quite literally what it says: The behavior of taking an 
umbrella occurs to you), but you are not yet able to execute 
it. Ten minutes later you leave without the umbrella. You 
can solve that kind of problem by executing as much of 

the behavior as possible when it occurs to you. Hang the 
umbrella on the doorknob, or put it through the handle of 
your briefcase, or in some other way start the process of 
taking it with you. (p. 240) 


Remove Items Necessary 
for an Undesired Behavior 


Altering the environment to make an undesired behavior less 
likely or, better yet, impossible to emit is another antecedent- 
based self-management tactic. The smoker who discards her 
cigarettes and the dieter who removes all cookies and chips from 
his house, car, and office, have, for the moment at least, effec- 
tively controlled their smoking and junk food eating. Although 
other self-management efforts will probably be needed to refrain 


from seeking and obtaining the harmful material, ridding one- 
self of the items needed to engage in an undesired behavior is 
a good start. 


Limit Undesired Behavior to Restricted 
Stimulus Conditions 


A person may be able to decrease an undesired behavior by 
limiting the setting or stimulus conditions where he engages in 
the behavior. To the extent that the restricted situation acquires 
stimulus control over the target behavior and access to the situa- 
tion is infrequent or otherwise not reinforcing, the target behav- 
ior will occur less frequently. Imagine a man who habitually 
touches and scratches his face (he is aware of this bad habit 
because his wife often nags him about it and asks him to stop). 
In an effort to decrease his face touching, the man decides that 
he will do two things. First, whenever he becomes aware that he 
is engaging in face touching, he will stop; second, at any time 
he can go to a bathroom and touch and rub his face for as long 
as he wants. 

Nolan (1968) reported the case of a woman who used 
restricted stimulus conditions in an effort to quit smoking. The 
woman noted she most often smoked when other people were 
around or when she watched television, read, or lay down to 
relax. It was decided she would smoke only in a prescribed 
place, and she would eliminate from that place the potentially 
reinforcing effects of other activities. She designated a specific 
chair as her “smoking chair” and positioned the chair so she 
could not watch TV or easily engage in conversation while 
sitting in it. She asked family members not to approach her or 
talk to her while she was in her smoking chair. She followed the 
procedure faithfully, and her smoking decreased from a base- 
line average of 30 cigarettes per day to 12 cigarettes per day. 
Nine days after beginning the program, the woman decided to 
try to reduce her smoking even further by making her smoking 
chair less accessible. She put the chair in the basement, and 
her smoking decreased to five cigarettes per day. A month after 
beginning the smoking chair program, the woman had quit 
smoking completely. 

Goldiamond (1965) used a similar tactic to help a man 
who continually sulked when interacting with his wife. The hus- 
band was instructed to limit his sulking behavior to his “sulking 
stool,” which was placed in the garage. The man went to the 
garage whenever he felt like sulking, sat on his stool and sulked 
for as long as he wanted, and left when he was finished sulking. 
The man found that his sulking decreased considerably when he 
had to sit on a stool in the garage to do it. 


Dedicate a Specific Environment 
for a Desired Behavior 


A person may achieve some degree of stimulus control over a 
behavior that requires diligence and concentration by reserving 
or creating an environment where he will only engage in that 
behavior. Students have improved their study habits and pro- 
fessors their scholarly productivity when they have selected a 
specific place in which to study, free of other distractions, and 
have not engaged in any other behaviors, such as daydreaming 


or letter writing, in that place (Goldiamond, 1965). Skinner 
(1981b) suggested this kind of stimulus control strategy when 
he offered the following advice to aspiring writers: 


Equally important are the conditions in which the behavior 
occurs. A convenient place is important. It should have 

all the facilities needed for the execution of writing. Pens, 
typewriters, recorders, files, books, a comfortable desk and 
chair. . . . Since the place is to take control of a particular 
kind of behavior, you should do nothing else there at any 
time. (p. 2) 


People who do not have the luxury of devoting a certain 
environment to a single activity can contrive a special stimulus 
that can be turned on and off in the multipurpose setting, as in 
this example provided by Watson and Tharp (2014): 


A man had in his room only one table, which he had to 
use for a variety of activities, such as writing letters, 
paying bills, watching TV, and eating. But when he 
wanted to do concentrated studying or writing, he always 
pulled his table away from the wall and sat on the other 
side of it. In that way, sitting on the other side of his 
table became the cue associated only with concentrated 
intellectual work. (p. 168) 


Most students have a personal computer they use for 
their academic work but also for writing and reading personal 
e-mails, conducting family business, browsing the Internet, 
playing games, shopping online, and so on. In a manner similar 
to the man who sat on the other side of his table when studying 
and writing, a student might display a particular background on 
the computer’s desktop that signals that only academic work iS 
to be done on the computer. For example, when he sits down to 
study or write, a student could replace the standard display on 
his computer desktop, a photo of his dog, let’s say, with a solid 
green background. The green desktop signals that only scholarly 
work will be done. Over time the “scholarly work” desktop may 
acquire a degree of stimulus control over the desired behav- 
ior. If the student wants to quit working and use the computer 
for something else, he must first change the background on the 
desktop. Doing so before he has fulfilled his time or productiv- 
ity goal for a work session may generate some guilt that can be 
escaped by returning to work. 

This tactic can also be used to increase a desired behavior 
that is not being emitted at an acceptable rate because of com- 
peting undesired behaviors. In one case, an adult with insomnia 
reportedly went to bed about midnight but did not fall asleep 
until 3:00 or 4:00 AM. Instead of falling asleep, the man would 
worry about mundane problems and turn on the television. 
Treatment consisted of instructing the man to go to bed when he 
felt tired, but he was not to stay in bed if he was unable to sleep. 
If he wanted to think about his problems or watch television, he 
could do so, but he was to get out of bed and go into another 
room. When he once more felt sleepy, he was to return to bed 
and try again to go to sleep. If he still could not sleep, he was 
to leave the bedroom again. The man reported getting up 4 or 
5 times a night for the first few days of the program, but within 
2 weeks he went to bed, stayed there, and fell asleep. 
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Self-monitoring has been the subject of more research and 
clinical applications than any other self-management strategy. 
Self-monitoring (also called self-recording or self-observation) 
is a procedure whereby a person systematically observes his 
behavior and records the occurrence or nonoccurrence of a 
target behavior. Originally conceived as a clinical assessment 
technique for collecting data on behaviors that only clients could 
observe and record (e.g., eating, smoking, fingernail biting), 
self-monitoring became a major therapeutic intervention in its 
own right because of the reactive effects it often produced. As 
discussed in Chapters 3 and 4, reactivity refers to the effects an 
assessment or measurement procedure has on a person’s behav- 
ior. In general, the more obtrusive the observation and measure- 
ment method, the greater the likelihood of reactivity. Maximum 
obtrusiveness exists when the person observing and recording 
the target behavior is the subject of the behavior change pro- 
gram, and reactivity is very likely. 

Although reactivity caused by a researcher’s measure- 
ment system represents uncontrolled variability in a study and 
must be minimized as much as possible, the reactive effects 
of self-monitoring are welcome from a clinical perspective. 
Not only does self-monitoring often change behavior, but also 
the change is typically in the educationally or therapeutically 
desired direction. 

Behavior analysts have taught adults to use self-monitoring 
to decrease smoking (Lipinski, Black, Nelson, & Ciminero, 1975, 
McFall, 1977), increase physical activity (Kurti & Dallery, 2013) 
and calorie expenditure (Donaldson & Normand, 2009), and 
stop biting their nails (Maletzky, 1974). Students with and with- 
out disabilities have used self-monitoring to be on task more 
often in the classroom (Legge, DeBar, & Alber-Morgan, 2010; 
Wills & Mason, 2014), decrease disruptive behaviors (Kamps, 
Conklin, & Wills, 2015; Martella, Leonard, Marchand-Martella, 
& Agran, 1993), improve reading comprehension (Crabtree, 
Alber-Morgan, & Konrad, 2010), increase performance in 
a variety of academic subject areas (Holifield, Goodman, 
Hazelkorn, & Heflin, 2010; Wolfe, Heron, & Goddard, 2000), 
and complete homework assignments (Falkenberg & Barbetta, 
2013). Classroom teachers have used self-monitoring to increase 
their use of specific praise statements during classroom instruc- 
tion (Hager, 2012; Simonsen, MacSuga, Fallon, & Sugai, 2013) 
and the fidelity with which they conduct instructional procedures 
(Belfiore, Fritts, & Herman, 2008; Pinkelman & Horner 2016; 
Plavnick, Ferreri, & Maupin, 2010). 

In one of the first published accounts of self-monitoring 
by children, Broden, Hall, and Mitts (1971) analyzed the 
effects of self-recording on the behavior of two eighth-grade 
students. Liza was earning a D- in history and exhibited poor 
study behavior during the lecture format class. Using 10-sec 
momentary time sampling for 30 minutes each school day, an 
observer seated in the back of the classroom (Liza was not told 
she was being observed) produced a 7-day baseline showing 
that Liza exhibited study behaviors (e.g., faced the teacher, took 
notes when appropriate) an average of 30% of the observation 
intervals, despite two conferences with the school counselor in 
which she promised to “really try.” Prior to the eighth session, 
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the counselor gave Liza a piece of paper with three rows of 
10 squares and directed her to record her study behavior “when 
she thought of it” during her history class sessions. Some aspects 
of study behavior were discussed at this time, including a defi- 
nition of what constituted studying. 


Liza was instructed to take the slip to class each day and to 
record a “+” in the square if she was studying or had been 
doing so for the last few minutes, and a “—” if she was not 
studying at the time she thought to record. Sometime before 
the end of the school day she was to turn it in to the coun- 
selor. (p. 193) 


Figure 29.3 shows the results of Liza’s self-monitoring. 
Her level of study behavior increased to 78% (as measured by 
the independent observer) and stayed at approximately that level 
during Self-Recording 1, quickly decreased to baseline levels 
when self-recording was terminated during Baseline 2, and 
averaged 80% during Self-Recording 2. During Self-Recording 
Plus Praise, Liza’s teacher attended to her whenever he could 
during history class and praised her for study behavior whenever 
possible. Liza’s level of study behavior during this condition 
increased to 88%. 

The bottom graph in Figure 29.3 shows the number of 
times per session the observer recorded teacher attention toward 


Liza. That teacher attention did not correlate in any apparent 
way with changes in Liza’s study behavior during the first four 
phases of the experiment suggests that teacher attention—which 
often exerts a powerful influence on student behavior—was not 
a confounding variable and that the improvements in Liza’s 
study behavior most likely can be attributed to the self-recording 
procedure. However, the effects of self-monitoring may have 
been confounded by the fact that Liza turned her self-recording 
slips in to the counselor before the end of each school day, and 
at weekly student-counselor conferences she received praise 
from the counselor for recording slips with a high percentage 
of plus marks. 

In the second experiment reported by Broden and 
colleagues (1971), self-monitoring was used by Stu, an eighth- 
grade student who talked out in class “continually.” An observer 
recorded Stu’s talk-outs during both parts of a math class that 
met before and after lunch. Self-recording was begun first during 
the before-lunch portion of the math class. The teacher handed 
Stu a slip of paper on which was marked a 2-inch by 5-inch 
rectangular box and the instruction, “Put a mark every time you 
talk out” (p. 196). Stu was told to use the recording slip and 
turn it in after lunch. At the top of the recording slip was a place 
for the student’s name and the date. No other instructions were 
given to Stu, nor were any consequences or other contingencies 
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Figure 29.3 Percentage of observed intervals in which an eighth-grade girl paid attention in class. 


Republished with permission of Society for the Experimental Analysis of Behavior, Inc., from The Effect of Self-Recording on 
the Classroom Behavior of Two Eighth-Grade Students, M. Broden, R. V. Hall, and B. Mitts, 1971, Journal of Applied Behavior 
Analysis, 4, p. 194. Permission conveyed through Copyright Clearance Center, Inc. 


applied to his behavior. Self-recording was later implemented 
during the second half of the math class. During the pre-lunch 
portion of the math class, Stu’s baseline rate of 1.1 talk-outs per 
minute was followed by a rate of 0.3 talk-outs per minute when 
he self-recorded. After lunch Stu talked out at an average rate 
of 1.6 times per minute before self-recording and 0.5 times per 
minute during self-recording. 

The combination reversal and multiple baseline design 
showed a clear functional relation between the self-monitoring 
procedure and Stu’s reduced talk-outs. During the final phase of 
the experiment, however, Stu talked out at rates equal to initial 
baseline levels, even though self-recording was in effect. Broden 
and colleagues (1971) suggested that the lack of effect produced 
by self-recording during the final phase may have resulted from 
the fact that “no contingencies were ever applied to differen- 
tial rates of talking out and the slips thus lost their effective- 
ness” (p. 198). It is also possible that the initial decreases in 
talk-outs attributed to self-recording were confounded by Stu’s 
expectations of some form of teacher reinforcement. 

As can be seen from these two experiments, it is extremely 
difficult to isolate self-monitoring as a straightforward, “clean” 
procedure; self-monitoring almost always entails other contin- 
gencies. Nevertheless, the various and combined procedures that 
constitute self-monitoring are effective in changing behavior. 


Self-Evaluation 


Self-monitoring is often combined with goal setting and self- 
evaluation. A person using self-evaluation (also called self- 
assessment) compares her performance with a predetermined 
goal or standard. Self-evaluation can be aided by self-graphing 
self-collected data (Kasper-Ferguson & Moxley, 2002; Stolz, 
Itoi, Konrad, & Alber-Morgan, 2008). For example, Grossi and 
Heward (1998) taught four restaurant trainees with developmen- 
tal disabilities to select productivity goals, self-monitor their 
work, and self-evaluate their performance against a competitive 
productivity standard (i.e., the typical rate at which nondisabled 
restaurant employees perform the task in competitive settings). 
Job tasks in this study included scrubbing pots and pans, load- 
ing a dishwasher, busing and setting tables, and mopping and 
sweeping floors. 

Self-evaluation training with each trainee took approxi- 
mately 35 minutes across three sessions and consisted of five 
parts: (1) rationale for working faster (e.g., finding and keep- 
ing a job in a competitive setting), (2) goal setting (trainee was 
shown a simple line graph of his baseline performance com- 
pared to the competitive standard and prompted to set a produc- 
tivity goal), (3) use of timer or stopwatch, (4) self-monitoring 
and how to chart self-recorded data on a graph that showed the 
competitive standard with a shaded area, and (5) self-evaluation 
(comparing his own work against the competitive standard and 
making self-evaluative statements, e.g., “I’m not in the shaded 
area. I need to work faster.” or “Good, I’m in the area.”). When 
4 trainee met his goal for 3 consecutive days, a new goal was 
selected. At that time, all four trainees had selected goals within 
the competitive range. 

The work productivity of all four trainees increased as a 
function of the self-evaluation intervention. Figure 29.4 shows 
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the results for one of the participants, Chad, a 20-year-old 
man with mild cognitive disabilities, cerebral palsy, and a sei- 
zure disorder. For the 3 months prior to the study, Chad had 
been training at two work tasks at the dishwashing job station: 
scrubbing cooking pots and pans and racking dishes (a six-step 
behavior chain for loading an empty dishwasher rack with dirty 
dishes). Throughout the study, pot scrubbing was measured 
during 10-min observation sessions both before and after the 
daily lunch operation. Racking dishes was measured using a 
stopwatch to time to the nearest second how long a trainee took 
to load a rack of dishes; 4 to 8 racks per session were timed 
during the busiest hour of the lunch shift. After each observation 
session during baseline, Chad received feedback on the accuracy 
and quality of his work. No feedback was given during baseline 
on the productivity of his performance. 

During baseline, Chad scrubbed an average of 4.5 pots and 
pans every 10 minutes, and none of his 15 baseline trials was 
within the competitive range of 10 to 15 pots. Chad’s pot scrub- 
bing rate increased to a mean of 11.7 pots during self-evaluation, 
with 76% of 89 self-evaluation trials at or above the competi- 
tive range. During baseline, it took Chad a mean of 3 minutes 
2 seconds to load one rack of dishes, and 19% of his 97 base- 
line trials were within the competitive range of 1 to 2 minutes. 
During self-evaluation, Chad’s time to load one rack improved 
to a mean of | minute 55 seconds, and 70% of the 114 self- 
evaluation trials were within the competitive range. At the end 
of the study, three of the four trainees stated that they liked to 
time and record their work performance. Although one trainee 
stated it was “too stressful” to time and record his work, he 
said that self-monitoring helped him show other people that 
he could do the work. 

Participants who have self-evaluated their behavior as 
captured on video have improved their posture while sitting at 
computer work stations (Sigurdsson & Austin, 2008) and their 
performance of complex yoga postures (Downs, Miltenberger, 
Biedronski, & Witherspoon, 2015). 


Self-Monitoring with Reinforcement 


Self-monitoring is often part of an intervention package that 
includes feedback and reinforcement for meeting either self- or 
teacher-selected goals (e.g., Bulla & Frieder, 2017; Christian 
& Poling, 1997; Olympia et al., 1994). The reinforcer may be 
self-administered or teacher delivered. For example, Koegel, 
Koegel, Hurley, and Frea (1992) taught four children with 
autism between the ages of 6 and 11 to obtain their own rein- 
forcers after they had self-recorded a criterion number of appro- 
priate responses to questions from others (e.g., “Who drove you 
to school today?”). 

Teacher-delivered reinforcement was a component of a 
self-monitoring intervention used by Martella and colleagues 
(1993) to help Brad, a 12-year-old student with mild intellectual 
disabilities, reduce the number of negative statements he made 
during classroom activities (e.g., “I hate this #*% @! calculator,” 
“Math is crappy”). Brad self-recorded negative statements 
during two class periods, charted the number on a graph, and 
then compared his count with the count obtained by a student 
teacher, If Brad’s self-recorded data agreed with the teacher at 
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Figure 29.4 Number of pots scrubbed in 10 minutes and the mean and range of minutes to load 
a rack of dishes in the dishwasher during baseline and self-evaluation conditions. Shaded bands 
represent the competitive performance range of restaurant workers without disabilities. The 
dashed horizontal line represents Chad’s self-selected goal; no dashed line means that Chad’s 
goal was within the competitive range. Vertical lines through data points on the bottom graph 
show the range of Chad's performance across 4 to 8 trials. 


Republished with permission of Division on Developmental Disabilities, from Using Self-Evaluation to Improve the Work 
Productivity of Trainees in a Community-Based Restaurant Training Program, T. A. Grossi and W. L. Heward, 1998, 
Education and Training in Mental Retardation and Developmental Disabilities, 33, p. 256. Permission conveyed through 


Copyright Clearance Center, Inc. 


a level of 80% or higher, he received his choice from a menu 
of “small” reinforcers (items costing 25 cents or less). If Brad’s 
self-recorded data agreed with the teacher’s data and were at 
or below a gradually declining criterion level for four consec- 
utive sessions, he was allowed to choose a “large” (more than 
25 cents) reinforcer (see Figure 30.16). 


Why Does Self-Monitoring Work? 


The behavioral mechanisms that account for the effectiveness of 
self-monitoring are not fully understood. Some behavior theo- 
rists suggest that self-monitoring is effective in. changing behav- 
ior because it evokes self-evaluative statements that serve either 
to reinforce desired behaviors or to punish undesired behav- 
iors. Cautela (1971) hypothesized that a child who records on 
a chart that he has completed his chores may emit covert ver- 
bal responses (e.g., “I am a good boy”) that serve to reinforce 
chore completion. Malott (1981) suggested that self-monitoring 
improves performance because of what he called guilt control. 
Self-monitoring less-than-desirable behavior produces covert 
guilt statements that can be avoided by improving one’s per- 
formance. That is, the target behavior is strengthened through 


negative reinforcement by escape and avoidance of the guilty 
feelings that occur when one’s behavior is “bad.” 

Descriptions of the self-monitoring techniques used by 
two famous authors apparently agreed with Malott’s guilt con- 
trol hypothesis. Novelist Anthony Trollope, writing in his 1883 
autobiography, stated: 


When I have commenced a new book; I have always pre- 
pared a diary, divided into weeks, and carried on for the 
period which I have allowed myself for the completion of 
the work. In this I have entered, day by day, the number of 
pages I have written, so that if at any time I have slipped 
into idleness for a day or two, the record of that idleness 
has been there staring me in the face, and demanding of 

me increased labour, so that the deficiency might be sup- 
plied. . . . [have allotted myself so many pages a week. The 
average number has been about 40. It has been placed as 
low as 20, and has risen to 112. And as a page is an ambig- 
uous term, my page has been made to contain 250 words; 
and as words, if not watched, will have a tendency to strag- 
gle, [ have had every word counted as I went. . . . There has 
ever been the record before me, and a week passed with an 


insufficient number of pages has been a blister to my eye 
and a month so disgraced would have been a sorrow to my 
heart. (from Wallace, 1977, p. 518) 


Guilt control also helped to motivate the legendary Ernest 
Hemingway. Novelist Irving Wallace (1977) reported the fol- 
lowing extract from an article by George Plimpton about the 
self-monitoring technique used by Hemingway. 


He keeps track of his daily progress—“so as not to 

kid myself’—on a large chart made out of the side of 

a cardboard packing case and set up against the wall 
under the nose of a mounted gazelle head. The numbers 
on the chart showing the daily output of words differ 
from 450, 575, 462, 1250, back to 512, the higher figures 
on days Hemingway put in extra work so he won’t feel 
guilty spending the following day fishing on the Gulf 
Stream. (p. 518) 


Exactly which principles of behavior are operating when 
self-monitoring changes the target behavior is not known because 
much of the self-monitoring procedure consists of private, covert 
behaviors. In addition to the problem of access to these private 
events, self-monitoring is usually confounded by other variables. 
Self-monitoring is often part of a self-management package in 
which contingencies of reinforcement, punishment, or both, are 
included, either explicitly (e.g., “If I run 10 miles this week, 
I can go to the movies.”) or implicitly (e.g., “I’ve got to show 
my record of calories consumed to my wife.”). Regardless of 
the principles of behavior involved, however, self-monitoring is 
often an effective procedure for changing one’s behavior. 


Self-Monitoring Guidelines and Procedures 


Practitioners should consider the following suggestions when 
implementing self-monitoring with their clients. 


Provide Materials That Make Self-Monitoring Easy 


If self-monitoring is difficult, cumbersome, or time-consuming, 
at best it will be ineffective and disliked by the participant, and at 
worst it may have negative effects on the target behavior. Provide 
participants with materials and devices that make self-monitoring 
as simple and efficient as possible. All of the low-tech devices 
and procedures for measuring behavior described in Chapter 4 
(e.g., paper and pencil, wrist counters, hand-tally counters, tim- 
ers, stopwatches) can be used for self-monitoring. For example, 
a teacher wanting to praise students at least 10 times during a 
class period could put 10 pennies in her pocket prior to the start 
of class. Each time she praises a student’s behavior, she moves 
one coin to another pocket. 

Recording forms for most self-monitoring situations should 
be very simple. Self-recording forms consisting of little more 
than a series of boxes or squares are often effective (e.g.., Broden 
et al., 1971). At various intervals, the participant might write a 
plus or minus, circle yes or no, or mark an X on a smiling face 
or sad face, in a kind of momentary time sampling procedure; or 
he could record a count of the number of responses made during 
a just-completed interval. 

Recording forms can be created for self-monitoring spe- 
cialized tasks or a chain of behaviors. Dunlap and Dunlap (1989) 
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taught students with learning disabilities to self-monitor the 
steps they used to solve subtraction problems with regrouping. 
Each student self-monitored his work by recording a plus or a 
minus next to each step on an individualized checklist of steps 
(e.g., “I underlined all the top numbers that were smaller than 
the bottom.”; “I crossed out only the number next to the under- 
lined number and made it one less.” [p. 311]) designed to prompt 
the student from committing specific types of errors. 

Lo (2003) taught elementary students who were at risk 
for behavioral disorders to use the form shown in Figure 29.5 
to self-monitor whether they worked quietly, evaluated their 
work, and followed a prescribed sequence for obtaining teacher 


3. “Do | need 
teacher?” 


> 


aa 


4. Raise my hand 


5. “How am | 
doing?” 
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Figure 29.5 Form used by elementary school students 

to self-monitor whether they worked quietly and followed 
a prescribed sequence for recruiting teacher assistance. 
From “Functional Assessment and Individualized Intervention Plans: 
Increasing the Behavioral Adjustment of Urban Learners in General 

and Special Education Settings" by Y. Lo, 2003. Unpublished doctoral 
dissertation. Columbus, OH: The Ohio State University. Reprinted by 
Permission. 
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assistance during independent seatwork activities. The form, 
which was taped to each student’s desk, functioned as a reminder 
to students of the expected behaviors and as a device on which 
they self-recorded those behaviors. 

‘“Countoons” are self-monitoring forms that illustrate the 
contingency with a series of cartoon-like frames. A countoon 
reminds young children of not only what behavior to record 
but also what consequences will follow if they meet specified 
performance criteria. Daly and Ranalli (2003) created six-frame 
countoons that enable students to self-record an inappropriate 
behavior and an incompatible appropriate behavior. In the coun- 
toon shown in Figure 29.6, Frames F1 and F4 show the student 
doing her math work, appropriate behavior that is counted in 
Frame F5. The criterion number of math problems to meet the 
contingency—in this case, 10—is also indicated in Frame F5. 
Frame F2 shows the student talking with a friend, the inappro- 
priate behavior to be counted in F3. The student must not chat 
more than 6 times to meet the contingency. The “What Hap- 
pens” frame (F6) depicts the reward the student will earn by 
meeting both parts of the contingency. Daly and Ranalli have 
provided detailed steps for creating countoons and using them 
to teach self-management skills to children. 

Self-monitoring forms can also be designed to enable 
self-recording of multiple tasks across days. For example, teach- 
ers working with secondary students might find the Classroom 
Performance Record (CPR) developed by Young and colleagues 
(1991) an effective way to assist students with monitoring their 
assignments, completion of homework, points earned, and citi- 
zenship scores. The form also provides students with informa- 
tion about their current status in the class, their likely semester 
grade, and tips for improving their performance. 

Self-monitoring interventions to improve fitness can be 
aided by smartwatches and other digital devices that automati- 
cally count steps, measure heart rate, and record variables such 


Figure 29.6 Example of a countoon that can 
be taped to a student’s desk as a reminder 
of target behaviors, the need to self-record, 
and the consequence for meeting the 
contingency. 

Republished with permission of Council for Excep- 
tional Children, from Using Countoons to Teach 
Self-Monitoring Skills, P. M. Daly and P. Ranalli, 2003, 
Teaching Exceptional Children, 35(5), p. 32. Permission 
conveyed through Copyright Clearance Center, Inc. 


Do my math work 


What | Do 


as time and distance (e.g., Hayes & Van Camp, 2015; Valbuena, 
Miltenberger, & Solley, 2015). In one study, college student 
athletes self-monitored their class attendance and arrival time by 
text messaging their academic counselors (Bicard, Lott, Mills, 
Bicard, & Baylot-Casey, 2012). 


Provide Supplementary Cues or Prompts 


Although some self-recording devices—a recording form taped 
to a student’s desk, a notepad for counting calories carried in a 
dieter’s shirt pocket, a golf counter on a teacher’s wrist—serve 
as continual reminders to self-monitor, additional prompts or 
cues to self-monitor are often helpful. Researchers and practi- 
tioners have used a variety of auditory, visual, and tactile stimuli 
as cues and prompts for self-monitoring. 

Auditory prompts in the form of prerecorded signals or 
tones have been widely used to cue self-monitoring in class- 
rooms (e.g., Todd, Horner, & Sugai, 1999). For example, in 
a study by Glynn, Thomas, and Shee (1973), second-graders 
placed a checkmark in a series of squares if they thought they 
were on task at the moment they heard a tape-recorded beep. A 
total of 10 beeps occurred at random intervals during a 30-min 
class period. Using a similar procedure with a digital device, 
Gulchak (2008) taught an 8-year-old boy diagnosed with emo- 
tional and behavior disorders to use a stylus to check Yes or No 
under the heading “Are You On-Task?” on a handheld computer 
when the computer’s alarm chimed at 10-min intervals. 

Ludwig (2004) used visual cues on the classroom board 
to prompt kindergarten children to self-record their productivity 
during a morning seatwork activity in which they wrote answers 
on individual worksheets to items, problems, or questions that 
the teacher had printed on a large whiteboard on the classroom 
wall. The work was divided into 14 sections by topic and covered 
a variety of curriculum areas (e.g., spelling, reading comprehen- 
sion, addition and subtraction problems, telling time, money). 


What Happens 


Chat with 
my friend 


de 238A S 


67 8 910] 


VW 12: 13. 14,15 


My Count 


At the end of each section, the experimenter drew a smiley face 
with a number from | to 14. The numbered smiley faces drawn 
on the board corresponded to 14 numbered smiley faces on each 
student’s self-monitoring card. 

Digital devices can provide unobtrusive visual prompts 
to self-monitor. Clemons and colleagues (2016) programmed a 
handheld tablet’s touchscreen to flash “Am I on-task?” followed 
by the choices “yes” and “no” at 30-sec or 1-min intervals. 
Three high school students with varying disabilities exhibited 
improved levels of on-task behavior when using the system in 
general and self-contained special education classrooms. 

Tactile prompts can also be used to signal self-recording 
opportunities. For example, the MotivAider(R) (www. 
habitchange.com) is a small electronic device that vibrates at 
fixed or variable time intervals programmed by the user. This 
device is excellent for signaling people to self-monitor or 
perform other self-management tasks (e.g., Legge et al., 2010) 
or to prompt practitioners to attend to the behavior of students or 
clients (Flaute, Peterson, Van Norman, Riffle, & Eakins, 2005). 

Prompts to self-monitor, whatever their form, should be as 
unobtrusive as possible so they do not disrupt the participant or 
others. As a general rule, practitioners should provide frequent 
prompts to self-monitor at the beginning of a self-management 
intervention and gradually reduce their number as the participant 
becomes more proficient and skilled with self-monitoring. 


Self-Monitor the Most Important Dimension 
of the Target Behavior 


Self-monitoring is measurement. As we saw in Chapter 4, there 
are various dimensions by which a given behavior can be mea- 
sured. What dimension of the target behavior should be self- 
monitored? A person should self-monitor the dimension that, 
should desired changes in its value occur, would yield the most 
direct and significant progress toward his self-management goal. 
For example, a man wanting to lose weight by eating less food 
could measure the number of bites he takes throughout the day 
(count), the number of bites per minute at each meal (rate), how 
much time elapses from the time he sits down at the table to when 
he takes the first bite (latency), how long he pauses between bites 
(interresponse time), and/or how long his meals last (duration). 
Although daily measures of each of these dimensions would pro- 
vide some quantitative information about the man’s eating, none 
of the dimensions would be as directly related to his goal as would 
the total number of calories he consumes each day. 

Numerous studies have examined the question of whether 
students should self-monitor their on-task behavior or their 
academic performance or productivity. Some researchers have 
found self-monitoring of attention more effective (Harris, 
Friedlander, Saddler, Frizzelle, & Graham, 2005; Kneedler 
& Hallahan, 1981), others have reported advantages for self- 
monitoring performance (Lam, Cole, Shapiro, & Bambara, 
1994: Maag, Reid, & DiGangi, 1993; Reid, 1996), and some 
have found the two procedures equally effective (Carr & Punzo, 
1993; Shimabukuro, Pratter, Jenkins, & Edelen-Smith, 2000). 
Although self-monitoring either attention or performance 
results in increased on-task behavior, students tend to complete 
more academic responses when self-monitoring productivity 
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than when self-recording on-task behavior (e.g., Maag et al., 
1993; Lam et al., 1994; Reid & Harris, 1993). Additionally, 
most students prefer self-monitoring academic performance to 
self-recording on-task behavior (e.g., “You could say the words 
while you counted them.” [Harris et al., 2005, p. 151]). 

In general, we recommend teaching students to self- 
monitor a measure of academic productivity (e.g., number 
of problems or items attempted, number correct) rather than 
whether they are on task. This is because increasing on-task 
behavior, whether by self-monitoring or by contingent rein- 
forcement, does not necessarily result in a collateral increase 
in productivity (e.g., Marholin & Steinman, 1977; McLaughlin 
& Malaby, 1972; Wolfe et al., 2000). By contrast, when pro- 
ductivity is increased, improvements in on-task behavior almost 
always occur. However, a student whose persistent off-task 
and disruptive behaviors create problems for him or others in 
the classroom may benefit more from self-monitoring on-task 
behavior, at least initially. 


Self-Monitor Early and Often 


In general, each occurrence of the target behavior should be 
self-recorded as soon as possible. However, the act of self- 
monitoring a behavior the person wants to increase should not 
disrupt “the flow of the behavior” (Critchfield, 1999). Self- 
monitoring behaviors that produce natural or contrived response 
products (e.g., answers on academic worksheets, words written) 
can be conducted after the session by permanent product mea- 
surement (see Chapter 4). 

Relevant aspects of some target behaviors can be self- 
monitored even before the behavior has occurred. Self-recording 
a response early in a behavior chain that leads to an undesired 
behavior the person wants to decrease may be more effective 
in changing the target behavior in the desired direction than 
recording on the terminal behavior in the chain. For example, 
Rozensky (1974) reported that when a woman who had been 
a heavy smoker for 25 years recorded the time and place she 
smoked each cigarette, there was little change in her rate of 
smoking. She then began recording the same information each 
time she noticed herself beginning the chain of responses that 
led to smoking: reaching for her cigarettes, removing a cigarette 
from the pack, and so on. She stopped smoking within a few 
weeks of self-monitoring in this fashion. 

Generally, a person should self-monitor more often at 
the beginning of a behavior change program. If and as per- 
formance improves, self-monitoring can occur less often. For 
example, Rhode, Morgan, and Young ( 1983) had students with 
behavioral disorders begin self-evaluating (on a 0- to 5-point 
scale) the extent to which they followed classroom rules and 
completed academic work at 15-min intervals throughout the 
school day. Over the course of the study, the self-evaluation 
intervals were increased gradually, first to every 20 minutes, 
then every 30 minutes, then once per hour. Eventually, the self- 
evaluation cards were withdrawn and students orally reported 
their self-evaluations. In the final self-evaluation condition, the 
students verbally self-evaluated their compliance with classroom 
rules and academic work on an average of every 2 days (.¢., a 
variable ratio [VR] 2-day schedule). 
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Reinforce Accurate Self-Monitoring 


Some researchers have reported little correlation between the 
accuracy of self-monitoring and its effectiveness in changing 
the behavior being recorded (e.g., Kneedler & Hallahan, 1981; 
Marshall, Lloyd, & Hallahan, 1993). It appears that accurate 
self-monitoring is neither a sufficient nor a necessary condi- 
tion for behavior change. For example, Hundert and Bucher 
(1978) found that, even though students became highly accu- 
rate in self-scoring their arithmetic, their performance on the 
arithmetic itself did not improve. In contrast, in the Broden 
and colleagues (1971) study, both Liza and Stu’s behavior 
improved even though their self-recorded data seldom matched 
the data from independent observers. Nevertheless, accurate 
self-monitoring is desirable, especially when participants are 
using their self-recorded data as the basis for self-evaluation or 
self-administered consequences. 

Although some studies have shown that young children 
can accurately self-record their behavior in the absence of exter- 
nal contingencies for accuracy (e.g., Ballard & Glynn, 1975; 
Glynn et al., 1973), other researchers have reported low agree- 
ment between the self-reported data and the data collected by 
independent observers (Kaufman & O'Leary, 1972; Turkewitz, 
O’Leary & Ironsmith, 1975). One factor that seems to affect 
the accuracy of self-scoring is the use of self-reported scores as 
a basis for reinforcement. Santogrossi, O’ Leary, Romanczyk, 
and Kaufman (1973) found that, when children self-evaluated 
their own work and those self-produced evaluations determined 
levels of token reinforcement, their self-monitoring accuracy 
deteriorated over time. Similarly, Hundert and Bucher (1978) 
found that students who previously had accurately self-scored 
their arithmetic assignments greatly exaggerated their scores 
when higher scores resulted in points that could be exchanged 
for prizes. 

Rewarding children for producing self-recorded data that 
match the data of an independent observer and spot-checking 
students’ self-scoring reports can increase the accuracy of 
self-monitoring by young children. Drabman, Spitalnik, and 
O’Leary (1973) used these procedures in teaching children with 
behavior disorders to self-evaluate their own classroom behavior. 


Now something different is going to happen. If you get a 
rating within plus or minus one point of my rating, you can 
keep all your points. You lose all your points if you are off 
my rating by more than plus or minus one point. In addi- 
tion, if you match my rating exactly you get a bonus point. 
(O’Leary, 1977, p. 204) 


After the children demonstrated that they could evaluate 
their own behavior reliably, the teacher began checking only 
50% of the children’s self-ratings by pulling names from a hat at 
the end of the period, then 33% of the children, then 25%, then 
12%, During the last 12 days of the study, she did not check any 
child’s self-rating. Throughout this period of reduced checks and 
finally no checks at all, the children continued to self-evaluate 
accurately. Rhode and colleagues (1983) used a similar faded 
matching technique. 

Additional examples and techniques for teaching self- 
monitoring to children can be found in Joseph and Konrad (2009); 
McCollow, Curiel, Davis, and Sainato (2016); and Rafferty (2010). 


SELF-ADMINISTERED CONSEQUENCES 


Arranging for specific consequences to follow occurrences (or 
nonoccurrences) of one’s behavior is a fundamental strategy of 
self-management. In this section, we review a variety of tactics 
people have used to self-reinforce and self-punish. First, we 
briefly examine some of the conceptual issues raised by the 
concept of “‘self-reinforcement.” 


Is Self-Reinforcement Possible? 


Skinner (1953) pointed out that self-reinforcement should 
not be considered synonymous with the principle of operant 
reinforcement. 


The place of operant reinforcement in self-control is not 
clear. In one sense, all reinforcements are self-administered 
since a response may be regarded as “producing” its rein- 
forcement, but “reinforcing one’s own behavior” is more 
than this. . . . Self-reinforcement of operant behavior pre- 
supposes that the individual has it in his power to obtain 
reinforcement but does not do so until a particular response 
has been emitted. This might be the case if a man denied 
himself all social contacts until he had finished a particular 
job. Something of this sort unquestionably happens, but 

is it operant reinforcement? It is roughly parallel to the 
procedure in conditioning the behavior of another person. 
But it must be remembered that the individual may at any 
moment drop the work in hand and obtain the reinforce- 
ment. We have to account for his not doing so. It may be 
that such indulgent behavior has been punished—say, with 
disapproval—except when a piece of work has just been 
completed. (pp. 237-238) 


In his discussion of self-reinforcement, Goldiamond 
(1976a) continued with Skinner’s example and stated that the 
fact that a person “does not cheat, but engages in the task, cannot 
be explained simply by resort to his self-reinforcement by social 
contact, which he makes available contingent on his finishing 
the job” (p. 510). In other words, the variables influencing the 
controlling response—in this case abstaining from social contact 
until the job is complete—must still be accounted for; simply 
citing self-reinforcement as the cause is an explanatory fiction. 

The issue is not whether procedures labeled as “self- 
reinforcement” often work to change behavior in ways that 
resemble the defining effect of reinforcement; they do. However, 
a careful examination of instances of self-reinforcement reveals 
that something more than, or different from, a straightforward 
application of positive reinforcement is involved (e.g., Brigham, 
1980; Catania, 2013; Goldiamond, 1976a, 1976b). As a tech- 
nical term, self-reinforcement (as also, self-punishment) is a 
misnomer, and the issue is not merely a semantic one, as some 
writers have suggested (e.g., Mahoney, 1976). Assigning the 
effectiveness of a behavior change tactic to a well-understood 
principle of behavior when something other than, or in addition 
to, that principle is operating overlooks relevant variables that 
may hold the key to a fuller understanding of the tactic. Consid- 
ering the analysis of a behavioral episode complete once it has 
been identified as a case of self-reinforcement precludes further 
search for other relevant variables. 


We agree with Malott (2005a, 2012; Malott & Shane, 
2014), who argued that performance-management contingen- 
cies, whether designed and implemented by oneself or by others, 
are best viewed as rule-governed analogs of reinforcement and 
punishment contingencies because the response-to-consequence 
delay is too great.’ Malott (2005a) provided the following exam- 
ples of self-management contingencies as analogs to negative 
reinforcement and punishment contingencies: 


[Consider the] contingency, Every day I eat more than 
1,250 calories, I will give someone else $5 to spend 
frivolously, like a despised roommate or despised charity, 
though for many of us, the loss of $5 to a beloved charity 

is sufficiently aversive that the thought of it will punish 
overeating. [This] contingency is an analog to a penalty 
[punishment] contingency, an analog because the actual 
loss of the $5 will normally occur more than | minute 

after exceeding the 1,250 calorie limit. Such analog pen- 
alty contingencies are effective in decreasing undesirable 
behavior. And to increase desirable behavior, an analog to 
an avoidance [negative reinforcement] contingency works 
well: Every day I exercise for one hour, I avoid paying a 

$5 fine. But, if you have not finished your hour by midnight, 
you must pay the $5. (p. 519, emphasis in original, words in 
brackets added) 


Self-Administered Consequences 
to Increase Behavior 


A person can increase the target behavior in a self-management 
program by applying contingencies that are analogs to positive 
and negative reinforcement. Self-selected consequences can 
be delivered directly (the self-manager accesses the consequence 
herself) or indirectly (another person provides the consequence). 


Self-Management Analogs of 
Positive Reinforcement 


A variety of target behaviors have improved when students 
determined the number of tokens, points, or minutes of free time 
earned based on self-assessment of their performance (Glynn, 
1970; Koegel et al., 1992; Olympia et al., 1994). In a self- 
experiment, Craig (2010) decreased the number of times he bit 
his fingernails by awarding himself points using a token-based 
differential reinforcement of other behavior (DRO) procedure. 
A wristwatch with a countdown timer signaled 1-hr DRO inter- 
vals. The effects of interventions featuring self-administered 
rewards are difficult to evaluate because they are typically con- 
founded by other variables. In a review of self-reinforcement 
research, Jones, Nelson, and Kazdin (1977) identified partic- 
ipants’ previous history, criterion setting, self-monitoring, 
self-evaluation, external contingencies on the self-reinforcing 
response, external contingencies on the target behavior, and 
actual or suspected surveillance as likely confounding variables. 

A study by Ballard and Glynn (1975) controlled for self- 
monitoring as a contributor to the effects of self-reinforcement. 
Third-graders were taught after a baseline condition to self-score 
and self-record several aspects of their writing—number of sen- 
tences, number of describing words, and number of action words. 
Self-monitoring had no effect on any of the variables measured, 
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even though the students handed in their counting sheets with 
their writing each day. The children were then given a notebook 
in which to record their points, which could be exchanged at the 
rate of one point per minute for each student’s choice of activities 
during an earned-time period each day. The self-reinforcement 
procedure resulted in large increases in each of the three depen- 
dent variables. 

In studies on self-recruited reinforcement, students are 
taught to periodically self-evaluate their work and then show 
it to their teacher and request feedback or assistance (Alber, 
Heward, & Hippler, 1999; Craft, Alber, & Heward, 1998; Rouse 
et al., 2014). Ina sense, students administer their own reinforcer 
by recruiting the teacher’s attention, which often results in praise 
and other forms of reinforcement (see Alber & Heward, 2000 
for a review). 

Todd and colleagues (1999) taught an elementary stu- 
dent to use a self-management system that included self- 
monitoring, self-evaluation, and self-recruited reinforcement. 
Kyle was a 9-year-old boy diagnosed with a learning disability 
and receiving special education services in reading, math, and 
language arts. Kyle’s individualized education program (IEP) 
also included several objectives for problem behavior (e.g., 
disrupting independent and group activities, teasing and taunt- 
ing classmates, and saying sexually inappropriate comments). 
The classroom teacher’s goal setting and daily evaluation had 
proven ineffective. An “action team” conducted a functional 
assessment (see Chapter 27) and designed a support plan that 
included the self-management system. 

Kyle was taught how to use the self-management system 
during two 15-min one-on-one training sessions in which he 
practiced self-recording numerous role-played examples and 
nonexamples of on-task and off-task behavior and learned 
appropriate ways to recruit teacher attention and praise. For 
self-monitoring, Kyle listened with a single earplug to a 50-min 
cassette tape on which 13 checkpoints (e.g., “check one,” “check 
two”) had been prerecorded on a variable interval (VI) 4-min 
schedule (ranging from 3- to 5-min intervals between check- 
points). Each time Kyle heard a checkpoint, he marked a plus 
(if he had been working quietly and keeping his hands, feet, and 
objects to himself) or a zero (if he had been teasing peers and/or 
not working quietly) on a self-recording card. 

Todd and colleagues (1999) described the recruitment of 
teacher praise and how Kyle’s special program was integrated 
into the reinforcement system for the entire class: 


Each time Kyle marked three pluses on his card he raised 
his hand (during instruction) or walked up to the teacher 
(during group project time) and requested feedback on his 
performance. The teacher acknowledged Kyle’s good work 
and placed a mark on his self-monitoring card to show 
where he should begin a new count of three pluses. In addi- 
tion to these within-session contingencies, Kyle could earn 
a self-manager sticker at the end of each class period if he 
had no more than two zeroes for the period. Stickers were 
earned by all students in the class for appropriate behavior 
and were pooled weekly for class rewards. Because these 
stickers were combined, Kyle’s stickers were valued by all 
the students and provided opportunities for positive peer 
attention. (p. 70) 
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Figure 29.7 Problem behaviors by a 9-year-old boy during 10-min probes across two class periods 
during baseline (BL) and self-management (SM) conditions. 
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Procedures for the second self-management phase (SM2) 
were the same as for the first phase; during the third phase 
(SM3), Kyle used a 95-min tape on which 16 checkpoints 
were prerecorded on a VI 5-min schedule (ranging from 4 to 
6 minutes). When Kyle used the self-management system, the 
percentage of intervals in which he engaged in problem behavior 
was much lower than baseline (BL) levels (see Figure 29.7). 
Large increases in on-task behavior and completion of aca- 
demic work were also noted. An important outcome of this 
study was that Kyle’s teacher praised his behavior more often. 
Significantly, the self-management intervention was initiated in 
Class Period B (bottom graph in Figure 29.7) at the teacher’s 
request because she had noted an immediate and “dramatic 
change” in Kyle’s performance when he began self-monitoring 


and recruiting teacher praise in Class Period A. This outcome is 
strong evidence for the social validity of the self-management 
intervention. 


Self-Management Analogs of Negative 
Reinforcement 


Many successful self-management interventions involve 
self-determined escape and avoidance contingencies analogous 
to negative reinforcement. Most of the case studies featured in 
Malott’s (2012) book on self-management, /’// Stop Procrasti- 
nating When I Get Around to It, feature escape and avoidance 
contingencies in which emitting the target behavior enabled the 


person to avoid an aversive event. The table below provides 
examples: 


Target Behavior/Objective Self-Management Contingency 


Make a daily journal Each day Kelly failed to make 
entry of interesting things _a journal entry, she had to do 
that happened so as to a friend's chores, including 

have events to include doing the dishes and laundry. 

in a weekly letter to her (Garner, 2012) 

parents. 

Four 5-hr exercise Pin $5 bill to the corkboard on 
sessions (e.g., mountain the refrigerator and give it to 
bike, aerobics, strength my roommate at the end of the 
building) per week; week if the exercise goal remains 


3 sessions must be during unmet. (Haroff, 2012) 
the week and 1 session 
on the weekend. 


Practice guitar for half an Do 50 sit-ups at 11:00 pm on 

hour each day of the week Sundays for each day that | did 

before 11:00 pm. not practice during the week. 
(Knittel, 2012) 


At stickK.com individuals can design personalized “com- 
mitment contracts” in which they specify behavior change goals, 
set dates for completion, select referees to verify self-reports of 
task completion, and escrow a self-determined amount of money 
to lose if the terms of the contract are not met. In response to 
the uneasiness some people may feel about using negative rein- 
forcement to control their behavior, Malott (2012) made the 
following case for building “pleasant aversive control” into self- 
management programs. 


Aversive control doesn’t have to be aversive! . . . Here’s 
what I think you need in order to have a pleasant aver- 
sive control procedure. You need to make sure that the 
aversive consequence, the penalty, is small. And you need 
to make sure the penalty is usually avoided—the avoid- 
ance response needs to be one that the person will readily 
make, most of the time, as long as the avoidance procedure 
is in effect. 

Our everyday life is full of such avoidance proce- 
dures, and yet they don’t make us miserable. You don’t 
have an anxiety attack every time you walk through a 
doorway, even though you might hurt yourself if you 
don’t avoid bumping into the doorjamb. And you don’t 
break out in a cold sweat every time you put the left- 
overs in the refrigerator, and thereby avoid leaving them 
out to spoil. So that leads us to the Self-Management 
Rule: Don’t hesitate to use an avoidance procedure to 
get yourself to do something you want to do anyway. 
Just be sure the aversive outcome is as small as possible 
(but not so small it’s ineffective) and the response is easy 
to make. (p. 80) 


Self-Administered Consequences 
to Decrease Behavior 


Self-administered consequences analogous to positive punish- 
ment or negative punishment can decrease the occurrence of 


undesirable behavior. 
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Self-Management Analogs of Positive Punishment 


A person can decrease the rate of an undesired behavior by fol- 
lowing each occurrence with the onset of painful stimulation 
or aversive activity. Mahoney (1971) reported a case study in 
which a man beleaguered by obsessive thoughts wore a heavy 
rubber band around his wrist. Each time he experienced an 
obsessive thought, he snapped the rubber band, delivering a 
brief, painful sensation to his wrist. A 15-year-old girl who had 
compulsively pulled her hair for 2 years, to the point of creating 
bald spots, also used contingent self-delivered snaps of a rubber 
band on her wrist to stop her habit (Mastellone, 1974). Another 
woman stopped her hair pulling by performing 15 sit-ups each 
time she pulled her hair or had the urge to do so (MacNeil & 
Thomas, 1976). 

Self-administering a positive practice overcorrection pro- 
cedure also qualifies as an example of self-administered posi- 
tive punishment. Heward, Dardig, and Rossett (1979) described 
the case of a teenage girl who frequently said don’t instead of 
doesn’t with the third person singular (e.g., “She don’t like that 
one.’”) and used this form of self-administered positive punish- 
ment to decrease the rate of her speech error. Each time the 
girl observed herself saying don’t when she should have said 
doesn’t, she covertly repeated the complete sentence she had 
just spoken 10 times in a row, using correct grammar. She wore 
a wrist counter that reminded her to listen to her speech and to 
keep track of the number of times she employed the positive 
practice procedure. 


Self-Management Analogs of Negative Punishment 


Self-administered analogs to negative punishment consist of 
arranging the loss of reinforcers (response cost) or denying oneself 
access to reinforcement for a specified period (time-out) contin- 
gent on the occurrence of the target behavior. Response cost and 
time-out contingencies are widely used self-management strat- 
egies. The most commonly applied self-administered response 
cost procedure is paying a small fine each time the target behavior 
occurs. In one study, smokers reduced their rate of smoking by 
tearing up a dollar bill each time they lit up (Axelrod, Hall, Weis, 
& Rohrer, 1971). Response cost procedures have also been used 
effectively with elementary school children who self-determined 
the number of tokens they should lose for inappropriate social 
behavior (Kaufman & O’Leary, 1972) or for poor academic work 
(Humphrey, Karoly, & Kirschenbaum, 1978). 

James (1981) taught an 18-year-old man who had stut- 
tered severely since the age of 6 to use a time-out from speaking 
procedure. Whenever he observed himself stuttering, the young 
man immediately stopped talking for at least 2 seconds, after 
which he could begin speaking again. His rate of dysfluencies 
decreased markedly. If talking is reinforcing, then this procedure 
might function as time-out (e.g., not allowing oneself to engage 
in a preferred activity for a period of time). 


Recommendations for Self-Administered 
Consequences 


Persons designing and implementing self-administered conse- 
quences should consider the following recommendations. 
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Select Small, Easy-to-Deliver Consequences 


Both rewards and penalties used in self-management programs 
should be small and easily delivered. A common mistake in 
designing self-management programs is selecting consequences 
that are big and grand. Although a person may believe that the 
promise of a large reward (or threat of a severe aversive event) 
will motivate her to meet her self-determined performance cri- 
teria, large consequences often work against the program’s suc- 
cess. Self-selected rewards and punishing consequences should 
not be costly, elaborate, time-consuming, or too severe. If they 
are, the person may not be able (in the case of grand rewards) 
or willing (in the case of severe aversive events) to deliver them 
immediately and consistently. 

In general, it is better to use small consequences that can 
be obtained immediately and frequently. This is particularly 
important with consequences intended to function as punishers, 
which—to be most effective—must be delivered immediately 
every time the behavior targeted for reduction is emitted. 


Set a Meaningful but Easy-to-Meet 
Criterion for Reinforcement 


When designing contingencies involving self-administered 
consequences, a person should guard against making the same 
two mistakes practitioners often make when implementing rein- 
forcement contingencies with clients: (1) setting expectations so 
low that improvement in the current level of performance is not 
necessary to obtain the self-administered reward, or (2) making 
the initial performance criterion too high (the more common 
mistake), thereby effectively programming an extinction contin- 
gency that may cause the person to give up on self-management 
altogether. Keys to the effectiveness of any reinforcement-based 
intervention are setting an initial criterion that ensures that the 
person’s behavior makes early contact with reinforcement 
and that continued reinforcement requires improvements over 
baseline levels. The criterion-setting formulae (Heward, 1980) 
described in Chapter 11 provide guidelines for making those 
determinations. 


Eliminate “Bootleg Reinforcement” 


Bootleg reinforcement—access to the specified reward or to 
other equally reinforcing items or events without meeting the 
response requirements of the contingency—is a common down- 
fall of self-management projects. A person with a comfortable 
supply of smuggled rewards is less likely to work hard to earn 
response-contingent rewards. 

Bootleg reinforcement is common when people use every- 
day preferred activities and treats as rewards in self-management 
programs. Although everyday expectations and enjoyments are 
easy to deliver, a person may find it difficult to withhold things 
he is used to enjoying on a regular basis. For example, a man 
used to unwinding at the end of the day by watching Baseball 
Tonight with a beer and a bowl of peanuts may not be consistent 
in making those daily treats contingent on meeting the require- 
ments of his self-management program. 

One method for combating this form of bootleg reinforce- 
ment is to retain free access to activities or items routinely enjoyed 


in the past and make alternative items or activities that are a cut 
above the usual contingent on meeting performance criteria. For 
example, each time the man in the previous scenario meets his 
performance criteria, he could replace his everyday brand of beer 
with his choice of a specialty beer from a collection reserved in 
the back of the refrigerator (e.g., Bell’s Two Hearted IPA). 


If Necessary, Put Someone Else in Control of 
Delivering Consequences 


Most ineffective self-management programs fail not because 
the controlling behavior is ineffective in controlling the con- 
trolled behavior, but because the contingencies that control the 
controlling behavior are not sufficiently strong. In other words, 
the person does not emit the controlling behavior consistently 
enough to realize its effects. What keeps a person from rational- 
izing that she did most of the behaviors she was supposed to 
and obtaining a self-determined reward anyway? What keeps a 
person from failing to deliver a self-determined aversive con- 
sequence? Too often, the answer to both questions is—nothing. 

A person who really wants to change her behavior but 
has difficulty following through with delivering self-determined 
consequences should enlist another person to serve as perfor- 
mance manager. A self-manager can ensure her self-designed 
consequences will be administered faithfully by creating a con- 
tingency in which the consequence for failing to meet the per- 
formance criterion is both aversive to her and a reinforcer for 
the person she has asked to implement the consequence. If the 
first person put in charge of the contingency fails to implement 
it as planned, the self-manager should find another person who 
will do the job. 


Christie wanted to walk on her dust-gathering treadmill 
20 minutes a day, 6 days a week. She tried her husband 
as her performance contractor, but he wasn’t tough 
enough; he always felt sorry for her. So she fired him 
and hired her son; she made his bed every time she failed 
to do her 20 minutes, and he showed no mercy. (Malott, 
2012, pr432) 


Internet-based self-deposit contracts for cigarette smok- 
ing reduction or abstinence (Dallery, Raiff, & Grabinski, 
2013; Jarvis & Dallery, 2017) or those created on stickK. 
com ensure that self-determined consequences are carried 
out. Kanfer (1976) called this kind of self-management deci- 
sional self-control: The person makes the initial decision to 
alter her behavior and plans how that will be accomplished, 
but then turns over the procedure to a second party to avoid 
the possibility of not emitting the controlling response. Kanfer 
made a distinction between decisional self-control and pro- 
tracted self-control, in which a person consistently engages in 
self-deprivation to effect the desired behavior change. Bellack 
and Hersen (1977) stated that decisional self-control “is gener- 
ally considered to be less desirable than protracted self-control, 
as it does not provide the individual with an enduring skill or 
resource” (p. 111). 

We disagree that a self-management program involving 
the help of other people is less desirable than one in which the 
self-manager does everything. First, a self-management program 


in which the contingencies are turned over to someone else may 
be more effective than trying to “go it alone” because the other 
person is More consistent in applying the consequences. In addi- 
tion, as a result of experiencing a successful self-management 
program in which he has chosen the target behavior, set per- 
formance criteria, determined a self-monitoring/self-evaluation 
system, and arranged to have someone else administer self- 
designed consequences, a person has acquired a considerable 
repertoire of self-management skills for future use. 


Keep It Simple 


A person should not create elaborate self-management con- 
tingencies if they are not needed. The same general rule that 
applies to behavior change programs designed on behalf of 
others—employ the least complicated and intrusive, yet effective, 
intervention—also applies to self-management programs. 


OTHER SELF-MANAGEMENT TACTICS 


Other self-management strategies used by behavior analysts are 
not so easily classified according to the four-term contingency. 
They include self-instruction, habit reversal, self-directed sys- 
tematic desensitization, and massed practice. 


Self-Instruction 


People talk to themselves all the time, offering encouragement 
(e.g., “I can do this; ve done it before”), congratulations (e.g., 
“Great shot, Daryl! You just crushed that 5-iron!””), and admon- 
ishment (e.g., “Don’t say that anymore; you hurt her feelings”) 
for their behavior, as well as specific instructions (e.g., “Pull 
the bottom rope through the middle”). Such self-statements can 
function as controlling responses—verbal mediators—that affect 
the occurrence of other behaviors. Self-instruction consists of 
self-generated verbal responses, covert or overt, that function as 
response prompts for a desired behavior. As a self-management 
tactic, self-instructions are often used to guide oneself through 
a behavior chain or sequence of tasks. 

Bornstein and Quevillon (1976) conducted a study fre- 
quently cited as evidence of the positive and lasting effects of 
self-instruction. They taught three hyperactive preschool boys a 
series of four types of self-instruction designed to keep them on 
task with classroom activities: 


1. Questions about the assigned task (e.g., “What does the 
teacher want me to do?”) 

2. Answers to the self-directed questions (e.g., “I’m sup- 
posed to copy that picture.”) 

3. Verbalizations to guide the child through the task at hand 
(e.g., “OK, first I draw a line through here.”) 


4. Self-reinforcement (e.g., “I really did that one well.’) 
During a 2-hr session, the children were taught to use 


self-instructions incorporating a sequence of training steps orig- 
inally developed by Meichenbaum and Goodman (1971): 


1. The experimenter modeled the task while talking aloud 
to himself. 
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2. The child performed the task while the experimenter pro- 
vided the verbal instructions. 


3. The child performed the task while talking aloud to 
himself with the experimenter whispering the instruc- 
tions softly. 


4. The child performed the task while whispering softly to 
himself with the experimenter moving his lips but making 
no sound. 


5. The child performed the task while making lip movements 
but no sound. 


6. The child performed the task while guiding his perfor- 
mance with covert instructions. (adapted from p. 117) 


During the training session, a variety of classroom tasks 
were used, ranging from simple motor tasks such as copy- 
ing lines and figures to more complex tasks such as block 
design and grouping tasks. The children showed a marked 
increase in on-task behavior immediately after receiving self- 
instruction training, and their improved behavior was main- 
tained over a considerable period. The authors suggested that 
the generalization obtained from the training setting to the 
classroom could have resulted from telling the children during 
training to imagine they were working with their teacher, not 
the experimenter. The authors hypothesized that a behavioral 
trapping phenomenon (Baer & Wolf, 1970) may have been 
responsible for the maintenance of on-task behavior; that is, 
the self-instructions may have initially produced better behay- 
ior, which in turn produced teacher attention that maintained 
the on-task behavior. 

Although some studies evaluating self-instruction failed 
to reproduce Bornstein and Quevillon’s (1976) impressive 
results (e.g. Billings & Wasik, 1985; Friedling & O'Leary, 
1979), other studies have yielded generally encouraging results 
(Burgio, Whitman, & Johnson, 1980; Hughes, 1992; Kosiewicz, 
Hallahan, Lloyd, & Graves, 1982). Self-instruction training has 
increased the rate of high school students’ initiating conver- 
sations with familiar and unfamiliar peers (Hughes, Harmer, 
Killian, & Niarhos, 1995; see Figure 30.5). 

Employees with disabilities have learned to self-manage 
their work performance by providing their own verbal prompts 
and self-instructions (Hughes, 1997). For example, Salend, 
Ellis, and Reynolds (1989) used a self-instruction strategy to 
teach four adults with severe cognitive disabilities to “talk while 
you work.” Productivity increased dramatically and error rates 
decreased when the women verbalized to themselves, “Comb 
up, comb down, comb in bag, bag in box” while packaging 
combs in plastic bags. Hughes and Rusch (1989) taught two 
supported employees working at a janitorial supply company 
how to solve problems by using a self-instruction procedure 
consisting of four statements: 


1. Statement of the problem (e.g., “Tape empty”) 


2. Statement of the response needed to solve the problem 
(e.g., “Need more tape’) 


3. Self-report (e.g., “Fixed it”) 


4. Self-reinforcement (e.g., “Good”) 
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O’Leary and Dubey (1979) summarized their review of 
self-instruction training by suggesting four factors that appear 
to influence its effectiveness with children. 


Self-instructions appear to be effective self-controlling 
procedures if the children actually implement the 
instructional procedure, if the children use them 

[the self-instructions] to influence behavior at which 
they are skilled, if children have been reinforced for 
adhering to their self-instructions in the past, and if the 
focus of the instructions is the behavior most subject to 
consequences. (p. 451) 


Habit Reversal 


In his discussion of self-control, Skinner (1953) included “doing 
something else” as a self-management tactic. In an interesting 
application of “doing something else,” Robin, Schneider, and 
Dolnick (1976) taught 11 primary-age children with emotional 
and behavioral disorders to control their aggressive behaviors 
by using the turtle technique: The children pulled their arms 
and legs close to their bodies, put their heads down on their 
desks, relaxed their muscles, and imagined they were turtles. 
The children were taught to use the turtle response whenever 
they believed an aggressive exchange with someone else was 
about to take place, when they were angry with themselves and 
felt they were about to have a tantrum, or when the teacher or a 
classmate called out, “Turtle!” 

Azrin and Nunn (1973) developed an intervention they 
called habit reversal, in which clients are taught to self- 
monitor their nervous habits and interrupt the behavior chain 
as early as possible by engaging in behavior incompatible 
with the problem behavior (i.e., doing something else). For 
example, when a nail biter observes herself beginning to bite 
her fingernails, she might squeeze her hand into a tight fist 
for 2 or 3 minutes (Azrin, Nunn, & Frantz, 1980) or sit on 
her hands (Long, Miltenberger, Ellingson, & Ott, 1999). Habit 
reversal is typically implemented as a treatment package that 
includes self-awareness training involving response detection 
and procedures for identifying events that precede and trig- 
ger the response and competing response training. Motivation 
techniques including self-administered consequences, social 
support systems, and procedures for promoting the generaliza- 
tion and maintenance of treatment gains may also be included. 
Habit reversal has proven an effective self-management tactic 
for a wide variety of problem behaviors (Miltenberger, Fuqua, 
& Woods, 1998). 

Mancuso and Miltenberger (2016) conducted a study 
using habit reversal with six female college students seeking 
to improve their public speaking skills. The students filled 
pauses by uttering nonsense syllables such as “uh,” “um,” or 
“er”; tongue clicking (pressing the tongue against the roof of 
the mouth and releasing the pressure to make an audible “tsk” 
or click sound); and interjecting the word “like.” These target 
behaviors were recorded during 3- to S-min speeches throughout 
the study. Each session the researcher randomly selected 2 top- 
ics from a list of 25 (e.g., my favorite movie, my first job) and 
instructed the participant to choose 1. The participant was then 


given 10 minutes to prepare an outline for a 5-min speech. Par- 
ticipants were allowed to use their outlines during their speeches. 

Intervention consisted of one-on-one habit reversal train- 
ing sessions with the researcher. Response detection began with 
each participant identifying occurrences of the target behaviors 
in 3-min video clips of her baseline speeches. The participant 
then practiced detecting the target behaviors during live speeches 
by raising her right hand each time she engaged in one of the 
behaviors and her left hand when becoming aware that she was 
about to emit one of the behaviors. Response detection training 
ended when the participant identified 100% of target behavior 
occurrences in one speech or 85% of the occurrences in two 
consecutive speeches. 

Next, each participant practiced emitting competing 
responses for each target behavior: for “uh,” “um,” or “er,” a 
3-sec silent pause; for tongue clicking, placing the tongue 
against the inside of the bottom teeth for 3 seconds; for interject- 
ing “like,” beginning the sentence again without “like.” Compet- 
ing response training concluded when the participant achieved 
an 80% reduction in the target behavior from her baseline mean 
during a speech. 

Habit reversal training resulted in an immediate and lasting 
decrease in filled pauses by the participants (see Figure 29.8). 
The participants emitted moderate to high rates of filled pauses 
during baseline (mean across participants of 7.4 responses per 
minute). Following habit reversal training, the mean rate of filled 
pauses by the participants was 1.4 responses per minute. Each 
participant maintained her improved performance in a follow-up 
speech 2 to 5 weeks later. 


Self-Directed Systematic Desensitization 


Systematic desensitization is a widely used behavior therapy 
treatment for anxieties, fears, and phobias that features the 
self-management strategy of engaging in an alternative behav- 
ior (i.e., doing something else). Originally developed by Wolpe 
(1958, 1973), systematic desensitization involves substituting 
one behavior, generally muscle relaxation, for the unwanted 
behavior—the fear and anxiety. The client develops a hierarchy 
of situations from the least to the most fearful and then learns to 
relax while imagining these anxiety-producing situations, first 
the least fearful situation, then the next fearful one, and so on. 
Figure 29.9 shows an anxiety-stimulus hierarchy that a person 
might develop in attempting to control a fear of cats. When a 
person is able to go completely through his hierarchy, imagining 
each scene in detail while maintaining deep relaxation and feel- 
ing no anxiety, he begins to expose himself gradually to real-life 
situations. 

Detailed procedures for achieving deep muscle relaxation, 
constructing and validating a hierarchy of anxiety- or fear-pro- 
ducing situations, and implementing a self-directed systematic 
desensitization program can be found in Martin and Pear (2015) 
and Wenrich, Dawley, and General (1976). 


Massed Practice 


Forcing oneself to perform an undesired behavior again and 
again, a technique called massed practice, will sometimes 
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Figure 29.8 Effects of habit reversal training 

on the rate of filled pauses during speeches 

Jen by six university students during baseline and 
after habit reversal training. The downward 
arrow represents sessions conducted after Anne 
received a booster session. Square data points 
represent 2- to 5-week follow-up speeches. 

From “Using Habit Reversal to Decrease Filled Pauses in Public 
Speaking.” by C. Mancuso and R. G. Miltenberger, 2016, 
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decrease the future occurrences of the behavior. Wolff (1977) 
reported an interesting case of this form of treatment by a 
20-year-old woman who engaged in a compulsive, ritualized 
routine of 13 specific security checks every time she entered 
her apartment (e.g., looking under beds, checking the closets, 
looking in the kitchen). She began her program by deliberately 
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going through the 13 steps in an exact order and then repeating 
the complete ritual 4 more times. After doing this for 1 week, 
she permitted herself to check the apartment if she wanted to 
but made herself go through the entire routine 5 times whenever 
she did any checking at all. She soon stopped performing her 
compulsive checking behavior. 
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Figure 29.9 Sequence of imaginary scenes concerning fear of cats that could be used for 


systematic self-desensitization. 


Directions 


. You're sitting in a comfortable chair in the safety of your home watching TV. 
. You'te watching a commercial for cat food—no cat is visible. 


. The commercial continues and a cat is now eating the food. 


. Aman is now petting the cat. 
. Aman is holding the cat and fondling It. 


. Awoman is holding the cat, and the cat is licking her hands and face. 

. You're looking out the window of your home and you see a cat on the lawn across the street. 
. You're sitting in front of your house, and you see a cat walk by on the sidewalk across the street. 
. You're sitting in your yard, and you see a cat walk by on your sidewalk. 


. Acat walks within 15 feet of you. 


. A friend of yours picks the cat up and plays with it. 


. Your friend is 10 feet away from you, and the cat is licking his face. 


. Your friend comes within 5 feet of you while he’s holding the cat. 


. Your friend stands 2 feet away and plays with the cat. 


. Your friend asks if you'd like to pet the cat. 


. Your friend reaches out and offers you the cat. 


. He puts the cat on the ground, and it walks over to you. 


. The cat rubs up against your leg. 

. The cat walks between your legs purring. 
. You reach down and touch the cat. 

. You pet the cat. 

. You pick up the cat and pet it. (p. 71) 


From "Self-Directed Systematic Desensitization: A Guide for the Student, Client, and Therapist” by W. W. Wen- 
rich, H. H. Dawley, and D. A. General, 1976, p. 71. Copyright 1976 by Behaviordelia, Kalamazoo, MI. Used by 
Permission. 


SUGGESTIONS FOR EFFECTIVE 
SELF-MANAGEMENT PROGRAMS 


Incorporating the suggestions that follow into the design and 
implementation of self-management programs should increase 
the likelihood of success. Although none of these guidelines 
has been rigorously examined through experimental analyses— 
research on self-management has a long way to go—each sug- 
gestion is consistent with procedures proven effective in other 
applications of behavior analysis and with “best practices” com- 
monly recommended in the self-management literature (e.g., 
Malott, 2012; Martin & Pear, 2015; Watson & Tharp, 2014). 


1. Specify a goal and define the behavior to be changed. 
2. Self-monitor the behavior. 


3. Contrive contingencies that will compete with natural 
contingencies. 


4. Go public with your commitment to change your behavior. 
5. Get a self-management partner. 


6. Continually evaluate your self-management program and 
redesign it as necessary. 


Specify a Goal and Define the Target Behavior 


A self-management program begins with identifying a personal 
goal or objective and the specific behavior changes necessary 
to accomplish that goal or objective. A person can use most of 
the questions and issues that practitioners should consider when 
selecting target behaviors for students or clients (see Chapter 3) 
to assess the social significance and prioritize the importance of 
a list of self-determined target behavior changes. 


Self-Monitor the Behavior 


A person should begin self-monitoring as soon as he has defined 
the target behavior. Self-monitoring prior to implementing any 
other form of intervention yields the same benefits as does tak- 
ing baseline data described in Chapter 7. 


1. Self-monitored baseline data provide information about 
antecedent-behavior-consequent correlations that may be 
helpful in designing an effective intervention. 


2. Self-monitored baseline data provide guidance for set- 


ting initial performance criteria for self-administered 
consequences. 


3. Self-monitored baseline data provide an objective basis 
for evaluating the effects of any subsequent interventions. 


. Another reason to begin self-monitoring as soon as pos- 
sible without using additional self-management tactics is that 


the desired improvement in behavior may be achieved by self- 
monitoring alone. 


Contrive Contingencies That Will Compete 
with Ineffective Natural Contingencies 


When self-monitoring alone does not result in the desired behav- 
ior change, the next step is designing a contrived contingency to 
compete with the natural contingencies. A person who imple- 
ments a contingency that provides immediate, definite conse- 
quences for each occurrence (or perhaps, nonoccurrence) of the 
target behavior greatly increases the probability of obtaining 
a previously elusive self-management goal. For example, a 
smoker who self-records and reports each cigarette smoked to 
his self-management partner, who provides contingent praise, 
rewards, and penalties, has arranged immediate, frequent, and 
more effective consequences for reducing his smoking behavior 


than those provided by the natural contingency: the threat of 


emphysema and death by lung cancer in the future, with neither 
outcome becoming appreciably more immediate or likely from 
one cigarette to the next. 


Go Public 


Publicly sharing the intentions of a self-management effort 
may enhance its effectiveness. When a person shares a goal or 
makes a prediction to others about her future behavior, she has 
arranged potential consequences—praise or condemnation—for 
her success or failure in meeting that goal. A person should 
state in specific terms what she intends to do and her deadline 
for completing it. Taking the idea of public commitment a step 
further, consider the powerful potential of public posting (e.g., 
an untenured junior faculty member could post a chart of her 
writing where her department chair or dean could see and com- 
ment on it). 


Malott (1981) called this the “public spotlight principle of 


self-management.” 


A public statement of goals improves performance. 
But just how do the social contingencies engaged 

by public commitment produce change? They 
increase the rewarding value of success and the aver- 
sive value of failure, I presume. But those outcomes 
are probably too delayed to directly reinforce prob- 
lem solving. Instead they must be part of the rules the 
student self-states at crucial times: “If I don’t make 
my goal, I'll look like a fool; but if I do make it, Pll 
look pretty cool.” Such rules then function as cues 
for immediate self-reinforcement of on-task behavior 
and self-punishment of off-task behavior. (Volume II, 


No. 18, p. 5) 


Skinner (1953) also theorized about the principles of 
behavior that operate when a person shares a self-management 
goal with important others. 
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By making it in the presence of people who supply aversive 
stimulation when a prediction is not fulfilled, we arrange 
consequences which are likely to strengthen the behavior 
resolved upon. Only by behaving as predicted can we 


escape the aversive consequences of breaking our resolu- 
tion. (p. 237) 


Get a Self-Management Partner 


Setting up a self-management exchange is a good way to 
involve another person whose differential feedback about how 
a self-management project is going can be effective as a behay- 
ioral consequence. Two people, each of whom has a long-range 
goal or a regular series of tasks to do, agree to interact with each 
other—in person or by phone, text, e-mail, or social media—on 
a daily or weekly basis, as determined by their schedules, pref- 
erences, the target behaviors, and each person’s progress. They 
share their self-monitoring data and exchange verbal praise or 
admonishments and perhaps even more tangible consequences 
contingent on performance. Malott (1981) reported a successful 
self-management exchange in which he and a colleague paid 
each other $1 every time either person failed to complete any 
one of a series of self-determined daily exercise, housekeeping, 
and writing tasks. Each morning they spoke on the telephone 
and reported their performance during the previous 24 hours. 

To help themselves and each other study for exams and 
complete research and writing tasks, a group of doctoral students 
created a self-management exchange they called the Disserta- 
tion Club (Ferreri et al., 2006). At weekly meetings, each group 
member reported data on the “scholarly behaviors” she had tar- 
geted (e.g., graphs of words written each day or number of hours 
studied). Group members provided social support to one another 
in the form of encouragement to continue working hard and 
praise for accomplishments, served as behavioral consultants 
to one another on the design of self-management interventions, 
and sometimes administered rewards and fines to one another. 
All six members of the group wrote and successfully defended 
their dissertations within the time lines they had set. 


Continually Evaluate and Redesign 
Program as Needed 


The biggest problem with self-management . . . is that the 
systems are liable to gradually deteriorate . . . So, you must 
always be on top of it. And you must be prepared with duct 
tape and bubble gum to stick the system back together. 
—R. W. Malott (2012, p. 31) 


Most self-management programs embody a pragmatic, data- 
based approach to personal problem solving more than rigorous 
research with an emphasis on experimental analysis and control. 
Like researchers, however, self-managers should be guided by 
the data. If the data show the program is not working satisfacto- 
rily, the intervention should be redesi ened. 

People who have learned to define, observe, record, and 
graph their own behavior can evaluate their self-management 
efforts with self-experiments. A simple A-B design (Chapter 7) 
provides a straightforward accounting of the results in a before- 
and-after fashion and is usually sufficient for self-evaluation. 
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Experimentally determining functional relations between 
self-managed interventions and their effects, though valuable, 
usually takes a back seat to the pragmatic goal of changing one’s 
behavior. However, the changing criterion design (Chapter 9) 
lends itself nicely not only to the stepwise increments in per- 
formance that are often a part of improving personal perfor- 
mance, but also to a clearer demonstration and understanding 
of the relation between the intervention and changes in the 
target behavior. 

In addition to data-based evaluation, people should judge 
the social validity of their self-management projects (Wolf, 
1978). Practitioners teaching self-management to others can help 
their clients assess the social validity of their self-management 
efforts by providing checklists of questions covering topics such 
as how practical their intervention was, whether they believed 
their self-management program affected their behavior in any 
unmeasured ways, and whether they enjoyed the project. 

One of the more important aspects of social validity for 
any behavior change program is the extent to which the results— 
measured changes in the target behavior—actually make a dif- 
ference in the lives of the participants. One approach to assessing 
the social validity of the results of a self-management program 
is to collect data on what Malott (2012) called benefit measures. 
For example, a person might measure: 


¢ Number of pounds lost as a benefit measure of eating less 
or exercising more 

¢ Improved lung capacity (peak airflow volume measured 
as cubic centimeters per second on an inexpensive device 
available at many pharmacies) as an outcome of reducing 
the number of cigarettes smoked 

¢ Decrease in time it takes to run a mile as a benefit of num- 
ber of miles run each day 

¢ Lower resting heart rate and faster recovery time as a ben- 
efit of aerobic exercise 

¢ Higher scores on foreign language practice exams as an 
outcome of studying 


In addition to assessing the social validity of self- 
management interventions, the positive results of benefit mea- 
sures may serve as consequences that reward and strengthen 
continued adherence to self-management. 

Box 29.2: “Take a Load Off Me” describes a self- 
management weight loss program that incorporated many of 
the tactics and suggestions described in this chapter. 


BEHAVIOR CHANGES BEHAVIOR 


Referring to a book on self-management he authored for a pop- 
ular audience, Epstein (1997) wrote: 


A young man whose life is in disarray (he smokes, drinks, 
overeats, loses things, procrastinates, and so on) seeks 
advice from his parents, teachers, and friends, but no one 
can help. Then the young man remembers his Uncle Fred 
(modeled shamelessly after Fred Skinner), whose life always 
seemed to be in perfect harmony. In a series of visits, Uncle 
Fred reveals to him the three “secrets” of self-management, 
all Ms: Modify your environment, monitor your behaviors, 
and make commitments. Fred also reveals and explains the 
“self-management principle”: Behavior changes behavior. 
After each visit, the young man tries out a new technique, 
and his life is changed radically for the better. In one scene, 
he sees a classroom of remarkably creative and insightful 
children who have been trained in self-management tech- 
niques in a public school. It is fiction, of course, but the 
technology is well established and the possibilities are 

well within reach. (p. 563, emphasis in original) 


Building upon Skinner’s (1953) conceptual analysis of 
self-control, applied behavior analysts have developed numer- 
ous self-management tactics and methods for teaching learners 
with diverse abilities how to apply them. Underlying all of these 
efforts and findings is the simple, yet profound principle that 
behavior changes behavior. 


BOX 29.2 


Take a Load Off Me: A Self-Managed Weight Loss Program 


Joe was a 63-year-old whose doctor had recently told him that 
the 195 pounds on his 5-foot 11-inch frame must be reduced to 
175 pounds or he would likely experience serious health prob- 
lems. Although Joe got regular physical exercise by keeping 
up the lawn and garden, cutting and hauling firewood, feeding 
the rabbits, and doing other chores that come with maintaining 
a farmhouse, his prodigious appetite—long considered one of 
the “best in the county”—had caught up with him. His doc- 
tor’s warning and the recent death of a high school classmate 
scared Joe enough to sit down with his son and plan a self- 
management weight-loss program. 

Joe’s program was a package intervention that included 
antecedent-based tactics, self-monitoring, self-evaluation, 
contingency contracting, self-selected and self-administered 
consequences, and contingency management by significant 
others. 


Goal 


To reduce the current weight of 195 pounds to 175 pounds at 
arate of 1 pound per week. 


Behavior Change Needed to Achieve Goal 


Reduce eating to a maximum 2100 calories per day. 
Rules and Procedures 


1. Mount the bathroom scale each morning before dressing 
or eating and chart the weight on “Joe’s Weight” graph 
taped to the bathroom mirror. 

2. Carry a notepad, pencil, and calorie counter in a pocket 
throughout the day and record the type and amount of 
all food and liquid (other than water) immediately after 
consumption. 
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3. Before going to bed each night, add the total calories and stamped postcards. Helen will verify your self-report 

Bete aoe the day and chart that number on by initialing the postcard and mailing it to Bill and Jill 
Joe’s Eating” graph taped to the bathroom mirror. on Monday. 

4. Do not waiver from Steps 1-3, regardless of weight loss ¢ Each Monday, if total calories consumed met the crite- 


or lack of weight loss. rion for at least 6 of the past 7 days, obtain an item or 


activity from “Joe’s Reward Menu.” 
Immediate Contingencies/Consequences 


* The calorie counter, notepad, and pencil in the pocket Intermediate Contingencies/Consequences 


provide continuously available response prompts. ¢ Eat within the daily calorie limit often enough and there 

° Recording all food and drink consumed provides an will be sufficient money in “Joe’s Garden Jar” to buy 
immediate consequence. seeds and plants for the spring vegetable garden. 

¢ When you have not eaten any food you thought about ¢ If your weight during the May visit to Ohio represents a 
eating, put a star on your notepad and give self-praise loss of at least | pound per week, be a guest of honor at 
(“Way to go, Joe!”’). a restaurant of your choice. 

Daily Contingencies/Consequences Long-term Consequences 

¢ If the calories consumed during the day do not exceed ¢ Feel better. 
the 2100 criterion, put 50 cents in “Joe’s Garden Jar.” ¢ Look better. 

e If total calories exceed the criterion, remove $1.00 from ¢ Be healthier. 
“Joe’s Garden Jar.” ¢ Live longer. 

¢ For each 3 days in a row that you meet the calorie crite- 
rion, add a bonus of 50 cents to “Joe’s Garden Jar.” Results 


¢ Have Helen initial your contract each day that you meet 


ae a Joe often struggled with the rules and procedures, but he stuck 
the calorie criterion. 


with his self-management program and lost 22 pounds (achiev- 

ing a weight of 173) in 16 weeks. Joe lived 30 active years 

following his adventure in self-management. He retained his 

¢ Each Sunday night, write the calories consumed each day weight at 175 pounds and enjoyed gardening, singing in a bar- 
for the previous week on one of the predated, addressed, bershop choir, and listening to the Cubs on the radio. 


Weekly Contingencies/Consequences 


The “Self” as Behavior Controller 6. Although self-control and self-management often appear 
interchangeably in the behavioral literature, we recom- 


mend that self-management be used in reference to a 
person acting in some way in order to change his sub- 
sequent behavior. Three reasons for this recommendation 
are as follows: 


1. We tend to assign causal status to events that immediately 
precede behavior, and when causal variables are not read- 
ily apparent in the immediate, surrounding environment, 
the tendency to turn to internal causes is strong. 


2. Hypothetical constructs such as willpower and drive are © Self-control implies that the ultimate control of behav- 
explanatory fictions that bring us no closer to understand- ior lies within the person, but the causal factors for 
ing the behaviors they claim to explain and lead to circular “self-control” are to be found in a person’s experiences 
reasoning. with his environment. 

3. Skinner (1953) conceptualized self-control as a two- ¢ Self-control as a causal factor can serve as an explana- 
response phenomenon: The controlling response attects tory fiction suggesting “a [separate] self inside or [that 
variables in such a way as to change the probability of the there is] a self inside controlling external behavior” 
other, the controlled response. (Baum, 2017, p. 167). 

* Laypeople and behaviorists alike use self-control to 
Self-Management Defined refer to a person’s ability to “delay gratification” by 


responding to achieve a delayed, but larger or higher 
quality, reward instead of acting to obtain an immediate, 
less valuable reward. 


4. We define self-management as the personal application of 
behavior change tactics that produces a desired change in 


behavior. 

5, Self-management is a relative concept. A behavior change 7. People discount the value of delayed rewards; the greater 
program may entail a small degree of self-management the delay, the less value or influence the reward has on 
or be totally conceived, designed, and implemented by current behavior. Behavior analysts call this phenomenon 


the person delay discounting (or temporal discounting). 
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Applications, Advantages, and Benefits 
of Self-Management 


8. 


Four uses of self-management are to 

¢ live a more effective and efficient daily life, 
¢ break bad habits and acquire good ones, 

¢ accomplish difficult tasks, and 


¢ achieve personal lifestyle goals. 


. Advantages and benefits of learning and teaching 


self-management skills include the following: 


¢ Self-management can influence behaviors not accessible 
to external change agents. 


External change agents often miss important instances 
of behavior. 


¢ Self-management can promote the generalization and 
maintenance of behavior change. 


¢ A small repertoire of self-management skills can control 
many behaviors. 


People with diverse abilities can learn self-management 
skills. 


¢ Some people perform better under self-selected tasks 
and performance criteria. 


People with good self-management skills contribute to 
more efficient and effective group environments. 


¢ Teaching students self-management skills provides 
meaningful practice for other areas of the curriculum. 


¢ Self-management is an ultimate goal of education. 
¢ Self-management benefits society. 
¢ Self-management helps a person feel free. 


¢ Self-management feels good. 


Antecedent-based Self-Management Tactics 


10. Antecedent-based self-management tactics feature the 


manipulation of events or stimuli antecedent to the target 
(controlled) behavior, such as the following: 


¢ Manipulating motivating operations to make a desired 
(or undesired) behavior more (or less) likely 


Providing response prompts 


¢ Performing the initial steps of a behavior chain to ensure 
being confronted later with a discriminative stimulus 
that will evoke the desired behavior 


Removing the materials required for an undesired behavior 


¢ Limiting an undesired behavior to restricted stimulus 
conditions 


* Dedicating a specific environment for a desired behavior 


Self-Monitoring 


11. 


12. 


Self-monitoring is a procedure whereby a person observes 
and responds to, usually by recording, the behavior he is 
trying to change. 


Originally developed as a method of clinical assessment 
for collecting data on behaviors that only the client could 


13: 


14. 


15. 


16. 


17. 


18. 


observe, self-monitoring evolved into the most widely 
used and studied self-management strategy because it 
often results in desired behavior change. 


Self-monitoring is often combined with goal setting and 
self-evaluation. A person using self-evaluation compares 
her performance with a predetermined goal or standard. 


Self-monitoring is often part of an intervention that 
includes reinforcement for meeting either self- or 
teacher-selected goals. 


It is difficult to determine exactly how self-monitoring 
works because the procedure necessarily includes, and 
is therefore confounded by, private events (covert verbal 
behavior); it often includes either explicit or implicit 
contingencies of reinforcement. 


Children can be taught to self-monitor and self-record their 
behavior accurately by means of a faded matching technique, 
in which the child is rewarded initially for producing data 
that match the teacher’s or parent’s data. Over time the child 
is required to match the adult’s record less often, eventually 
monitoring the behavior independently. 


Accuracy of self-monitoring is neither necessary nor 
sufficient to achieve improvement in the behavior being 
monitored. 

Suggested guidelines for self-monitoring are as follows: 
¢ Provide materials that make self-monitoring easy. 

¢ Provide supplementary cues or prompts. 


¢ Self-monitor the most important dimension of the target 
behavior. 


* Self-monitor early and often, but do not interrupt the 
flow of a desired behavior targeted for increase. 


¢ Reinforce accurate self-monitoring. 


Self-Administered Consequences 


19. 


20. 


21. 


As a technical term, self-reinforcement (as also, self- 
punishment) is a misnomer. Although behavior can be 
changed by self-administered consequences, the variables 
influencing the controlling response make such 
self-management tactics more than a straightforward 
application of operant reinforcement. 


Self-administered contingencies analogous to positive and 
negative reinforcement and positive and negative punish- 
ment can be incorporated into self-management programs. 


When designing self-management programs involving 
self-administered consequences, a person should: 
* Select small, easy-to-deliver consequences. 


* Set a meaningful but easy-to-meet criterion for 
reinforcement. 


* Eliminate “bootleg reinforcement.” 


If necessary, put someone else in control of delivery 
consequences. 


* Use the least complicated and intrusive contingencies 
that will be effective. 


Other Self-Management Tactics 


22. Self-instruction (talking to oneself) can function as a 


controlling response (verbal mediator) that affects the 
occurrence of other behaviors. 


23. Habit reversal is a multiple-component treatment package in 
which clients are taught to self-monitor their unwanted habits 
and interrupt the behavior chain as early as possible by engag- 
ing in a behavior incompatible with the problem behavior. 


24. Systematic desensitization is a behavior therapy treatment 
for anxieties, fears, and phobias that involves substituting 
one behavior, generally muscle relaxation, for the unwanted 
behavior—the fear and anxiety. Self-directed systematic 
desensitization involves developing a hierarchy of situations 
from the least to the most fearful and then learning to relax 
while imagining these anxiety-producing situations, first the 
least fearful situation, then the next fearful one, and so on. 


25. Massed practice, forcing oneself to perform an undesired 
behavior again and again, can decrease future occurrences 
of the behavior. 


(Wane DEERE ERO ASSETS SSE 


KEY TERMS 


habit reversal 


self-evaluation 
massed practice self-instruction 


self-control self-management 


eS ERENI SECRET 


MULTIPLE-CHOICE QUESTIONS 


1. The personal application of behavior change tactics that 
produces a desired change in behavior is: 


a. Self-management 

b. Managing behaviors 

c. Behavior management 

d. Behavior modification 

Hint: (See “The ‘Self’ as Controller of Behavior’) 


2. Self-management can help individuals to: 
a. Break bad habits and replace with good ones 
b. Accomplish difficult tasks 
c. Achieve personal goals 
d. All of these 


Hint: (See “Applications, Advantages, and Benefits of 
Self-Management’) 


3. Limiting an undesired behavior to restructure stimulus 
conditions 1s 
a. A consequent self-management tactic 
b. An antecedent-based self-management tactic 
c. Always used when possible to produce an effective 
self-management program 
d. None of these 
Hint: (See “Antecedent-Based Self-Management Tactics’) 
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Suggestions for Effective 
Self-Management Programs 


26. Following are six steps in designing and implementing a 
self-management program: 
Step 1: Specify a goal and define the behavior to be 
changed. 
Step 2: Self-monitor the behavior. 


Step 3: Create contingencies that will compete with natural 
contingencies. 


Step 4: Go public with the commitment to change 
behavior. 


Step 5: Get a self-management partner. 


Step 6: Continually evaluate and redesign the program as 
needed. 


Behavior Changes Behavior 


27. The most fundamental principle of self-management is 
that behavior changes behavior. 


self-monitoring 


systematic desensitization 


4. Self-monitoring is best defined as: 


a. A procedure whereby persons observe their own 
behavior and record only the occurrence of the target 
behavior 

b. A procedure whereby persons observe the behavior of 
someone else and systematically record the occurrence 
or nonoccurrence of the target behavior 

c. A procedure whereby persons observe their behavior 
systematically and record the occurrence or nonoccur- 
rence of the target behavior 

d. A procedure whereby persons observe their behavior 
and someone else systematically records the occurrence 
or nonoccurrence of the target behavior. 


Hint: (See “Self-Monitoring”) 

5. Suggested guidelines for effective and efficient self- 
monitoring include: 
a. Make materials cumbersome and sophisticated 


b. Never provide supplementary cues or prompts, this will 
make the individual become dependent on the prompts 
or cues rather than the target behavior 


c. Self-monitor the most important dimension of the target 
behavior 


d. Do not reinforce accurate self-monitoring 


Hint: (See “Self-Monitoring”) 
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6. 


ESSAY-TYPE QUESTIONS 
1. 
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Self-administered consequences that increase desired 
behavior include: 


a. Self-management analogs of positive reinforcement 
b. Self-management analogs of positive punishment 
c. Self-management analogs of negative punishment 
d. None of these 


Hint: (See “Self-Administered Consequences’) 


. If you wanted to decrease a desired behavior, what 


type of self-administered consequence would be 
most useful? 


a. Self-management analogs of positive reinforcement 
b. Self-management analogs of negative reinforcement 
c. Self-management analogs of positive punishment 

d. None of these 


Hint: (See “Self-Administered Consequences”) 


. A type of self-management tactic that typically involves 


the use of self-awareness training, competing response 
training, and motivation techniques is: 


a. Self-Instruction 
b. Habit Reversal 


2 SEL SSE SEES 


What is self-management and what are three of its 
potential advantages or benefits? 


Hint: (See “The ‘Self’ as a Controller of Behavior’ 
and “Applications, Advantages, and Benefits of 
Self-Management’) 


. What is self-monitoring and what procedures promote its 


effectiveness? 
Hint: (See “Self-Monitoring’’) 


. What are self-administered consequences and what is an 


example of one that would increase a desired or decrease 
an undesired behavior? 


Hint: (See “Self-Administered Consequences”) 


NOTES 


it, 


to 


Explanatory fictions and circular reasoning are discussed in Chapters 
and 11. 


. There is also nothing new about self-management. As Epstein (1997) noted, 


c. Massed Practice 
d. None of these 
Hint: (See “Other Self-Management Tactics”’) 


9. This type of self-management tactic involves self- 


generated verbal responses, covert or overt, that function 
as response prompts for a desired behavior. 


a. Self-instruction 

b. Self-directed systematic desensitization 

c. Habit reversal 

d. None of these 

Hint: (See “Other Self-Management Tactics’) 


10. The first step in conducting an effective self-management 


program is: 
a. Begin self-monitoring the behavior 


b. Contrive contingencies that will compete with natural 


contingencies 


c. Continually evaluate your self-management program 


and redesign it as necessary 


d. Specify a goal and define the behavior to be changed 


Hint: (See “Suggestions for Conducting an Effective 
Self-Management Program’’) 
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4. Name and describe an additional self-management tactic 


that is discussed within the text. 
Hint: (See “Other Self-Management Tactics”’) 


5. List and discuss the steps for conducting an effective 


1 


many self-control techniques outlined by Skinner were described by ancient 


Greek and Roman philosophers and have appeared in the teachings of organized 5: 


religions for millennia (cf. Bolin & Goldberg, 1979; Shimmel, 1977, 1979). 


. Fora variety of conceptual analyses of self-control/self-management from 
behavioral perspective, see Brigham (1983), Catania (1975, 2013), Goldiamond a 


a 6. 


(1976), Hughes and Lloyd (1993), Kanfer and Karoly (1972), Malott (1984, 


self-management program. 


Hint: (See “Suggestions for Conducting an Effective 
Self-Management Program’’) 
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1989, 2005a, b), Moore (2015), Rachlin (1995, 2016), and Watson and 
Tharp (2014). 
4. In early biological theory, homunculus was a fully formed human being 
thought to exist inside an egg or spermatozoon. 
Generalized behavior change is a defining goal of applied behavior analy- 
sis and the focus of Chapter 30. 
Motivating operations are described in detail in Chapter 16. 
The critical importance of immediacy in reinforcement is discussed in 
Chapter 11. 


PART 12 


Promoting Generalized 
Behavior Change 


It is one thing to implement interventions that change behavior; it is quite another to extend those 
behavior changes through time, across settings, or to new behaviors. Practitioners face no more 
challenging or important task than that of designing, implementing, and evaluating interventions 
that produce behavior changes that continue after the intervention is terminated, appear in relevant 
environments beyond the training setting, and engender related behaviors that were not taught 
directly. Chapter 30 defines the major types of generalized behavior change and describes strategies 
and tactics applied behavior analysts use to achieve them. 
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cHarter| BO 


Generalization and Maintenance 


of Behavior Change 


LEARNING OBJECTIVES 


¢ Define and provide examples of the different basic forms of generalized behavior change. 

¢ Define and provide examples of the different undesirable types of generalized behavior change. 
e Discuss planning techniques for generalized behavior change. 

e List and discuss strategies and tactics for promoting generalized behavior change. 


e Discuss how to modify and terminate successful interventions. 
e¢ Name and discuss the guiding principles for promoting generalized outcomes. 


Sherry’s teacher implemented an intervention that taught 
her to complete each part of multiple-part, in-school 
assignments before submitting them and beginning another 
activity. Now, 3 weeks after the program ended, most of 
the work Sherry submits as “finished” is incomplete and 
her stick-with-a-task-until-it’s-finished behavior is as poor 
as it was before the intervention began. 

Ricardo has just begun his first competitive job 
working as a copy machine operator in a downtown 
business office. In spite of his long history of distracti- 
bility and poor endurance, Ricardo had learned to work 
independently for several hours at a time in the copy room 
at the vocational training center. His employer, however, 
is complaining that Ricardo frequently stops working after 
a few minutes to seek attention from others. Ricardo may 
soon lose his job. 

Brian is a 10-year-old boy diagnosed with autism. 
In an effort to meet an objective on his individualized 
education program that targets functional language and 
communication skills, Brian’s teacher taught him to say, 
“Hello, how are you?” as a greeting. Now, whenever 
Brian meets anyone, he invariably responds with, “Hello, 
how are you?” Brian’s parents are concerned that their 
son’s language seems stilted and parrot-like. 


he mere passage of time resulted in Sherry losing her 

ability to complete assignments. A change of scenery 

threw Ricardo off his game; the excellent work habits 
he acquired at the vocational training center were absent at the 
community job site. Although Brian used his new greeting skill 
over time and with new people, its repetitive and restricted form 
did not serve him well. 
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Each of these three situations illustrates a common type of 
teaching failure. The most socially significant behavior changes 
last over time, are used in all relevant settings and situations, 
and are accompanied by other functionally related responses. 
The student who learns to count money and make change in the 
classroom today must be able to count and make change at the 
convenience store tomorrow and at the supermarket next month. 
The beginning writer who has been taught to write a few good 
sentences in school must be able to write many more meaningful 
sentences when writing notes or letters to family or friends. 

Applied behavior analysts face no more challenging or 
important task than that of designing, implementing, and eval- 
uating interventions that produce generalized behavior change. 
This chapter defines three major types of generalized behavior 
change and describes the strategies and tactics researchers and 
practitioners use most often to promote them. 


GENERALIZED BEHAVIOR CHANGE: 
DEFINITIONS AND KEY CONCEPTS 


Baer, Wolf, and Risley (1968) included generalized behavior 
change as one of seven defining characteristics of applied 
behavior analysis. 


A behavior change may be said to have generality if it 
proves durable over time, if it appears in a wide variety of 
possible environments, or if it spreads to a wide variety of 
related behaviors. (p. 96) 


In their seminal review paper, “An Implicit Technology 
of Generalization,’ Stokes and Baer (1977) also stressed those 
three facets of generalized behavior change—across time, set- 
tings, and behaviors—when they defined generalization as 


the occurrence of relevant behavior under different, 
nontraining conditions (i.e., across subjects, settings, 


people, behaviors, and/or time) without the scheduling of 
the same events in those conditions. Thus, generalization 
may be claimed when no extratraining manipulations are 
needed for extratraining changes; or may be claimed when 
some extra manipulations are necessary, but their cost is 
clearly less than that of the direct intervention. Generaliza- 
tion will not be claimed when similar events are necessary 
for similar effects across conditions. (p. 350) 


In other words, when a learner emits a trained behavior 
in places or at times without having to be retrained completely 
in those places or times, or emits functionally related behaviors 
that were not taught directly, generalized behavior change is 
evident. The following sections define and provide examples of 
these three basic forms of generalized behavior change: response 
maintenance, setting/situation generalization, and response 
generalization. 


Response Maintenance 


We define response maintenance as the extent to which a learner 
continues to perform a target behavior after a portion or all of the 
intervention responsible for the behavior’s initial appearance in 
the learner’s repertoire has been terminated.' For example: 


¢ Sayaka was having difficulty identifying the lowest com- 
mon denominator (LCD) when adding and subtracting 
fractions. Her teacher had Sayaka write the steps for find- 
ing the LCD on an index card and told her to refer to 
the card when needed. Sayaka began using the LCD cue 
card, and the accuracy of her math assignments improved. 
After using the cue card for a week, Sayaka said she no 
longer needed it and returned it to her teacher. The next 
day Sayaka correctly computes the LCD for every prob- 
lem on a quiz on adding and subtracting fractions. 

* On Loraine’s first day on the job with a residential land- 
scaping company, a coworker taught her how to use a 
long-handled tool to extract dandelions. Without further 
instruction, Loraine continues to use the tool correctly a 
month later. 

* When Derek was in the seventh grade, one of his teachers 
taught him to record his assignments and keep materials 
for each class in separate folders. Now a college sopho- 
more, Derek continues to apply those organizational skills 
to his academic work. 


Response maintenance was evident in Sayaka’s perfor- 
mance on a math quiz | day after the cue card intervention 
ended, in Loraine’s effective use of the weeding tool 1 month 
after training, and in Derek’s continued use of the organiza- 
tional skills he had learned years earlier. How long a newly 
learned behavior must be maintained depends on how long the 
learner’s environment presents opportunities or requirements 
to perform the behavior. If covertly reciting a phone number 
3 times immediately after hearing it enables a person without 
a cell phone to remember the number long enough to tap it in 
on a borrowed phone a few minutes later, sufficient response 
maintenance has been achieved. Other behaviors, such as 
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literacy, self-care, and social skills, must be maintained in a 
person’s repertoire for a lifetime. 

Meaningful response maintenance may be obtained with 
some components of the initial intervention in place. A study 
by Sprague and Horner (1984), in which six students with 
moderate to severe cognitive disabilities were taught to use 
vending machines, provides a good example of this point. The 
students were given cue cards to aid them in operating vending 
machines without assistance. The cue cards had food and drink 
logos on one side and pictures of quarters paired with prices on 
the other. The cards were used during instruction and gener- 
alization probes, but also were kept by the students at the end 
of the program. Five of the six students still carried a cue card 
and used vending machines independently 18 months after the 
study had ended. 


Setting/Situation Generalization 


Setting/situation generalization is the extent to which a learner 
emits a target behavior in a setting or stimulus situation that 
differs from the instructional setting in any meaningful way. 
For example: 


¢ While waiting for his new motorized wheelchair to arrive 
from the factory, Chaz used a computer simulation program 
and a joystick to learn how to operate his soon-to-arrive 
chair. When the new chair arrived, Chaz grabbed the 
joystick and immediately began zipping up and down 
the hall and spinning perfectly executed donuts. 

¢ Loraine was taught to pull weeds from flowerbeds and 
mulched areas. Although never having been instructed 
to do so, Loraine began removing weeds from lawns she 
crossed on her way to the flowerbeds. 

¢ After Brandy’s teacher taught her to read 10 different 
C-V-C-E words (e.g., bike, cute, made), Brandy could 
read other C-V-C-E words for which she had not received 
instruction (e.g., bite, cake, mute). 


A study by van den Pol and colleagues (1981) provides an 
excellent example of setting/situation generalization. They taught 
three young adults with multiple disabilities to eat independently 
in fast-food restaurants. All three students had previously eaten 
in restaurants, but could not order or pay for a meal without 
assistance. The researchers began by constructing a task analysis 
of the steps required to order, pay for, and eat a meal appropri- 
ately in a fast-food restaurant. Instruction took place in the stu- 
dents’ classroom and consisted of role playing each of the steps 
during simulated customer-cashier interactions and responding 
to questions about photographic slides showing customers at a 
fast-food restaurant performing the various steps in the sequence. 
The 22 steps in the task analysis were divided into four major 
components: locating, ordering, paying, and eating and exiting. 
After a student had mastered the steps in each component in the 
classroom, he was given “a randomly determined number of bills 
equaling two to five dollars and instructed to go eat lunch” at 
a local restaurant (p. 64). Observers stationed inside the restau- 
rant recorded each student’s performance of each step in the 
task analysis. The results of these generalization probes, which 
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were also conducted before training (baseline) and after training 
(follow-up), are shown in Figure 30.1. In addition to assessing the 
degree of generalization from the classroom, which was based on 
the specific McDonald’s restaurant used for most of the probes, 
the researchers conducted follow-up probes in a Burger King 
restaurant (also a measure of maintenance). 

This study exemplifies the pragmatic approach used by 
most applied behavior analysts to assess and promote generalized 
behavior change. The setting in which generalized responding is 
desired can contain one or more components of the intervention 
that was implemented in the instructional environment, but not 
all of the components. If the complete intervention program is 
required to produce behavior change in a novel environment, 
then no setting/situation generalization can be claimed. How- 
ever, if some component or components of the training program 
result in meaningful behavior change in a generalization setting, 
then setting/situation generalization can be claimed, provided 
it can be shown that the component or components used in the 
generalization setting were insufficient to produce the behavior 
change alone in the training environment. 

For example, van den Pol and colleagues taught Student 3, 
who was deaf, how to use a prosthetic ordering device in the 
classroom. The device, a plastic laminated sheet of cardboard 
with a wax pencil, had preprinted questions (e.g., “How much 
is... ?”), generic item names (e.g., large hamburger), and 
spaces where the cashier could write responses. Simply giving 
the student some money and the prosthetic ordering card would 
not have enabled him to order, purchase, and eat a meal inde- 
pendently. However, after classroom instruction that included 
guided practice, role playing, social reinforcement (“Good job! 
You remembered to ask for your change” [p. 64]), corrective 
feedback, and review sessions with the prosthetic ordering card 
produced the desired behaviors in the instructional setting. 
Student 3 was able to order, pay for, and eat meals in a restau- 
rant, aided only by the card. 


Distinguishing Between Instructional and 
Generalization Settings 


Instructional setting denotes the environment where teaching 
occurs, including all planned or unplanned aspects of the envi- 
ronment that may influence the learner’s acquisition and general- 
ization of the target behavior.” Planned elements are the stimuli 
and events the teacher has programmed to achieve initial behavior 
change and promote generalization. For example, planned ele- 
ments for a math lesson would include the specific problems and 
solution steps to be presented and the format and sequencing of 
those items. Unplanned aspects of the instructional setting are 
factors the teacher is not aware of or has not considered that might 
affect the acquisition and generalization of the target behavior. For 
example, the phrase how much in a word problem may acquire 
stimulus control over a student’s use of addition, even when the 
correct solution to the problem requires a different arithmetic 
operation. Or, perhaps a student always uses subtraction for the 
first problem on each page of word problems because a subtrac- 
tion problem has always been presented first during instruction. 
A generalization setting is any place or stimulus situation 
that differs from the instructional setting in some meaningful 
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Figure 30.1 Percentage of steps necessary to order a 
meal at a fast-food restaurant correctly performed by 
three students with disabilities before, during, and after 
instruction in the classroom. During follow-up, the closed 
triangles represent probes conducted at a Burger King 
restaurant using typical observation procedures, open 
triangles represent Burger King probes during which 
students did not know they were being observed, and open 
circles represent covert probes conducted in a different 
McDonald's 1 year after training. 
From “Teaching the Handicapped to Eat in Public Places: Acquisition, 
Generalization and Maintenance of Restaurant Skills” by R. A. van den Pol, 
B. A. lwata, M. T. Ivanic, T. J. Page, N. A. Neef, and F. P. Whitley, 1981, 


Journal of Applied Behavior Analysis, 14, p. 66. Reproduced with permission 
of John Wiley & Sons, Inc. 


way and in which performance of the target behavior is desired. 
There are multiple generalization settings for many target 
behaviors. The student who learns to solve addition and sub- 
traction word problems in the classroom must be able to solve 
similar problems at home, at the store, and in the dugout with 
teammates. 


Examples of instructional and generalization settings for 
six target behaviors are shown in Figure 30.2. When a person 
uses a skill in an environment physically removed from the 
setting where she learned it—as with behaviors 1 through 3 in 
Figure 30.2—it is easy to understand that event as an example 
of generalization across settings. However, many important gen- 
eralized outcomes occur across more subtle differences between 
the instructional setting and generalization setting. A general- 
ization setting does not have to be somewhere different from 
the physical location where instruction is provided. Students 
often receive instruction in the same place where they will need 
to maintain and generalize what they have learned. In other 
words, the instructional setting and generalization setting can, 
and often do, share the same physical location (as with behaviors 
4 through 6 in Figure 30.2). 


Distinguishing Between Setting/Situation 
Generalization and Response Maintenance 


Because any measure of setting/situation generalization is con- 
ducted after some instruction has taken place, it might be argued 
that setting/situation generalization and response maintenance 
are the same, or are inseparable phenomena at least. Most mea- 
sures of setting/situation generalization provide information on 
response maintenance, and vice versa. For example, the post- 
training generalization probes conducted by van den Pol and 
colleagues (1981) at the Burger King restaurant and at the sec- 
ond McDonald’s provided data on setting/situation generaliza- 
tion (i.e., to novel restaurants) and on response maintenance of 
up to 1 year. However, a functional distinction exists between 
setting/situation generalization and response maintenance, with 
each outcome presenting a somewhat different set of challenges 
for programming and ensuring enduring behavior change. When 
a behavior change produced in the classroom or clinic is not 
observed in the generalization environment, a lack of setting/ 
situation generalization is evident. When a behavior change 
produced in the classroom or clinic has occurred at least once 
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in the generalization setting and then ceases to occur, a lack of 
response maintenance is evident. 

An experiment by Koegel and Rincover (1977) illus- 
trates the functional difference between setting/situation 
generalization and response maintenance. Participants were 
three young boys with autism; each was mute, echolalic, 
or displayed no appropriate contextual speech. One-to-one 
instructional sessions were conducted in a small room with 
the trainer and child seated across from each other at a table. 
Each child was taught to imitate a series of responses (e.g., the 
trainer said, “Touch your [nose, ear]” or “Do this” and [raised 
his arm, clapped his hands]). Each 40-min session consisted 
of blocks of 10 training trials in the instructional setting alter- 
nated with blocks of 10 trials conducted by an unfamiliar adult 
standing outside, surrounded by trees. All correct responses 
in the instructional setting were followed by candy and social 
praise. During generalization trials, the children received the 
same instructions and model prompts as in the classroom, but 
no reinforcement or other consequences were provided for 
correct responses. 

Figure 30.3 shows the percentage of trials in which each 
child responded correctly in the instructional setting and in the 
generalization setting. All three children learned to respond to 
the imitative models in the instructional setting. At the end of 
the experiment, all three children showed 0% correct responding 
in the generalization setting, but for different reasons. Child | 
and Child 3 began emitting correct responses in the general- 
ization setting as their performances improved in the instruc- 
tional setting; however, their generalized responding was not 
maintained (most likely the result of the extinction condition in 
effect in the generalization setting). The imitative responding 
acquired by Child 2 in the instructional setting never generalized 
to the outside setting. Therefore, the 0% correct responding at 
the experiment’s conclusion confirms a lack of response main- 
tenance for Child 1 and Child 3, but for Child 2 it represents a 
failure of setting generalization. 


Figure 30.2 Examples of a possible instructional setting and a generalization setting for various 


target behaviors. 


Instructional Setting 

1. Raising hand when special education teacher 
asks a question in the resource room. 

2. Practicing conversational skills with speech 
therapist at school. 

3. Passing basketball during a team scrimmage 
on home court. 

4. Answering addition problems in vertical 
format at desk at school. 

5. Solving word problems with no distracter 
numbers on homework assignment. 

6. Operating package sealer at community job 

site in presence of supervisor. 


Generalization Setting 


1. Raising hand when general education teacher 
asks a question in the regular classroom. 


2. Talking with peers in town. 


3. Passing basketball during a game on the 
opponent's court. 

4. Answering addition problems in horizontal 
format at desk at school. 

5. Solving word problems with distracter 
numbers on homework assignment. 

6. Operating package sealer at community job 
site in absence of supervisor. 
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Figure 30.3 Correct responding 

by three children on alternating 
blocks of 10 trials in the 
instructional setting and 10 trials in 
the generalization setting. 


From “Research on the Differences 
Between Generalization and Maintenance 
in Extra-Therapy Responding” by 

R. L. Koegel and A. Rincover, 1977, Journal 
of Applied Behavior Analysis, 10, p. 4. 
Reproduced with permission of John Wiley 50 
& Sons, Inc. 


Percent Correct Trials 


Response Generalization 


Response generalization is the extent to which a learner emits 
untrained responses that are functionally equivalent to the 
trained target behavior. In other words, in response generaliza- 
tion useful variations of the target behavior appear for which 
no instructional contingencies have been applied. For example: 


¢ Traci wanted to earn some extra money by helping her 
older brother with his lawn mowing business. Her brother 
taught Traci to walk the mower up and down parallel 
rows that progressed from one side of a lawn to the other. 
Traci discovered that she could mow some lawns just as 
quickly by first cutting around the perimeter and then 
walking the mower in a concentric pattern toward the 
center of the lawn. 

¢ Loraine was taught to remove weeds with a long weed- 
removal tool. Although never taught or asked to do so, 
sometimes Loraine removes weeds with a hand trowel or 
with her bare hands. 

¢ Michael’s mother taught him how to take phone messages 
by using the pencil and notepaper next to the phone to 
write the caller’s name, phone number, and message. One 
day, Michael’s mother came home and saw her son’s tape 
recorder next to the phone. She pushed the play button and 
heard Michael’s voice say, “Grandma called. She wants to 
know what you'd like her to cook for dinner Wednesday. 
Mr. Stone called. His number is 555-1234. He said your 
car is ready to be picked up.” 


The study by Goetz and Baer (1973) described in Chapter 8 
of three preschool girls’ block building provides a good example 
of response generalization. During baseline, the teacher sat by 
each girl as she played with the blocks, watching closely but 
quietly, and displaying neither enthusiasm nor criticism for any 


Instructional Setting 


Generalization Setting 
Wa 


particular use of the blocks. During the next condition, each 
time the child placed or rearranged the blocks to create a form 
that had not appeared previously in that session’s constructions, 
the teacher commented with enthusiasm and interest (e.g., “Oh, 
that’s very nice—that’s different!’’). Another condition followed 
in which each repeated construction of a form within the session 
was praised (e.g., “How nice—another arch!”’). The study ended 
with a phase in which descriptive praise was again contingent 
on the construction of different block forms. All three children 
constructed more new forms with the blocks when form diver- 
sity was praised than under baseline or the praise-for-the-same- 
forms condition (see Figure 8.7). 

Even though specific responses produced reinforcement 
(i.e., the actual block forms that preceded each instance of 
teacher praise), other responses sharing that functional charac- 
teristic (i.e., being different from block forms constructed pre- 
viously by the child) increased in frequency as a function of the 
teacher’s praise. As a result, during reinforcement for different 
forms, the children constructed new forms with the blocks even 
though each new form itself had never before appeared and 
therefore could not have been reinforced previously. Reinforcing 
a few members of the response class of new forms increased the 
frequency of other members of the same response class. 


Generalized Behavior Change: A Relative 
and Intermingled Outcome 


Generalized behavior change is a relative concept; it exists along 
a continuum. At one end of the continuum is a minimum amount 
of generalized behavior change—the learner uses the new skill 
in a limited range of nontraining settings and situations where 
contrived response prompts and/or consequences still might need 
to be applied. At the other end of the continuum lies maximum 
generalized behavior change; that is, after all components of an 


intervention have been terminated, the learner emits the newly 
acquired target behavior and several functionally related behaviors 
not observed previously at every appropriate opportunity, in all rel- 
evant settings, and performs them indefinitely and independently. 

We have presented the three primary types of generalized 
behavior change individually to isolate their defining features, but 
they often overlap and occur in combination. Although it is possi- 
ble to obtain response maintenance without generalization across 
settings/situations or behaviors (i.e., the target behavior contin- 
ues to occur in the same setting in which it was trained after the 
training contingencies have been terminated), all meaningful mea- 
sures of setting generalization also entail some degree of response 
maintenance. And it is common for all three forms of generalized 
behavior change to be represented in the same instance. For exam- 
ple, during a relatively quiet shift at the widget factory on Monday, 
Joyce’s supervisor taught her to obtain assistance by calling out, 
“Ms. Johnson, I need some help.” Later that week (response main- 
tenance) when it was very noisy on the factory floor (setting/ 
situation generalization), Joyce signaled her supervisor by waving 
her hand back and forth (response generalization). 


Generalized Behavior Change 
Is Not Always Desirable 


It is hard to imagine any behavior important enough to target for 
systematic instruction for which response maintenance would be 
undesirable.? However, unwanted setting/situation generaliza- 
tion and response generalization occur often, and practitioners 
should design intervention plans to prevent or minimize such 
unwanted outcomes. Undesirable setting/situation generaliza- 
tion takes two common forms: overgeneralization and faulty 
stimulus control. 

Overgeneralization, a nontechnical but effectively descrip- 
tive term, refers to an outcome in which the behavior has come 
under the control of an overly broad stimulus class. That is, 
the learner emits the target behavior in the presence of stimuli 
that, although similar in some way to the instructional examples 
or situation, are inappropriate occasions for the behavior. For 
example, a student learns to spell division, mission, and fusion 
with the grapheme —sion. When asked to spell fraction, the stu- 
dent writes f-r-a-c-s-I-o-n. 

With faulty stimulus control, the target behavior comes 
under the restricted control of an irrelevant antecedent stimu- 
lus. For example, after learning to solve word problems such 
as “Ling has 3 books. Nikola has 5 books. How many books do 
they have in total?” by adding the numerals in the problem, the 
student adds the numerals in any problem containing “in total” 
(e.g., “Raluca has 3 candies. Corinne and Raluca have 8 candies 
in total. How many candies does Corrine have?”).4 

Undesired response generalization occurs when a learner’s 
untrained but functionally equivalent responses result in poor 
performance or undesirable outcomes. For example, although 
Jack’s supervisor at the widget factory taught him to operate 
the drill press with two hands because that is the safest method, 
sometimes Jack operates the press with one hand. One-handed 
and two-handed responses are functionally equivalent because 
both topographies produce widgets, but one-handed responses 
compromise Jack’s health and the factory’s safety record. Or, 
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perhaps some of her brother’s customers do not like how their 
lawns look after Traci has mowed them in concentric rectangles. 

Box 30.1: “Perspectives on the Sometimes Confusing and 
Misleading Terminology of Generalization” discusses the varied 
terms applied behavior analysts use to describe these outcomes. 


Other Types of Generalized Outcomes 


Other types of generalized outcomes that do not fit easily into cat- 
egories of response maintenance, setting/situation generalization, 
and response generalization have been reported in the behavior 
analysis literature. Complex components of a person’s repertoire 
sometimes appear quickly with little or no apparent direct condi- 
tioning. A variety of such emergent performances are described 
as stimulus equivalence relations (Sidman, 1994, 2000), recom- 
binative generalization (Axe & Sainato, 2010; Goldstein, 1983), 
bidirectional naming (Horne & Lowe, 1996; Miguel, 2016), and 
relational frame theory (Hayes, Barnes-Holmes, & Roche, 2001) 
(see chapters 18, 19, and 20). Another type of rapid learning 
that appears to be a generalized outcome of other events has 
been called contingency adduction, a process whereby behaviors 
initially established under one set of contingency conditions are 
recruited and recombined by a different set of contingencies and 
take on a new function in a person’s repertoire (Andronis, Layng, 
& Goldiamond, 1997; Layng, Twyman, & Stikeleather 2004). 

Sometimes an intervention applied to one or more peo- 
ple results in behavior changes in other people who were not 
directly treated by the contingencies. Generalization across 
subjects refers to changes in the behavior of people not directly 
treated by an intervention as a function of treatment contin- 
gencies applied to other people. This phenomenon, which has 
been described with a variety of related or synonymous terms— 
vicarious reinforcement (Bandura, 1971; Kazdin, 1973), ripple 
effect (Kounin, 1970), and spillover effect (Strain, Shores, & 
Kerr, 1976)—provides another dimension for assessing the 
generalization of treatment effects. For example, Fantuzzo 
and Clement (1981) examined the degree to which behavior 
changes would generalize from one child who received teacher- 
administered or self-administered token reinforcement during a 
math activity to a peer seated next to the child. 

Drabman, Hammer, and Rosenbaum (1979) combined 
four basic types of generalized treatment effects—(a) across 
time (i.e., response maintenance), (b) across settings (i.e., set- 
ting/situation generalization), (Cc) across behaviors (1.e., response 
generalization), and (d) across subjects—into a conceptual 
framework they called the generalization map. By viewing each 
type of generalized outcome as dichotomous (i.e., either present 
or absent) and by combining all possible permutations of the 
four categories, Drabman and colleagues arrived at 16 catego- 
ries of generalized behavior change ranging from maintenance 
(Class 1) to subject-behavior-setting-time generalization (Class 
16). Class 1 generalization is evident if the target behavior of 
the target subject(s) continues in the treatment setting after any 
“experiment-controlled contingencies” have been discontinued. 
Class 16 generalization, the “ultimate form” of generalization, is 
evidenced by “a change in a nontarget subject’s nontarget behav- 
ior which endures in a different setting after the contingencies 
have been withdrawn in the treatment setting” (p. 213). 
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BOX 30.1 


Perspectives on the Sometimes Confusing and Misleading Terminology of Generalization 


Applied behavior analysts use a variety of terms to describe 
behavior changes that appear as adjuncts or by-products of 
direct intervention. The overlapping and multiple meanings 
of these terms can lead to confusion and misunderstanding. 
For example, maintenance, the most frequently used term 
for a behavior change that persists after the intervention 
that brought it about has been withdrawn or terminated, is 
also the most common name for an experimental condition 
in which treatment has been discontinued or partially with- 
drawn. Applied behavior analysts should distinguish between 
response maintenance as a measure of behavior (1.e., a depen- 
dent variable) and maintenance as an experimental condition 
(i.e., an independent variable). Other terms found in the 
behavior analysis literature for continued responding after 
programmed contingencies have ended include durability, 
behavioral persistence, resurgence, and (often incorrectly) 
resistance to extinction. 

Response maintenance can be measured under extinction 
conditions, in which case the relative frequency of continued 
responding is described correctly as resistance to extinc- 
tion (see Chapter 24). However, using resistance to extinction 
to describe response maintenance in most applied situations is 
incorrect because reinforcement of some kind typically follows 
some occurrences of the target behavior in the post-treatment 
environment. 

Terms used in the applied behavior analysis literature 
for behavior changes that occur in nontraining settings or 
stimulus conditions include stimulus generalization, setting 
generalization, transfer of training, or simply, generalization. 
It is technically incorrect to use stimulus generalization to 
refer to the generalized behavior change achieved by many 
applied interventions. Stimulus generalization is a technical 
term referring to a specific behavioral process: a response 
that has been reinforced in the presence of a given stimulus 
occurring more often in the presence of different but similar 
stimuli under extinction conditions (Guttman & Kalish, 1956; 
see Chapter 17). Use of the term should be restricted to that 
phenomenon. 

The occurrence of behaviors that have not been trained 
directly may be referred to as collateral or side effects, response 
variability, induction, or concomitant behavior change. To 
further complicate matters, generalization is often used as a 
catchall term to refer to all three types of generalized behavior 
change. 

Johnston (1979) discussed some problems caused by 
using generalization to describe any desirable behavior change 
in a generalization setting. 


This kind of usage is misleading in that it suggests 
that a single phenomenon is at work when actually a 


number of different phenomena need to be described, 
explained, and controlled. . . . Carefully designing 
procedures to optimize the contributions of stimulus 
and response generalization would hardly exhaust our 
repertoire of tactics for getting the subject to behave 
in a desirable way in non-instructional settings. Our 
successes will be more frequent when we realize 

that maximizing behavioral influence in such set- 
tings requires careful consideration of all behavioral 
principles and processes. (pp. 1-2) 


° 


Inconsistent use of the “terminology of generalization’ 
can lead researchers and practitioners to incorrect assump- 
tions and conclusions regarding the principles and processes 
responsible for the presence or absence of generalized out- 
comes. Nevertheless, applied behavior analysts will probably 
continue to use generalization as a dual-purpose term, refer- 
ring sometimes to types of behavior change and sometimes to 
behavioral principles and processes that produce such changes. 
Stokes and Baer (1977) clearly indicated their awareness of the 
differences in definitions. 


The notion of generalization developed here is an 
essentially pragmatic one; it does not closely follow 

the traditional conceptualizations (Keller & Schoenfeld, 
1950; Skinner, 1953). In many ways, this discussion 
will sidestep much of the controversy concerning 
terminology. (p. 350) 


While discussing the use of naturally existing contin- 
gencies of reinforcement to maintain and extend programmed 
behavior changes, Baer (1999) explained his preference for 
using the term generalization: 


It is the best of the techniques described here and, inter- 
estingly, it does not deserve the textbook definition of 
“generalization.” It is a reinforcement technique, and 
the textbook definition of generalization refers to unre- 
inforced behavior changes resulting from other directly 
reinforced behavior changes. . . . [But] we are dealing 
with the pragmatic use of the word generalization, not 
the textbook meaning. We reinforce each other for using 
the word pragmatically, and it serves us well enough 

so far, so we shall probably maintain this imprecise 
usage. (p. 30, emphasis in original) 


In an effort to promote the precise use of the technical 
terminology of behavior analysis and as a reminder that the 
phenomena of interest are usually products of multiple prin- 
ciples and procedures, we use terms that focus on the type 
of generalized behavior change rather than the variables that 
bring it about. 


Although Drabman and colleagues recognized that “the 
classifications may prove arbitrary” (p. 204), they provided 
objectively stated rules for determining whether a given event 
fits the requirements of each of their 16 classifications. Regard- 
less of whether generalized behavior change consists of such 
distinctly separate and wide-ranging phenomena as detailed 
by Drabman and colleagues, their generalization map provided 
an objective framework with which researchers can assess the 
extended effects of behavioral interventions (Allen, Tarnowski, 
Simonian, Elliot, & Drabman, 1991). For example, Stevenson 
and Fantuzzo (1984) measured 15 of the 16 generalization map 
categories in a study of the effects of teaching a fifth-grade boy 
to use self-management techniques. They not only measured the 
effects of the intervention on the target behavior (math perfor- 
mance) in the instructional setting (school), but also assessed 
effects on the student’s math behavior at home, disruptive 
behavior at home and at school, both behaviors for a nontreated 
peer in both settings, and maintenance of all of the above. 


PLANNING FOR GENERALIZED 
BEHAVIOR CHANGE 


In general, generalization should be programmed, rather 
than expected or lamented. 
—Baer, Wolf, and Risley (1968, p. 97) 


In their review of 270 published studies relevant to general- 
ized behavior change, Stokes and Baer (1977) concluded that 
practitioners should always “assume that generalization does not 
occur except through some form of programming . . . and act 
as if there were no such animal as ‘free’ generalization—as if 
generalization never occurs ‘naturally,’ but always requires pro- 
gramming” (p. 365). Of course, generalization of some type and 
degree usually occurs, whether or not it is planned. Unplanned 
and unprogrammed generalization may be sufficient, but often it 
is not, particularly for many learners served by applied behavior 
analysts (e.g., children and adults with learning problems and 
developmental disabilities). And if left unchecked, unplanned 
generalized outcomes may be undesirable outcomes. 
Achieving optimal generalized outcomes requires thought- 
ful, systematic planning. This planning begins with two major 
steps: (1) selecting target behaviors that will meet natural contin- 
gencies of reinforcement and (2) specifying all desired variations 
of the target behavior and the settings/situations in which those 
behaviors should (and should not) occur after instruction has ended. 


Select Target Behaviors that will Meet Naturally 
Existing Contingencies of Reinforcement 


The everyday environment is full of steady, dependable, 
hardworking sources of reinforcement for almost all of 
the behaviors that seem natural to us. That is why they seem 


natural. 
—Donald M. Baer (1999, p. 15) 


Numerous criteria have been suggested for determining whether 
a proposed teaching objective is relevant or functional for the 
learner. Age appropriateness and the degree to which a skill 
represents normalization are often cited as important criteria 
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when selecting target behaviors for students with disabilities 
(e.g., Brown, McDonnell, & Snell, 2016). These criteria were 
discussed in Chapter 3, along with numerous other issues that 
should be considered when selecting and prioritizing target 
behaviors. In the end, however, there is just one ultimate crite- 
rion of functionality: A behavior is functional only to the extent 
that it produces reinforcement for the learner. This criterion 
holds no matter how important the behavior may be to the 
person’s health or welfare, or no matter how much teachers, 
family, friends, or the learner herself considers the behavior 
to be desirable. We repeat: A behavior is not functional if it 
does not produce reinforcement for the learner. We can say this 
another way: Behaviors that are not followed by reinforcers on 
at least some occasions will not be maintained. 

Ayllon and Azrin (1968) recognized this fundamental truth 
when they recommended that when selecting target behaviors 
practitioners follow the relevance-of-behavior rule: Choose only 
those behaviors to change that will produce reinforcers in the 
learner’s postintervention environment. Baer (1999) believed so 
strongly in the importance of this criterion that he recommended 
practitioners heed a similar rule: 


A good rule is to not make any deliberate behavior changes 
that will not meet natural communities of reinforcement. 
Breaking this rule commits you to maintain and extend the 
behavior changes that you want, by yourself, indefinitely. 
If you break this rule, do so knowingly. Be sure that you 
are willing and able to do what will be necessary. (p. 16, 
emphasis in original) 


Promoting for the generalization and maintenance of any 
behavior change for which a natural contingency of reinforce- 
ment exists, no matter the specific tactics employed, consists of 
getting the learner to emit the behavior in the generalization set- 
ting just often enough to contact the contingency. Generalization 
and maintenance of the behavior from that point forward, while 
not ensured, is a very good bet. For example, after receiving 
some basic instruction on how to operate the steering wheel, gas 
pedal, and brakes on a car, the naturally existing reinforcement 
and punishment contingencies involving moving automobiles 
and the road will select and maintain effective steering, acceler- 
ation, and braking. Few drivers need booster training sessions on 
the basic operation of the steering wheel, gas pedal, and brakes. 

We define a naturally existing contingency as any 
contingency of reinforcement (or punishment) that operates 
independent of the behavior analyst’s or practitioner’s efforts. 
Naturally existing contingencies include those that operate with- 
out social mediation (e.g., walking fast on an icy sidewalk is 
often punished by a slip and fall) and socially mediated con- 
tingencies contrived and implemented by other people in the 
generalization setting. From the perspective of a special educator 
who is teaching a set of targeted social and academic skills to 
students for whom the general education classroom represents 
the generalization setting, a token economy operated by the gen- 
eral education classroom teacher is an example of the latter type 
of naturally existing contingency. Although the general educa- 
tion teacher contrived the token economy, it represents a set of 
naturally existing contingencies because it already operates in 
the generalization setting. 
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We define a contrived contingency as any contingency of 
reinforcement (or punishment) designed and implemented by a 
behavior analyst or practitioner to achieve the acquisition, main- 
tenance, and/or generalization of a targeted behavior change. 
The token economy in the previous example is a contrived con- 
tingency from the perspective of the general education teacher 
who designed and implemented it. 

Practitioners are often charged with the difficult task of 
teaching important skills for which there are no dependable 
naturally existing contingencies of reinforcement. In such 
cases, practitioners should realize and plan for the fact that 
the generalization and maintenance of target behaviors will 
have to be supported, perhaps indefinitely, with contrived 
contingencies. 


Specify All Desired Variations of the Behavior 
and the Settings/Situations Where Those 
Behaviors Should (and Should Not) Occur 


This stage of planning for generalized outcomes includes iden- 
tifying all the desired behavior changes that need to be made 
and all the environments and stimulus conditions in which the 
learner should emit the target behavior(s) after direct training 
has ceased (Baer, 1999). For some target behaviors, the most 
important stimulus control for each response variation is clearly 
defined (e.g., reading C-V-C-E words) and restricted in num- 
ber (e.g., solving single-digit multiplication facts). For many 
important target behaviors, however, the learner is likely to 
encounter a multitude of settings and stimulus conditions in 
which the behavior, in a variety of response forms, is desired. 
Only by considering these possibilities prior to instruction can 
the behavior analyst design an intervention with the best chance 
of preparing the learner for them. 

In one sense, this component of planning for general- 
ized outcomes is similar to preparing a student for a future test 
without knowing the content or format of all the questions that 
will be on the test. The stimulus conditions and contingencies 
of reinforcement in the generalization setting(s) will provide 
that test to the learner. Planning involves trying to determine 
what the final exam will cover (type and form of questions), 
whether any trick questions will appear (e.g., confusing stimuli 
that might evoke the target response when it should not occur), 
and whether the learner will need to use his new knowledge or 
skill in different ways (response generalization). 


List All the Behaviors that Need to be Changed 


Listing all behaviors that require change is not an easy task, 
but a necessary one to obtain a complete picture of the teaching 
task ahead. For example, if the target behavior is teaching Brian, 
the young boy with autism, to greet people, he should learn a 
variety of greetings in addition to “Hello, how are you?” Brian 
may also need to learn behaviors to initiate and participate in 
conversations, such as responding to questions, taking turns, 
staying on topic, and so forth. Further, he may need to be taught 
when and to whom he should introduce himself. Only by having 
a complete list of all the desired forms of the behavior can the 
practitioner make meaningful decisions about which behaviors 
to teach directly and which to leave to generalization. 


The practitioner should determine whether and to what 
extent response generalization is desirable for all of the behav- 
ior changes listed, and then make a prioritized list of the varia- 
tions of the target behavior she would like to see as generalized 
outcomes. 


List All the Settings and Situations in Which 
the Target Behavior Should Occur 


A list should be made of all the desired settings and situations 
in which the learner will emit the target behavior if optimal 
generalization is achieved. Will Brian need to introduce him- 
self and talk with children his own age, to adults, to males and 
females? Will he need to talk with others at home, at school, 
in the lunchroom, on the playground? Will he be confronted 
with situations that may appear to be appropriate opportunities 
to converse but are not (e.g., an unknown adult approaches and 
offers candy) and for which an alternative response is needed 
(e.g., walking away, seeking out a known adult). (This kind of 
analysis often adds extra behaviors to the list of skills to be 
taught.) 

When all of the possible situations and settings have been 
identified, they should be prioritized according to their impor- 
tance and the client’s likelihood of encountering them. Further 
analysis of the prioritized environments should then be con- 
ducted. What discriminative stimuli usually set the occasion for 
the target behavior in these various settings and situations? What 
schedules of reinforcement for the target behavior are typical 
in these nontraining environments? What kinds of reinforcers 
are likely to be contingent on the emission of the target behav- 
ior in each of the settings? Only when the behavior analyst has 
answered all of these questions, if not by objective observation, 
then at least by considered estimation, can she begin to have a 
full picture of the teaching task ahead. 


Is All This Preintervention Planning Worth It? 


Obtaining the information just described requires considerable 
time and effort. Given limited resources, why not design an 
intervention and immediately begin trying to change the target 
behavior? It is true that many behavior changes do show gen- 
eralization, even though the extension of the trained behavior 
across time, settings, and other behaviors was unplanned and 
unprogrammed. When target behaviors have been chosen that 
are truly functional for the subject, and when those behaviors 
have been brought to a high level of proficiency under discrim- 
inative stimuli relevant to generalization settings, the chances 
of generalization are good. But what constitutes a high level of 
proficiency for certain behaviors in various settings? What are 
all of the relevant discriminative stimuli in all of the relevant 
settings? What are all the relevant settings? 

Without a systematic plan, a practitioner will usually be 
ignorant of the answers to these vital questions. Few behaviors 
important enough to target have such limited needs for general- 
ized outcomes that the answers to such questions are obvious. 
Just a cursory consideration of the behaviors, settings, and peo- 
ple related to a child introducing himself revealed numerous 
factors that may need to be incorporated into an instructional 
plan. A more thorough analysis would produce many more. In 


fact, a complete analysis will invariably reveal more behaviors to 
be taught than time or resources would ever allow. And Brian— 
the 10-year-old who is learning to greet people and introduce 
himself—in all likelihood needs to learn many other skills also, 
such as self-help, academic, and recreation and leisure skills, to 
name just a few. Why, then, create all the lists in the first place 
when everything cannot be taught anyway? Why not just train 
and hope?° 

Baer (1999) described six possible benefits of listing all 
the forms of behavior change and all the situations in which 
these behavior changes should occur. 


1. You now see the full scope of the problem ahead of you, 
and thus see the corresponding scope that your teaching 
program needs to have. 


2. If you teach less than the full scope of the problem, you do 
so by choice rather than by forgetting that some forms of 
the behavior could be important, or that there were some 
other situations in which the behavior change should or 
should not occur. 


2 


If less than a complete teaching program results in less 
than a complete set of behavior changes, you will not be 
surprised. 


4. You can decide to teach less than there is to learn, perhaps 
because that is all that is practical or possible for you to do. 


5. You can decide what is most important to teach. You can 
also decide to teach the behavior in a way that encourages 
the indirect development of some of the other forms of the 
desired behavior, as well as the indirect occurrence of the 
behavior in some other desired situations, that you will not 
or cannot teach directly. 


6. But if you choose the option discussed in number 5 above, 
rather than the complete program implicit in number 1, 
you will do so knowing that the desired outcome would 
have been more certain had you taught every desirable 
behavior change directly. The best that you can do is to 
encourage the behavior changes that you do not cause 
directly. So, you will have chosen the option in number 
5 either of necessity or else as a well-considered gamble 
after a thoughtful consideration of possibilities, costs, and 
benefits. (pp. 10-11) 


After determining which behaviors to teach directly and 
in which situations and settings to teach those behaviors, the 
behavior analyst is ready to consider strategies and tactics for 
promoting generalized outcomes. 


STRATEGIES AND TACTICS FOR PROMOTING 
GENERALIZED BEHAVIOR CHANGE 


Various conceptual systems and methodological taxonomies 
for promoting generalized behavior change have been proposed 
(e.g., Horner, Dunlap, & Koegel, 1988; Osnes & Lieblein, 2003; 
Stokes & Baer, 1977; Stokes & Osnes, 1989). The conceptual 
scheme presented here is informed by the work of those authors 
and others, and by our experiences designing, implementing, and 
evaluating methods for promoting generalized outcomes and 
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in teaching practitioners to use them. Although numerous 
methods and techniques have been demonstrated and given a 
variety of names, most tactics that effectively promote gener- 
alized behavior change can be classified under five strategic 
approaches: 


¢ Teach the full range of relevant stimulus conditions and 
response requirements. 

¢ Make the instructional setting similar to the generalization 
setting. 

¢ Maximize the target behavior’s contact with reinforce- 
ment in the generalization setting. 

¢ Mediate generalization. 

¢ Train to generalize. 


In the following sections we describe and provide exam- 
ples of 13 tactics used by applied behavior analysts to accom- 
plish these five strategies (see Figure 30.4). Although each 
tactic is described individually, most interventions to promote 
generalized behavior change combine these tactics (e.g., Bord, 
Sidener, Reeve, & Disener, 2017; Grossi, Kimball, & Heward, 
1994; Hughes, Harmer, Killina, & Niarhos, 1995; Trask-Tyler, 
Grossi, & Heward, 1994). 


Teach the Full Range of Relevant Stimulus 
Conditions and Response Requirements 


The most common mistake that teachers make, when they 
want to establish a generalized behavior change, is to teach 
one good example of it and expect the student to generalize 
from that example. 

—Donald M. Baer (1999, p. 18) 


Most important behaviors must be performed in various ways 
across a wide range of stimulus conditions. Consider a person 
skilled in reading, math, conversing with others, and cooking. 
That person can read thousands of different words; add, subtract, 
multiply, and divide any combination of numbers; make a mul- 
titude of relevant and appropriate comments when talking with 
others: and measure, combine, and prepare numerous ingre- 
dients in hundreds of recipes. Helping learners achieve such 
wide-ranging performances presents an enormous challenge to 
the practitioner. 

One approach to this challenge would be to teach every 
desired form of a target behavior in every setting/situation in 
which the learner may need that behavior in the future. Although 
this approach would eliminate the need to program for response 
generalization and setting/situation generalization (response 
maintenance would remain an issue), pursuing it is seldom 
possible and never practical. A teacher cannot provide direct 
instruction on every printed word a student may encounter, or 
teach a student every measuring, pouring, stirring, and sautéing 
movement needed to prepare every dish he may want to make 
later on. Even for skill areas for which it would be possible to 
teach every possible example—instruction could be provided on 
all 900 single-digit-times-two-digit multiplication problems—to 
do so would be impractical for many reasons, not the least of 
which is that the student needs to learn not only many other 
types of math problems but also skills in other curriculum areas. 
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1. Teach enough stimulus examples. 
2. Teach enough response examples. 


3. Program common stimuli. 
4. Teach loosely. 


. Set behavior traps. 


O ON OD 


Mediate Generalization 
10. Contrive a mediating stimulus. 
11. Teach self-management skills. 
Train to Generalize 
12. Reinforce response variability. 
13. Instruct the learner to generalize. 


A strategy called teach enough examples consists of 
teaching the student to respond to a subset of all of the possible 
stimulus and response examples and then probing the student’s 
performance on untrained examples.° For example, the general- 
ization of a student’s ability to solve two-digit-minus-two-digit 
arithmetic problems with regrouping can be assessed by asking 
the student to solve several problems of the same type for which 
no instruction or guided practice has been provided. If the results 
of this generalization probe show that the student responds 
correctly to untaught examples, then instruction can be halted on 
this class of problems. If the student performs poorly on the gen- 
eralization probe, the practitioner teaches additional examples 
before again assessing the student’s performance on a new set 
of untaught examples. This cycle of teaching new examples and 
probing with untaught examples continues until the learner con- 
sistently responds correctly to untrained examples representing 
the full range of stimulus conditions and response requirements 
found in the generalization setting. 


Teach Enough Stimulus Examples 


The tactic for promoting setting/situation generalization called 
teach enough stimulus examples involves teaching the learner to 
respond correctly to multiple examples of antecedent stimulus 
conditions and probing for generalization to untaught stimulus 
examples. A different stimulus example is incorporated into the 
teaching program each time a change is made in any dimen- 
sion of the instructional item itself or the environmental context 
in which the item is taught. Examples of four dimensions by 
which different instructional examples can be identified and 
programmed are the following: 


* The specific item taught (e.g., multiplication facts: 
7 X 2,4 X 5; letter sounds: a, f) 

* The stimulus context in which the item is taught (e.g., mul- 
tiplication facts presented in vertical format, in horizontal 


Figure 30.4 Strategies and tactics for promoting generalized behavior change. 


Teach the Full-Range of Relevant Stimulus Conditions and Response Requirements 
Make the Instructional Setting Similar to the Generalization Setting 


Maximize Contact with Reinforcement in the Generalization Setting 
5. Teach the target behavior to levels of performance required by naturally 
existing contingericies of reinforcement. 
. Program indiscriminable contingencies. 


. Ask people in the generalization setting to reinforce the target behavior. 
. Teach the learner to recruit reinforcement. 


format, in word problems; saying the sound of ¢ when it 
appears at the beginning and end of words: rab, bat) 
¢ The setting where instruction occurs (e.g., large-group 
instruction at school, collaborative learning group, home) 
¢ The person doing the teaching (e.g., classroom teacher, 
peer, parent) 


As a general rule, the more examples used during instruc- 
tion, the more likely the learner will respond correctly to untrained 
examples or situations. The number of examples that must be 
taught before sufficient generalization occurs will vary as a func- 
tion of factors such as the complexity of the target behavior being 
taught, the teaching procedures employed, the student’s oppor- 
tunities to emit the target behavior under the various conditions, 
the naturally existing contingencies of reinforcement, and the 
learner’s history of reinforcement for generalized responding. 

Sometimes teaching as few as two examples will produce 
significant generalization to untaught examples. Stokes, Baer, and 
Jackson (1974) taught a greeting response to four children with 
severe cognitive disabilities who seldom acknowledged or greeted 
other people. The senior author, working as a dormitory assistant, 
used unconditioned reinforcers (potato chips and M&M’S) and 
praise to shape the greeting response (at least two back-and-forth 
waves of a raised hand). Then this initial trainer maintained the 
newly learned hand wave by contriving three to six contacts per 
day with each of the children in various settings (e.g., playroom, 
corridor, dormitory, courtyard). Throughout the study, as many as 
23 different staff members systematically approached the children 
during different times of the day in different settings and recorded 
whether the children greeted them with a hand waving response. 
If a child greeted a prober with the waving response, the prober 
responded with “Hello, (name).” Approximately 20 such general- 
ization probes were conducted each day with each child. 

Immediately after learning the greeting response with 
Just one trainer, one of the children (Kerry) showed good 
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setting/situation generalization by using it appropriately in 
most of her contacts with other staff members. However, the 
other three children failed to greet staff members most of the generalization by increasing the number and range of conver- 
time, even though they continued to greet the original trainer —_ sational statements that were likely to be used . subsequent 
on virtually every occasion. A second staff member then began conversations. ) fi 

to reinforce and maintain the greeting responses of these three 
children. As a result of adding the second trainer, the children’s 
greeting behavior showed widespread generalization to the other 


Additionally, participants were encouraged to develop individ- 
ual adaptations of statements, which further promoted response 


Before, during, and after multiple-exemplar training, 
generalization probes were conducted of each participant’s use 
of self-instructions, eye contact, and initiating and responding 


staff members. Stokes and colleagues’ (1974) study is import- _ to conversation partners. The 23 to 32 different students who 
ant for at least two reasons. First, it demonstrated an effective served as conversation partners for each participant represented 
method for continual assessment of setting/situation generaliza- the full range of student characteristics in the school population 
tion across numerous examples (in this case, people). Second, (e.g., gender, age, ethnicity, students with and without disabil- 
the study showed that widespread generalization may result —_ ities) and included students who were known and unknown to 
from teaching just two examples. the participants prior to the study. The rate of conversation initia- 

tions by all four participants increased during multiple-exemplar 
Teach Enough Response Examples training to levels approximating those of general education stu- 


Instruction that provides practice with a variety of response dents and was maintained at those rates after the intervention 
topographies helps ensure the acquisition of desired response “@S terminated completely (see Figure 30.5). 

forms and promotes response generalization in the form of 

untrained topographies. Often called multiple-exemplar General Case Analysis 

training, this tactic typically incorporates both stimulus and Teaching a learner to respond correctly to multiple examples 
response variations.’ Multiple-exemplar training has promoted does not guarantee generalized responding to untaught exam- 
the generalization of a wide range of behaviors by diverse ples. To achieve optimal generalization and discrimination, 


learners: auditory-visual discrimination tasks (Carr, 2003), the behavior analyst must pay close attention to the specific 
empathy skills (Sivaraman, 2017), and sharing (Marzullo-Kerth, examples used during instruction; not just any examples will 
Reeve, Reeve, & Townsend, 2011) by children with autism; do. Maximally effective instructional design requires selecting 
food preparation (Trask-Tyler, Grossi, & Heward, 1994), teaching examples that represent the full range of stimulus sit- 
domestic skills (Neef, Lensbower, Hockersmith, DePalma, & uations and response requirements in the natural environment.” 
Gray, 1990), vocational skills (Horner, Eberhard, & Sheehan, General case analysis (also called general case strategy) is a 
1986), and requests for assistance (Chadsey-Rusch, Drasgow, systematic method for selecting teaching examples that repre- 


Reinoehl, Halle, & Collet-Klingenberg, 1993) by children and sent the full range of stimulus variations and response require- 
young adults with disabilities; and identifying definitions of ments in the generalization setting (Albin & Horner, 1988; 


behavior analysis terms by university students (Meindl, Ivy, Becker & Engelmann, 1978). 

Miller, Neef, & Williamson, ROIs )e A series of studies by Horner and colleagues demon- 
Four female high school students with moderate cog- strated the importance of teaching examples that systemat- 

nitive disabilities participated in a study by Hughes and col- ically sample the range of stimulus variations and response 


leagues (1995) that assessed the effects of an intervention — requirements the learner will encounter in the generalization 
they called multiple-exemplar self-instructional training on setting (e.g., Horner et al., 1986; Horner & McDonald, 1982; 
the acquisition and generalization of the students’ conver- _ Horner, Williams, & Steveley, 1987). In a classic example of 
sational interactions with peers. The students were recom- this line of research, Sprague and Horner (1984) evaluated the 
mended for the study because they initiated conversations effects of general case instruction on the generalized use of 
and responded to peers’ efforts to talk with them at “low or vending machines by six high school students with moderate 
nonexistent rates” and seldom maintained eye contact. One to severe cognitive disabilities. The dependent variable was 
of the students, Tanya, had recently been refused a job at a the number of vending machines each student operated cor- 
restaurant because of her “reticence and lack of eye contact rectly during generalization probes of 10 different machines 
during her job interview” (p. 202). located within the community. For a probe trial to be scored 

A key element of Hughes and colleagues’ intervention as correct, a student had to perform correctly a chain of five 
was practicing a wide variety of conversation starters and responses (i.e., insert the proper number of coins, activate 
statements with different peer teachers. Ten volunteer peer the machine for the desired item, and so on). The researchers 
teachers recruited from general education classrooms helped selected the 10 vending machines used to assess generaliza- 
teach conversation skills to the participants. The peer teachers tion because each student’s performance on those machines 
were male and female, ranged in grade level from freshmen to would serve as an index of his performance “across all vend- 
seniors, and represented African American, Asian American, and ing machines dispensing food and beverage items costing 
Euro-American ethnic groups. Instead of learning a few scripted between $.20 and $.75 in Eugene, Oregon” (p. 274). None 
conversation openers, the participants practiced using multiple of the vending machines used in the generalization probes 
examples of conversation starters selected from a pooled list was identical to any of the vending machines used during 
of conversation openers used by general education students. instruction. 
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Figure 30.5 Conversation initiations per minute by four high school students with 
disabilities to conversation partners with and without disabilities during generalization 
sessions. The shaded bands represent typical performance by general education students. 
From “The Effects of Multiple-Exemplar Training on High-School Students’ Generalized Conversational Interac- 


tions” by C. Hughes, M. L. Harmer, D. J. Killina, and F. Niarhos, 1995. Journal of Applied Behavior Analysis, 28, 
p. 210. Reproduced with permission of John Wiley & Sons, Inc. 


After a single-baseline probe verified each student’s 
inability to use the 10 vending machines in the community, a 
condition the researchers called “single-instance instruction” 
began. Under this condition, each student received individual 
training on a single vending machine located in the school until 
he used the machine independently for three consecutive correct 
trials on each of 2 consecutive days. Even though each student 
had learned to operate the training machine without errors, 
the generalization probe following single-instance instruction 
revealed little or no success with the vending machines in the 
community (see Probe Session 2 in Figure 30.6). The contin- 
ued poor performance by Students 2, 3, 5, and 6 on gener- 
alization probes that followed additional instruction with the 
single-instance training machine shows that “overlearning” 


on a single example does not necessarily aid generalization. 
Further evidence of the limited generalization obtained from 
single-instance instruction is the fact that seven of the eight 
total probe trials performed correctly by all students after 
single-instance instruction were on Probe Machine 1, the vend- 
ing machine that most closely resembled the machine on which 
the students had been trained. 

Next, multiple-instance training was implemented with 
Students 4, 5, and 6. The teaching procedures and performance 
criteria for multiple-instance training were the same as those 
used in the single-instance condition, except that each student 
received instruction until he reached criterion on three new 
machines. Sprague and Horner (1984) purposely selected vend- 
ing machines to use in the multiple-instance instruction that 
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Figure 30.6 Number of nontrained vending machines operated correctly by students 


across phases and probe sessions. 


From “The Effects of Single Instance, Multiple Instance, and General Case Training on Generalized 
Vending Machine Use by Moderately and Severely Handicapped Students,” by J. R. Sprague and 
RH. Horner, 1984, Journal of Applied Behavior Analysis, 17, Pp. 276. Reproduced with permission of 


John Wiley & Sons, Inc. 


were similar to one another and that did not sample the range 
of stimulus variations and response requirements that defined 
the vending machines in the community. After reaching train- 
ing criterion on three additional machines, Students 4, 5, and 
6 were still unable to operate the machines in the community. 
During the six probe sessions that followed multiple-instance 
instruction, these students correctly completed only 9 of the 
60 total trials. 

The researchers then introduced general case instruction 
in multiple-baseline fashion across students. This condition 
was the same as multiple-instance instruction, except that the 
three different vending machines used in general case training, 
when combined with the single-instance machine, provided 
the students with practice across the total range of stimulus 


conditions and response variations found in vending machines 
in the community. None of the training machines, however, 
was exactly the same as any machine used in the generaliza- 
tion probes. After reaching training criterion on the general 
case machines, all six students showed substantial improve- 
ments in their performance on the 10 untrained machines. 
Sprague and Horner (1984) speculated that Student 3’s poor 
performance on the first generalization probe following 
general case instruction was caused by a ritualistic pattern of 
inserting coins that he had developed during previous probe 
sessions, After receiving repeated practice on the coin inser- 
tion step during a training session between Probe Sessions 
5 and 6, Student 3’s performance on the generalization 
machines improved greatly. 
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Negative, or “Don’t Do It,” Teaching Examples 
Teaching a student where and when to use a new skill or bit of 
knowledge does not mean that he also knows where and when 
not to use this newly learned behavior. Brian, for example, needs 
to learn that he should not say, “Hello, how are you?” to people 
that he has greeted within the past hour or so. Learners must be 
taught to discriminate the stimulus conditions that signal when 
responding is appropriate from stimulus conditions that signal 
when responding is inappropriate. 

Instruction that includes “don’t do it” teaching examples 
intermixed with positive examples provides learners with prac- 
tice discriminating stimulus situations in which the target behav- 
ior should not be emitted (i.e., S4s) from those conditions when 
the behavior is appropriate. !° This sharpens the stimulus control 
necessary to master many concepts and skills (Engelmann & 
Carnine, 1982). 

Horner and colleagues (1986) incorporated “don’t do it” 
examples into a training program to teach four high school stu- 
dents with moderate to severe cognitive disabilities how to bus 
tables in cafeteria-style restaurants. To correctly bus a table, the 
student had to remove all dishes, silverware, and garbage from 
tabletop, chairs, and the floor under and around the table; wipe 
the tabletop; straighten chairs; and place dirty dishes and gar- 
bage in appropriate receptacles. In addition, the students were 
taught to inquire, through the use of cards, if a customer was 
finished with empty dishes. The three settings, one for training 
and two for generalization probes, differed in terms of size and 
furniture characteristics and configurations. 

Each training trial required the student to attend to 
the following stimulus features of a table: (a) the presence or 
absence of people at a table, (b) whether people were eating at 
the table, (c) the amount and/or status of food on dishes, (d) the 
presence or absence of garbage at a table, and (e) the location 
of garbage and/or dirty dishes at a table. Training consisted of 
30-min sessions involving six table types that represented the 
range of conditions most likely to be encountered in a cafete- 
ria-style restaurant. A trainer modeled correct table busing and 
verbally prompted correct responding. When errors occurred, the 
trainer stopped the student, re-created the situation, and provided 


TABLE 30.1 
Cafeteria-style Restaurants 


additional modeling and assistance. The six training examples 
consisted of four to-be-bused tables and two not-to-be-bused 
tables (see Table 30.1). 

Generalization probe sessions were conducted in two 
restaurants not used for training. Each student was presented 
with 15 probe tables: 10 to-be-bused tables and 5 not-to-be- 
bused tables. The results showed a functional relation between 
general case instruction that included not-to-be-bused tables and 
“immediate and pronounced improvement in the percentage of 
probe tables responded to correctly” (p. 467). 

Negative teaching examples are necessary when the 
learner must discriminate between appropriate and inappropriate 
conditions for a particular response. Practitioners should ask this 
question: Is responding always appropriate in the generalization 
setting(s)? If the answer is no, then “don’t do it” teaching exam- 
ples should be part of instruction. 

Does the instructional setting naturally or automatically 
include a sufficient number and range of negative examples? 
The teaching situation must be analyzed to answer this import- 
ant question. Practitioners may need to contrive some negative 
teaching examples. Practitioners should not assume the natural 
environment will readily reveal sufficient negative examples. 
Conducting training in the natural environment is no guarantee 
that learners will be exposed to stimulus situations that they are 
likely to encounter in the generalization environment after train- 
ing. For example, in the study on teaching table busing, Horner 
and colleagues (1986) noted that, “on some days the trainer 
needed actively to set up one or more table types to ensure that 
a student had access to a table type that was not ‘naturally’ avail- 
able” (p. 464). 

“Don’t do it” teaching examples should be selected and 
sequenced according to the degree to which they differ from pos- 
itive examples (i.e., Ss). The most effective negative teaching 
examples will share many of the relevant characteristics of the 
positive teaching examples (Horner et al., 1988). For example, 
the “don’t bus” tables used by Horner and colleagues (1986) 
shared many features with the “bus” tables (see Table 30.1). 
Such minimum difference negative teaching examples help the 
learner to perform the target behavior with the precision required 


Six Training Examples Used to Teach Students with Disabilities How to Bus Tables in 


— ee ee EEE 


Presence of Garbage: Location 
Training People and People Eating Dishes: Empty/ Present or of Garbage Correct 
Examples Possessions or Not Eating Part/New Food Not Present and Dishes Response 
1 O People + N/A Partial Present Table, Chairs Don’t Bus 
possessions 
2 O People N/A Partial Present Table, Floor, Chair Bus 
3 2 People Eating New food Present Table, Chair, Floor Don’t Bus 
4 O People N/A Empty Present Table, Floor Bus 
5 1 Person Not eating Empty Present Chair, Floor Bus 
6 2 People Not eating Empty Present Table Bus 


From “Teaching Generalized Table Bussing: The Importance of Negative Teaching Examples,” by R. H. Horner, J. M. Eberhard, and M. R. Sheehan, 1986 
Behavior Modification, 10, p. 465. Copyright 1986 by the Sage Publications, Inc. Used by permission. 


by the natural environment. Minimum difference teaching exam- 
ples help eliminate “generalization errors” due to overgeneral- 
ization and faulty stimulus control. 


Make the Instructional Setting Similar 
to the Generalization Setting 


Fresno State coach Pat Hill expects Ohio Stadium to be a 
new experience for the Bulldogs, who will visit for the first 
time. For a practice last week in FSU’s stadium, Hill hired 
a production company to blast the Ohio State fight song—at 
about 90 decibels—throughout the two-hour session. “We 
created some noise and atmosphere to give us a feel of a live 
game,” Hill said. 


—Columbus Dispatch (August 27, 2000)!! 


A basic strategy for promoting generalization is to incorpo- 
rate into the instructional setting stimuli the learner is likely to 
encounter in the generalization setting. The greater the similarity 
between the instructional environment and the generalization 
environment, the more likely the target behavior will be emitted 
in the generalization setting. The principle of stimulus general- 
ization states that a behavior is likely to be emitted in the pres- 
ence of stimuli that are very similar to the stimulus conditions in 
which the behavior was reinforced previously, but the behavior 
will likely not be emitted under stimulus conditions that differ 
significantly from the training stimulus. 

Stimulus generalization is a relative phenomenon: The 
more that stimulus conditions in the generalization setting 
resemble the stimulus conditions present during instruction, the 
greater the probability that the trained response will be emitted, 
and vice versa. A generalization setting that differs significantly 
from the instructional setting may not provide sufficient stim- 
ulus control over the target behavior. Such a setting may also 
contain stimuli that impede the target behavior because their 
novelty confuses or startles the learner. Exposing the learner to 
stimuli during instruction that are commonly found in the gen- 
eralization setting increases the likelihood that those stimuli will 
acquire some stimulus control over the target behavior and also 
prepares the learner for the presence of stimuli in the general- 
ization setting that have the potential of hindering performance. 
Two tactics used by applied behavior analysts to implement this 
basic strategy are program common stimuli and teach loosely. 


Program Common Stimuli 


Program common stimuli means incorporating typical fea- 
tures of the generalization setting into the instructional setting. 
Although behavior analysts have attached a special term to this 
tactic, successful practitioners in many fields have long used 
this technique for promoting generalized behavior change. 
For example, coaches (like Fresno State’s Coach Hill), music 
teachers, and theater directors hold scrimmages, mock auditions, 
and dress rehearsals to prepare their athletes, musicians, and 
actors to perform important skills in settings that include the 
sights, sounds, materials, people, and procedures that simulate 
as closely as possible those in the “real world.” 

Van den Pol and colleagues (1981) programmed common 
stimuli when they taught three young adults with disabilities 
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how to order and eat in fast-food restaurants. The researchers 
used numerous items and photos from actual restaurants to make 
the classroom simulate the conditions found in actual restau- 
rants. Plastic signs with pictures and names of various McDon- 
ald’s sandwiches were posted on the classroom wall, a table was 
transformed into a mock “counter” for role-playing transactions, 
and the students practiced responding to 60 photographic slides 
taken in actual restaurants showing examples of situations cus- 
tomers are likely to encounter. 

Bord and colleagues (2017) provide an example of pro- 
gramming common stimuli when teaching ice skating to a child 
with autism. The researchers set up a rollerblading environment 
to simulate the local ice-skating rink by arranging large cones in 
the same-sized oval as the ice-skating rink. They also included 
common features, such as the presence of people (e.g., center, 
sidelines, no people), the direction of skating (i.e., clockwise, 
counterclockwise) across sessions, and the presence and absence 
of music. 

Why go to all the trouble of simulating the generalization 
setting? Why not just conduct instruction in the generalization 
setting itself to ensure that the learner experiences all of the 
relevant aspects of the setting? First, conducting instruction in 
natural settings is not always possible or practical. The resources 
and time required to transport students to community-based set- 
tings may not be available. 

Second, community-based training may not expose stu- 
dents to the full range of examples they are likely to encounter 
later in the same setting. For example, students who receive in 
situ instruction for grocery shopping or street crossing during 
school hours may not experience the long lines at the checkout 
counters or heavy traffic patterns typical of evening hours. 

Third, instruction in natural settings may be less effective 
and efficient than classroom instruction because the trainer can- 
not halt the natural flow of events to contrive an optimal number 
and sequence of training trials needed (e.g., Neef, Lensbower, 
Hockersmith, DePalma, & Gray, 1990). 

Fourth, instruction in simulated settings can be safer, par- 
ticularly with target behaviors that must be performed in poten- 
tially dangerous environments or that have severe consequences 
if performed incorrectly (e.g., Miltenberger et al., 2005), or when 
children or people with learning problems must perform complex 
procedures. If the procedures involve invading the body or errors 
during practice are potentially hazardous, simulation training 
should be used. For example, Neef, Parrish, Hannigan, Page, and 
Iwata (1990) had children with neurogenic bladder complications 
practice performing self-catheterization skills on dolls. 

Programming common stimuli is a two-step process of (a) 
identifying salient stimuli that characterize the generalization 
setting(s) and (b) incorporating those stimuli into the instruc- 
tional setting. A practitioner can identify possible stimuli in the 
generalization setting to make common by direct observation or 
by asking people familiar with the setting. Practitioners should 
conduct observations in the generalization setting(s) and note 
prominent features of the environment that might be import- 
ant to include during training. When direct observation is not 
feasible, practitioners can obtain secondhand knowledge of the 
setting by interviewing or giving checklists to people who have 
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firsthand knowledge of the generalization setting—those who 
live, work in, or are otherwise familiar with the generalization 
setting(s) in question. 

When possible, practitioners should use the same reinforc- 
ers during instruction as those typically produced by the target 
behavior in the generalization environment. Once contacted in 
the natural environment, the reinforcers themselves may serve 
as a discriminative stimuli for the target behavior (e.g., Koegel 
& Rincover, 1977). 

If a generalization setting includes important stimuli that 
cannot be re-created or simulated in the instructional setting, 
then at least some training trials must be conducted in the gen- 
eralization setting. However, as pointed out previously, prac- 
titioners should not assume that community-based instruction 
will guarantee students’ exposure to all of the important stimuli 
common to the generalization setting. 


Teach Loosely 


Applied behavior analysts control and standardize intervention 
procedures to maximize their direct effects, and so the effects 
of their interventions can be interpreted and replicated by oth- 
ers. Yet restricting teaching procedures to a “precisely repeti- 
tive handful of stimuli or formats may, in fact, correspondingly 
restrict generalization of the lessons being learned” (Stokes & 
Baer, 1977, p. 358). To the extent that generalized behavior 
change can be viewed as the opposite of strict stimulus control 
and discrimination, one technique for facilitating generalization 
is to vary as many of the noncritical dimensions of the anteced- 
ent stimuli as possible during instruction. 

Teach loosely means randomly varying noncritical aspects 
of the instructional setting within and across teaching sessions. 
This tactic has two advantages or rationales for promoting gen- 
eralization. First, teaching loosely reduces the likelihood that a 
single or small group of noncritical stimuli will acquire exclu- 
sive control over the target behavior. A target behavior that inad- 
vertently comes under the control of a stimulus present in the 
instructional setting but not always present in the generalization 
setting may not be emitted in the generalization setting. Here are 
two examples of this type of faulty stimulus control: 


¢ Following teachers’ directions: A student with a history 
of receiving reinforcement for complying with teachers’ 
directions when they are given in a loud voice and accom- 
panied by a stern facial expression may not follow direc- 
tions that do not contain one or both of those noncritical 
variables. The discriminative stimulus for the student’s 
compliance with teacher directions should be the content 
of the teacher’s statements. 

* Assembling bicycle sprocket sets: A new employee at 
the bicycle factory inadvertently learns to assemble rear 
sprocket sets by putting a red sprocket on top of a green 
sprocket and a green sprocket on top of a blue sprocket 
because the sprocket sets on a particular bicycle model in 
production on the day she was trained were colored in that 
fashion. However, proper assembly of a sprocket set has 
nothing to do with the colors of the individual sprockets; 
the relevant variable is the relative size of the sprockets 


(i.e., the biggest sprocket goes on the bottom, the next 
biggest on top of that one, and so on). 


Systematically varying the presence and absence of non- 
critical stimuli during instruction greatly decreases the chance 
that a functionally irrelevant factor—a teacher’s tone of voice 
or sprocket color—will acquire control of the target behavior 
(Kirby & Bickel, 1988). 

A second rationale for loose teaching is that including a 
wide variety of noncritical stimuli during instruction increases 
the probability that the generalization setting will include some 
of the stimuli that were present during instruction. In this sense, 
loose teaching acts as a kind of catchall effort at programming 
common stimuli and makes it less likely that a student’s per- 
formance will be impeded or “thrown off” by the presence of a 
“strange” stimulus. 

Loose teaching applied to the previous two examples 
might entail the following: 


¢ Following teachers’ directions: During instruction, the 
teacher varies all of the factors mentioned earlier (e.g., 
tone of voice, facial expression), plus gives directions 
while standing, while sitting, from different places within 
the classroom, at different times of the day, while the stu- 
dent is alone and in groups, while looking away from the 
student, and so on. In each instance, reinforcement is con- 
tingent on the student’s compliance with the content of the 
teacher’s direction irrespective of the presence or absence 
of any of the noncritical features. 

¢ Assembling bicycle sprocket sets: During training, the 
new employee assembles sprocket sets containing sprock- 
ets of widely varying colors, after receiving the compo- 
nent sprockets in varied sequences, when the factory floor 
is busy, at different times during a work shift, with and 
without music playing, and so forth. Irrespective of the 
presence, absence, or values of any of these noncritical 
factors, reinforcement would be contingent on correct 
assembly of sprockets by relative size. 


Seldom used as a stand-alone tactic, loose teaching is often 
a recognizable component of interventions when generalization 
to highly variable and diverse settings or situations is desired. 
For example, Horner and colleagues (1986) incorporated loose 
teaching into their training program for table busing by sys- 
tematically but randomly varying the location of the tables, the 
number of people at the tables, whether food was completely or 
partially eaten, the amount and location of garbage, and so forth. 
Hughes and colleagues (1995) incorporated loose teaching by 
varying the peer teachers and varying the locations of training 
sessions. Loose teaching was evident in Bord and colleagues’ 
(2017) study when they varied the genre (e.g., rock, classical, 
no music) and volume (e.g., 80 db, 60 db, 40 db) of music in 
the instructional setting to reflect the range of music the learner 
might encounter in the generalization setting. Loose teaching is 
often a recognizable feature of language training programs that 
use milieu, incidental, and naturalistic teaching methods (e.g., 
Charlop-Christy & Carpenter, 2000; McGee, Morrier, & Daly, 
1999; Warner, 1992). 


out Few studies evaluating the effects of using loose teaching 
in isolation have been reported. One exception is an experiment 
by Campbell and Stremel-Campbell (1982), who evaluated 
the effectiveness of loose teaching as a tactic for facilitating 
the generalization of newly acquired language by two students 
with moderate cognitive disabilities. The students were taught 
the correct use of the words is and are in “wh” questions (e.g., 
“What are you doing?”’), yes/no reversal questions (e.g., “Is this 
mine?”), and statements (e.g., “These are mine?’’). Each stu- 
dent received two 15-min language training sessions conducted 
within the context of other instructional activities that were part 
of each child’s individualized education program, one during 
an academic task and the second during a self-help task. The 
student could initiate a language interaction based on the wide 
variety of naturally occurring stimuli, and the teacher could 
try to evoke a statement or question from the student by inten- 
tionally misplacing instructional materials or offering indirect 
prompts. Generalization probes of the students’ language use 
during two daily 15-min free-play periods revealed substantial 
generalization of the language structures acquired during the 
loose teaching sessions. 

The learner’s performance of the target behavior should 
be established under fairly restricted, simplified, and consistent 
conditions, before much “looseness” is introduced. This is par- 
ticularly important when teaching complex and difficult skills. 
Only noncritical (i.e., functionally irrelevant) stimuli should be 
“Joosened.” Practitioners should not inadvertently loosen stimuli 
that reliably function in the generalization setting as discrimi- 
native stimuli (SPs) or as “don’t do it” examples (S4s), Stimuli 
known to play important roles in signaling when and when not to 
respond should be systematically incorporated into instructional 
programs as teaching examples. A stimulus condition that may 
be functionally irrelevant for one skill may be a critical SP for 
another skill. 

Taking the notion of varying noncritical aspects of the 
instructional setting and procedures to its logical limit, Baer 
(1999) offered the following advice for loose teaching: 


¢ Use two or more teachers. 

¢ Teach in two or more places. 

¢ Teach from a variety of positions. 

¢ Vary your tone of voice. 

¢ Vary your choice of words. 

© Show the stimuli from a variety of angles, using some- 
times one hand and sometimes the other. 

¢ Have other persons present sometimes and not other times. 

* Dress quite differently on different days. 

¢ Vary the reinforcers. 

* Teach sometimes in bright light, sometimes in dim light. 

* Teach sometimes in noisy settings, sometimes in quiet ones. 

¢ Inany setting, vary the decorations, vary the furniture, and 
vary their locations. 

Vary the times of day when you and everyone else teach. 

+ Vary the temperature in the teaching settings. 

¢ Vary the smells in the teaching settings. 

* Within the limits possible vary the content of what’s being 


taught. 
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* Do all of this as often and as unpredictably as possi- 
ble. (p. 24) 


Of course, Baer (1999) was not suggesting that a teacher 
needs to vary all of these factors for every behavior taught. But 
building a reasonable degree of “looseness” into teaching is an 
important element of a teacher’s overall effort to program for 
generalization rather than train and hope. 


Comparing Teach Enough Examples, Program Common 
Stimuli, and Teach Loosely. Teach enough examples, program 
common stimuli, and teach loosely are three related tactics for 
promoting generalized outcomes based upon the principle of 
stimulus control. Each strategy entails the systematic selection, 
presentation, and variation of antecedent stimuli during instruc- 
tion. The definition and rationale for each tactic and examples 
of stimuli that might be included in a program to teach a novice 
driver how to parallel park are shown in Table 30.2. 

The three tactics differ from one another on the basis of 
how critical a particular antecedent stimulus is to the target 
behavior and the specificity with which the stimulus is identified 
and controlled during instruction. The tactics can be compared 
and contrasted as illustrated in Figure 30.7. At one end of the 
importance and specificity continuum is teach loosely, a tactic in 
which noncritical stimuli are randomly varied during and across 
teaching sessions. At the other end lies teach enough examples, 
which requires the identification and precise presentation of 
instructional examples representing critical features found in the 
generalization setting/situation. Relatively speaking, program- 
ming common stimuli falls in the middle of the continuum and 
overlaps with the tactics on either side. 

Judging where a given stimulus belongs on the impor- 
tance continuum can be puzzling. When teaching parallel 
parking, should the amount and flow of traffic be incorpo- 
rated into specific teaching examples or programmed as a 
common stimulus? Should passengers talking to the driver or 
one another be programmed as a common stimulus or var- 
ied among a collection of other noncritical stimuli as part of 
loose teaching? A good case could be made for either answer 
in both examples. That there are no right and wrong answers to 
these questions poses no hindrance to planning and delivering 
instruction for generalized behavior change. What is import- 
ant is taking all potentially relevant stimuli into account. We 
recommend practitioners integrate stimuli that fall in the over- 
laps between these strategies with this conservative approach: 
Build stimuli that might be teaching examples or common 
stimuli into the instructional plan as specific teaching exam- 
ples; program as common stimuli those that fall between pro- 
gram common stimuli and teach loosely. 


Maximize Contact with Reinforcement 
in the Generalization Setting 


Even though the learner emits a newly acquired target behay- 
ior in a generalization setting, generalization and maintenance 
will be short-lived if the behavior does not contact naturally 
existing contingencies of reinforcement. Five of the 13 tactics 
for promoting generalized behavior change described in this 
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TABLE 30.2 Teach Enough Examples, Program Common Stimuli, and Teach Loosely Compared 


Strategy Rationale 


Teach enough examples—Select 
and teach examples representing 
the full range of critically relevant 
stimulus variations and response 
requirements typically found in the 
generalization setting. 


setting. 


Program common stimuli—include 
Stimuli in the instructional setting 
that are known or are likely to exist 
in the generalization setting. 


situation generalization. 


Teach loosely—Vary functionally 
irrelevant stimuli within and across 
teaching sessions. 


Examples: Teaching Parallel Parking 


To increase the likelihood of correct ¢ Available parking spaces of different lengths 


responses to untaught examples e Parked car at front, back, or both ends of space 
encountered in the generalization 


Driving own car, partner's car, rental car 
e Ina hurry/ate for appointment 
e “No Parking” sign or hood over parking meter 


e Passenger questioning or challenging driver's ability to 
parallel park 


To increase the likelihood of setting/ ¢ Heavy traffic, little or no traffic 


e One or more passengers in car 
e Pedestrians on sidewalk 

¢ Radio on/off 

¢ Daytime or at night 


To (a) minimize the likelihood of an © Music, sports, or news on radio at different volumes 
irrelevant stimulus acquiring stimu-  » passenger(s) talking to driver or one another 

lus control over the target behavior 
and (b) program common stimuli to 
the extent the learner encounters 


e Weather: sun, rain, fog, snow 
e Expensive car(s) parked in front and/or back of space 


any of the “loosened” stimuli in the ¢ Police officer or police car nearby 


generalization settings. 


chapter involve some form of arranging or contriving for the 
target behavior to be reinforced in the generalization setting. 


Teach Behavior to Levels Required 
by Natural Contingencies 


Baer (1999) noted a common mistake by practitioners: failing 
to teach the behavior change to performance levels sufficient to 
meet natural contingencies of reinforcement. 


Behavior changes that seem to need generalization may 
only need better teaching. Try making the students fluent, 
and see if they still need further support for generaliza- 
tion. Fluency may consist of any or all of the following: 
high rate of performance, high accuracy of performance, 
fast latency given the opportunity to perform, and strong 
response. (p. 17) 


A new behavior may occur in the generalization setting 
but fail to make contact with naturally existing contingencies 
of reinforcement. Ensuring that a new behavior change meets 
naturally existing contingencies of reinforcement may require 
enhancing the learner’s performance of the target behavior 


in one or more of these dimensions: accuracy, rate, duration, 
latency, magnitude, and topography. For example, when given 
a worksheet to complete at his desk, a student’s behavior that is 
consistent with the following dimensions is unlikely to contact 
reinforcement for completing the task, even if the student has the 
ability to complete each worksheet item accurately. 


* Latency too long. A student who spends 5 minutes “‘day- 
dreaming” before he begins reading the directions may not 
finish in time to obtain reinforcement. 

¢ Rate too low. A student who needs 5 minutes to read the 
directions for an independent seatwork assignment that his 
peers read in less than | minute may not finish in time to 
obtain reinforcement. 

¢ Duration too brief. A student who can work without direct 
supervision for only 5 minutes at a time will not be able 
to complete any task requiring more than 5 minutes of 
independent work. 


The solution for this kind of generalization problem, while 
not always simple, is straightforward. Instruction must bring 
target behavior performance to levels commensurate with the 
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naturally occurring contingencies in the generalization setting. 
A good generalization plan specifies the performance necessary 
to meet naturally existing criteria for reinforcement. 


Program Indiscriminable Contingencies 


The clear, predictable, and immediate consequences that are 
so effective in helping learners acquire new behavior can work 
against generalized responding. This is most likely to occur when 
anewly acquired skill has not yet contacted naturally existing con- 
tingencies of reinforcement, and the learner can discriminate when 
the instructional contingencies are absent in the generalization set- 
tings. When the presence or absence of the controlling contingen- 
cies can be detected in the generalization setting (“The game’s off, 
no need to respond here/now.”), the learner may stop responding 
in the generalization setting, and the behavior change the prac- 
titioner worked so hard to develop may cease to occur before it 
can contact the naturally existing contingencies of reinforcement. 

An indiscriminable contingency is one in which the 
learner cannot discriminate whether or not the next response will 
produce reinforcement. As a tactic for promoting generalization 
and maintenance, programming indiscriminable contingencies 
involves contriving a contingency in the generalization setting 
in which (a) reinforcement is contingent on some, but not all, 
occurrences of the target behavior, and (b) the learner is unable 
to predict which responses will produce reinforcement. 

The basic rationale for programming indiscriminable con- 
tingencies is to keep the learner responding often enough and 
long enough in the generalization setting for the target behavior 
to make sufficient contact with the naturally existing contingen- 
cies of reinforcement. From that point on, the need to program 
contrived contingencies to promote generalization will be moot. 
Applied behavior analysts use two related techniques to program 
indiscriminable contingencies: intermittent schedules of rein- 
forcement and delayed rewards. 


Intermittent Schedules of Reinforcement. A newly learned 
behavior often must occur repeatedly over a period of time in 
the generalization setting before it contacts a naturally existing 
contingency of reinforcement. During that time, an extinction 
condition exists for responses emitted in the generalization set- 
ting. The current or most recent schedule of reinforcement for 
4 behavior in the instructional setting plays a significant role 
in how many responses will be emitted in the generalization 
setting prior to reinforcement. Behaviors that have been under 
continuous schedules of reinforcement (CRF) show limited 
response maintenance under extinction. When reinforcement is 
no longer available, responding is likely to decrease rapidly to 
pre-reinforcement levels. In contrast, behaviors with a history 
of intermittent schedules of reinforcement often continue to be 
emitted for relatively long periods after reinforcement is no lon- 
ger available (e.g., Dunlap & Johnson, 1985; Hoch, McComas, 
Thompson, & Paone, 2002). 

Koegel and Rincover (1977, Experiment II) demon- 
strated the effects of intermittent schedules of reinforcement 
on response maintenance in a generalization setting. The par- 
ticipants were six boys diagnosed with autism and severe to 
profound cognitive disabilities, ages 7 to 12 years, who had 
participated in a previous study and had shown generalized 
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responding in the extra-therapy setting used in that experiment 
(Rincover & Koegel, 1975). As in Experiment I by Koegel and 
Rincover (1977) described earlier in this chapter, one-on-one 
training trials were conducted with each child and the trainer 
seated at a table in a small room, and generalization trials were 
conducted by an unfamiliar adult standing outside on the lawn, 
surrounded by trees. The target behavior was (a) nonverbal imi- 
tation (e.g., raising arm) in response to an imitative model and 
the verbal instruction “Do this” and (b) touching a body part in 
response to verbal instructions such as “Touch your nose.” After 
acquiring an imitation response, each child was given additional 
trials on one of three randomly chosen schedules of reinforce- 
ment: CRF, fixed ratio (FR) 2, or FR 5. After these additional 
training trials, each child was taken outside to assess response 
maintenance. Once outside, trials were conducted until the 
child’s correct responding had decreased to 0% or maintained 
at 80% correct or above for 100 consecutive trials. 

Behaviors that were most recently on a CRF schedule 
in the instructional setting underwent extinction quickly in the 
generalization setting (see Figure 30.8). Generalized responding 
occurred longer for the FR 2 trained behavior, and longer still 
for behavior that had been shifted to an FR 5 schedule in the 
instructional setting. The results showed clearly that the sched- 
ule of reinforcement in the instructional setting had a predictable 
effect on responding in the absence of reinforcement in the gen- 
eralization setting: The thinner the schedule in the instructional 
setting, the longer the response maintenance in the generaliza- 
tion setting. 

The defining feature of all intermittent schedules of rein- 
forcement is that only some responses are reinforced, which 
means, of course, that some responses go unreinforced. One 
possible explanation for the maintenance of responding during 
periods of extinction for behaviors developed under intermittent 
schedules is the relative difficulty of discriminating that rein- 
forcement is no longer available. Thus, the unpredictability of 
an intermittent schedule may account for the maintenance of 
behavior after the schedule is terminated. 

Although all indiscriminable contingencies of rein- 
forcement involve intermittent schedules, not all schedules of 
intermittent reinforcement are indiscriminable. For example, 
although the FR 2 and FR 5 schedules of reinforcement used 
by Koegel and Rincover (1977) were intermittent, many learn- 
ers would soon be able to discriminate whether reinforcement 
would follow their next response. In contrast, a student whose 
behavior is being supported by a variable ratio (VR) 5 schedule 
of reinforcement cannot determine if his next response will 
be reinforced. 


Delayed Rewards. Stokes and Baer (1977) suggested that not 
being able to discriminate the time when a behavior will be rein- 
forced is similar to not being able to discriminate whether the 
next response will be reinforced. They cited an experiment by 
Schwarz and Hawkins (1970) in which each day after school a 
sixth-grade girl was shown videotapes of her behavior in that 
day’s math class and received praise and token reinforcement 
for improvements in her posture, reducing the number of times 
she touched her face, and speaking with sufficient volume to 
be heard by others. Reinforcement after school was contingent 
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Figure 30.8 Percent of correct responses by three children in a generalization setting 
as a function of the schedule of reinforcement used during the final sessions in an 


instructional setting. 


From “Research on the Differences between Generalization and Maintenance in Extra-Therapy 
Responding” by R. L. Koegel and A. Rincover, 1977, Journal of Applied Behavior Analysis, 10, p. 8. 


Reproduced with permission of John Wiley & Sons, Inc. 


on behaviors emitted during math class only, but comparable 
improvements were noted in spelling class as well. The gener- 
alization data were taken from videotapes of the girl’s behav- 
ior in spelling class and were never shown to her. Stokes and 
Baer hypothesized that because reinforcement was delayed (the 
behaviors that produced praise and tokens were emitted during 
math class but were not rewarded until after school), it may have 
been difficult for the student to discriminate when improved 
performance was required for reinforcement. They suggested 
that the generalization across settings of the target behaviors 
may have been a result of the indiscriminable nature of the 
response-to-reinforcement delay. 

Delayed rewards and intermittent schedules of reinforce- 
ment are alike in two ways: (a) Reinforcement is not delivered 
each time the target behavior is emitted (only some responses 
are followed by reinforcement), and (b) there are no clear stimuli 
to signal which responses will produce reinforcement. Delayed 
reward contingencies differ from intermittent reinforcement 
in that instead of the consequence immediately following an 
occurrence of the target behavior, the reward is provided after 
a period of time has elapsed (i.e., a response-to-reward delay). 
Receiving the delayed reward is contingent on the learner hav- 
ing performed the target behavior in the generalization setting 
during an earlier time period. With an effective delayed reward 
contingency, the learner cannot discriminate when (and perhaps 
where, depending on the details of the contingency) the target 
behavior must be emitted to obtain a reward later. As a result, the 
learner’s best chance to receive the reward later is to “be good 
all day” (Fowler & Baer, 1981). 

Two similar studies by Freeland and Noell (1999, 2002) 
investigated the effects of delayed rewards on the maintenance 


of students’ mathematics performance. Participants in the sec- 
ond study were two third-grade girls who had been referred by 
their teacher for help with mathematics. That target behavior for 
both students was writing answers to single-digit addition prob- 
lems with sums to 18. The researchers used a multiple-treatment 
reversal design to compare the effects of five conditions on 
the number of correct digits written as answers to single-digit 
addition problems during daily 5-min work periods (e.g., writ- 
ing “11” as the answer to “S + 6 = ?” counted as two digits 
correct). 


Baseline: Green worksheets; no programmed conse- 
quences. Students were told they could attempt as many 
or as few problems as they wanted. 

Reinforcement: Blue worksheets with a goal number at 
the top indicating the number of correct digits needed to 
choose a reward from the “goody box.” Each student’s 
goal number was the median number of correct digits on 
the last three worksheets. All worksheets were graded 
after each session, and reinforcement was contingent on 
meeting the performance goal. 

Delay 2: White worksheets with goal number; after every 
two sessions, one of the two worksheets completed by 
each student was randomly selected for grading; rein- 
forcement was contingent on meeting highest median of 
three consecutive sessions up to that point in the study. 
Delay 4: White worksheets with goal number and same 
procedures as Delay 2 except that worksheets were not 
graded until four sessions had been completed, at which 
time one of each student’s previous four worksheets was 
randomly selected and graded. 


* Maintenance: White worksheets with goal number as 
before; no worksheets were graded and no feedback or 
rewards for performance were given. 


The different-colored worksheets used for each condition 
in this study made it easy for the students to predict the likelihood 
of reinforcement. A green worksheet meant no feedback or other 
consequences no matter how many correct digits were written. 
A blue worksheet indicated that meeting the performance goal 
would earn a reward from the “goody box.” However, meeting 
the performance criterion on a white worksheet sometimes pro- 
duced reinforcement. This study demonstrates the importance 
of having contingencies in the instructional setting “look like” 
the contingencies in effect in the generalization setting(s) in two 
ways: (a) Both students showed large decreases in performance 
when baseline conditions were reinstated, and stopped respond- 
ing altogether during a second return to baseline; and (b) the stu- 
dents continued completing math problems at a high rate during 
the maintenance condition, even though no reinforcement was 
provided (see Figure 30.9). 

When the delayed (indiscriminable) contingencies were 
implemented, both students demonstrated levels of correct 
responding at or above levels during the reinforcement phase. 
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When the students were exposed to maintenance conditions, Amy 
maintained high levels of responding for 18 sessions with vari- 
able performance over the final six sessions, and Kristen showed 
a gradually increasing rate of performance over 24 sessions. The 
results demonstrated that behavior with an indiscriminable con- 
tingency can be maintained at the same rate as with a predictable 
schedule, and with greater resistance to extinction. 

Delayed consequences have been used to promote the set- 
ting/situation generalization and response maintenance of a wide 
range of target behaviors, including academic and vocational 
tasks by individuals with autism (Dunlap, Koegel, Johnson, & 
O’Neill, 1987); young children’s toy play, social initiations, 
and selection of healthy snacks (R. A. Baer, Blount, Dietrich, 
& Stokes, 1987; R. A. Baer, Williams, Osnes, & Stokes, 1984; 
Osnes, Guevremont, & Stokes, 1986); restaurant trainees’ 
responding appropriately to coworkers’ initiations (Grossi 
et al., 1994); and performance on academic tasks (Brame, 2001; 
Theodore, Bray, Kehle, & Jenson, 2001). 

Effective use of delayed consequences can reduce (or even 
eliminate in some instances) the learner’s ability to discriminate 
when a contingency is or is not in effect. As a result, the learner 
needs to “be good” (i.e., emit the target behavior) all the time. 
If an effective contingency can be made indiscriminable across 
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11, p. 13. Copyright 2002 by Human Sciences Press. 
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settings and target behaviors, the learner will also have to “be 
good” everywhere, with all of his or her relevant skills. 

Here are two examples of how the supervisor of an office 
cleaning crew could use an indiscriminable contingency to rein- 
force quality work by an employee. 


¢ At shift’s end, the employee pulls from the supervisor’s 
hat a slip of paper on which a room number and item to 
be cleaned are written (e.g., #566, carpet vacuumed). If 
the randomly selected item has been cleaned properly, the 
employee earns a reward. 

¢ The employee initials and leaves a sticky note on the door 
of each room she has cleaned. The supervisor collects the 
notes and initials those from properly cleaned rooms. At 
shift’s end, the employee draws a sticky-note from all the 
notes she left during that shift. If the randomly drawn note 
contains the supervisor’s initials, she gets a reward. 


Following are five examples of classroom applications of 
indiscriminable contingencies involving delayed rewards. Each 
application features an interdependent group contingency that 
makes group rewards contingent on the performance of ran- 
domly selected students (see Chapter 28). 


¢ Spinners and Dice I. Teachers can make academic seatwork 
periods more effective with spinners and dice. Every few 
minutes (e.g., on a variable interval [VI] 5-min schedule), 
the teacher (a) randomly selects a student’s name, (b) walks 
to that student’s desk and has the student twirl a spinner or 
roll a die, (c) counts backward from the worksheet problem 
or item the student is currently working on by the number 
shown on the spinner or die, and (d) gives a token to the stu- 
dent if that problem or item is correct. Students who imme- 
diately begin to work on the assignment and work quickly 
but carefully throughout the seatwork period are most likely 
to obtain reinforcers under this indiscriminable contingency. 
Spinners and Dice II. In a study by L. D. Heward, Park, 
and Alber-Morgan (2008), second-graders completed a 
variety of academic tasks on the board. At the end of the 
timed work session, one student spun a spinner that deter- 
mined the subject area (reading, time telling, social stud- 
ies, or language). A second student then rolled dice that 
indicated the item number. Each student who correctly 
answered the randomly selected item earned a reward. 

* Story Fact Recall Game. Many elementary teachers 
devote 20 to 30 minutes per day to sustained silent read- 
ing (SSR), a period when students can read silently from 
books of their choice (Temple, Ogle, Crawford, & Freppon, 
2018). An indiscriminable contingency can encourage stu- 
dents to read with purpose during SSR. At the end of the 
SSR period, the teacher asks several randomly selected stu- 
dents a question about the book they are reading. A student 
reading Chapter 3 of Elizabeth Winthrop’s The Castle in 
the Attic might be asked, “What did William give the Silver 
Knight to eat?” (Answer: bacon and toast). Correct answers 
are praised by the teacher, are applauded by the class, and 
earn a marble in a jar toward a reward for the whole class. 
Students do not know when they will be called on or what 
they might be asked (Brame, Heward, & Bicard, 2002). 


¢ The Jars. The teacher arranges three jars containing slips 


of paper or cards indicating different behaviors, perfor- 
mance criteria, and rewards. At the conclusion of a pre- 
announced time, the teacher randomly selects from one 
jar the subject area (e.g., math, science, social studies) 
or classroom rule (e.g., participation, no talk-outs), from 
a second jar the performance criteria (e.g., 70%, 80%, or 
90% accuracy; 10, 15, or 20 problems completed), and 
from a third jar the reward to be earned if the perfor- 
mance criterion is met. A fourth jar might include names 
of individual students or subgroups within the class who 
must meet the performance criterion for the class to 
earn the reward. Several studies have found “the jars” 
to be a fun and effective method to improve and gener- 
alize students’ academic performance and compliance 
with classroom rules (e.g., Kelshaw-Levering, Ster- 
ling-Turner, Henry, & Skinner, 2000; Theodore et al., 
2001; Zibreg Hargis, Pattie, Maheady, Budin, & Raf- 
ferty, 2017). 

Numbered Heads Together. Having small groups of stu- 
dents work together on a joint learning activity can be 
effective, but teachers should use procedures that motivate 
all students to participate. A collaborative learning tech- 
nique called Numbered Heads Together ensures that all 
students actively participate (Hunter et al., 2015; Kagan 
& Kagan, 2009). Students are seated in heterogeneous 
groups of three or four, and each student is given the num- 
ber 1, 2, 3, or 4. The teacher asks the class a question, 
and each group discusses the problem and comes up with 
an answer. Next, the teacher randomly selects a number 
from | to 4 and then calls on one or more students with 
that number to answer. It is important that every person in 
the group knows the answer to the question. This strategy 
promotes cooperation within the group rather than compe- 
tition. Because all students must know the answer, group 
members help each other understand not only the answer 
but also the how and why behind it. Finally, this strategy 
encourages individual responsibility. 

Intermittent Grading. Most students do not receive suf- 
ficient practice writing, and when students do write, the 
feedback they receive is often ineffective. One reason for 
this may be that providing detailed feedback on a daily 
composition by each student in the class requires more 
time and effort than even the most dedicated teacher 
can give. A procedure called intermittent grading offers 
one solution to this problem (Heward, Heron, Gardner, 
& Prayzer, 1991). Students write for 10 to 15 minutes 
each day, but instead of reading and evaluating every 
student’s paper, the teacher provides detailed feedback 
on a randomly selected 20% to 25% of students’ daily 
compositions. The students whose papers were graded 
earn points based on individualized performance criteria, 
and bonus points are given to the class contingent on 
the quality of the selected and graded papers (e.g., if 
the authors of four of the five papers that were graded 
met their individual criteria). Students’ papers that were 
graded can be used as a source of instructional examples 
for the next lesson. 


The effectiveness of a delayed reward tactic in promoting 
generalization and maintenance rests on (a) the indiscriminabil- 
ity of the contingency (i.e., the learner cannot tell exactly when 
emitting the target behavior in the generalization setting will 
produce a reward at a later time) and (b) the learner understand- 
ing the relation between emitting the target behavior and receiv- 
ing a reward at a later time for doing so. Box 30.2 describes how 
indiscriminable contingencies can be used to foster sustainable 
practices such as recycling and energy conservation. 


Guidelines for Programming Indiscriminable Contingencies. 
Practitioners should consider these guidelines when implement- 
ing indiscriminable contingencies: 


¢ Use continuous reinforcement during the initial stages 
of developing new behaviors or when strengthening 
little-used behaviors. 

¢ Systematically thin the schedule of reinforcement based 
on the learner’s performance (see Chapter 13). Remember 
that the thinner the schedule of reinforcement is, the more 
indiscriminable it is (e.g., an FR 8 schedule is more indis- 
criminable than an FR 2 schedule); and variable schedules 
of reinforcement (e.g., WR and VI schedules) are more 
indiscriminable than fixed schedules are (e.g., FR and 
fixed interval [FI] schedules). 

¢ When using delayed rewards, begin by delivering the 
reinforcer immediately following the target behavior and 
gradually increase the response-to-reinforcement delay. 

¢ Each time a delayed reward is delivered, explain to 
the learner that he is receiving the reward for specific 
behaviors he performed earlier. This helps to build and 
strengthen the learner’s understanding of the rule describ- 
ing the contingency. A delayed rewards intervention may 
not be effective with some learners who have significant 
intellectual disabilities. 


Set Behavior Traps 


Some contingencies of reinforcement are especially powerful, 
producing substantial and long-lasting behavior changes. Baer 
and Wolf (1970b) called such contingencies behavior traps. 
Using a mouse trap as an analogy, they described how a house- 
holder has only to exert a relatively small amount of behav- 
ioral control over the mouse—getting the mouse to smell the 
cheese—to produce a behavior change with considerable (in this 
case, complete) generalization and maintenance. 


A householder without a trap can, of course, still kill a 
mouse. He can wait patiently outside the mouse’s hole, 
grab the mouse faster than the mouse can avoid him, and 
then apply various forms of force to the unfortunate ani- 
mal to accomplish the behavioral change desired. But this 
performance requires a great deal of competence: vast 
patience, super-coordination, extreme manual dexterity, and 
a well-suppressed squeamishness. By contrast, a householder 
with a trap needs very few accomplishments: If he can 
merely apply the cheese and then leave the trap where the 
mouse is likely to smell that cheese, in effect he has guaran- 
teed general(ized) change in the mouse’s future behavior. 
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The essence of a trap, in behavioral terms, is that 
only a relatively simple response is necessary to enter the 
trap, yet once entered, the trap cannot be resisted in cre- 
ating general behavior change. For the mouse, the entry 
response is merely to smell the cheese. Everything proceeds 
from there almost automatically. (p. 321, emphasis added) 


Behavioral trapping is a fairly common phenomenon that 
everyone experiences from time to time. Behavior traps are par- 
ticularly evident in the activities students just cannot get (or do) 
enough of. The most effective behavior traps share four essential 
features: (a) They are “baited” with virtually irresistible rein- 
forcers that “lure” the student to the trap; (b) only a low-effort 
response already in the student’s repertoire is necessary to enter 
the trap; (c) interrelated contingencies of reinforcement inside 
the trap motivate the student to acquire, extend, and maintain 
targeted academic and/or social skills; and (d) they can remain 
effective for a long time because students show few, if any, sati- 
ation effects. 

Consider the case of the “reluctant bowler.” A young man 
is persuaded to fill in as a substitute for a friend’s bowling team. 
He has always regarded bowling as uncool. And bowling looks 
so easy on television that he does not see how it can be con- 
sidered a real sport. Nevertheless, he agrees to go, just to help 
out this one time. During the evening he learns that bowling 
is not as easy as he had always assumed (he has a history of 
reinforcement for athletic challenges) and that several people 
he would like to get to know are avid bowlers (i.e., it is a mixed 
doubles league). Within a week he has purchased a custom-fitted 
bowling ball, a bag, and shoes; practiced twice on his own; and 
signed up for the next league. 

The reluctant bowler example illustrates the fundamen- 
tal nature of behavior traps: easy to enter and difficult to exit. 
Some naturally existing behavior traps can lead to maladaptive 
behaviors, such as alcoholism, drug addiction, and juvenile delin- 
quency. The everyday term vicious circle refers to the natural con- 
tingencies of reinforcement that operate in destructive behavior 
traps. Practitioners, however, can learn to create behavior traps 
that help students develop positive, constructive knowledge and 
skills. Alber and Heward (1996) provided guidelines for creating 
successful traps and gave the following example of an elementary 
teacher creating a behavior trap that took advantage of a student's 
penchant for playing with baseball cards. 


Like many fifth graders struggling with reading and 
math, Carlos experiences school as tedious and unre- 
warding. With few friends of his own, Carlos finds 

that even recess offers little reprieve. But he does find 
solace in his baseball cards, often studying, sorting and 
playing with them in class. His teacher, Ms. Greene, long 
ago lost count of the number of times she had to stop an 
instructional activity to separate Carlos and his beloved 
baseball cards. Then one day, when she approached 
Carlos’ desk to confiscate his cards in the middle of a 
lesson on alphabetization, Ms. Greene discovered that 
Carlos had already alphabetized all the left-handed 
pitchers in the National League! Ms. Greene realized 
she’d found the secret to sparking Carlos’ academic 
development. 
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BOX 30.2 


Fostering Earth-friendly Behavior with Clueless Contingencies 


Earth’s average temperature in 2016 was the warmest ever 
recorded, and when 2019 began, the ten warmest years on 
record had all occurred since 1998 (National Aeronautics and 
Space Administration [NASA], 2019; National Oceanic and 
Atmospheric Administration [NOAA], 2019). This global 
warming has contributed to melting ice sheets and rising sea 
levels, an increased frequency of catastrophic floods, droughts, 
and forest fires, and has threatened the survival of some species 
(Intergovernmental Panel on Climate Change, 2018). Most of 
the warming results from increased concentrations of carbon 
dioxide (CO>) and other greenhouse gases in the atmosphere 
due to human activity since the mid-20" century, primarily 
the burning of fossil fuels (Intergovernmental Panel on Climate 
Change, 2014). 

Although some climate change deniers persist in com- 
plete disregard of scientific facts, the challenge has shifted 
from convincing people climate change is real to getting them 
to change their behavior in response to it (Thompson, 2010). 
Even if all fossil fuel emissions ceased today, our climate 
would continue warming for decades (Marcott, Shakun, Clark, 
& Mix, 2013). Successfully adapting to that reality will require 
massive shifts to renewable energy sources and changes in 
transportation infrastructure, food production, environmen- 
tal conservation, and economic practices on a global scale far 
beyond the behavior changes any of us can make now. But 
the changes we do make now can provide a behavioral wedge 
that gives society time to discover the technological fixes and 
implement the policy changes needed to make realities of car- 
bon neutrality and a truly sustainable society (Dietz, Gardner, 
Gilligan, Stern, & Vandenbergh, 2009). 


Rewarding Green Behavior 


Richard Thaler (2012), economist and co-author of Nudge: 
Improving Decisions About Health, Wealth, and Happiness, 
began a New York Times essay, “Making Good Citizenship 
Fun,” as follows: “Governments typically use two tools to 
encourage citizens to engage in civic behavior like paying their 
taxes, driving safely, or recycling their garbage: exhortation 
and fines. These efforts are often ineffective. So it might be 
a good time to include positive reinforcement.” Thaler then 
described several examples of reinforcement-based govern- 
ment campaigns to promote behavior that many people avoid, 
including obeying the speed limit in Sweden, paying taxes in 
China, and picking up dog feces off the sidewalks in Taiwan. 

Behavior analysts should be delighted that efforts to 
reward good behavior are catching on as a way to ameliorate 
social problems. Lotteries and related incentive systems are 
practically effective and conceptually consistent with behav- 
ioral principles. The overall impact of most such programs, 
however, is limited by a restricted footprint (limited setting, 
reward a single type of green behavior, reward individuals,) 
and predictability: “The game’s off. No need to respond now.” 


An indiscriminable contingency approach to enhancing 
existing green initiatives can be designed, either for individu- 
als or groups, to address multiple responses in many settings 
with many rewards. A green behavior change program featur- 
ing a well-designed indiscriminable contingency presents a 
radically different and enticing “if-then” contingency: When 
you can’t tell which of several green behaviors performed 
exactly where or when will produce a reward, the best strat- 
egy for optimizing reward is acting green in every way, every- 
where, all the time. 


Let's Play Conservation Clue 


Conservation Clue is a whodunit competition like the classic 
board game Clue’ ”, but instead of trying to evade getting caught, 
players leave clues of their good green deeds all over the place. 
Examples of target behaviors, measures, rewards, and gaming 
fun that could be part of Conservation Clue games played in res- 
idential, work, and community settings are shown in Figure A. 


1. Target Green Behaviors. Select oft-repeated behaviors 
such as turning off lights, unplugging electronics, or tak- 
ing public transit. Once-only behaviors such as buying 
a plug-in hybrid or installing solar panels do not lend 
themselves to this approach (though they can be incen- 
tivized in other ways; see Chance & Heward, 2010). 

2. Determine How to Detect and Measure the Behaviors. 
Some behaviors leave products that are readily mea- 
sured, such as the number of outlets or bulbs in use, or 
the weight or volume in a recycle bin. Other behaviors 
must be recorded as they occur, such as punches on a 
bus pass or tickets dispensed for bringing a reusable bag. 
Recruiting people to monitor and measure green actions 
can pay off with additional prompts, modeling, or inci- 
dental social reinforcement for the targeted behaviors. 
Members of student or community groups with environ- 
mental and sustainability missions would gladly serve as 
“Conservation Clue Detectives.” 

3. Ready the Playing Field. Just as athletes play better 
on a well-groomed diamond or rink, Conservation Clue 
gamers will behave greener in well-prepared environ- 
ments. Preparation entails two primary actions: minimiz- 
ing barriers to responding (e.g., putting recycling bins 
where they are easy to access) and alerting participants 
to the opportunity to play (in other words, providing 
prompts to engage in green behavior). 

4. Select Rewards. Examples of simple, relatively inexpen- 
sive rewards for a campus-based game of Conservation 
Clue include school insignia wear, bookstore discounts, 
lunch with the dean, concert tickets, and gift certificates 
for local green businesses. Virtual objects such as stars, 
trophies, and green leaves that gamers receive and collect 
online can be effective and fun rewards (Pritchard, 2010; 
Twyman, 2010). 


Settings 


Where We Live 
unplug 


e reduce water use 
reduce heat and A/C 


reduce solid waste (recycle & 
compost) 


Behaviors 
e 


e reuse containers 
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Where We Work 


Where We Shop 


e recycle paper/waste 


e carpool/take public 
transportation 


e power down idle equipment 
e use vehicles efficiently 
e reuse materials 


plugs in use 


bulb counts 
utility bills 
e meter readings 


Measures 
e 


e weight of recyclables 
e purchase receipts 


Seer 


e public transit receipts 
° GPS data from fleet vehicles 


¢ employees initial reused 
materials 


e green officer verification 


e buy locally sourced food 
and goods 


e buy products in bulk or reduced 
packaging 


e buy items made with recycled 
content 


e shop with reusable bags 


e store receipts 
e bar code scans 
e cashier verification 


e discounts at local shops 


e recycle bin stickers 
e utility bill discounts 


Rewards 


e point accrual 
e virtual rewards 


e recognition 

e extra breaks 

e junch with the boss 
e¢ company gear 

e cash bonuses 


a 


e reduced cost items 
e gift cards 


e charity donations in 
shopper's name 


e electronic fanfare 


e public posting 


jar drawings 


dice rolls 
e dart throws 


The Game 


e spinners 


e wheels of chance 


e lotteries 


e random item on sales receipts 
half-off 


e raffles 


_| 


Figure A Target behaviors, measures, rewards, and game methods that could be used in Conservation Clue, 


played where we live, work, and shop. 


indiscriminable contingency. Game rules and materials— 
spinners or dice or dart boards—should be selected to suit 
the setting and players’ interests; the crucial element is 
that multiple forms of green behavior are rewarded in a 
way that is utterly unpredictable . . . and fun. 


5. Make It a Game. The centerpiece of Conservation Clue 
is a game-like procedure for randomly determining which 
green behaviors, emitted where and when, will earn what 
rewards. The jars technique described earlier is one of 
countless ways of implementing this multidimensional 

(continued) 
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BOX 30.2 (continued) 


6. Evaluate, Revise, and Play Again. In the spirit of the 
science from which it is derived, Conservation Clue 
should be an empirical endeavor. Gather baseline data 
on target behaviors against which to compare any gains 
that subsequently come as a result of the playing. The 
game includes many variables to monitor and exper- 
iment with, such as which green behaviors are more 
malleable, which rewards are more preferred, and what 
time intervals between drawings are optimal. Player 
satisfaction should be assessed and suggestions for 
improvements sought. 


Conservation Clue is one way to back good intentions 
with consequential muscle. The behavior changes may seem 


Carlos was both astonished and thrilled to learn that 
Ms. Greene not only let him keep his baseball cards at his 
desk, but also encouraged him to “play with them” during 
class. Before long, Ms. Greene had incorporated baseball 
cards into learning activities across the curriculum. In 
math, Carlos calculated batting averages; in geography, he 
located the hometown of every major leaguer born in his 
state; and in language arts, he wrote letters to his favorite 
players requesting an autographed photo. Carlos began to 
make significant gains academically and an improvement 
in his attitude about school was also apparent. 

But school became really fun for Carlos when some 
of his classmates began to take an interest in his knowledge 
of baseball cards and all the wonderful things you could 
do with them. Ms. Greene helped Carlos form a classroom 
Baseball Card Club, giving him and his new friends oppor- 
tunities to develop and practice new social skills as they 
responded to their teacher’s challenge to think of new ways 
to integrate the cards into the curriculum. (p. 285) 


Ask People in the Generalization Setting 
to Reinforce the Target Behavior 


The problem may be simply that the natural community 
[of reinforcement] is asleep and needs to be awakened and 
turned on. 

—Donald M. Baer (1999, p. 16) 


Sometimes a potentially effective contingency of reinforcement 
in the generalization setting is not operating in a form available 
to the learner no matter how often or well she performs the target 
behavior. The contingency is there, but dormant. One solution 
for this kind of problem is to inform key people in the gener- 
alization setting of the value and importance of their attention 
to the learner’s efforts to acquire and use new skills and ask 
them to help. 

For example, a special education teacher who has been 
helping a student learn how to participate in class discussions by 
providing repeated opportunities for practice and feedback in the 
resource room could inform the teachers in the general education 


trivial at first. But many small actions can redefine what it 
means to act locally and have global impact. Imagine Conserva- 
tion Clue being played by the residents of London or Shanghai, 
drivers on the New Jersey Turnpike, or employees of a major 
corporation with a global footprint. What if Walmart’s more than 
2 million employees at the company’s 4253 stores around the 
globe engaged in a full-fledged game of Conservation Clue? 
What if they then invited the 200 million customers who shop 
at their stores each week to join in the fun? 


Adapted from “Sustaining Sustainability with Clueless Contingencies,” by 
W. L. Heward and J. W. Kimball, 2013, Sustain Magazine, 28, 4-15. Used 
with permission. 


classes that the student attends about the behavior change program 
and ask them to look for and reinforce any reasonable effort by 
the student to participate in their classrooms. A small amount of 
contingent attention and praise from these teachers may be all that 
is needed to achieve the desired generalization of the new skill. 

This simple but often effective technique for promot- 
ing generalized behavior was evident in the study by Stokes 
and colleagues (1974) in which staff members responded to 
the children’s waving response by saying, “Hello, (name).” 
Approximately 20 such generalization probes were conducted 
each day with each child. 

Williams, Donley, and Keller (2000) gave mothers of two 
preschool children with autism explicit instructions on providing 
models, response prompts, and reinforcement to their children 
who were learning to ask questions about hidden objects (e.g., 
“What’s that?” “Can I see it?’’). 

Contingent praise and attention from significant others can 
add to the effectiveness of other strategies already in place in 
the generalization environment. Broden, Hall, and Mitts (1971) 
showed this in the self-monitoring study described in Chapter 29. 
After self-recording had improved eighth-grader Liza’s study 
behavior during history class, the researchers asked Liza’s teacher 
to praise Liza for study behavior whenever he could during history 
class. Liza’s level of study behavior during this Self-Recording 
Plus Praise condition increased to a mean of 88% and was main- 
tained at levels nearly as high during a subsequent Praise Only 
condition (see Figure 29.3). 


Teach the Learner to Recruit Reinforcement 


Another way to “wake up” a potentially powerful but dormant 
natural contingency of reinforcement is to teach the learner to 
recruit reinforcement from significant others. For example, 
Seymour and Stokes (1976) taught delinquent girls to work 
more productively in the vocational training area of a residential 
institution. However, observation showed that staff at the insti- 
tution gave the girls no praise or positive interaction regardless 
of the quality of their work. The much-needed natural commu- 
nity of reinforcement to ensure the generalization of the girls’ 
improved work behaviors was not functioning. To get around 


this difficulty, the experimenters trained the girls to use a simple 
response that called the attention of staff members to their work. 
With this strategy, staff praise for good work increased. Thus, 
teaching the girls an additional response that could be used to 
recruit reinforcement enabled the target behavior to contact 
natural reinforcers that would extend and maintain the desired 
behavior change. 

Students of various ages and abilities have learned to 
recruit teacher and peer attention in classroom and community 
settings: preschoolers with developmental delays for complet- 
ing preacademic tasks and staying on task during transitions 
(Connell, Carta, & Baer, 1993; Stokes, Fowler, & Baer, 1978); 
and elementary and secondary students with learning disabilities 
(Alber, Heward, & Hippler, 1999; Wolford, Alber, & Heward, 
2001), behavioral disorders (Alber, Anderson, Martin, & Moore, 
2004; Morgan, Young, & Goldstein, 1983), and cognitive dis- 
abilities (Mank & Horner, 1987; Rouse, Everhart-Sherwood, & 
Alber-Morgan, 2014) who need help completing various aca- 
demic and social tasks and improving vocational skills. A study 
by Craft, Alber, and Heward (1998) provides a representative 
example of this research. 

Craft and colleagues (1998) assessed the effects of 
recruitment training on academic assignments for which stu- 
dents recruited teacher attention. Four elementary students were 
trained by their special education teacher (the first author) when, 
how, and how often to recruit teacher attention in the general 
education classroom. Training consisted of modeling, role play- 
ing, error correction, and praise in the special education class- 
room. The students were taught to show their work to the teacher 
or ask for help 2 to 3 times per session, and to use appropriate 
statements such as “How am I doing?” or “Does this look right?” 

Data on the frequency of student recruiting and teacher 
praise statements were collected during a daily 20-min home- 
room period in a general education classroom. During this 
period, the general education students completed a variety 
of independent seatwork tasks (reading, language arts, math) 
assigned by the general education teacher, while the four spe- 
cial education students completed spelling worksheets assigned 
by the special education teacher, an arrangement that had been 
established prior to the experiment. If students needed help with 
their assignments during homeroom, they took their work to the 
teacher’s desk and asked for help. 

The effects of recruitment training on the children’s fre- 
quency of recruiting and on the number of praise statements they 
received from the classroom teacher are shown in Figure 30.10. 
Recruiting across students increased from a mean rate of 0.01 to 
0.8 recruiting response per 20-min session during baseline to a 
mean rate of 1.8 to 2.7 after training. Teacher praise statements 
received by the students increased from a mean rate of 0.1 to 
0.8 praise statement per session during baseline to a mean rate 
of 1.0 to 1.7 after training. The ultimate meaning and outcome 
of the intervention was in the increased amount and accuracy of 
academic work by each student (see Figure 6.8). 

For a review of research on recruiting and suggestions 
for teaching children to recruit reinforcement from significant 
others, see Alber and Heward (2000). Box 30.3: “Look, Teacher, 
I’m All Finished!” provides suggestions for teaching students to 
recruit teacher attention. 
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Mediate Generalization 


Another strategy for promoting generalized behavior change 
is to arrange for some thing or person to act as a medium that 
ensures the transfer of the target behavior from the instructional 
setting to the generalization setting. Two tactics for imple- 
menting this strategy are contriving a mediating stimulus and 
teaching the learner to mediate her own generalization through 
self-management. 


Contrive a Mediating Stimulus 


One tactic for mediating generalization is to bring the target 
behavior under the control of a stimulus in the instructional set- 
ting that will function in the generalization setting to reliably 
prompt or aid the learner’s performance of the target behavior. 
The stimulus selected for this important role may exist already 
in the generalization environment, or it may be a new stimu- 
lus added to the instructional program that subsequently goes 
with the learner to the generalization setting. Whether it is a 
naturally existing component of the generalization setting or an 
added element to the instructional setting, to effectively medi- 
ate generalization, a contrived mediating stimulus must be (a) 
made functional for the target behavior during instruction and 
(b) easily transported to the generalization setting (Baer, 1999). 
The mediating stimulus is functional for the learner if it reliably 
prompts or aids the learner in performing the target behavior; 
the mediating stimulus is transportable if it easily goes with the 
learner to all important generalization settings. 

Naturally existing features of generalization settings that 
are used as contrived mediating stimuli may be physical objects 
or people. Van den Pol and colleagues (1981) used paper nap- 
kins, a common feature of any fast-food restaurant, as a contrived 
mediating stimulus. They taught students that a paper napkin was 
the only place to put food. In this way, the researchers eliminated 
the added challenge and difficulty of teaching the students to dis- 
criminate clean tables from dirty tables, to sit only at clean tables, 
or to wipe dirty tables, and then programming the generalization 
and maintenance of those behaviors. By contriving the special 
use of the napkin, only one response had to be trained, and nap- 
kins then served as a mediating stimulus for that behavior. 

Cariveau and Kodak (2017) contrived a common stimu- 
lus to promote the generalization and maintenance of academic 
engagement by second-graders during small-group writing and 
writing lessons. At the start of each lesson during intervention, 
the instructor placed in front of the students a yellow paper 
with a numeral written on it representing a performance goal 
(e.g., 8 indicated a goal of academic engagement for 80% of 
the session). At the end of each lesson, if a randomly selected 
student met the performance criterion, the entire group earned a 
reward. After a reversal design demonstrated a functional rela- 
tion between the intervention and increased academic engage- 
ment, the researchers implemented a maintenance condition that 
incorporated the common stimulus. “The instructor placed the 
yellow card with a goal written on it (i.e., the common stimuli) 
in view of all participants and said, “This is your goal for today.’ 
At the end of each session, the instructor did not provide infor- 
mation about whether the participants met their goal and did not 
deliver reinforcement.” (p. 126) 
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Figure 30.10 Number of recruiting responses (data points) and teacher praise statement (bars) per 20-min 
session. Target recruiting rate was two to three responses per session. Asterisks indicate when each 


student was trained in the resource room. 


From “Teaching Elementary Students with Developmental Disabilities to Recruit Teacher Attention in a General Education Classroom: 
Effects on Teacher Praise and Academic Productivity” by M. A. Craft, S. R. Alber, and W. L. Heward, 1998, Journal of Applied Behavior 
Analysis, 31, p. 407. Reproduced with permission of John Wiley & Sons, Inc. 


BOX 30.3 


“Look, Teacher, I’ 


m All Finished!” 


Teaching Students to Recruit Teacher Attention 


Classrooms are extremely busy places, and even the most con- 
scientious teachers can easily overlook students’ important 
academic and social behaviors. Research shows that teachers 
are more likely to pay attention to a disruptive student than to 
one who is working quietly and productively (Walker, 1997). 
It can be hard for teachers to recognize students who need help, 
especially low-achieving students who are less likely to ask for 
help (Newman & Golding, 1990). 

Although teachers in general education classrooms are 
expected to adapt instruction to serve students with disabil- 
ities, this is not always the case. Most secondary teachers 


interviewed by Schumm and colleagues (1995) believed that 
students with disabilities should take responsibility for obtain- 
ing the help they need. Thus, knowing how to politely recruit 
teacher attention and assistance can help students with disabil- 
ities function more independently and actively influence the 
quality of instruction they receive. 


Who Should Be Taught to Recruit? 


Although most students would probably benefit from learning 
to recruit teacher praise and feedback, here are some ideal can- 
didates for recruitment training: 


Withdrawn Willamena. Willamena seldom asks a teacher 


anything. Because she is so quiet and well behaved, her 
teachers sometimes forget she’s in the room. 


In-a-Hurry Harry. Harry is usually half-done with a task 
before his teacher finishes explaining it. Racing through his 
work allows him to be the first to turn it in. But his work 
is often incomplete and filled with errors, so he doesn’t 
hear much praise from his teacher. Harry would benefit 


from recruitment training that includes self-checking and 
self-correction. 


Shouting Shelly. Shelly has just finished her work, and she 
wants her teacher to look at it—right now! But Shelly doesn’t 
raise her hand. She gets her teacher’s attention—and disrupts 
most of her classmates—by shouting across the room. Shelly 
should be taught appropriate ways to solicit teacher attention. 


Pestering Pete. Pete always raises his hand, waits quietly 
for his teacher to come to his desk, and then politely asks, 
“Have I done this right?” But he repeats this routine a 
dozen or more times in a 20-min period, and his teachers 
find it annoying. Positive teacher attention often turns into 
reprimands. Recruitment training for Pete will teach him to 
limit the number of times he cues his teachers for attention. 


How to Get Started 


1. Identify target behaviors. Students should recruit 
teacher attention for target behaviors that are valued and 
therefore likely to be reinforced, such as writing neatly 
and legibly, working accurately, completing assigned 
work, cleaning up at transitions, and making contribu- 
tions when working in a cooperative group. 
Teach self-assessment. Students should self-assess their 
work before recruiting teacher attention (e.g., Sue asks 
herself, “Is my work complete?”). After the student can 
reliably distinguish between complete and incomplete 
work samples, she can learn how to check the accuracy 
of her work with answer keys or checklists of the steps or 
components of the academic skill, or how to spot-check 
two or three items before asking the teacher to look at it. 
Teach appropriate recruiting. Teach students when, 
how, and how often to recruit and how to respond to the 
teacher after receiving attention. 
© When? Students should signal for teacher attention 
after they have completed and self-checked a substan- 
tial part of their work. Students should also be taught 
when not to try to get their teacher’s attention (e.g., 
when the teacher is working with another student, 
talking to another adult, or taking the lunch count). 
¢ How? The traditional hand raise should be part of 
every student’s recruiting repertoire. Other methods 
of gaining attention should be taught depending on 
teacher preferences and routines in the general edu- 
cation classroom (e.g., have students signal they need 
help by standing up a small flag on their desks; expect 
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students to bring their work to the teacher’s desk for 
help and feedback). 

¢ How often? While helping Withdrawn Willamena learn 
to seek teacher attention, don’t turn her into a Pestering 
Pete. How often a student should recruit varies across 
teachers and activities (e.g., independent seatwork, 
cooperative learning groups, whole-class instruction). 
Direct observation in the classroom is the best way to 
establish an optimal rate of recruiting; it is also a good 
idea to ask the regular classroom teacher when, how, and 
with what frequency she prefers students to ask for help. 

¢ What to say? Students should be taught several state- 
ments that are likely to evoke positive feedback from 
the teacher (e.g., “Please look at my work.” “Did I do 
a good job?” “How am I doing?”). Keep it simple, but 
teach the student to vary her verbal cues so she will 
not sound like a parrot. 

* How to respond? Students should respond to their 
teacher’s feedback by establishing eye contact, smil- 
ing, and saying “Thank you.” Polite appreciation is 
very reinforcing to teachers. 

Role-play the complete sequence. Begin by providing 

students with a rationale for recruiting (e.g., the teacher 

will be happy you did a good job, you will get more work 
done, your grades might improve). Thinking aloud while 
modeling is a good way to show the recruiting sequence. 

While performing each step, say, “Okay, I’ve finished 

my work. Now I’m going to check it. Did I put my name 

on my paper? Yes. Did I do all the problems? Yes. Did I 

follow all the steps? Yes. Okay, my teacher doesn’t look 

busy right now. P'Il raise my hand and wait quietly until 
she comes to my desk.” Have another student pretend 
to be the regular classroom teacher and come over to 
you when you have your hand up. Say, “Mr. Patterson, 
please look at my work.” The helper says, “Oh, you 

did a very nice job.” Then smile and say, “Thank you, 

Mr. Patterson.” Role-play with praise and offer corrective 

feedback until the student correctly performs the entire 

sequence on several consecutive trials. 

Prepare students for alternate responses. Of course, 

not every recruiting attempt will result in teacher praise; 

some recruiting responses may even be followed by 
criticism (e.g., “This is all wrong. Pay better attention 
the next time.”). Use role playing to prepare students 
for these possibilities and have them practice polite 
responses (e.g., “Thank you for helping me with this”). 

Promote generalization to the regular classroom. The 

success of any recruitment training effort depends on the 

student actually using his or her new skill in the regular 
classroom. 


Republished with permission of Sage, from Recruit It or Lose It! Training Stu- 
dents to Recruit Contingent Teacher Attention, S. R. Alber and W. L. Heward, 
1997, Intervention in School and Clinic, 5, pp. 275-282. Permission conveyed 
through Copyright Clearance Center, Inc. 
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The maintenance condition was continued until the stu- 
dents’ mean level of academic engagement was 20% below the 
mean level during the second treatment phase for five consecutive 
sessions. Results for three of the study’s six participants are shown 
in Figure 30.11. Although their experimental design did not 
include an analysis of the yellow paper’s effect on maintenance 
of academic engagement, Cariveau and Kodak (2017) provided 
a promising demonstration of the potential power of a contrived 
common stimulus to mediate generalized behavior change. 

When choosing a stimulus to be made common to both 
teaching and social generalization setting(s), practitioners 
should consider using people. In addition to being a requisite 
feature of social settings, people are transportable and important 
sources of reinforcement for many behaviors. A study by Stokes 
and Baer (1976) is a good example of the potential influence 
of the evocative effects of the presence of a person in the 
generalization setting who had a functional role in the learner’s 
acquisition of the target behavior in the instructional setting. 
Two preschool children with learning disabilities learned 
word-recognition skills while working as reciprocal peer 
tutors for one another. However, neither child showed reliable 
generalization of those new skills in nontraining settings until 
the peer with whom he had learned the skills was present in the 
generalization setting. 

Some contrived mediating stimuli serve as more than 
response prompts; they are prosthetic devices that assist 
the learner in performing the target behavior. Such devices can 
be especially useful in promoting generalization and mainte- 
nance of complex behaviors and extend response chains by sim- 
plifying a complex situation. Three common forms are cue cards, 
visual activity schedules, and self-operated prompting devices. 


Cue Cards. Sprague and Horner (1984) gave the students in 
their study cue cards to aid them in operating a vending machine 
without another person’s assistance. The cue cards, which had 


food and drink logos on one side and pictures of quarters paired 
with prices on the other, not only were used during instruction 
and generalization probes but also were kept by the students at 
the end of the program. A follow-up 18 months after the study 
was completed revealed that five of the six students still carried 
a cue card and were using vending machines independently. 

In a study in which middle school students with learning 
disabilities learned to recruit peer assistance during cooperative 
learning group activities in the general education classroom, their 
special education teacher provided each student with a 3- X 5-in. 
laminated cue card that listed the recruiting steps (Wolford et al., 
2001; see Figure 30.12). The students practiced using the card in 
the special education class where recruiting training took place. 
They were encouraged to put the cue cards in their language arts 
notebooks used in the general education classroom and, if neces- 
sary, to look at the card during cooperative learning group activi- 
ties as a reminder of when and how to recruit. 


Visual Activity Schedules. Visual activity schedules use icons, 
photos, illustrations, and real objects to depict a sequence of 
events the learner will experience or tasks to complete. Some 
schedules also include performance criteria and a way to check 
off completed activities and tasks. Figure 30.13 shows two exam- 
ples of visual activity schedules. Visual activity schedules can 
facilitate transitions from one activity to the next, provide struc- 
ture for down time, help the learner understand what activities 
are associated with different physical spaces, and promote social 
interactions and communication skills (Banda, Grimmett, & Hart, 
2009; Morrison, Sainato, Benchaaban, & Endo, 2002; Pierce, 
Spriggs, Gast, & Luscre, 2013). A visual schedule can promote 
independence and self-determination. Individuals who learn to 
use visual activity schedules no longer require a teacher, parent, 
or staff member to tell them when to initiate one activity and when 
to move on to the next. Beyond a simple prompting tool, a visual 
schedule can promote independence and self-determination. 
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Figure 30.12 Middle school students could refer to a 
laminated card when seeking help from classmates during 
collaborative learning group activities. 


How to Ask for Help 
Is the teacher talking? 


Is the student talking to somone else? 


3. Get student's attention. 
— say their name quietly 
— say “Excuse me.” 

— tap them on shoulder 


Ask your question clearly. 


— Can you explain what we are supposed to do? 
— How does this look so far? 
— Can you see if | am doing this right? 

5. Say “Thank you.” 


Described in “Teaching Middle School Students with Learning 
Disabilities to Recruit Peer Assistance During Cooperative Learning 
Group Activities,” by T. Wolford, S. R. Alber, and W. L. Heward, 

2001, Learning Disabilities Research & Practice, 16, 161-173. 


MacDuff, Krantz, and McClannahan (1993) conducted 
one of the first behavior analytic experiments evaluating 
visual activity schedules. They taught four boys with autism, 
ages 9 to 14, to use visual activity schedules when perform- 
ing domestic-living skills such as vacuuming and table setting 
and for leisure activities such as using manipulative toys. The 
researchers reported that prior to training with photographic 
activity schedules, the boys required continuous supervision 
and verbal prompts to complete self-help, housekeeping, and 
leisure activities. They summarized the results and suggested 
a possible explanation for the maintenance and generalization 
as follows. 


When the study ended, all 4 boys were able to display 
complex home-living and recreational repertoires for 

an hour, during which time they frequently changed 
tasks and moved to different areas of their group 

home without adults’ prompts. Photographic activity 
schedules, .. . became functional discriminative stimuli 
that promoted sustained engagement after training ceased 
and fostered generalized responding to new activity 
sequences and novel leisure materials. (p. 97) 


Self-operated Prompting Devices. Numerous studies have 
shown that learners across a wide range of age and cogni- 
tive levels can learn to use mobile audio and video playback 
devices to independently perform a variety of academic, self- 
care, domestic, and employment tasks (e.g., Bouck, Satsangi, 
Muhl, & Bartlett, 2013; Grossi, 1998; Mays & Heflin, 201 1; 
Montgomery, Storey, Post, & Lemley, 2011; Trask-Tyler et al., 
1994). The popularity of personal mobile devices such as 
smartphones and tablets enables a person to listen to or view a 
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series of self-delivered response prompts in a private, normal- 
ized manner that does not impose on or bother others (Mechling 
2007; Savage, 2014). 


Teach Self-management Skills 


The most potentially effective approach to mediating general- 
ized behavior changes rests with the one element that is always 
present in every instructional and generalization setting— 
the learner herself. Chapter 29 described a variety of self- 
management tactics that people can use to modify their own 
behavior. The logic of using self-management to mediate gen- 
eralized behavior changes goes like this: If the learner can be 
taught a behavior (not the original target behavior, but another 
behavior—a controlling response from a self-management per- 
spective) that serves to prompt or reinforce the target behavior 
in all the relevant settings, at all appropriate times, and in all of 
its relevant forms, then the generalization of the target behavior 
is ensured. But as Baer and Fowler (1984) warned: 


Giving a student self-control responses designed to mediate 
the generalization of some critical behavior changes does 
not ensure that those mediating responses will indeed be 
used. They are, after all, just responses: they, too, need gen- 
eralization and maintenance just as do the behavior changes 
that they are meant to generalize and maintain. Setting 

up one behavior to mediate the generalization of another 
behavior may succeed—but it may also represent a problem 
in guaranteeing the generalization of two responses, where 
before we had only the problem of guaranteeing the gener- 
alization of one! (p. 149) 


Train to Generalize 


If generalization is considered as a response itself, then a 
reinforcement contingency may be placed on it, the same as 
with any other operant. 


—Stokes and Baer (1977, p. 362) 


Training “to generalize” was one of the eight proactive strategies 
for programming generalized behavior change in Stokes and 
Baer’s (1977) conceptual scheme. The quotation marks around 
“to generalize” signified that the authors were hypothesizing 
about the possibilities of treating “to generalize” as an operant 
response and that they recognized “the preference of behavior- 
ists to consider generalization to be an outcome of behavioral 
change, rather than as a behavior itself” (p. 363). Although con- 
siderable debate surrounds the validity of variable responding 
being an operant in its own right (e.g., de Souza Barba, 2012; 
Holth, 2012; Neuringer, 2003, 2004; Peleg, Martin, & Holth, 
2017), basic and applied research has demonstrated the prag- 
matic value of reinforcing varied responding (e.g., Neuringer & 
Jensen, 2013; Shahan & Chase, 2002). Two tactics that applied 
behavior analysts have used are reinforcing response variability 
and instructing the learner to generalize. 


Reinforce Response Variability 


Response variability can help a person solve problems. A person 
who can improvise by emitting a variety of responses is more 
likely to solve problems encountered when a standard response 
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form fails to obtain reinforcement (e.g., Arnesen, 2000; Marckel, 
Neef, & Ferreri, 2006; Miller & Neuringer, 2000; Neuringer, 
2004; Shahan & Chase, 2002). Response variability also may 
result in behavior that is valued because it is novel or creative 
(e.g., Holman, Goetz, & Baer, 1977; Neuringer, 2003; Pryor, 


Haag, & O'Reilly, 1969). Response variability may expose a 
person to sources of reinforcement and contingencies not acces- 
sible by more constricted forms of responding. The additional 
learning that results from contacting those contingencies further 
expands the person’s repertoire. 


An One direct way to promote desired response general- 
ization 1s to reinforce response variability when it occurs. 
The simplest approach entails differential reinforcement of 
new response forms as Goetz and Baer (1973) did by attend- 
ing to children when they constructed different blockbuilding 
forms (see Figure 8.7). Systematic replications of this procedure 
have ranged from increasing diverse block building by children 
with autism (Napolitano, Smith, Zarcone, Goodkin, & McAdam, 
2010) to varied punching and kicking techniques by martial arts 
students (Harding, Wacker, Berg, Rick, & Lee, 2004). 

The contingency between response variability and rein- 
forcement can be formalized with a lag reinforcement schedule. 
Reinforcement on a lag schedule is contingent on a response being 
different in some defined way from the previous response (a Lag | 
schedule) or a specified number of previous responses (Lag 2 or 
more). Lag reinforcement schedules have produced varied verbal 
responding by children with disabilities (Contreras & Betz, 2016; 
Heldt & Schlinger, 2012; see Wiskow & Donaldson’s 2016 study 
discussed in Chapter 13). 

Cammilleri and Hanley (2005) used a lag reinforcement 
contingency to increase varied selections of classroom activities 
by two typically developing girls, ages 5 and 7, who were selected 
to participate in the study because they spent the vast majority 
of their time engaged in activities not systematically designed 
to result in a particular set of skills within the curriculum. At the 
beginning of each 60-min session, the children were told they 
could select any activity and could switch activities at any time. 
A timer sounded every 5 minutes to prompt activity choices. 
During baseline, there were no programmed consequences for 
selecting any of the activities. Intervention consisted of a lag 
reinforcement schedule in which the first activity selection and 
each subsequent novel selection were followed by the teacher 
handing the student a green card that she could exchange later 
for 2 minutes of teacher attention (resulting in a Lag 12 contin- 
gency, which was reset if all 12 activities were selected within 
a session). 

Activity selections during baseline showed little variabil- 
ity, with both girls showing strong preferences for stackable 
blocks (Figure 30.14 shows results for one of the girls). When 
the lag contingency was introduced, both girls immediately 
selected and engaged in more diverse activities. The research- 
ers noted that ‘‘an indirect but important outcome of this shift 
in time allocation was a marked increase in the number of aca- 
demic units completed” (p. 115). 


Instruct the Learner to Generalize 


The simplest and least expensive of all tactics for promoting 
generalized behavior change is to “tell the subject about the 
possibility of generalization and then ask for it” (Stokes & Baer, 
1977. D2 50>). FOF example, Ninness, Fuerst, and Rutherford 
(1991) explicitly told three middle school students with emo- 
tional disturbance to use self-management procedures they had 
learned in the classroom to self-assess and self-record their 
behavior while walking from the lunchroom to the classroom. 
Hughes and colleagues (1995) used a similar procedure to pro- 
mote generalization: “At the close of each training session, peer 
teachers reminded participants to self-instruct when they wished 
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to talk to someone” (p. 207). Likewise, at the conclusion of each 
training session conducted in the special education classroom on 
how to recruit assistance from peers during cooperative learning 
groups, Wolford and colleagues (2001) prompted middle school 
students with learning disabilities to recruit peer assistance at 
least 2 times but not more than 4 times during each cooperative 
learning group in the language arts classroom. 

To the extent that generalizations occur and are themselves 
generalized, a person might then become skilled at generalizing 
newly acquired skills or, in the words of Stokes and Baer (1977), 
become a “generalized generalizer.” 


MODIFYING AND TERMINATING SUCCESSFUL 
INTERVENTIONS 


With most successful behavior change programs it is impossible, 
impractical, or undesirable to continue the intervention indefi- 
nitely. Withdrawal of a successful intervention should be carried 
out in a systematic fashion, guided by the learner’s performance 
of the target behavior in the most important generalization set- 
tings. Gradually shifting from the contrived conditions of the 
intervention to the typical, everyday environment will increase 
the likelihood that the learner will maintain the new behavior 
patterns. When deciding how soon and how swiftly to withdraw 
intervention components, practitioners should consider factors 
such as the complexity of the intervention, the ease or speed 
with which behavior changed, and the availability of naturally 
existing contingencies of reinforcement for the new behavior. 

This shift from intervention conditions to the postinter- 
vention environment can be made by modifying one or more 
of the following components, each representing one part of the 
three-term contingency: 


¢ Antecedents, prompts, or cue-related stimuli 
¢ Task requirements and criteria 
¢ Consequences or reinforcement variables 


Although the order in which intervention components are 
withdrawn may make little or no difference, in most programs it 
is probably best to make all task-related requirements as similar 
as possible to those of the postintervention environment before 
withdrawing significant antecedent or consequence components 
of the intervention. In this way, the learner will be emitting the 
target behavior at the same level of performance that will be 
required after the complete intervention has been withdrawn. 

A behavior change program carried out many years ago by 
a graduate student in one of our classes illustrates how the com- 
ponents of a program can be gradually and systematically with- 
drawn. A man with developmental disabilities took an inordinate 
amount of time to get dressed each morning (40 to 70 minutes 
during baseline), even though he possessed the skills needed to 
dress himself. Intervention began with a construction paper clock 
hung by his bed with the hands set to indicate the time by which 
he had to be fully dressed to receive reinforcement. Although the 
man could not tell time, he could discriminate whether the posi- 
tion of the hands on the real clock nearby matched those on his 
paper clock. Two task-related intervention elements were intro- 
duced to increase the likelihood of initial success. First, he was 
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Figure 30.14 Number of novel activity selections (top panel), percentage of intervals of engagement in 
programmed (italicized) and nonprogrammed activities (middle panel; shaded cells indicate activities for 
which there was some engagement), and number of academic units completed (bottom panel). 


From “Use of a Lag Differential Reinforcement Contingency to Increase Varied Selections of Classroom Activities” by 
A. P. Cammilleri and G. P. Hanley, 2005, Journal of Applied Behavior Analysis, 38, p. 114. Reproduced with permission of 


John Wiley & Sons, Inc. 


given fewer and easier clothes to put on each morning (e.g., no 
belt, slip-on loafers instead of shoes with laces). Second, based 
on his baseline performance, he was initially given 30 minutes to 
dress himself, even though the objective of the program was for 
him to be completely dressed within 10 minutes. An edible rein- 
forcer paired with verbal praise was used first on a continuous 
schedule of reinforcement. Figure 30.15 shows how each aspect 
of the intervention (antecedent, behavior, and consequence) was 
modified and eventually withdrawn completely, so that by the 
program’s end the man was dressing himself completely within 
10 minutes without the aid of extra clocks or charts or contrived 
reinforcement other than a naturally existing schedule of inter- 
mittent praise from staff members. 

Rusch and Kazdin (1981) described a method for sys- 
tematically withdrawing intervention components while 
simultaneously assessing response maintenance that they 


called “partial-sequential withdrawal.” Martella, Leonard, 
Marchand-Martella, and Agran (1993) used a partial-sequential 
withdrawal of various components of a self-monitoring inter- 
vention they had implemented to help Brad, a 12-year-old 
student with mild cognitive disabilities, reduce the number 
of negative statements (e.g., “I hate this @#!%ing calcula- 
tor,’ “Math is crappy”) he made during classroom activities. 
The self-management intervention consisted of Brad (a) self- 
recording negative statements on a form during two class 
periods, (b) charting the number on a graph, (c) receiving his 
choice from a menu of “small” reinforcers (items costing 25 
cents or less), and (d) being allowed to choose a “large” rein- 
forcer (more than 25 cents) when his self-recorded data were in 
agreement with the trainer and at or below a gradually reducing 
criterion level for four consecutive sessions. After the rate of 
Brad’s negative statements was reduced, the researchers began 
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a four-phase partial-sequential withdrawal of the intervention. In 
the first phase, charting and earning of “large” reinforcers were 
withdrawn; in the second phase, Brad was required to have zero 
negative statements in both periods to receive a daily “small” 
reinforcer; in the third phase, Brad used the same self-moni- 
toring form for both class periods instead of one form for each 
period, as before, and the small reinforcer was no longer pro- 
vided; and in the fourth phase (the follow-up condition), all 
components of the intervention were withdrawn except for the 
self-monitoring form without a criterion number highlighted. 
Brad’s negative statements remained low throughout the gradual 
and partial withdrawal of the intervention (see Figure 30.16). 

Another approach to terminating an effective intervention 
is gradually thinning the schedule of days or sessions in which 
the full intervention operates. For example, after Falkenberg 
and Barbetta (2013) determined that homework completion 
and accuracy by four elementary students had increased and 
remained high for several weeks as a function of an intervention 
consisting of self-monitoring and a brief daily conference with 
their special education teacher to review their self-monitoring 
sheets, the researchers moved to Fading I and II conditions. In 
Fading I, the students continued to self-monitor their homework 
performance daily but met with their teacher only twice per 
week; in Fading II, self-monitoring continued but conference 
with the teacher occurred just once per week. Students’ home- 
work completion accuracy remained high throughout the two 
fading conditions and a final maintenance phase in which the 
students were no longer prompted to self-monitor and no teacher 
conferences took place. 

A word of caution is in order regarding the termination 
of successful behavior change programs. Achieving socially 
significant improvements in behavior is a defining purpose of 
applied behavior analysis. Additionally, those improved behav- 
iors should be maintained and should show generalization to 
other relevant settings and behaviors. In most instances, achiev- 
ing optimal generalization of the behavior change will require 
most, if not all, of the intervention components to be withdrawn. 
However, practitioners, parents, and others responsible for 


by an adult with developmental disabilities. 


helping children learn important behaviors are sometimes more 
concerned with whether and with how a potentially effective 
intervention will eventually be withdrawn than they are with 
whether it will produce the needed behavior change. Consid- 
ering how a proposed intervention will lend itself to eventual 
withdrawal or blending into the natural environment is import- 
ant and consistent with everything recommended throughout 
this book. And clearly, when the choice is between two or more 
interventions of potentially equal effectiveness, the intervention 
that is most similar to the natural environment and the easiest to 
withdraw and terminate should be given first priority. However, 
an important behavior change should not be forsaken because 
complete withdrawal of the intervention required to achieve it 
is considered unlikely. Some level of intervention may always 
be required to maintain certain behaviors. 


GUIDING PRINCIPLES FOR PROMOTING 
GENERALIZED OUTCOMES 


Regardless of the specific tactics selected and applied, we believe 
that practitioners’ efforts to promote generalized behavior change 
will be enhanced by adherence to five guiding principles: 


1. Minimize the need for generalization as much as possible. 

2. Conduct generalization probes before, during, and after 
instruction. 

3. Involve significant others whenever possible. 

4. Promote generalization with the least intrusive, least 
costly tactics possible. 


5. Contrive intervention tactics as needed to achieve import- 
ant generalized outcomes. 


Minimize the Need for Generalization 


Practitioners should reduce the need for generalization to 
untaught skills, settings, and situations as much as possible. 
Doing so requires thoughtful, systematic assessment of what 
behavior changes are most important. Practitioners should 
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Figure 30.16 Number of negative (solid data points) and positive (open data points) statements 
in two class periods by a 12-year-old boy with disabilities during baseline, self-monitoring, and 
partial-sequential withdrawal conditions. Horizontal lines during the self-monitoring condition 
show changing criteria for teacher-delivered reinforcement. 


“Self-Monitoring Negative Statements” by R. Martella, |. J. Leonard, N. E. Marchand-Martella, and M. Agran, 1993, 


Journal of Behavioral Education, 3, p. 84. Copyright 1993 by Human Sciences Press. Reprinted by permission. 


prioritize the knowledge and skills that will most often be 
required of the learner and the settings and situations in which the 
learner will most often benefit from using those skills. In addition 
to the environment(s) in which the learner is currently function- 
ing, practitioners should consider the environments in which the 
learner will function in the immediate future and later in life. 

The most critical behavior changes should not be relegated 
to the not-for-certain technology of generalization. The most 
important skill-setting-stimulus combinations should always be 
taught directly and, when possible, taught first. For example, a 
training program to teach a young adult with disabilities to ride 
the public bus system should use the bus routes the person will 
take most often (e.g., to and from her home, school, job site, and 
community recreation center) as teaching examples. Instead of 
providing direct instruction on routes to destinations the learner 
may visit only occasionally after training, use those routes as 
generalization probes. Achieving a high level of response main- 
tenance on the trained routes will still be a challenge. 


Probe for Generalization Before, 
During, and After Instruction 


Generalization probes should be conducted prior to, during, and 
after instruction. 


Probe Before Instruction 


Probes conducted prior to instruction may reveal that the learner 
already performs some or all of the needed behaviors in the gen- 
eralization setting, thereby reducing the scope of the teaching 
task. It is a mistake to assume that because a learner does not 
perform a particular behavior in the instructional setting, she 
does not or cannot do it in the generalization setting(s). Prein- 
struction probe data are the only objective basis for knowing that 
the learner’s performance of the target behavior after instruction 
is, in fact, a generalized outcome. 

Additionally, probes before instruction enable observation 
of the contingencies that operate in the generalization setting. 


Knowledge of such information may contribute to a more effec- 
tive treatment or instruction. 


Probe During Instruction 


Probes during instruction reveal if and when generalization has 
occurred and if and when instruction can be terminated or shifted 
in focus from acquisition to maintenance. A teacher who finds 
that a student can solve untaught examples of a particular type of 
algebraic equation after receiving instruction on just a few exam- 
ples, and shifts instruction to the next type of equation in the 
curriculum, will cover more of the algebra program effectively 
than will a teacher who continues to present additional examples. 

Probes during instruction also show whether generaliza- 
tion is not occurring, thereby indicating that a change in instruc- 
tional strategies is needed. When Sprague and Horner (1984) 
observed that one student’s poor performance on a generaliza- 
tion probe was caused by a ritualistic pattern of inserting coins, 
they incorporated repeated practice on the coin insertion step 
during a training session and the student’s performance on the 
generalization task improved greatly. 

In a study on teaching children with autism to respond 
with empathy to others, Sivaraman (2017) randomly inter- 
spersed generalization probes during all treatment sessions. 
Probes were presented in the same manner as during baseline. 
The nontraining discriminative stimuli presented during each 
probe trial belonged to the same antecedent emotion category 
(frustration, sadness, or happiness) that was being trained. To 
assess the extent to which tackling by high school football play- 
ers generalized from practice with a dummy bag at walking 
speed to tackling a live ball carrier, Harrison and Pyles (2013) 
conducted probe sessions of four-tackle generalization probes at 
each stage of a shaping program (see Figure 22.4). 

Generalization probes can often be made more efficient by 
contriving opportunities for the learner to use her new knowl- 
edge or skill. For example, instead of waiting for (and perhaps 
missing) naturally occurring opportunities for the learner to use 
her new conversational skills in the generalization environment, 
a practitioner could enlist the assistance of a “confederate” peer 
to approach the learner. Ninness et al. (1991) used contrived 
generalization probes in which students were provoked or 
distracted to test the extent to which they used a set of self- 
management skills. 

Contrived generalization probes can also be used as the 
primary measure of acquisition and generalization. For example, 
Miltenberger and colleagues (2005) contrived opportunities to 
measure and teach children’s use of gun safety skills by placing 
a gun in home and school locations where the children would 
find it. If a child did not execute the target safety skills when 
finding a gun (i.e., do not touch it, get away from the gun, and 
tell an adult), the trainer entered the room and conducted an 
in situ training session, asking the child what he should have 
done and rehearsing the entire sequence 5 times. 


Probe After Instruction 


Probes after instruction has ended reveal the extent of response 
maintenance. The question of how long probes should be con- 
ducted after instruction has ended should be answered by factors 
such as the severity of the target behavior, the importance of 
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the behavior change to the person’s quality of life, the strength 
and consistency of the response maintenance obtained by 
probes to date, and so forth. The need for long-term assessment 
of maintenance is especially critical for severe behavior prob- 
lems. In some cases, maintenance probes over many months 
and several years may be indicated (e.g., Derby et al., 1997; 
Wagman, Miltenberger, & Woods, 1995). Reid, Parsons, and 
Jensen (2017) reported that after intensive staff training at a 
center for adolescents and adults with severe disabilities brought 
about desired performance (increased participant involvement in 
functional tasks), supervisors observed staff and provided per- 
formance feedback approximately once per week for 12 weeks 
and thereafter monthly. This program of ongoing monitoring 
and feedback maintained high levels of staff performance over 
follow-up observations spanning 30 years. 

If conducting systematic generalization and maintenance 
probes seems too difficult or too contrived, the practitioner 
should consider the relative importance of the target behav- 
ior to the student or client. If a behavior is important to target 
for intervention, then assessing the generalized outcomes of that 
intervention is worth whatever effort that assessment requires. 


Involve Significant Others 


All persons are potential teachers of all sorts of behavior 

changes. Just because you are designated as “teacher” 

or “behavior analyst” does not mean that you have an 

exclusive franchise on the ability to make deliberate 

behavior changes. In fact, there is no possibility of such 

a franchise. Everyone in contact contributes to everyone 

else’s behavior, both to its changes and its maintenance. 
—Donald M. Baer (1999, p. 12) 


Teaching for generalized outcomes is a big job, and practitioners 
should try to get as much help as they can. People are almost 
always around where and when important behaviors need to be 
prompted and reinforced; and when social behaviors are tar- 
geted, people are there by definition. 

In virtually every behavior change program, people other 
than the participant and the behavior analyst are involved, and 
their cooperation is crucial to the success of the program. Foxx 
(1996) stated that in programming successful behavior change 
interventions, “10% is knowing what to do; 90% is getting 
people to do it... . Many programs are unsuccessful because 
these percentages have been reversed” (p. 230). Although 
Foxx was referring to the challenge and importance of getting 
staff to implement programs with the consistency and fidel- 
ity needed for success, the same holds for the involvement of 
significant others. 

Baer (1999) suggested identifying others who will or may 
be involved in a behavior change program as active supporters 
or tolerators. An active supporter is someone naturally present 
in the generalization setting who helps promote the generaliza- 
tion and maintenance of the target behavior by doing specific 
things. Active supporters help facilitate the desired generalized 
outcomes by arranging opportunities for the learner to use or 
practice the new skill, giving cues and response prompts for the 
behavior, and providing reinforcement for performance of the 
target behavior. 
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Active supporters for a behavior change program to teach 
independent eating to a child with severe disabilities might 
include one or two key people in the school’s cafeteria, a vol- 
unteer or aide who works regularly with the child, the child’s 
parents, and an older sibling. These people are vital parts of the 
teaching team. If optimal generalization is to occur, the active 
supporters must see to it that the learner has many opportuni- 
ties to use the new skill in the generalization environments they 
share, and the natural reinforcers they control in those environ- 
ments (e.g., praise, touch, smiles, companionship) must be used 
as consequences for the target behavior. It is not necessary, and 
probably not desirable, to limit an active supporters list to school 
staff, family members, and peers. They may not be regularly 
available in all the environments and situations in which gener- 
alization is desired. 

A tolerator is someone in the generalization setting who 
agrees not to behave in ways that would impede the generaliza- 
tion plan. Tolerators should be informed that the learner will be 
using a new skill in the generalization environment and asked 
to be patient. In an independent eating program, a list of toler- 
ators would likely include some members of the child’s family, 
school cafeteria staff, and peers. Beyond the home and school 
environments, the behavior analyst should consider the possi- 
ble role of the general public with whom the child may find 
herself sharing a table or dining area in a public restaurant. The 
learner’s initial sloppiness and slowness (she may always be 
sloppier and slower than most people) as she makes the first 
attempts beyond the familiar contingencies of home and school 
might result in various responses from strangers that could 
punish the new eating skills. Being stared at, laughed at, talked 
about, told to hurry up, or even offered assistance could reduce 
the possibility of generalization. Certainly, the behavior ana- 
lyst can inform various school staff and family members of the 
ongoing eating program and request that they not interfere with 
the child’s attempts to eat independently. But the public at large 
is another issue. It is impossible to inform everyone of the pro- 
gram. However, by considering the types of intolerant behaviors 
that the learner may encounter in the generalization setting, the 
teaching program can be constructed to include practice under 
such intolerant conditions. Instructional trials might be contrived 
to reinforce the learner for ignoring rude remarks and continuing 
to eat independently. 


Use the Least Intrusive, Least Costly Tactics 
Possible 


Behavior analysts should use less intrusive and less costly 
tactics to promote generalization before using more intrusive 
and more costly tactics. As noted previously, simply reminding 
students to use their new skills in the generalization setting is the 
easiest and least expensive of all methods that might promote 
generalization. Although practitioners should never assume that 
telling the learner to generalize will produce the desired out- 
comes, neither should they fail to include such a simple and 
cost-free tactic. Similarly, incorporating some of the most rele- 
vant features of the generalization setting into the instructional 
setting (i.e., programming common stimuli) often helps produce 
the needed generalization and is less costly than conducting 


instruction in the natural setting (e.g., Neef, Lensbower et al., 
1990; van den Pol et al., 1981). 

Not only is using less costly tactics good conservation of 
the limited resources available for teaching, but also less intru- 
sive interventions with fewer moving parts are easier to with- 
draw. Systematic generalization probes will determine whether 
generalization has occurred and whether more elaborate and 
intrusive intervention and supports are needed. 


Contrive Intervention Tactics as Needed to 
Achieve Important Generalized Outcomes 


A practitioner should not be so concerned about intrusiveness 
that she fails to implement a potentially effective intervention or 
procedures that will achieve important outcomes for the learner. 
Therefore, if necessary, practitioners should disregard the previ- 
ous guideline and contrive as many instruction and generaliza- 
tion tactics as necessary to enable the learner to generalize and 
maintain critical knowledge and skills. 

Rather than lament the lack of generalization or blame the 
learner for his inability to show generalized behavior changes, 
the behavior analyst should work to arrange whatever socially 
valid contingencies may be needed to extend and maintain the 
target behavior. 


Some Final Words of Wisdom from Don Baer 


The most difficult and important challenge facing behavioral 
practitioners is helping learners achieve generalized change in 
socially significant behaviors. A behavior change—no matter 
how important initially—is of little value to the learner if it does 
not last over time, is not emitted in appropriate settings and 
situations, or occurs in restricted form when varied topographies 
are desired. 

Research during the past 40 years has developed and 
advanced the knowledge base of what Stokes and Baer (1977) 
described as an “implicit technology of generalization” into an 
increasingly explicit and effective set of strategies and tactics 
for promoting generalized behavior change. Knowledge of these 
methods, combined with knowledge of the basic principles and 
behavior change tactics described throughout this book, pro- 
vides behavior analysts with a powerful approach for helping 
people enjoy healthy, happy, and productive lives. 

We end this chapter with a dually wise observation by Don 
Baer (1999), in which he pointed out a fundamental truth about 
the relation between a person’s experience (in this case, the 
nature of a lesson) and what is learned or not learned from that 
experience. Like Skinner before him, Baer wisely reminded us 
not to blame the learner for not behaving as we think he should. 


Learning one aspect of anything never means that you 
will automatically know the rest of it. Doing something 
skillfully now never means that you will always do it well. 
Resisting one temptation consistently never means that 
you now have character, strength, and discipline. Thus, it 
is not the learner who is dull, learning disabled, or imma- 
ture, because all learners are alike in this regard: no one 
learns a generalized lesson unless a generalized lesson is 
taught. (p. 1) [emphasis in original] 
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SUMMARY 


Generalized Behavior Change: Definitions and 
Key Concepts 


1. 


10. 


11. 


Generalized behavior change has taken place if trained 
behavior occurs at other times or in other places without 
having to be retrained completely in those times or places, 


or if functionally related behaviors occur that were not 
taught directly. 


. Response maintenance refers to the extent to which a 


learner continues to perform a behavior after a portion or 
all of the intervention responsible for the behavior’s initial 
appearance in the learner’s repertoire has been terminated. 


. Setting/situation generalization refers to the extent to 


which a learner emits the target behavior in settings or sit- 
uations that are different from the instructional setting. 


. The instructional setting is the environment where instruc- 


tion occurs and encompasses all aspects of the environment, 
planned or unplanned, that may influence the learner’s 
acquisition and generalization of the target behavior. 


. A generalization setting is any place or stimulus situation 


that differs from the instructional setting in some meaning- 
ful way and in which performance of the target behavior 
is desired. 


. Response generalization refers to the extent to which a 


learner emits untrained responses that are functionally 
equivalent to the trained response. 


. Some interventions yield significant and widespread 


generalized effects across time, settings, and other behav- 
iors; others produce circumscribed changes in behavior 
with limited endurance and spread. 


. Undesirable setting/situation generalization takes two com- 


mon forms: overgeneralization, in which the behavior has 
come under control of a stimulus class that is too broad, and 
faulty stimulus control, in which the behavior comes under 
the control of an irrelevant antecedent stimulus. 


. Undesired response generalization occurs when any of a 


learner’s untrained but functionally equivalent responses 
produce undesirable outcomes. 


Other types of generalized outcomes (e.g., stimulus equiv- 
alence, contingency adduction, and generalization across 
subjects) do not fit easily into categories of response 
maintenance, setting/situation generalization, and response 
generalization. 


The generalization map is a conceptual framework for 
combining and categorizing the various types of general- 
ized behavior change. 


Planning for Generalized Behavior Change 


12. 


The first step in promoting generalized behavior changes 
is to select target behaviors that will meet naturally exist- 
ing contingencies of reinforcement. 
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13. 


14. 


15. 


16. 


A naturally existing contingency is any contingency of 
reinforcement (or punishment) that operates independent 
of the behavior analyst’s or practitioner’s efforts, including 
socially mediated contingencies contrived by other people 
and already in effect in the relevant setting. 


A contrived contingency is any contingency of reinforce- 
ment (or punishment) designed and implemented by a 
behavior analyst to achieve the acquisition, maintenance, 
and/or generalization of a targeted behavior change. 


Planning for generalization includes identifying all the 
desired behavior changes and all the environments in 
which the learner should emit the target behavior(s) after 
direct training has ceased. 


Benefits of developing the planning lists include a better 
understanding of the scope of the teaching task and an 
opportunity to prioritize the most important behavior 
changes and settings for direct instruction. 


Strategies and Tactics for Promoting Generalized 
Behavior Change 


17. 


18. 


19. 


20. 


21. 


23. 


Researchers have developed and advanced what 
Stokes and Baer (1977) called an “implicit technology 
of generalization” into an increasingly explicit and 
effective set of methods for promoting generalized 
behavior change. 


The strategy of teaching enough examples requires 
teaching a subset of all of the possible stimulus and 
response examples and then assessing the learner’s 
performance on untrained examples. 


A generalization probe is any measurement of a learner’s 
performance of a target behavior in a setting and/or 
stimulus situation in which direct training has not been 
provided. 


Teaching enough stimulus examples involves teaching 
the learner to respond correctly to more than one example 
of an antecedent stimulus and probing for generalization 
to untaught stimulus examples. 


As a general rule, the more examples the practitioner 
uses during instruction, the more likely the learner 
will be to respond correctly to untrained examples or 
situations. 


. Having the learner practice a variety of response 


topographies helps ensure the acquisition of desired 
response forms and promotes response generalization. 
Often called multiple-exemplar training, this tactic 
typically incorporates numerous stimulus examples 
and response variations. 


General case analysis is a systematic method for selecting 
teaching examples that represent the full range of stimulus 
variations and response requirements in the generalization 
setting. 
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24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


34. 


Negative, or “don’t do it,” teaching examples help learners 
identify stimulus situations in which the target behavior 
should not be performed. 


Minimum difference negative teaching examples, 
which share many characteristics with positive teaching 
examples, help eliminate “generalization errors” due to 
overgeneralization and faulty stimulus control. 


The greater the similarity between the instructional 
setting and the generalization setting, the more likely 
the target behavior will be emitted in the generalization 
setting. 


Programming common stimuli means including in the 
instructional setting stimulus features typically found 

in the generalization setting. Practitioners can identify 
possible stimuli to make common by direct observation in 
the generalization setting and by asking people who are 
familiar with the generalization setting. 


Teaching loosely—randomly varying noncritical aspects 
of the instructional setting within and across teaching 
sessions—(a) reduces the likelihood that a single or small 
group of noncritical stimuli will acquire exclusive control 
over the target behavior and (b) makes it less likely that 
the learner’s performance will be impeded or “thrown off” 
by the presence of a “strange” stimulus in the generaliza- 
tion setting. 


A newly learned behavior may fail to contact an existing 
contingency of reinforcement because it has not been 
taught well enough. The solution for this kind of gener- 
alization problem is to teach the learner to emit the target 
behavior at the rate, accuracy, topography, latency, dura- 
tion, and/or magnitude required by the naturally occurring 
contingencies of reinforcement. 


The use of intermittent schedules of reinforcement and 
delayed rewards can create indiscriminable contingen- 
cies, which promote generalized responding by making it 
difficult for the learner to discriminate whether the next 
response will produce reinforcement. 


Behavior traps are powerful contingencies of reinforce- 
ment with four defining features: (a) They are “baited” 
with virtually irresistible reinforcers; (b) only a low-effort 
response already in the student’s repertoire is needed 

to enter the trap; (c) interrelated contingencies of rein- 
forcement inside the trap motivate the student to acquire, 
extend, and maintain targeted skills; and (d) they can 
remain effective for a long time. 


One way to wake up an existing but inoperative contin- 
gency of reinforcement is to ask key people in the gen- 
eralization setting to attend to and praise the learner’s 
performance of the target behavior. 


. Another tactic for waking up a natural contingency of rein- 


forcement is to teach the learner how to recruit reinforce- 
ment in the generalization setting. 


One tactic for mediating generalization is to bring the 
target behavior under the control of a contrived stimulus 


ay 


36. 


Sie 


38. 


in the instructional setting that will reliably prompt or aid 
the learner’s performance of the target behavior in the 
generalization setting. 


Teaching a learner self-management skills with which 
he can prompt and maintain targeted behavior changes 
in all relevant settings at all times is the most potentially 
effective approach to mediating generalized behavior 
changes. 


The strategy of training to generalize is predicated on 
treating “to generalize” as an operant response class that, 
like any other operant, is selected and maintained by con- 
tingencies of reinforcement. 


One tactic for promoting response generalization is to 
reinforce response variability. On a lag reinforcement 
schedule, reinforcement is contingent on a response being 
different in some defined way from the previous response 
(a Lag | schedule) or a specified number of previous 
responses (Lag 2 or more). 


The simplest and least expensive tactic for promoting 
generalized behavior change is to tell the learner about 
the usefulness of generalization and then instruct him 
to do so. 


Modifying and Terminating Successful 
Interventions 


39. 


40. 


4i. 


With most successful behavior change programs it is 
impossible, impractical, or undesirable to continue the 
intervention indefinitely. 


The shift from formal intervention procedures to a normal 
everyday environment can be accomplished by gradually 
withdrawing elements constituting the three components 
of the training program: (a) antecedents, prompts, or 
cue-related stimuli; (b) task modifications and criteria; and 
(c) consequences or reinforcement variables. 


An important behavior change should not go unmade 
because complete withdrawal of the intervention required 
to achieve it may never be possible. Some level of inter- 
vention may always be required to maintain certain behav- 
iors, in which case attempts must be made to continue 
necessary programming. 


Guiding Principles for Promoting Generalized 
Outcomes 


42. 


Efforts to promote generalized behavior change will be 
enhanced by adhering to five guiding principles: 


* Minimize the need for generalization as much as 
possible. 


* Conduct generalization probes before, during, and after 
instruction. 


* Involve significant others whenever possible. 


* Promote generalized behavior change with the least 
intrusive, least costly tactics possible. 


* Contrive intervention tactics as needed to achieve 
important generalized outcomes. 
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KEY TERMS 


behavior trap 

contrived contingency 
contrived mediating stimulus 
general case analysis 
generalization 


generalization across subjects 
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generalization setting 
indiscriminable contingency 
instructional setting 

lag reinforcement schedule 
multiple exemplar training 


naturally existing contingency 
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programming common stimuli 
response generalization 
response maintenance 
setting/situation generalization 
teaching sufficient examples 


teaching loosely 


generalization probe 
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MULTIPLE-CHOICE QUESTIONS 


1. This is the extent to which a learner emits the target 
behavior in a setting or stimulus situation that is different 
from the instructional setting. 


a. Setting/situation maintenance 

b. Response generalization 

c. Setting/situation generalization 

d. Response maintenance 

Hint: (See “Generalized Behavior Change: Definitions 
and Key Concepts”’) 

. This is the extent to which a learner emits untrained responses 
that are functionally equivalent to the trained target behavior 
a. Setting/situation maintenance 
b. Response generalization 
c. Setting/situation generalization 
d. Response maintenance 
Hint: (See “Generalized Behavior Change: Definitions and 
Key Concepts’) 

. A learner’s untrained but functionally equivalent responses 
result in poor performance. This would be an example of: 
a. Overgeneralization 
b. Faulty stimulus control 
c. Undesirable outcome of setting/situation generalization 
d. Undesirable outcome of response generalization 
Hint: (See “Generalized Behavior Change: Definitions and 
Key Concepts’’) 

. When a target behavior comes under the restricted control 
of an irrelevant antecedent stimulus, it is an example of: 
a. Overgeneralization 
b. Faulty stimulus control 
c. Undesirable outcome of response maintenance 
d. Undesirable outcome of response generalization 
Hint: (See “Generalized Behavior Change: Definitions and 
Key Concepts’) 

. A contingency that operates without social mediation is 
referred to as: 

a. Natural contingency 


b. Contrived contingency 


c. Unplanned contingency 
d. Generalized contingency 


Hint: (See “Planning for Generalized Behavior Change”) 


. Choosing only those behaviors to change that will 


produce reinforcers in the post-intervention environment 
is referred to as: 


a. Teaching loosely 

b. Contrived contingency planning 
c. Programming common stimuli 
d. Relevance-of-behavior rule 


Hint: (See “Planning for Generalized Behavior Change”) 


. The actual number of examples needed to be taught when 


teaching sufficient stimulus examples to a learner will 
depend on: 


a. The learner’s opportunities to emit the target behavior 


under various conditions 
. Naturally existing contingencies of reinforcement 
c. The learner’s history of reinforcement for generalized 
responding 
d. All of these 


Hint: (See “Strategies and Tactics for Promoting 
Generalized Behavior Change’’) 


. When using multiple exemplar training, 


a. Only stimulus variations should be used 

b. Only response variations should be used 

c. Both stimulus and response variations should be used 
d. Neither stimulus nor response variations should be used 


Hint: (See “Strategies and Tactics for Promoting 
Generalized Behavior Change”) 


. Including typical features of the generalization setting into 


the instructional setting is referred to as: 
a. Teaching loosely 

b. Teaching negative examples 

c. General case analysis 

d. Programming common stimuli 


Hint: (See “Strategies and Tactics for Promoting Generalized 
Behavior Change”) 
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10. 


1. 


ESSAY-TYPE QUESTIONS 


Shifting from intervention to post-intervention conditions 
can be accomplished by modifying: 


a. Antecedents, prompts, or cue-related stimuli 
b. Task requirements and criteria 

c. Consequences or reinforcement variables 

d. All of these 


Hint: (See “Modifying and Terminating Successful 
Interventions’) 


11. A guiding principle for promoting generalized behavior 


change includes: 
a. Involving significant others whenever possible 


b. Promoting generalization with the least intrusive, least 
costly tactics possible 

c. Contriving intervention tactics as needed to achieve 
important generalized outcomes 

d. All of these 


Hint: (See “Guiding Principles for Promoting Generalized 
Outcomes”) 
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Name and discuss the three major types of generalized 
behavior change. 


Hint: (See “Generalized Behavior Change: Definitions 
and Key Concepts’) 


Discuss overgeneralization and faulty stimulus control, 
and how each can be detrimental to the generalized 
behavior change for a learner. 


Hint: (See “Generalized Behavior Change Is not always 
desirable’) 


Explain the most important criterion that the text 
discusses when planning for generalized behavior 
change. 


Hint: (See “Planning for Generalized Behavior 
Change’) 


NOTES 


V 


Wacker and colleagues (2017) proposed another conception of mainte- 
nance: the persistence of treatment effects when treatment is challenged. 


. Because the majority of the examples in this chapter are school based, we 


use the language of education. To help generalize the examples beyond 
schooling, read instruction as a synonym for treatment, intervention, or 
therapy; instructional setting as a synonym for clinical setting or therapy 
setting; and teacher as a synonym for practitioner, clinician, or therapist. 


. The unintended maintenance of problem behavior that may result from 


the manner in which interventions are withdrawn is an undesirable 
outcome (Ringdahl & St. Peter, 2017). See discussion of resurgence in 
Chapter 24. 


. Examples of faulty stimulus control caused by flaws in the design of 


instructional materials, and suggestions for detecting and correcting those 
flaws, can be found in J. S. Vargas (1984). 

Teaching a new behavior without developing and implementing a plan to 
facilitate its maintenance and generalization is so common that Stokes and 
Baer (1977) called it the “train and hope” approach to generalization. 
Other terms commonly used for this strategy for promoting generalized 
behavior change are training sufficient exemplars, training diversely, and 
multiple-exemplar training. 

LaFrance, Tarbox, and Holth (2019) proposed the term mixed operant 
instruction for teaching that features a rapid rotation between consecu- 
tive trials that target different operants. Teaching the same target (e.g., 
water) across verbal operants (e.g., mand, tact, intraverbal) can pro- 
mote functional interdependence between verbal operants and between 
speaker and listener repertoires (Fiorile & Greer, 2007; Greer, Stolfi, 


4. There are a number of strategies and tactics that can be 


utilized to promote generalized behavior change. What 
are three of these strategies, and how does each one aim 
to promote generalized behavior change for the learner? 


Hint: (See “Strategies and Tactics for Prompting 
Generalized Behavior Change”) 


5. When deciding how soon to withdraw an intervention or 


components of the intervention, what should be considered? 


Hint: (See “Modifying and Terminating Successful 
Interventions”’) 


6. What are the five guiding principles for promoting 


generalized behavior change? 


Hint: (See “Guiding Principles for Promoting Generalized 
Outcomes’) 


Chavez-Brown, & Rivera-Valdes, 2005; Nuzzolo-Gomez & Greer, 2004; 
Olaff, Ona, & Holth, 2017). 


8. Holth (2017) identifies two situations in which multiple-exemplar training 


is ineffective: when (a) there are no physical dimensions along which gen- 
eralized responding can emerge, or (b) the relation between an antecedent 
stimulus and an effective response is complex. 


9. Fora thorough and sophisticated treatment of the selection and sequencing 


of teaching examples for optimally effective and efficient curriculum 
design, see Engelmann and Carnine’s (1982) Theory of Instruction: 
Principles and Applications. Critchfield and Twyman (2014) and Twyman, 
Layng, Stikeleather, and Hobbins (2004) also discuss curriculum design 
from the standpoint of maximizing learning outcomes. 


10. Teaching examples used to help students discriminate when not to respond 


(i.e., S4s) are sometimes called negative examples and contrasted with 
positive examples (i.e., Ss). However, in our work in teaching this concept, 
practitioners have told us that the term negative teaching example suggests 
that the teacher is modeling or showing the learner how not to perform the 
target behavior. Instruction on the desired topography for some behaviors 
may be aided by providing students with models on how not to perform 
certain behavior (i.e., negative examples), but the function of “don’t do it” 
examples is to help the learner discriminate antecedent conditions that signal 
an inappropriate occasion for responding. 


11. Coach Hill’s effort to simulate the din opposing teams must deal with at “The 


Shoe” may have helped; his team scored twice. A bit of programming for 
generalization, however, could not overcome other variables, most notably the 
home team’s overwhelming prowess. Final score: Buckeyes 43, Bulldogs 10. 


PART 


Ethics 


In Chapter 31, Tom Freeman, Linda LeBlanc, and Jose Martinez-Diaz help behavior analysts address 
three fundamental questions concerning ethical practice: What is the right thing to do? What is worth 
doing? and What does it mean to be a good behavior analyst? The authors define ethics; outline 
standards of professional practice and a code of conduct for behavior analysts; describe strategies 
and tactics for providing ethically sound client services; and recommend how behavior analysts can 
achieve, maintain, and extend their professional competence. 
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Ethical and Professional Responsibilities 
of Applied Behavior Analysts 


Thomas R. Freeman, Linda A. LeBlanc, and Jose A. Martinez-Diaz 


LEARNING OBJECTIVES 


¢ Define ethics and state why ethical practice and codes are important. 
¢ Define and explain standards of professional practice relevant to behavior analysts. 


e Explain how professional competence can be ensured. 
e Identify and explain ethical issues in client services. 


e Explain considerations important to advocating for clients. 


thical dilemmas surface frequently in the practice 
of applied behavior analysis. Consider these 
situations: 


¢ Lee resides in a private, rural, for-profit community-based 
home for persons with intellectual disabilities. He informs 
Dr. Han, the facility director, that he wants to move to an 
affordable apartment in a nearby town. Such a move would 
represent a loss of income to the agency, might generate 
additional transition costs (e.g., moving expenses, future 
on-site supervision), and has the potential to be dangerous, 
given the spike in the crime rate in the area of town that 
Lee is considering. How could Dr. Han respond ethically 
to Lee’s request about moving without being biased by a 
conflict of interest? 

¢ Raluca has begun her supervised fieldwork in prepara- 
tion for becoming a Board Certified Behavior Analyst 
(BCBA®). Her supervisor, Anca, is already a BCBA. 
Recently, Anca has directed Raluca to do more advanced 
tasks (e.g., assessments, designing programs) that are 
beyond the material that she has encountered in her 
coursework. She has not been explicitly taught how to do 
the tasks that Anca is directing her to complete with cli- 
ents. Further, given circumstances, Anca is not directly 
observing her work or providing specific feedback. Raluca 
is concerned that she will make errors that could impact 
the quality of her client’s programming. What should 
Raluca do to ensure that she is learning new behavior 
analytic skills fluently and that her clients have optimal 
programming? What should Anca have done to mitigate 
this situation from developing in the first place? 


We thank Timothy E. Heron for his input, and acknowledge Matthew Normand 
for his contributions to a previous version of this chapter. 
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¢ Ms. Dougherty is a second-year teacher who is also pur- 

suing experience hours toward certification in behavior 
analysis. During an annual individualized education pro- 
gram (IEP) meeting, she perceives that a school district 
administrator, Mr. Doyle, is trying to “steer” the parents 
of a student with emotional disabilities into accepting a 
revised IEP. Mr. Doyle has removed the recommendation 
from the team for applied behavior analysis services and 
replaced it with his own suggestion for sensory-based 
calming techniques to be provided by a paraprofessional. 
Ms. Dougherty believes Mr. Doyle’s suggestion may be 
motivated, at least in part, by his desire to contain costs 
for his tightly budgeted school district, where professional 
behavior analysis services would represent a costlier 
intervention for this student. As a second-year teacher, 
Ms. Dougherty is concerned that if she speaks up, she 
might upset her principal and possibly lose her job. If she 
remains silent, however, and does not inform the parents 
of the team’s original recommendation, the student is 
unlikely to receive needed services. Since she is accruing 
supervised experience hours, and is, therefore, responsible 
for behaving in accordance with the Compliance Code of 
the BACB, how might Ms. Dougherty serve as an effec- 
tive advocate for the student, while attempting to retain 
both her faculty position and her credibility with school 
administrators? 

How should the behavior analyst respond in each of these 
situations? This chapter will help to address that question. We 
begin by defining ethics and discussing the importance of main- 
taining ethical and professional standards in the science and prac- 
tice of applied behavior analysis. We discuss ethical issues within 
the context of client services (e.g., informed consent, conflict of 
interest), ethical implications of new technologies, and the need 
to create professional networks to support ethical behavior. 
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WHAT IS ETHICS AND WHY IS IT IMPORTANT? 
Ethics Defined 


From a behavior analytic viewpoint, ethics is defined by specific 
rules of conduct. These rules can either be broadly stated, as in 
Koocher and Keith-Speigel’s (1998) guiding principles—(1) do 
no harm; (2) respect autonomy; (3) benefit others; (4) be just; 


(5) be truthful; (6) accord dignity; (7) treat others with care and 
compassion; (8) pursue excellence; and (9) accept accountability— 
or more specifically delineated, as in a professional code of con- 
duct. For applied behavior analysts, The Professional and Ethical 
Compliance Code for Behavior Analysts—hereafter referred to as 
The Compliance Code (see Figure 31.1)—includes 10 sections rel- 
evant to professional and ethical behavior across an array of areas. ! 


Figure 31.1 The Professional and Ethical Compliance Code of the Behavior Analyst Certification Board. 


_ 1.0 Responsible Conduct of Behavior Analysts 
1.01 Reliance on Scientific Knowledge 
1.02 Boundaries of Competence 


1.03 Maintaining Competence through Professional Development 


1.04 Integrity 
1.05 Professional and Scientific Relationships 


1.06 Multiple Relationships and Conflicts of Interest 


1.07 Exploitative Relationships 


| 2.0 Behavior Analysts’ Responsibility to Clients 
2.01 Accepting Clients 
2.02 Responsibility 
2.03 Consultation 
2.04 Third-Party Involvement in Services 
2.05 Rights and Prerogatives of Clients 
2.06 Maintaining Confidentiality 
2.07 Maintaining Records 
2.08 Disclosures 
2.09 Treatment/Intervention Efficacy 


2.10 Documenting Professional Work and Research 


2.11 Records and Data 


2.12 Contracts, Fees, and Financial Arrangements 


2.13 Accuracy in Billing Reports 
2.14 Referrals and Fees 
2.15 Interrupting or Discontinuing Services 


| 3.0 Assessing Behavior 
3.01 Behavior-Analytic Assessment 
3.02 Medical Consultation 
3.03 Behavior-Analytic Assessment Consent 
3.04 Explaining Assessment Results 
3.05 Consent-Client Records 


| 4.0 Behavior Analysts and the Behavior-Change Program 


4.01 Conceptual Consistency 

4.02 Involving Clients in Planning and Consent 
4.03 Individualized Behavior-Change Programs 
4.04 Approving Behavior-Change Programs 


4.05 Describing Behavior-Change Program Objectives 


4.06 Describing Conditions for Behavior-Change Program Success 
4.07 Environmental Conditions that Interfere with Implementation 
4.08 Considerations Regarding Punishment Procedures 


4.09 Least Restrictive Procedures 
4.10 Avoiding Harmful Reinforcers 


4.11 Discontinuing Behavior-Change Programs and Behavior-Analytic Services 


(continued) 
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Figure 31.1 (continued) 


5.0 Behavior Analysts as Supervisors 
5.01 Supervisory Competence 
5.02 Supervisory Volume 
5.03 Supervisory Delegation 


5.04 Designing Effective Supervision and Training 
5.05 Communication of Supervision Conditions 


5.06 Providing Feedback to Supervisees 
5.07 Evaluating the Effects of Supervision 


6.0 Behavior Analysts’ Ethical Responsibility to the Profession of Behavior Analysts 


6.01 Affirming Principles 
6.02 Disseminating Behavior Analysis 


7.0 Behavior Analysts’ Ethical Responsibility to Colleagues 


7.01 Promoting an Ethical Culture 


7.02 Ethical Violations by Others and Risk of Harm 


8.0 Public Statements 
8.01 Avoiding False or Deceptive Statements 
8.02 Intellectual Property 
8.03 Statements by Others 


8.04 Media Presentations and Media-Based Services 


8.05 Testimonials and Advertising 
8.06 In-Person Solicitation 


9.0 Behavior Analysts and Research 
9.01 Conforming with Laws and Regulations 
9.02 Characteristics of Responsible Research 
9.03 Informed Consent 


9.04 Using Confidential Information for Didactic or Instructive Purposes 


9.05 Debriefing 

9.06 Grant and Journal Reviews 
9.07 Plagiarism 

9.08 Acknowledging Contributions 
9.09 Accuracy and Use of Data 


10.0 Behavior Analysts’ Ethical Responsibility to the BACB 
10.01 Truthful and Accurate Information Provided to the BACB 
10.02 Timely Responding, Reporting, and Updating of Information Provided to the BACB 
10.03 Confidentiality and BACB Intellectual Property 


10.04 Examination Honesty and Irregularities 


10.05 Compliance with BACB Supervision and Coursework Standards 


10.06 Being Familiar with This Code 


10.07 Discouraging Misrepresentation by Non-Certified Individuals 


Retrieved February 22, 2019, from www.bacb.com/ethics-code/. Copyright 2014 Behavior Analyst Certification 
Board,® Inc. (BACB®). All rights reserved. Ver. December 17, 2018. Used with permission. 


Rules of ethics describe the personal behavior of individ- 
uals, not vague inherent traits or characteristics of individuals 
or groups. One should not refer to a person or an organization 
as being either ethical or unethical; these terms describe the 
behavior of an individual. 

The two main approaches to ethics are termed the deontolog- 
ical (1.€., acts-oriented) approach and the utilitarian or teleological 


(.e., results-oriented) approach. The deontological approach eval- 
uates the ethicality of behavior based solely on the nature of the 
act itself regardless of consequences (O’Donohue & Ferguson, 
2011; White, 1993). The utilitarian approach judges the rightness 
or wrongness of a behavior based on its consequences or results. 
Both approaches are important for behavior analysts. The 
deontological approach creates a firm and necessary set of rules 
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to guide ethical behavior. For example, The Compliance Code 
states that behavior analysts must not accept gifts. However, 
the utilitarian approach provides needed flexibility when imple- 
menting those rules of conduct. In relation to the prohibition 
regarding acceptance of gifts, it is not difficult to envision a 
situation where an adult with a developmental disability excit- 
edly presents her behavior analyst with a picture she has drawn, 
or a young client, shyly gives the behavior analyst a clay statue 
he made at school as a holiday gift. The behavior analyst may 
unnecessarily damage the clinical relationship by rejecting such 
gifts on the grounds of maintaining ethical purity. However, as 
Lattal and Clark (2007) stated, “Rules alone are neither spe- 
cific enough nor comprehensive enough to cover all aspects of 
a difficult ethical decision” (p. 6). The utilitarian approach thus 
provides a principled way to analyze and resolve ethical ques- 
tions that are not answered with a simple “yes” or “no.” One 
might consider the deontological approach to be analogous with 
“the letter of the law,” while the utilitarian approach attempts to 
address “the spirit of the law.” 

From a philosophical perspective, Skinner (1978, 2005) 
tended toward the utilitarian view, repeatedly suggesting that 
ethics govern behavior, and are therefore functionally related 
to the survival of the individual, group, culture, and species (de 
Melo, de Castro & de Rose, 2015). When various interests across 
groups or cultures are not in full agreement, or when different 
ethical rules conflict, complexities arise. For instance, a behavior 
that might serve the long-term survival interests of a group may 
nonetheless work to the short-term detriment of an individual. 
When a behavior analyst is confronted with a complex ethical 
question that presents competing, contradictory, or potentially 
harmful contingencies, effective navigation toward a fair outcome 
requires a well-calibrated ethical compass and reliable anchor. 
The behavior analyst must carefully consider who is likely to 
benefit, who might be potentially harmed, and what guidance is 
provided by the professional and ethical code of conduct. 

Behavior analysts’ decisions and actions must always be 
ethically grounded in the answers to three fundamental ques- 
tions: What is the right thing to do? What is worth doing? and 
What does it mean to be a good behavior analyst? (Reich, 1988; 
Smith, 1987, 1993). The answers to these questions guide each 
behavior analyst’s personal and professional practices based on 
an overriding principle: to help others learn how to improve their 
physical, emotional, social, and overall personal condition. As 
Corey, Corey, and Callanan (1993) stated, “The basic purpose of 
practicing ethically is to further the welfare of the client” (p. 4). 


What Is the Right Thing to Do? 


The answer to this question is influenced by many factors, 
including personal learning histories of right and wrong, the 
context within which applied behavior analysis is practiced, and 
the applicable ethical and legal rules of conduct. One makes a 
determination based on established principles, validated meth- 
ods, and contact with effective decisions made by other applied 
behavior analysts and professionals in related fields. The goal 
is to ensure the welfare of those entrusted to one’s care, the 
integrity and robust growth of the profession, and ultimately the 
survival of the culture (Skinner, 1953, 2002). 


Ethics is ultimately related to cultural practices. Ethical 
codes may vary between cultures or within a single culture over 
time. What is acceptable in one culture may not be acceptable 
in another (e.g., recreational consumption of alcohol). What is 


acceptable at one point in time may be completely unacceptable 
20 years later. 


Personal Histories. Each person represents a distinctly differ- 
ent learning history. Thus, each professional should strive to 
obtain initial and recurrent training and experiences to offset 
any bias or predisposition that may carry over from our personal 
or cultural background. For example, a behavior analyst who 
grew up watching her brother engage in self-injurious behavior 
(SIB) is now providing services to a child who exhibits similar 
SIB. She may recall her parents’ use of various procedures to 
help her brother (e.g., use of punishment, exclusion from family 
outings, psychotropic medication). The potential influence of 
such personal experience can be reduced effectively, in favor 
of an unbiased exploration of clinically appropriate treatment 
alternatives, if the analyst has had proper training in a range of 
assessment and treatment options. 

A practitioner’s cultural or religious background may also 
influence decisions about the right course of action. An individ- 
ual reared in a family culture based on a “spare the rod and spoil 
the child” philosophy may react differently to a child’s tantrum 
than if that same individual was raised by parents who took a 
“he’ll grow out of it” approach. With proper training in behavior 
analysis, his response will likely be less culturally biased and 
more reflective of a scientific evidence-based approach. 

Finally, a person’s professional training and clinical expe- 
rience can influence decision making through such subtle biases 
as the practitioner’s comfort with the use of familiar treatment 
options. This may lead to decisions that extend beyond proper 
discriminative control (i.e., the same interventions are used in 
cases where those treatment options are not best practice). Thus, 
the practitioner must remain vigilant, aware that personal his- 
tory, cultural background, and even clinical experience may exert 
undue influence on the proper delivery of professional service. 
To counteract these influences, the practitioner should regularly 
discuss professional issues with colleagues, obtain training, seek 
supervision or consultation when needed, read recent research 
literature, and review case studies relating to one’s area of pro- 
fessional focus (Bailey & Burch, 2016). If undue influence still 
presents a problem, the ethical practitioner, after consultation, 
should consider transferring the case to another provider. 


The Context of Practice. Applied behavior analysts work in 
schools, homes, community settings, job sites, centers, and other 
environments. Rules within these environments cover myriad 
response classes (e.g., attendance, use of sick time, use of spe- 
cific positive or aversive intervention strategies). Included in 
the rules are policy statements that are designed to help practi- 
tioners discriminate between legal issues and ethical issues. For 
example, some practices may be legal, but unethical. Breaking 
a professional confidence, accepting valued family “heirlooms” 
in lieu of payment for services, or engaging in consensual sexual 
relations with a recently discharged client over age 18 serve as 
examples of actions that are legal, but unethical. Other actions 
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are both illegal and unethical. For instance, misrepresenting per- 
sonal skills or promised services; overbilling for services; steal- 
ing the belongings of a client while rendering services; abusing 
a client physically, emotionally, sexually, or socially; and engag- 
ing in consensual sexual interaction with persons under the legal 
age of consent are all examples of actions that are both illegal 
and unethical (Greenspan & Negron, 1994). Behavior analysts 
who can discriminate between these legal and ethical distinc- 
tions are more likely to provide effective services, maintain a 
focused sensitivity toward clients, and not run afoul of the law 
or professional standards of conduct. 


Ethical Codes of Behavior. Professional organizations and 
associations generate or adopt ethical codes of behavior, to pro- 
vide clear guidelines for their members to follow when decid- 
ing a course of proper action in conducting their professional 
duties. These codes also typically outline the rules and standards 
by which graduated sanctions may be imposed on members 
who deviate from the professional code (e.g., reprimand, 
censure, expulsion from the organization). The Association 
of Professional Behavior Analysts has adopted the BACB’s 
Professional and Ethical Compliance Code for Behavior 
Analysts (Behavior Analyst Certification Board [BACB], 2014). 
More discussion of this point is found later in the chapter. 


What Is Worth Doing? 


Questions related to what is worth doing directly address the 
goals of practice (i.e., what we hope to accomplish) and the 
impact of the strategies used to pursue those goals. The answers 
to these questions are typically based on an evaluation of the 
existing exigencies, the need for intervention, and an analysis of 
social validity and cost-benefit ratios that relate to any proposed 
intervention. 


Existing Exigencies. An adult who engages in severe self- 
injurious behavior, a child with significant feeding problems, 
or a student who engages in highly disruptive behavior in class 
each presents a distinct problem-solving challenge to the behav- 
ior analyst. Behaviors that are self-injurious or that harm others 
are unquestionably worthy of swift assessment and interven- 
tion. Such dangerous behaviors must ethically be considered 
as targets for change before other less dangerous or disruptive 
behaviors are addressed. However, the need to expedite treat- 
ment is not an invitation to adopt a form of situational ethics, 
in which the promise of a quick result in the short term com- 
promises one’s consideration of the long-term effects of the 
intervention. For example, a positive punishment procedure may 
produce an immediate reduction in behavior, but had a careful 
assessment of the function of the behavior been completed, 
a less intrusive intervention could have been used. Behavior 
analysts must, therefore, strike a balance between immediate 
health and safety concerns, any likely short-term and long-term 
effects of an intervention, and insuring that any behavioral pro- 
cedures comply with The Compliance Code. 


Social Validity. Questions of social validity ask (a) whether the 
goals of the planned intervention are acceptable; (b) whether 


procedures are acceptable and aligned with best treatment 
practices (Peters & Heron, 1993); and (c) whether the obtained 
results show meaningful, significant, and sustainable change 
(Wolf, 1978). Most people would agree that teaching a school- 
age child to read is a desirable goal. Using a measurably 
effective instructional method such as direct instruction is 
procedurally sound. Demonstrating marked improvement in 
reading skill is a socially significant outcome. In every sense, 
teaching a child or adult to read meets the ethical test for social 
validity. The new reading skill has a positive effect on the life 
of the individual. 

However, we cannot make the same claim for every skill 
in every situation. Consider the following real-life examples: 
A 50-year-old woman with mobility, sight, and hearing prob- 
lems is being taught to identify various street signs; an elderly 
patient with Alzheimer’s disease is being taught to stack colored 
rings; a first-grade student with autism receives daily discrete 
trial instruction on discriminating various sports cars. In each of 
these cases, the present and future “worth” of the treatment with 
respect to resources, goals, and outcomes is questionable at best. 
From an ethical standpoint, the behavior analyst must carefully 
consider whether a particular goal should be accomplished, and 
whether the achievement of that goal would be sustainable in 
the natural environment. 


Cost-Benefit Ratio. A cost-benefit analysis is contextual. The 
behavior analyst must weigh the likely difference between the 
cost of a treatment or intervention (i.e., resources involved in 
planning, implementing, monitoring, and evaluating) and the 
projected benefits (i.e., likely future potential gains by the 
person). The benefit to the individual should be significant 
enough to justify the short- and long-term cost of providing the 
service. For example, we would likely find it ethically difficult 
to support a plan to transport a student with learning disabilities 
to an expensive out-of-state private school when comparable 
services are available as part of the local neighborhood pub- 
lic school system, especially if the move to a different school 
would not clearly result in significantly improved learning 
and social behavior outcomes. In another case, an 1 1th-grade 
student with developmental disabilities might not be best served 
by being placed in an academic, college-oriented program, 
when a more functional individualized curriculum would be 
far more likely to increase this student’s self-sufficiency, inde- 
pendence, and long-term prospects for employment. We would 
be acting unethically to assign this student to an environment 
where an overly ambitious academic program is likely to result 
in escape-related reflexive establishing operations, thereby 
evoking escape/avoidance behavior—especially when a more 
functional educational model would effectively help this stu- 
dent meet his individualized short-term and long-term goals. 
Sprague and Horner (1991) suggested that committees are best 
suited to address the thorny issues that often arise in a cost— 
benefit analysis. A committee-based decision-making process 
can effectively account for a wide range of perspectives that are 
placed into a hierarchy. This can help ensure that the opinions 
of those persons with the highest stake in the outcome are given 
the greatest consideration. 
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What Does It Mean to Be a Good Behavior Analyst? 


Being a good behavior analyst requires more than following 
professional codes of conduct. While adherence to The Compliance 
Code is necessary, it is not sufficient. Moreover, keeping the 
client’s welfare at the forefront of one’s decision-making process 
is vitally important, but not, in and of itself, sufficient. A good 
practitioner must be self-regulating (Rosenberg & Schwartz, 
2018). This means that the ethical practitioner seeks ways to cal- 
ibrate decisions over time, making sure to take into consideration 
a well-informed integration of values, contingencies, rights, and 
responsibilities (Smith, 1993). In addition, good behavior analysts 
maintain contact with the evolving professional literature, seek 
advanced training to improve their skills and practices using a sys- 
tematic method (Rosenberg & Schwartz, 2018). 


Why Is Ethics Important? 


Behavior analytic practitioners abide by ethical principles to 
(a) produce socially significant behavior change that produces 
contextually relevant and meaningful outcomes for the persons 
entrusted to their care (Hawkins, 1984), (b) reduce or elimi- 
nate specious or questionable interventions and the potential 
for harm (e.g., poor treatments, injuries to self or others), and 
(c) conform to the ethical standards of learned societies and 
professional organizations (https://bacb.com/ethics-code/). 
Skinner (1953) was concerned with the development of ethics, 
particularly in relation to those in society who require care and 
protection. His ethical perspective focused in part on the influ- 
ence of countercontrol, which he defined as occurring when 
someone responds to coercion or an aversive form of external 
control by showing “an emotional reaction of anger or frustra- 
tion including operant behavior which injures or is otherwise 
aversive to the controller” (p. 321). 

Countercontrol is often characterized by the individual 
“moving out of range, attacking, or passively resisting” 
(Delprato, 2002, p.192). Skinner stated that countercontrol is 
naturally evoked in individuals who have the capacity to rebel 
against exploitative or harsh controlling conditions, but that 
this capacity is limited or absent in certain groups, such as the 
elderly, children, individuals with an intellectual disability or a 
chronic mental health condition, or individuals who are incarcer- 
ated. (Skinner, 1976). O’ Donohue and Ferguson (2011) extend 
this conceptualization, concluding that, “in those circumstances 
where individuals are disadvantaged, such that they have no 
effective mechanisms of countercontrol, external agencies need 
to step in to offset this imbalance” (p. 495), which occurs by 
developing and enforcing ethical codes of conduct. 

Ethical codes not only provide guidance on what to 
do but also help practitioners avoid engaging in “situational 
ethics” whereby one acts (or does not act) based on expedi- 
ency, pressure, or misplaced priorities (Maxwell, 2003). By 
following professional ethical codes, the practitioner increases 
the likelihood that he or she will deliver appropriate and effec- 
tive services, which will produce a cumulative social benefit. 
Over time, these codified rules of conduct are refined through 
a selection-based process that serves to promote cultural sur- 
vival. Skinner directly addresses this broader perspective on the 


importance and function of ethical rules in his discussions on 
group control and cultural selection. 


Without help a person acquires very little moral or ethi- 
cal behavior under either natural or social contingencies. 
The group supplies supporting contingencies when it 
describes its practices in codes or rules, which tell the 
individual how to behave, and when it enforces those rules 
with supplementary contingencies . . . This is all part of the 
social environment called culture, and the main effect, as 
we have seen, is to bring the individual under the control of 
remoter consequences of his behavior. The effect has had 
survival value in the process of cultural evolution, since 
practices evolve because those who practice them are as a 
result better off. (Skinner, 2002, p. 173) 


Practitioners should recognize that behaving ethically 
may not result in direct and immediate reinforcement. Ethical 
action may in fact disadvantage the practitioner in the short 
term. For example, a behavior analyst may decline a highly 
promising business opportunity because it represents a poten- 
tial conflict of interest, or even the appearance of a conflict. 
In rejecting this financial opportunity, the behavior analyst 
exhibits behavior that has come under the control of a set of 
rules whereby he sets aside immediate gain, choosing instead 
to abide by ethical standards that, in the long run, elevate the 
entire community. According to Skinner, such actions pro- 
mote the survival of the group and, by extension, its cultural 
practices. Moreover, the benefit to the group redounds back 
upon the individual behavior analyst in the long run, since 
“reasonable control over the selfish behavior of the individual 
is matched by the advantages which he gains as a member of 
a group which controls the same selfish behavior in others” 
(Skinner, 1953, p. 327). 


STANDARDS OF PROFESSIONAL PRACTICE 
FOR APPLIED BEHAVIOR ANALYSTS 


What Are Professional Standards? 


Professional standards are written guidelines or rules that pro- 
vide direction for conducting the practices associated with an 
organized profession. Professional societies and certification or 
licensing boards develop, refine, and revise the standards that 
govern their profession to provide members with parameters for 
appropriate behavior in a dynamic and changing environment. 
In practice, organizations initially form task forces to develop 
standards that are reviewed and approved by their respective 
boards of directors and members. In addition to prescribing rules 
of conduct, most professional organizations exact sanctions on 
members who fail to follow the rules. Major code violations 
may result in expulsion from the organization, or revocation of 
certification or licensure. 

Five complementary and interrelated documents describe 
standards of professional conduct and ethical practice for applied 
behavior analysts. Two of these documents specify codes of 
conduct (Professional and Ethical Compliance C odes for Behavior 
Analysts, Ethical Principles of Psychologists and Code of 
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Conduct). Three additional documents are position statements 
(1.e., The Right to Effective Behavioral Treatment, Statement 
on Restraint and Seclusion, The Right to Effective Education). 
These documents were published on the following dates. 


¢ Professional and Ethical Compliance Code for Behavior 
Analysts (Behavior Analyst Certification Board [BACB], 
2014). 

Ethical Principles of Psychologists and Code of Conduct 
(American Psychological Association, 2010). 

The Right to Effective Behavioral Treatment (Association 
for Behavior Analysis International, 1989). 

Statement on Restraint and Seclusion (Association for 
Behavior Analysis International, 2010). 

The Right to Effective Education (Association for Behav- 
ior Analysis International, 1990). 


Professional and Ethical Compliance Code 
for Behavior Analysts 


According to the preamble in the Professional and Ethical Com- 
pliance Code for Behavior Analysts, this document “consolidates, 
updates, and replaces the former BACB’s Professional Disci- 
plinary and Ethical Standards and Guidelines for Responsible 
Conduct for Behavior Analysts” (2014, p. 1). The Compliance 
Code contains 71 elements organized into 10 different sections, 
“relevant to professional and ethical behavior of behavior analysts, 
along with a glossary of terms.” With an effective date of January 
1, 2016, “all BACB applicants, certificants, and registrants will 
be required to adhere to the Code” (2014, p. 1). (See Figure 31.1.) 

The Compliance Code has undergone several revisions 
over the past decade, changing from unenforceable guidelines, 
references, and resources to an enforceable document with 
disciplinary standards. Disciplinary standards are statements 
that affect intellectual property, malpractice, and an expanded 
compliance system for providing corrective, disciplinary, and 
revocation actions, depending on the circumstances (BACB, 
2014; Carr, Ratcliff, Nosik, & Johnston, in press). The entire 
Compliance Code not only serves as a reference for clients and 
employers in determining whether a behavior analyst’s behavior 
has been conducted in an appropriate, professional, and ethical 
manner (Carr et al., in press), but also provides a procedure for 
the supervised practice of aspiring behavior analysts, handling 
complaints, and enforcing the code of ethics. 


Figure 31.2 The right to effective behavioral treatment. 
The Right to Effective Behavioral Treatment 

. An individual has a right to a therapeutic environment. 

_ An individual has a right to services whose overriding goal is personal welfare. 
. An individual has a right to treatment by a competent behavior analyst. 

. An individual has a right to programs that teach functional skills. 

. An individual has a right to behavioral assessment and ongoing evaluation. 

. An individual has a right to the most effective treatment procedures available. 
Adapted from “The Right to Effective Behavioral Treatment,” by the Association for Behavior Analysis Interna- 


tional, 1989. Retrieved April 15, 2017, from https:/Avww.abainternational.org/about-us/policies-and-positions/. 
Copyright by the Association for Behavior Analysis International. Adapted with permission. 


Figure 31.3 Principal areas of endorsement from the ABAI 
Statement on Restraint and Seclusion. 


2. Individuals have a Right to Choose 
3. The Principle of Least Restrictiveness 


Adapted from “Statement on Restraint and Seclusion, 2010,” 

by the Association for Behavior Analysis International. Retrieved 
October 30, 2017, from https:/www.abainternational.org/about-us/ | 
policies-and-positions/. Copyright by the Association for Behavior 
Analysis International. Adapted with permission. 


The Right to Effective Behavioral Treatment 


The Association for Behavior Analysis International has 
published two position papers describing client’s rights. In 
1986, the Association established a task force to examine 
the rights of persons receiving behavioral treatment and how 
behavior analysts can ensure that clients are served appropri- 
ately. After 2 years of study, the task force outlined six basic 
client rights as the basis for directing the ethical and appro- 
priate application of behavioral treatment (see Figure 31.2) 
(Van Houten et al., 1988). 


Statement on Restraint and Seclusion 


Figure 31.3 shows a portion of the Association for Behavior 
Analysis International’s statement on restraint and seclusion. 
Overall, this statement underscores ABAI’s position that the 
principles of least restrictiveness, the individual’s right to choose, 
and the overall welfare of the individual are the highest priority. 


ENSURING PROFESSIONAL COMPETENCE 


Professional competence in behavior analysis is achieved 
through academic training that involves formal coursework, 
supervised practica, and mentored professional experience. 
Even though very few universities house departments of 
behavior analysis, many excellent behavior analysts have been 
trained through university-based master’s and doctoral-level 


programs housed in psychology, education, and other human 
service departments.” 
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Behavior analysis accreditation and credentialing bodies 
have specified minimum curriculum and supervised experience 
requirements for behavior analysts. Both the Behavior Analyst 
Certification Board (BACB) and the Association for Behavior 
Analysis International Accreditation Board have set minimum 
standards for what constitutes training in behavior analysis. The 
BACB credentials individual practitioners, while ABAI Accred- 
itation Board accredits university training programs and verifies 
that course sequences meet the instructional requirements for cer- 
tification as set by the BACB. Practitioners must not only meet 
the certification criteria but also pass a certification examination. 
The BACB has conducted an extensive occupational analysis 
and developed the Behavior Analyst Task List, now in its fourth 
edition—Behavior Analyst Certification Board (BACB), 2012. 
The Task List specifies the minimum content that all behav- 
ior analysts should master (BACB, 2012; Shook, Johnston, & 
Mellichamp, 2004; cf. Martinez-Diaz, 2003).° The fifth edition of 
the Task List (BACB, 2017) becomes effective January 1, 2022. 


Obtaining Certification and Licensure 


Potential consumers must be able to identify practicing behavior 
analysts who have demonstrated at least a minimum standard 
of training and competence (Moore & Shook, 2001; Shook & 
Favell, 1996; Shook & Neisworth, 2005). A professional creden- 
tial publicly identifies those who have met certain requirements 
in a field (Shook & Favell, 1996). Those seeking professional 
services often use a profession’s credential as a tool in selecting 
a provider, and employers may use the credential as an efficient 
means of hiring professionals (Carr et al., in press). This infor- 
mation, combined with the disciplinary action that can occur if 
a professional acts outside the scope of responsible professional 
conduct, is designed to enhance public protection. In the past, 
most behavior analysts in private practice were licensed in psy- 
chology, education, or clinical social work. The public had no 
way to determine whether the licensed professional had specific 
training in applied behavior analysis (Martinez-Diaz, 2003). 

Two distinct categories of professional credentials are 
important in the field of applied behavior analysis: professional 
certification and licensure. 


Professional Certification 


Professional certification is a voluntary credential issued by a 
private organization that 1s authorized by members of the profes- 
sion, not the government. The professional certifying body for 
applied behavior analysis is the BACB, whose board of direc- 
tors is composed of behavior analysts and a consumer represen- 
tative. This international credential crosses state and national 
borders, offering greater mobility to members of the profession 
whose credentials might otherwise be restricted to specific areas 
(Hall & Lunt, 2005). The eligibility requirements for certifi- 
cation are predicated upon educational degree and coursework 
requirements, supervised experiential training requirements, 
and passing a psychometrically sound examination (Johnston, 
Mellichamp, Shook & Carr, 2014). Like other professions, the 
BACB requires documented continuing educational experiences 
and professional development, with specific emphasis on the 
areas of ethics and supervision (Carr et al., in press). 


State/National-issued Licenses 


A license is a state, provincial, or national government doc- 
ument that is established by law and specifies the privileges 
and limits of professional practice (Green & Johnston, 2009). 
A license is a mandatory credential needed to practice, not a 
voluntary certificate of achievement. Standards for obtaining, 
maintaining, and legally revoking a license are developed by the 
designated regulatory board that may or may not include mem- 
bers of the profession. As laws are constantly changing within 
and across states or nationally, behavior analysts are responsible 
for remaining up to date with respect to any licensing regula- 
tions that govern their practice within that jurisdiction. 


Practicing Within One’s Areas of Competence 


Behavior analysts must practice within their areas of competence, 
based on professional training, experience, and performance. For 
example, a behavior analyst with extensive experience working 
with typically developing adolescents should not begin working 
with children with autism-related language delays or adults with 
developmental disabilities without obtaining additional active 
supervision. The behavior analyst must accrue adequate super- 
vised experience working with any new clinical population. 
Moreover, an individual with experience only in clinical practice 
should not begin to provide services in the area of organizational 
behavior management without additional training and supervi- 
sion. LeBlanc, Heinicke, and Baker (2012) provide guidance on 
expanding areas of practice in an ethically responsible manner. 

Even when practicing within one’s area of competence, 
situations may arise that exceed one’s training or experience. 
In these cases, the practitioner must either make a referral to 
another behavior analyst or consultant, or work in close con- 
sultation with another provider who has the necessary exper- 
tise. Gaps in one’s professional training may be addressed by 
attending workshops, seminars, classes, and other continuing 
education activities. However, professional situations can arise 
that are outside a BCBA’s experience (see Figure 31.4). At that 
point, behavior analysts should work with mentors, supervisors, 
or colleagues who can provide enhanced training, professional 
development, and guidance. 


Developing Competence in Supervision 
and Mentorship 


Recognizing the Need 


Behavior analysts often provide supervision for (a) young pro- 
fessionals who are pursuing certification as a behavior analyst, 
(b) individuals holding paraprofessional-level credentials (e.g., 
Registered Behavior Technician®), (c) certified behavior analysts 
who require supervision of their ongoing practice, and (d) non— 
behavior analysts. Effective supervision of others is critical to 
the quality of ongoing behavioral services. It contributes to the 
professional development of the supervisee, the continued growth 
of the supervisor, and the overall development of our field and its 
practice, and is recognized in the professional literature (Sellers, 
Valentino, & LeBlanc, 2016). The supervisor who also fully 
embraces the role of mentoring is positioned to serve as a life-long 
counselor through the many challenging professional situations 
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Figure 31.4 Seeking guidance to handle an unexpected ethical situation. 


Presenting Concern 


Background: Rhonda recently became employed as 

a senior behavior analyst for the state. She chairs the 
local program review committee in District 20, and 
monitors behavioral services delivered under state 
authorizations. Voting members of the committee 
must be Board Certified Behavior Analysts (BCBAs). 
Stanley, a long-time provider in District 20, who was 
grandfathered under the old state certification rules, 
always signed his programs in the past as “Certified 
Behavior Analyst.” However, in a recent presentation, 
Stanley signed his program (which was countersigned 
by the parents), “Stanley Smith, BCBA.” After 

the program review committee meeting, Rhonda 
approached Stanley and asked, “Stan, | see that you 
have signed off as a BCBA. Did you pass the certifi- 
cation exam recently?” Stan answered, “| sent in my 
certification. You should have it in my file.” Rhonda 
searched and checked the committee files, and other 
appropriate files, but did not find Stanley's certifi- 
cate. Rhonda then asked Stanley to provide a copy 

of his certification, on the chance that it was “lost in 
the system.” He did not comply. Then, she checked 
the board registry and he was not listed. She asked 
him to come into the office for a meeting to clarify 
the situation. He declined. Stanley then filed an age 
discrimination complaint with the state Equal Employ- 
ment Opportunity Commission office. What would 
be Rhonda’s best course of action to address this 
situation? 


Ethical Problem-solving Strategies 


Analysis: Stanley does not appear to be a BCBA, and therefore 

should not be signing documents as if he were. He did not pro- 

vide documentation of his certification, he is not listed on the 

public registry, and he declined to meet with Rhonda to explain the 
discrepancy between his statement (“I sent in my certification”) and 
the fact that he is not listed in the registry. He is likely in violation of 
at least The Compliance Code 1.04 Integrity, 7.02 Ethical Violations by 
others and risk of harm, 8.01 Avoiding false or deceptive statements, 
and possibly 1.02 Boundaries of competence. 


Action: Before taking further action that might escalate the matter, 
Rhonda would be wise to do two things: (1) She should attempt to 
verify that Stan indeed does not have a certificate of any kind that 
could have been legitimately issued in the past under the grandfather 
clause (basically ruling out that Stan's certificate was lost in the mail 
or the file somehow); and (2) she should seek guidance from her 
supervisor in the state service organization to ensure that she follows 
the proper regulations in dealing with this potential major violation. 
Under The Compliance Code 10.07 Discouraging misrepresentation 
by noncertified individuals, Rhonda has an ethical responsibility to 
report Stanley to the BACB immediately. Moreover, Rhonda may 
have to initiate an investigation by the state of Stanley’s billing 
practices, to be sure that he has been billing appropriately for the 
level of services he is providing. This may require her to report Stanley 
directly to the fraud division of a funder. In terms of the discrimina- 
tion complaint, she would be wise to save and file all documentation 
of her interactions with Stanley, beginning with the date of her first 
inquiry. Alerting her supervisor of the disposition of the complaint 
through carbon copies of e-mails or letters would help to ensure total 
transparency. 


that may arise throughout a career. Serving in the role of supervi- 
sor requires training, and The Compliance Code’s changing stan- 
dards underscore this need (LeBlanc & Luiselli, 2016). 

In response to the need to provide competent training, the 
BACB convened a workgroup on supervision that recommended 
several requirements for all certificants providing supervision 
for those already credentialed, or those pursuing credentials. 
The requirements were implemented in 2015 and included 
(a) specific training in supervisory practices prior to supervising 
others, (b) continuing education in the area of supervision, and 
(c) a registry that publicly links the BCBA supervisor to his or 
her supervisee(s). 

A special section of the journal Behavior Analysis in 
Practice includes several papers that outline practice recom- 
mendations for individual supervision, group supervision, 
and addressing problems that can arise during the supervisory 
relationship (LeBlanc & Luiselli, 2016; Sellers, LeBlanc, & 
Valentino, 2016; Sellers, Valentino, & LeBlanc, 2016; Turner, 
Fischer, & Luiselli, 2016; Valentino, Sellers, & LeBlanc, 2016). 


Developing the Supervisor’s Skill Set 


The supervisor bears broad responsibility in the supervisory 
relationship, including the obligation to shape core behavior 
analytic, conceptual, experimental, and applied repertoires 
(Sellers, Valentino, & LeBlanc, 2016; Turner et al., 2016). 


These professional skill sets also include social effectiveness 
training, public speaking to multiple audiences, time and stress 
management skills, problem-solving skills, and ethical decision 
making, among many others (Bailey & Burch, 2010). 
Developing these repertoires through in-service train- 
ing, mentoring experiences, or field-based practica, allows the 
behavior analyst to be more effective in influencing those who 
might adopt, implement, or disseminate effective behavioral 
programming. When these repertoires are actively modeled and 
explicitly taught to the supervisor-in-training, the likelihood of 
future professional success of her clients is increased. When 
these repertoires are compromised, the supervisee does not learn 
important skills, he may leave the professional altogether, the 
supervisory relationship may become strained or damaged, and 
the overall perception of behavior analysis may be hurt (Sellers, 
Valentino, & LeBlanc, 2016). Sellers, LeBlanc, and Valentino 
(2016) offer specific suggestions for detecting and remediating 
these issues throughout the supervisory period (see Figure 31.5). 
Section 5.0 of The Compliance Code addresses the spe- 
cific ethical guidelines for supervision by behavior analysts. This 
section establishes that the “supervisor must take full responsi- 
bility for all facets of this undertaking.” Sellers, Alai-Rosales, 
and MacDonald (2016) provide a rationale for each sub-code 
on supervision with examples of the concerns that might arise if 
the behavior analyst does not take appropriate action to comply 
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Figure 31.5 Identifying and managing performance issues in a supervisory context. 


Presenting Concern 


Background: Jasmina is a BCBA-D 
who supervises an aspiring behav- 
ior analyst, Barry, in his accrual of 
supervised fieldwork experience 
hours. Barry has been learning 
many new concepts and princi- 
ples along with specific applied 
procedures to use with clients. 
Barry implements procedures accu- 
rately during therapy sessions, but 
Jasmina has noticed that Barry is 
often late to therapy and supervi- 
sion sessions, makes many errors in 
data collection, and does not reli- 
ably turn in written work products 
on time. When he does turn in the 
work, it often has typographical 
errors or incomplete sentences. 
This week Jasmina realized that 
skill acquisition data on a client has 


not been graphed in several weeks. 


A casual review of the data sheets 
indicates that progress stalled 
over 3 weeks ago, and program 
changes are well overdue. 


Ethical Problem-solving Strategies 


Analysis: Jasmina realizes that her supervisee has problems in attending, time management 
and organization skills, and this is affecting his ability to provide appropriate, data-based 
clinical services. They are failing in their basic ethical responsibilities to clients. While she may 
catch many of his errors by providing close supervision, she expects that Barry will continue 
to struggle with these issues once he is no longer closely supervised and is practicing inde- 
pendently. Jasmina identifies her ethical responsibility to her supervisee, the field, and the 
current and future clients of this supervisee. She must speak with Barry and create a profes- 
sional development plan to remedy these concerns and help him establish the needed skills. 


As suggested by Sellers, LeBlanc and Valentino (2016), Jasmina should first reflect on 
whether she has clearly specified her performance expectations to Barry. If she has not 
yet given feedback about his late arrivals or poor written work, she should use basic 
instructions, modeling, rehearsal and feedback to explicitly address his specific problem- 
atic repertoires. If she has already provided training and feedback, and his behavior did 
not change, she may conclude that a more pervasive deficit in time management skills 
may be having an adverse impact on several of Barry's repertoires. 


Action: Jasmina should talk to him about this in clear and explicit terms, state her desire to 
help him succeed, and express her concern for the long-term interests of future clients if his 
problems are not resolved. She might ask whether he has had similar problems in the past. 
She might also ask to see his time management and scheduling system. If she determines 
that he has a more global problem with organization, which has a broad impact on his 
performance (e.g., no system for planning time and tracking tasks and deadlines), Sellers 

et al. provide recommendations to help develop a support plan. She could assign relevant 
readings (e.g., David Allen’s, Getting Things Done (2015)); they could review readings 
together, and she could help Barry begin to employ new effective strategies. They could 


with The Compliance Code. For example, sub-code 5.02 refers 
to appropriate supervisory volume (i.e., a volume of supervi- 
sory activities, which is commensurate with the ability to pro- 
vide high-quality, effective supervision leading to high-quality 
intervention services). Sellers, Alai-Rosales et al. provide a case 
example in which a well-intentioned BCBA gradually began 
to take on too many supervisees. Although her intent was to 
quickly build to capacity the BCBA-based services in her com- 
munity, taking on more supervisees than she could effectively 
oversee resulted in a poor training experience for many of these 
new professionals. Whether a practitioner overcommits based 
only on financial pressures or due to a genuine intent to benefit 
the field, the risk for a poor outcome is equally high. 

In another example, Sellers, Alai-Rosales et al. describe 
the importance of sub-code 5.05, which refers to the importance 
of communicating the conditions for supervision in writing at the 
outset of the supervisory relationship (i.e., in a supervision con- 
tract) and abiding by those communicated conditions. Conditions 
and expectations of the relationship must be clearly communicated 
when the relationship is established. If not, uncertainties arise, 
leading to disputes regarding required activities, performance 
criteria and evaluations, and whether or not a supervisor will 
sign off to verify that experience requirements have been met. 
If the supervisor and supervisee are not within the same organi- 
zational hierarchy (e.g., the supervisor does not directly oversee 
the clinical work of the supervisee), opportunities for mispercep- 
tion may be even more likely regarding valid activities, actions, 
and competencies required on the part of the supervisee. Clearly 


select an electronic schedule/notekeeping tool to establish a new system for Barry, with 
explicit reinforcement contingencies for success at each step in the development plan. 


delineated conditions in a contract are critical in producing an 
effective working relationship. Many ethical dilemmas can arise in 
supervision, and the supervisor must fully accept the comprehen- 
sive responsibility that goes along with this role. The supervisor 
must proactively seek to ensure the best possible outcome for the 
supervisee, while ensuring delivery of the highest quality clinical 
services possible. 

The ethics of developing a supervisor’s skill set applies 
to the second scenario at the beginning of the chapter. Anca is 
not providing Raluca adequate supervision. She would have 
done far better had she communicated, in writing, the condi- 
tions and expectations for both herself and the supervisee at 
the outset of their supervisory relationship, based on the stan- 
dards provided by the BACB—and followed those guidelines. 
Moreover, she should have assigned Raluca only to tasks for 
which she was adequately trained. At this point in the scenario, 
Raluca must inform Anca that she requires additional 
assistance, most likely in the form of hands-on supervision as 
required by the BACB’s own guidelines. If Anca’s supervision 
behavior does not immediately improve, Raluca will probably 
need to find a new, competent supervisor, and may need to 
report Anca to the BACB for providing inadequate supervision. 


Maintaining and Expanding 
Professional Competence 


Behavior analysts have an ethical responsibility to stay 
informed of advancements in the field. For example, 
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conceptual advances and technical innovations in areas such as 
functional analysis, motivating operations, and applications of 
the matching law have had profound implications for clinical 
and educational practice. Behavior analysts can maintain and 
expand their professional competence by earning continuing 
education credits, attending and participating in professional 
conferences, reading professional literature, and presenting 
cases to peer review and oversight committees. 


Continuing Education Units 


Behavior analysts can expand their professional competence and 
keep abreast of new developments by attending training events 
offering continuing education unit (CEU) credits. The BACB 
requires a minimum number of CEUs per recertification cycle 
to maintain certification. Types of continuing education include 
university coursework, CE issued by BACB-approved continu- 
ing education providers, instruction of either of the preceding 
types, CE issued by the BACB directly, scholarly activities such 
as publishing or reviewing articles for a journal, and taking and 
passing the BACB’s certification exam again. Continuing edu- 
cation units demonstrate that the behavior analyst has added 
relevant awareness, knowledge, and/or skills to his or her reper- 
toire. The BACB now requires CEUs in the areas of ethics and 
supervision during each renewal cycle, which is a testament to 
the growing importance of these two areas as critical to effective 
functioning of an applied behavior analyst. 


Attending and Presenting at Conferences 


Attending and participating in local, state, or national confer- 
ences enhance every behavior analyst’s skills. The axiom “You 
never learn something so well as when you have to teach it” 
continues to have value. Hence, participating in a conference 
helps practitioners refine their skills. 


Professional Reading 


Self-study is a fundamental way to maintain currency in an 
ever-changing field. In addition to routinely reading Behavior 
Analysis in Practice, the Journal of Applied Behavior Analysis 
and Perspectives on Behavior Science (formerly published as The 
Behavior Analyst), all behavior analysts should study other behav- 
ioral publications specific to their areas of expertise and interest. 


Oversight and Peer Review Opportunities 


When behavior analysts approach a difficult problem, they 
apply skills and techniques within their repertoire to address 
the problem. For example, a behavior analyst may successfully 
fade out the use of a protective helmet with a child who has long 
engaged in frequent and severe face-slapping by using differ- 
ential reinforcement of alternative behavior (DRA) to increase 
preferred-object holding and play activity by the child. Fading 
the safety-based helmet presents no ethical problems as long 
as the technique remains conceptually systematic, is tied to the 
published research literature, and is effective. However, even 
the most technically proficient and professionally careful prac- 
titioner is subject to contingencies that can lead to treatment 
drift or error. This is where oversight and peer review come 
into play. 


Many states have laws requiring that oversight be provided 
under specific circumstances, which are often defined by the type 
and severity of behaviors being addressed and/or the restrictive- 
ness or intrusiveness of the procedure(s) being proposed. How- 
ever, the presence or absence of laws in a particular jurisdiction 
should not determine our dedication to a process of peer review 
and oversight. This type of professional monitoring protects 
(1) the consumers of behavior analytic services by ensuring the 
use of valid and appropriate procedures; (2) the individual behav- 
ior analyst, by providing both helpful feedback and administra- 
tive support and back-up; and (3) the field in general, in reducing 
the likelihood of lawsuits or scandal based on the rogue use of 
untested, inappropriate, and possibly dangerous treatment options. 

When findings are presented before a group of colleagues, 
professionals, or other stakeholders outside the behavior analytic 
community, the practitioner must outline the use of procedures 
that comply with clinical and professional standards. Such pre- 
sentations also provide the behavior analyst an opportunity to 
clarify behavioral outcomes, present graphic displays that are 
easy to interpret, and explain the rationale for the various edu- 
cational or clinical choices in clear language, free of jargon. 


Making and Substantiating Professional Claims 


Sometimes overzealous or overly confident behavior analysts 
make preemptive claims that are not realistic. For example, 
asserting, “I am certain I can help your son” is almost certainly 
an unethical claim, especially prior to conducting a full func- 
tional assessment. A more ethical and appropriate statement 
may be, “I have had success working with other children who 
have profiles similar to your son’s.” A behavior analyst who is 
experienced in working with a particular population, skilled in 
the careful process of identifying functional relations, and well 
acquainted with the salient professional literature is less likely to 
make unsubstantiated or exceedingly ambitious claims. 

This professional standard also relates to those situations 
where an individual makes a false or deceptive claim to having a 
certification, license, educational background, or type of training 
that the individual does not have. Making dishonest statements 
or claims relating to one’s professional experience or credentials 
is always unethical and, as likely fraudulent conduct, may be 
illegal. This will be discussed more fully in the section below. 


ETHICAL ISSUES IN CLIENT SERVICES 


Although applied behavior analysis shares many ethical concerns 
with other disciplines, ethical considerations that are specific to 
behavior analysis do arise. Behavior analysts often address skills 
or problem behavior in contexts where privacy can be a signif- 
icant concern (e.g., toileting, dressing) or where best-practice 
guidelines suggest the use of certain evidence-based, but poten- 
tially controversial, interventions (e.g., use of an aversive pro- 
cedure to reduce a life-threatening behavior). Clinical decision 
making often poses ethical questions, which must be addressed 
prior to implementation (Herr, O’ Sullivan, & Dinerstein, 1999; 
Iwata, 1988; Repp & Singh, 1990). Reading and referring to The 
Compliance Code and the Task List on a regular basis helps the 
practitioner maintain a strong ethical foundation. 
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Informed Consent 


Any discussion of service delivery must begin with the issue of 
consent. A potential recipient of behavioral services, or a par- 
ticipant in a research study, must provide explicit written and 
signed permission before any assessment, treatment, or research 
participation may begin. This formal permission to begin any 
aspect of treatment—assessment, intervention, evaluation—is 
termed informed consent, and requires that full disclosure of 
all pertinent information be provided to the individual prior to a 
granting of permission. Figure 31.6 shows a basic example of an 
informed consent form in which such information is provided. 

Three tests must be met before informed consent can be 
considered valid: (a) The person must demonstrate the capac- 
ity to decide, (b) the person’s decision must be voluntary, and 
(c) the person must have adequate knowledge of all salient 
aspects of the treatment. 


Capacity to Decide 


To be considered capable of making an informed decision, a 
participant must have (a) an adequate mental process or faculty 
by which he or she acquires knowledge, (b) the ability to select 
and express his or her choices, and (c) the ability to engage in a 
rational process of decision making. Behavior analysts consider 
concepts such as “mental process or faculty” to be hypothetical 
constructs (i.e., mentalism), and there are no widely accepted 
evaluative tools for testing one’s preintervention capacity. 
However, capacity is most likely to be questioned if an indi- 
vidual “has impaired or limited ability to reason, remember, 
make choices, see the consequences of actions, and plan for 
the future” (O’ Sullivan, 1999, p. 13). A person is considered 
mentally incapacitated if a disability affects his or her ability to 


CLIENT: 


(Name of agency/provide 
Informed Consent Form 


understand the consequences of his or her actions (Turnbull & 
Turnbull, 1998). 

According to Hurley and O’ Sullivan (1999), “Capacity to 
give informed consent is a fluid concept and varies with each 
individual and proposed procedure” (p. 39). A person may have 
the capacity to consent to a positive reinforcement program that 
poses little or no risk, but may not have the capacity to make 
decisions on more complex treatment alternatives such as the 
manipulation of motivating operations or use of punishment. 
Practitioners should not assume that a person who is capable of 
giving informed consent in one set of circumstances can provide 
truly informed consent in all cases. 

Evaluation of capacity must be viewed within both a 
legal and a behavioral context. Courts have held that capacity, 
in relation to legal competence, requires “the subject to under- 
stand rationally the nature of the procedure, its risks, and other 
relevant information” (Kaimowitz v. Michigan Department of 
Mental Health, 1973, p. 150; Neef, Iwata, & Page, 1986). Thus, 
operationally, capacity is often determined in relation to the indi- 
vidual meeting the “knowledge” requirement (see below). The 
determination of capacity in persons with intellectual disabili- 
ties poses specific challenges, and the behavior analyst would 
benefit from consulting a legal expert whenever a question of 
capacity arises. 

When a person is deemed incapacitated, informed consent 
may be obtained either through a surrogate or a guardian. 


Surrogate Consent. Surrogate consent is a legal process by 
which another individual—the surrogate—is authorized to 
make decisions for a person deemed incompetent, based on the 
knowledge of what the incapacitated person would have wanted. 
Family members or close friends most often serve as surrogates. 


aa | 


DOB: 


PARENT/GUARDIAN: 


STATEMENT OF AUTHORITY TO CONSENT: | certify that | have the authority to legally consent to assess- 
ment, release of information, and all legal issues involving the above-named client. Upon request, | will 
provide (provider/agency) with proper legal documentation to support this claim. | further hereby agree 
that if my status as legal guardian should change, | will immediately inform (provider/agency) of this 
change in status and will further immediately inform (provider/agency) of the name, address, and phone 
number of the person or persons who have assumed guardianship of the above-named client. 


TREATMENT CONSENT: | consent for behavioral treatment to be provided for the above-named client by 
(provider/agency), and its staff. | understand that the procedures used will consist of manipulating anteced- 
ents and consequences to produce improvements in behavior. At the beginning of treatment behavior may 
get worse in the environment where the treatment is provided (e.g., “extinction burst”) or in other settings 
(e.g., “behavioral contrast”). As part of the behavioral treatment, physical prompting and manual guidance 
may be used. The actual treatment protocols that will be used have been explained to me. 


| understand that | may revoke this consent at any time. However, | cannot revoke consent for action that 
has already been taken. A copy of this consent shall be as valid as the original. 


DATE: 


WITNESS: 


DATE: 


Adapted from ABA Technologies, Inc., Melbourne, FL. Copyright ABA Technologi 


es Inc. Used with permission. 
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In most states, a surrogate’s authority is limited. A sur- 
rogate may not authorize treatment when the client is actively 
refusing treatment (e.g., when an adult with developmental 
disabilities refuses to sit in a dentist’s chair, a surrogate cannot 
consent to sedation); or for controversial medical procedures such 
as sterilization, abortion, or certain treatments for mental disor- 
ders (e.g., electroconvulsive therapy or psychotropic medication) 
(Hurley & O’ Sullivan, 1999). The surrogate is required to consider 
specific information when making decisions. For a person who is 
incapacitated, the landmark case of Superintendent of Belchertown 
State School v. Saikewicz (1977) crystallized factors that surro- 
gates must consider when making informed consent decisions 
for persons who cannot make their own decisions (Hurley & 
O’ Sullivan, 1999). Figure 31.7 lists necessary information for the 
two primary areas of concern for surrogates: (a) decision mak- 
ing for a person who is incapacitated and whose wishes may be 
known or inferred and (b) decision making for a person whose 
wishes are unknown and probably unknowable (e.g., a person 
with a profound and lifelong developmental disability). 


Guardian Consent. A guardian is an individual or organization 
named by order of the court to exercise any or all powers and 
rights over the person and/or estate of an individual (National 
Guardianship Association, 2016). Guardianship is a complex 
legal issue that varies by jurisdiction. Thus, we will address only 
two main points regarding guardianship. 

First, guardianship may be limited in any way that the court 
deems appropriate. In most states, a full guardian is responsible 
for essentially every important decision in a person’s life. The 


court, in protecting the rights of the individual, may determine 
that limited or temporary guardianship is more appropriate. For 
example, guardianship may apply only to financial or medical 
concerns, and may remain in effect only so long as a very spe- 
cific concern is at issue (e.g., need for surgery). Guardians may 
not deny basic human rights, such as the right to freedom from 
abuse or the right to privacy. 

Second, when treatment is deemed necessary for an indi- 
vidual who lacks capacity, but that individual refuses treatment, 
surrogate consent is insufficient. Only a guardian may provide 
oversight and consent in this situation. The greater the degree 
of guardianship, the less legal control a person has over his or 
her own life. Insofar as a major goal of behavior analysis is to 
increase the independence of those we serve, guardianship of 
any kind should be considered only when absolutely necessary, 
and at the most limited level that still resolves the issue. 

In all guardianship cases, the court is the ultimate decision- 
making body and may take any action, including revoking 
guardianship or determining who should serve as the guardian 
(O’Sullivan, 1999). 


Voluntary Decision 


Consent is considered voluntary when it is given in the absence 
of coercion, duress, or any undue influence and when it is issued 
with the understanding that it can be withdrawn at any time. 
“Revocation of consent has the same effects as an initial refusal 
to consent” (Yell, 1998, p. 274). 

Family members, doctors, support staff, or others may 
exert strong influence over a person’s willingness to grant or 


Figure 31.7 Factors surrogates must consider when making informed consent decisions for persons 


who are incapacitated. 


FOR A PERSON WHOSE WISHES MAY BE KNOWN OR INFERRED 


1. The person's current diagnosis and prognosis. 


. The person's expressed preference regarding the treatment issue. 


. The person's relevant religious or personal beliefs. 


. The person's behavior and attitude toward medical treatment. 


. The person's attitude toward a similar treatment for another individual. 


. The person's expressed concerns about the effects of his or her illness and treatment on 


family and friends. 


FOR A PERSON WHOSE WISHES ARE UNKNOWN AND PROBABLY UNKNOWABLE 


. The effects of the treatment on the physical, emotional, and mental functions of the person. 
. The physical pain that the person would suffer from the treatment or from the withholding or 


withdrawal of the treatment. 


. The humiliation, loss of dignity, and dependency the person is suffering as a result of the 


condition or as a result of the treatment. 


4. The effect the treatment would have on the life expectancy of the person. 
5. The person's potential for recovery, with the treatment and without the treatment. 
6. The risks, side effects, and benefits of the treatment. 


Adapted from “Informed Consent for Health Care,” by A. D. N. Hurley and J. L. O'Sullivan, 1999, in R. D. 
Dinerstein, S. S. Herr, and J. L. O'Sullivan (Eds.), A Guide to Consent, pp. 50-51. Washington DC, American 
Association on Intellectual and Developmental Disabilities. Copyright by The American Association on Intellectual 
and Developmental Disabilities. Used with permission. 
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deny consent (Hayes, Adams, & Rydeen, 1994). For example, 
individuals with developmental disabilities may be asked to 
make major decisions during interdisciplinary team meetings. 
The way questions are posed to the individual may suggest 
subtle or not-so-subtle pressure to provide consent (e.g., “This 
will really help you. You agree with all of us on this, right?”). 
Therefore, the practitioner should consider discussing issues 
requiring consent with the individual and an independent advo- 
cate in a smaller, private setting. The discussion should not 
be constrained by time limits, or indicate a need for a quick 
approval. The prospective service recipient should be given an 
opportunity to take time to think, discuss, and review all options 


with a trusted confidant. These steps help ensure that consent is 
voluntary. 


Knowledge of the Treatment 


A person considering behavioral services or participation in 
research must be provided information in clear, nontechnical 
language regarding: (a) all important aspects of the planned 
treatment, (b) all potential risks and benefits of the planned pro- 
cedure or research protocols, (c) all potential evidence-based 
alternative treatments, and (d) the right to discontinue treatment 
at any time. Once information has been provided, the individual 
must demonstrate an understanding of that information, 
which clearly relates back to an assessment of that individual’ s 
capacity. The person must be able to describe the procedures 
in his or her own words, and correctly answer questions about 
the plan. For example, if a time-out procedure is suggested, the 
client should be able to describe how and when the time-out 
will be implemented. Instead of saying, “I'll get in trouble if 
I hit another person,” the individual should say, “If I hit some- 
body, I have to go sit over there for 2 minutes, and then I can 
come back.” Figure 31.8 outlines additional information that the 


following tasks: 


who may provide counsel and/or assistance. 


Adapted with permission of the authors. 


The behavior analyst helps to ensure the validity and clarity of informed consent by completing the 


e Generates appropriate documents that are understandable in the person’s primary language and 
that make clear all elements of the program, including who will deliver the services, their qualifi- 
cations, under what conditions services will be rendered, the length of the proposed intervention, 
risks and benefits, and how the program will be evaluated. 


° Ensures that all consent documents are discussed in a private setting, and, if necessary, with a 
family member, third-party advocate, or other witness present, as chosen by the service recipient, 


* Schedules the consent meeting at a time and location that Is convenient for the service recipient. 
This meeting should allow adequate time for all questions to be addressed, and for adjustments 
or alternatives to the proposed treatments to be considered. 


© Documents in writing the responsibilities of the behavior analyst and the service recipient, 

including: How confidentiality will be handled (i.e., who has access to records); provisions for a 
renewed consent if, for example, new data emerge that have the potential to change the pro- 
posed intervention; and any issues associated with remuneration, communication (e.g., during 


and after the intervention), cancellation of appointments, or termination of services. 
© Provides a copy of the signed and dated consent form at the conclusion of the meeting. 


Adapted from Ethical and Risk Management Issues in the Practice of ABA, by E. Cipani, S. Robinson, and H. Toro, 
2003. Paper presented at the annual conference of the Florida Association for Behavior Analysis, St. Petersburg, FL. 


behavior analyst should provide to the potential service recipi- 
ent to ensure voluntary, informed consent (Cipani, Robinson, & 
Toro, 2003). 

As part of providing information for consent, the behav- 
ior analyst should always do a risk—benefit analysis—that is, 
weighing the potential harm to the client (or others) versus the 
benefits likely to result from implementing those procedures— 
prior to conducting the assessment or intervention.’ Bailey and 
Burch (2016) encourage conducting a risk—benefit analysis of four 
key areas: (1) the general risk factors for behavioral treatment, 
(2) the benefits of behavioral treatment, (3) the risk factors for 
each behavioral procedure, and (4) a reconciliation of the risks 
and benefits with key parties involved. This analysis should occur 
prior to initial or subsequent requests for informed consent, and 
repeatedly throughout the delivery of services. For example, a 
descriptive assessment has few risks, and can help determine 
factors that trigger or maintain behavior. However, it does not 
provide a clear demonstration of functional relations. 

Alternatively, a full functional analysis, in which anteced- 
ents and consequences are systematically manipulated, can 
clearly demonstrate functional relations, but is also likely to 
evoke a short-term increase in the problem behavior, with lim- 
ited potential for triggering a major behavioral episode, possible 
injury, or other iatrogenic (i.e., intervention-induced) effects. 

A risk—benefit analysis must always be done when choos- 
ing one evidence-based treatment option over another. This type 
of analysis assists the practitioner in choosing the least restric- 
tive option that is most likely to be effective. 


Treatment Without Consent 


Most states have policies authorizing a course of action when an 
individual cannot, or will not, provide informed consent for a nec- 
essary treatment. Typically, consent may be granted in the case 


Figure 31.8 Information for clients to ensure informed consent. 
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of a life-threatening emergency, or when a risk of serious harm is 
imminent. In selected jurisdictions, if a school district determines 
that certain services are necessary (e.g., special education pro- 
gramming), but the parents refuse, the district may have recourse 
through a process of progressive administrative review, mediation, 
and ultimately court action (Turnbull & Turnbull, 1998). As in all 
aspects of our work, providing treatment without consent requires 
the practitioner to consult and follow all applicable and current 
local, state, and national laws and regulations. These may differ 
across various jurisdictions and are subject to change over time. 


Confidentiality 


Professional relationships require confidentiality, meaning that any 
information regarding an individual receiving or having received 
services may not be discussed with, or otherwise made available to, 
any third party, unless that individual has provided explicit autho- 
rization for release of that information. Although confidentiality 
is a professional ethical standard for behavior analysts, it is also a 
legal requirement in many jurisdictions (Koocher & Keith-Spiegel, 
1998). Figure 31.9 shows a standard release of information (ROT) 
form. Note that the ROI form clearly specifies what can be shared 
and released, and the expiration date for such releases. 


Limits to Confidentiality and Legal Disclosures 


Prior to the initiation of services, the behavior analyst must explain 
the limits of confidentiality to the client, document the meeting, 
and ensure that the client signs and dates the document. Disclosure 
of confidential information may occur without consent for a valid 
purpose, defined by the following conditions: (1) as mandated by 


Figure 31.9 Release of information form. 


(Name of a 


CLIENT: 


RELEASE OF INFORMATION AND ASSESSMENT CONSENT FORM 


law, (2) to comply with a court order, (3) when credible information 
suggests imminent harm or danger to the client or another person, 
(4) when an immediate crisis develops, or (5) when third-party 
payment for services is involved. For instance, if a student informs 
a school-based behavior analyst in confidence that another stu- 
dent has a loaded gun in the building, then confidentiality must be 
breached to protect the safety of students, staff, and administrators 
(cf. Tarasoff v. Regents of the University of California, 1976). 
Whenever such a disclosure is necessary, one should limit the dis- 
closure to the absolute minimum information necessary to meet the 
requirement of the valid purpose. 

Limits to disclosure also relate to third-party funding 
sources, which are entities other than the client or client guardian 
that pay for services. Typically, third-party entities have only lim- 
ited access to confidential material pertaining to basic client infor- 
mation, the validation of the delivery of services, and payment for 
those services. Often, other treatment-specific information may 
only be provided to a third party if a formal release is signed by the 
“first-party” (1.e., the recipient of services or his or her guardian). 


Maintaining Confidentiality 


The practitioner must ensure that proper safeguards are in 
place to protect client confidentiality, including, but not lim- 
ited to, storing hard-copy records in locked file cabinets, using 
only password-protected computer files, using encrypted files 
whenever possible, avoiding transmission of identifiable client 
information across any wireless systems, and confirming the 
identity of, and authorization for, any potential recipient of 
client-specific information. 


DOB: 


PARENT/GUARDIAN NAME: 


Home School Other: 


| consent for the above-named client to participate in an assessment through (provider/agency). | consent to 
have the assessment with the above-named individual conducted at the following locations (Circle relevant ones) 


care in the above-named locations: 


Evaluations/Assessments: 


| understand and consent to have the individuals responsible for care in the above-named locations 
involved in the assessment of the above-named client. In order to coordinate the assessment with these 
individuals, | authorize the release of the following confidential records to the individuals responsible for 


IEP or Other Records: 


Other: 


PARENT/GUARDIAN: 


| understand that these records may contain psychiatric and/or drug and alcohol information. | understand 
that these records may also contain references to blood-borne pathogens (e.g., HIV, AIDS). | understand that 

| may revoke this consent at any time; however, | cannot revoke consent for action that has already been taken. 

A copy of this release shall be as valid as the original. THIS CONSENT AUTOMATICALLY EXPIRES 30 DAYS AFTER 
TERMINATION OF SERVICES or after ONE CALENDAR YEAR. 


DATE: 


Adapted from ABA Technologies, Inc., Melbourne, FL. Copyright ABA Technologies Inc. Used with permission. 
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When Use of Social Media Results in Breach of Confidentiality 


Rebecca, a young BCBA, is excited that her 7-year-old client 
Jarod is making so much progress in acquiring language. She 
takes a selfie with Jarod and his family at a school picnic, 
and posts it on Instagram, with the family’s verbal consent. 
Does this raise an ethical issue? Verbal consent is clearly not 
sufficient. Others may appear in the background of the photo, 
and have not given consent. Moreover, the family members 
might decide that they do not want to advertise to both friends 
and strangers that their child is the recipient of behavioral 
services (and thus a potential target for exploitation or abuse). 

Louis is an aging BCBA from a state residential facil- 
ity that closed 30 years ago, who now posts old pictures of 


staff and clients on a Twitter feed titled “Staff Reunion from 
State Institution X.” Great pictures! Everyone loves to see 
them and share memories. Many of the individuals in these 
pictures have already passed away. Has Louis breached con- 
fidentiality here? The answer is almost certainly “yes.” But 
is there real harm? What if Louis did a Google search on 
“images from State Institution X” and found pictures of his 
old clients already available on the Internet? Would post- 
ing those pictures on Twitter be an ethical breach? There 
is no way to evaluate the potential harm; but without a 
release from the client, this is an unauthorized breach of 
confidentiality. 


The Health Insurance Portability and Accountability Act 
of 1996 (HIPAA), Public Law 104-191, specifically pertains to 
confidentiality of medical information and records. Many ABA- 
based services are delivered under the umbrella of medically 
authorized services (e.g., commercial insurance, Medicaid, other 
state funding). In these cases, practitioners must be careful to 
follow all HIPAA requirements, lest they sustain hefty fines and 
legal sanctions. 

New and heretofore unforeseen challenges to confidenti- 
ality have arisen in recent years with the inexorable growth in 
the use of social media (see Box 31.1). Later in this chapter we 
will discuss issues relating to maintaining confidentiality in this 
expanding global landscape of public communication networks. 


Breaches of Confidentiality 


Breaches of confidentiality apply to two main response classes: 
(a) valid disclosures without consent and (b) unintentional 


Client: 


Figure 31.10 Example of abuse-reporting protocol form. 


(Name of agency/provider) 
Confidentiality Act/Abuse-Reporting Protocol 


| understand that all information related to the above-named client's assessment and treatment must be 
handled with strict confidentiality. No information related to the client, either verbal or written, will be 
released to other agencies or individuals without the express written consent of the client's legal guardian. 
By law, the rules of confidentiality do not hold under the following conditions: 


1. lf abuse or neglect of a minor, disabled, or elderly person is reported or suspected, the professional 
involved is required to report it to the Department of Children and Families for investigation. 


2. If, during the course of services, the professional involved receives information that someone's life is in 
danger, that professional has a duty to warn the potential victim. 


3. If our records, our subcontractor records or staff testimony are subpoenaed by court order, we are 
required to produce requested information or appear in court to answer questions regarding the client. 


This consent expires 1 year after signature date below. 


breaches resulting from carelessness, misunderstanding of con- 
fidentiality requirements, or negligent disregard. For example, 
an unintentional and careless breach might occur if a behavior 
analyst provides confidential information to a parent about a 
child’s status, without confirming that the parent requesting 
the information is the de facto legal guardian for the child. The 
behavior analyst should have been aware of who has authorized 
access, and release protected information only after confirming 
proper authorization. 

Legally, all professionals must report suspected child 
abuse. Even in cases where the child fears repercussions if the 
abuse is reported, professionals at any level must step forward 
and report such cases to the authorities. Maintaining confiden- 
tiality when a child’s health, safety, or welfare is involved does 
not apply. Figure 31.10 shows a sample form documenting the 
information provided to the client relating to abuse-reporting 
requirements. 


Parent/Legal Guardian 


Adapted from ABA Technologies, Inc., Melbourne, FL. Copyri 


Date 


ht ABA Technologies Inc. Used with permission. | 


ne 
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Figure 31.11 pPerining an aPeepebls PrceUmens environment. 


. The environment is engaging: SEN era is Sle oie eae peteacte are Piece reinforcement contingen- 
cies are effective; exploratory play and practice flourish; the environment is by definition humane. 


. Functional skills are taught and maintained: Environments are judged not by paperwork or records, but by observed evidence 
of training and progress; incidental teaching allows the natural environment to support skill acquisition and maintenance. 


. Behavior problems are ameliorated: A purely functional approach regarding both behaviors and procedures leads to effec- 
tive interventions; arbitrary labels based on topographies are replaced by individualized definitions based on function. 

. The environment is the least restrictive alternative. Again, this is functionally defined, based on freedom of movement and 
activity engagement parameters; community settings may actually be more restrictive than institutional ones, depending on 
their behavioral effects. 

_ The environment is stable. Changes in schedules, programs, peers, caregivers, etc., are minimized; skills are maintained; 
consistency and predictability valued. 


. The environment is safe: Physical safety is paramount; supervision and monitoring are adequate; peer review ensures that 


proper program procedures are based on function. 


. The client chooses to live there: Efforts are made to determine client choice; alternative environments are sampled. 


Republished with permission of Plenum Press, from Defining an Acceptable Treatment Environment, J. E. Favell and J. E. McGimsey, 1993, in R. Van 
Houten and S. Axelrod (Eds.), Behavior Analysis and Treatment 25-45). Permission Se bees schemas Clearance Center, Inc. 


Protecting the Client’s Dignity, 
Health, and Safety 


Dignity, health, and safety issues often center on the contin- 
gencies and physical structures present in the environments in 
which people live and work. The behavior analyst should be 
acutely aware of these issues. Favell and McGimsey (1993) pro- 
vided a list of acceptable characteristics of treatment environ- 
ments to ensure dignity, health, and safety (see Figure 31.11). 

Dignity can be examined by addressing the following 
questions: Do I honor the person’s choices? Do I provide ade- 
quate space for privacy? Do I look beyond the person’s dis- 
ability and treat the person with respect? Behavior analysts can 
help to ensure dignity for their clients by defining their own 
roles. By using operant principles of behavior, they teach skills 
that will enable learners to establish increasingly effective con- 
trol over the contingencies in their own natural environments. 
Everyone has the right to say yes, no, or sometimes nothing 
at all (cf. Bannerman, Sheldon, Sherman, & Harchik, 1990). 

Choice is a central principle in the delivery of ethical 
behavioral services. In behavioral terms, the act of making a 
choice requires that both behavioral and stimulus alternatives be 
possible and available (Hayes et al., 1994). Practitioners must 
provide the client with behavioral alternatives, and the person 
must be capable of performing the actions required by the alter- 
natives. To leave a room, a person must be physically capable of 
opening the door, and the door must not be blocked or locked. 
The term stimulus alternatives refers to the simultaneous pres- 
ence of more than one stimulus item from which to choose (e.g., 
choosing to eat an apple instead of an orange or pear). To have 
a fair choice, a client must have alternatives, must be able to 
perform each alternative, and must be able to experience the 
natural consequences of the chosen alternative. 

Negligence represents a failure to demonstrate professional 
integrity on the part of the provider, and threatens the client’s 
health and safety [see The Compliance Code 1.04(c)]. Negligence 
is typically exhibited as nonfeasance—not doing what ought to 


be done. For instance, a behavior analyst may take on too many 
cases, and not adequately monitor progress for those cases; or he 
may not have planned for adequate clinical coverage during his 
vacation, leaving clients without access to services during those 
weeks; or perhaps she abruptly terminates a case without con- 
sideration for the client’s need for a slower fade-out of services. 
Additionally, negligence can sometimes be demonstrated when 
acting in an imprudent fashion. For example, an inexperienced 
analyst might escort too many clients on a community outing, 
and lacking appropriate supervision, one of the clients wanders 
away. This was not the intention of the escort, but by not providing 
adequate supervision, a negligent act occurred. 

While negligence is typically unintentional, fraudulent 
conduct is characterized as intentional, willful, and deceitful 
behavior, and can cause harm to another individual. Misrepre- 
senting one’s credentials, training, or background to a recipient 
of services is a form-of fraud, as is overbilling or falsifying cli- 
ent records. Less obvious forms of fraudulent conduct, such as 
minor falsifications of raw data (e.g., a data collector fills in a 
data sheet well after the observational period has ended) may 
or may not result in formal sanctions, but still constitute a clear 
breach of professional ethics. When an investigation leads to 
a finding of negligence or fraudulent conduct on the part of a 
practitioner, professional sanctions are nearly certain, and legal 
action resulting in fines or even prison is possible, depending on 
the circumstances and the degree of demonstrated harm. 


COORDINATING WITH OTHER PROFESSIONALS 


Behavior analysts are often required to work with a myriad of 
other professionals, many of whom subscribe to scientifically 
supported treatments. Such collaborations among interdisci- 
plinary team members can result in a variety of positive educa- 
tional and behavioral outcomes for individuals with disabilities 
(Brodhead, 2015; Hunt, Soto, Maier, & Doering, 2003) and 
promote treatment fidelity as well (Kelly & Tincani, 2013). 
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For example, behavior analysts often coordinate with medical 
professionals, particularly when the client is receiving psycho- 
tropic medication to address problem behavior. Behavior analysts 
must be sensitive to the fact that the medical professionals follow 
their own professional ethical code, as does each professional 
member of any treatment team. Effective coordination between 
disciplines can only occur if communication among all treatment 
team members remains clear and frequent, particularly in relation 
to any proposed “changes in medication regimens, therapies, or 
programming.” (Newhouse-Oisten, Peck, Conway, & Frieder, 
2017, p. 148). 

Behavior analysts may also interact with staunch propo- 
nents of nonbehavioral treatments for which solid evidence of 
effectiveness is unknown, ignored, or unreliable and therefore 
invalid. In these situations, behavior analysts must call upon 
their technical skills, social interaction skills, and The Com- 
pliance Code when formulating a comprehensive plan for the 
client. The Compliance Code {2.09(d)] states, in part, that behav- 
ior analysts have a responsibility to “review and appraise the 
effects of any treatments about which they are aware that might 
impact the goals of the behavior-change program” (BACB, 
2014, p. 9). Newhouse-Oisten et al. (2017) suggest that behavior 
analysts must focus on two aspects of each suggested interven- 
tion: “whether or not the intervention is supported by research 
evidence, and whether or not the intervention change is compat- 
ible with other interventions” (p. 150). This type of analysis is 
a clear ethical imperative when determining best practice, but 
can sometimes cause conflict within a treatment team. Brodhead 
(2015) provides a series of yes-no questions in a flow chart 
for evaluating such unknown treatments in an unbiased and 


objective manner: (1) What is the nonbehavioral treatment? 
(2) Is the client’s safety at risk? (3) Are you familiar with the 
treatment? (4) Being now familiar with the treatment, have new 
safety concerns been revealed? (5) Is treatment success possible 
when nonbehavioral treatment is translated into behavioral pro- 
cesses? (6) Are the impacts to the client sufficient to justify the 
possibility of compromising the professional relationship? Brod- 
head also provides a checklist for analyzing proposed treatments 
and suggests options for ethical action by behavior analysts. 

An alternative, but similar, approach is provided by 
Schreck and Miller (2010), who provide a flow chart to help 
the practitioner evaluate the validity of three aspects of the pro- 
posed but unknown treatment: (1) the underlying theory, (2) the 
treatment techniques, and (3) the claims of effectiveness. 

Box 31.2 offers additional guidelines for professionals to 
help parents, teachers, and others to evaluate alternative treat- 
ment recommendations that may arise. 

The best interests of the client will be served when the 
behavior analyst uses one of these models, or an alternative effec- 
tive and structured approach, to help guide collaboration with 
other professionals in a way that enhances client services and 
reflects well on the field. However, in rare cases, the alternative 
approach is so disruptive to the effective delivery of behavioral 
services that the practitioner must withdraw from the case, pro- 
viding the client with clear and valid reasons for doing so. 

This approach to coordinating with other professionals can 
be applied to the third scenario described in the opening of the 
chapter. Ms. Dougherty is in a bind, but she only “perceives” the 
intent of the school administrator with whom she has to coordinate 
services. She should continue to advocate for the best interests of 


Guidelines for Professionals to Help Parents, Teachers, and Other Caretakers 


Parents, teachers, and other caretakers are not without resources 
when trying to ascertain whether a particular treatment meets 
the standard for best practice, or falls within the scope of a fad 
or untested intervention. Three resources follow. 

First, change agents can consult a trusted behavior ana- 
lyst and the corresponding Code of Conduct that he or she 
follows. The Compliance Code states that, “Behavior analysts 
always have the obligation to advocate for and educate the 
client about scientifically supported, most effective treat- 
ment procedures” [The Compliance Code 2.09(a)]. Moreover, 
The Compliance Code 6.02 states: “Behavior analysts promote 
behavior analysis by making information about it available 
to the public through presentations, discussions, and other 
media.” So, when change agents follow the advice of a behay- 
ior analyst, they increase the chances that the recommenda- 
tions they receive meet quality care criteria. 

Second, Schreck and Mazur (2008) suggest a series of 
six strategic techniques to counter the proliferation of fad/ 
alternative treatments: (1) compare the effectiveness of the 
treatment in question with ABA; (2) emphasize the long- 
term cost effectiveness of ABA; (3) publicize results of these 


comparisons via mass media; (4) market ABA to parents and 
employers; (5) find a way to short-circuit the search for an 
“easy way out” solution and the influence on this by mass 
media; and (6) educate BCBAs on the ethical dilemmas and 
consequences of using nonsupported treatments. 

Finally, Heron, Tincani, Peterson, and Miller (2005) 
suggest that all interventionists develop what noted astrono- 
mer, Carl Sagan, termed a “Baloney Detection Kit’—a kind 
of toolbox of conceptual instruments to promote skeptical 
thinking and logical analysis. Peters and Heron (1993) add 
that practitioners, when trying to judge the efficacy of any edu- 
cational or therapeutic approach, should consider answering 
these five questions: 


1. Does the model or program derive from a sound theo- 
retical base? 

2. Is the research methodology convincing and compelling? 

3. Is there consensus with existing literature? 

4. Is there evidence that outcome data are consistently 
produced? 

5. Is there evidence of social validity? 
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the student, but also should consult with her supervisor to assist 
with her advocacy, and suggest a systematic assessment of the 
suggested treatments to identify the most effective approach. 

When a behavior analyst makes a referral to another pro- 
fessional, there must never be referral fees or kick-backs. The 
practitioner should provide the client with a list of options, a 
choice of possible providers, and allow the client to make the 
final choice. No money, gifts, or favors should ever change 
hands when a referral from one professional results in a client 
contacting another professional. 


SOCIAL MEDIA AND NEW TECHNOLOGIES 


The use of social media is pervasive, with compelling evidence 
to show that a great majority of the world’s population has access 
to, and frequently uses, smartphones, tablets, pads, and other 
electronic devices to share information and interact with others 
(Greenwood, Perrin, & Duggan, 2016; Pagoto et al., 2016). In 
effect, social media and the Internet have become the marketplace 
and village square of the 21st century, where social interactions 
flourish. Admittedly, social media present positive opportunities 
for sharing information and discussing issues pertaining to behav- 
ioral science (e.g., Students of Behavior Analysis on Facebook). 
Several media platforms (e.g., Facebook, Twitter) offer private 
areas where postings may be viewed by an individual, or a select 
few, who are provided specific and limited access. Thus, a prac- 
titioner might send a private message to an individual or limited 
group, to assist in supporting treatment goals or to help maintain 
treatment fidelity (e.g., sending reminders to collect data). 

Moreover, new technologies provide individuals with the 
opportunity to view treatment sessions from remote locations, in 
real time, which can be useful for supervisors, parent/guardians, 
and behavior analysts in training. 

Still, all therapeutic disciplines must come to terms 
with the proper way to use social media, without breaching 


confidentiality or falling into ethical quicksand. Unsecured 
transmission networks are of significant concern, and behav- 
ior analysts must be well versed in the nature and scope of 
the security provided to protect their communications. Even in 
supposedly secure social media sites, users must be cautious, 
since data breaches are always possible and security settings 
are subject to change by the service provider. Thus, client- 
specific information must always be masked, such that any 
individual discussed can never be identified by name or loca- 
tion (Pagoto et al., 2016). While codes of professional ethics 
are only recently beginning to catch up with the challenges 
posed by these new media, O’Leary, Miller, Olive, and Kelly 
(2017) have provided useful recommendations for maintaining 
professional ethics in an emerging social media environment 
(see Box 31.3). Further, Box 31.4 presents ways to avoid the 
pitfalls of using social media. 


Helping the Client Select Outcomes 
and Behavior Change Targets 


The recipient of behavior analysis services (or the recipient’s 
guardian) should identify the ultimate goal(s) of the service, 
termed “outcomes.” Outcomes must be meaningful to the indi- 
vidual, contextually appropriate, and achievable. The goals 
should focus on a quality of life issue for the individual (e.g., 
obtaining employment, establishing long-term relationships, 
accessing community and educational settings) (Felce & 
Emerson, 2000; Risley, 1996). The behavior analyst should help 
the individual express her preferences and life goals, and iden- 
tify what behaviors she would need to do (and not do) to help 
accomplish those goals. Behaviors selected for change must 
benefit the individual recipient of services, not the practitioner 
or caregiver. 

Early on, behavior analysts were rightly criticized for 
focusing on behaviors that mainly benefited staff (e.g., Winett & 


Recommendations for Behavior Analysts’ Use of Social Media 


1. Real clients should be heavily disguised. Whenever 
talking about anything client specific, many details can 
be changed and the point of the discussion remains intact. 

2. Avoid making treatment recommendations. There is sim- 
ply not enough information on social media outlets to 
give specific advice. 

3. Refer readers back to the literature. Rather than mak- 
ing a treatment recommendation, recommend a specific 
paper, or work by particular authors. 

4. Write a disclaimer. Be sure that anyone reading anything 
you write, especially an answer to a question, cannot 
get the idea that there is a professional relationship. Be 
explicit in stating this. 


5. Provide resources. Beyond literature references, provide 
interested parties with ideas for search topics, websites, 
and the opportunity to discuss a post offline, to prevent 
misinterpretation by non—behavior analysts. 

6. Provide organizational training. This fits with The Com- 
pliance Code’s requirement that we create a culture of 
ethics. Training in a work setting on how to navigate 
the dangerous currents of social media can be vital in 
preventing potential problems. 


From “Blurred Lines: Ethical Implications of Social Media for Behavior Ana- 
lysts,” by P. N. O’Leary, M. M. Miller, M. L. Olive, and A. N, Kelly, 2017, 
Behavior Analysis in Practice, 10, pp. 45-51. Used with permission. 
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BOX 31.4 


The Pitfalls of Social Media 


Guy is a veteran behavior analyst who reads young BCBAs’ 
case-related questions on a Facebook page titled “Behavior 
Analysis Q&A!” and provides answers and treatment advice 
to help guide them along. Everyone on the Facebook page tells 
Guy how much his help is appreciated. Even if the posted ques- 
tions are carefully written so they contain no hints of client 
identity or specific setting, are there ethical issues involved 
here? Yes, many. The young practitioners might begin to rely 
on the sage advice provided by this “Facebook expert” and 
possibly others who provide quick answers. Those posing 
questions may, over time, become less likely to search the lit- 
erature, seek necessary supervision or mentoring, or obtain 
consultations. They may eventually engage in the same form 


Winkler, 1972). For example, an adult in a work setting might 
easily have had docile “compliance” identified as a target behav- 
ior, even though strengthening work skills and functional social 
interactions would have been of far greater benefit to that indi- 
vidual. Compliance, once commonly identified in behavior pro- 
grams, is not an appropriate goal by itself. The question must 
be posed: compliant to what? Teaching compliance is ethically 
justifiable only when it facilitates the development of other func- 
tional or social skills that will, in turn, contribute to that individ- 
ual’s obtaining long-term self-identified outcomes. 

This discussion on appropriate selection of goals, along 
with prior information on obtaining input from a guardian, sur- 
rogate, or advocate may be applied to the first scenario described 
in the opening of the chapter. Dr. Han could suggest that Lee 
meet with a trusted adviser or surrogate to review the pros and 
cons of the move from a residential program to an apartment. In 
addition, they could discuss the reasons Lee wants to move, to 
see if the current setting could be adjusted in ways that would 
meet his needs more effectively, without having to go through 
a move in residence. 


Maintaining Records 


Well-maintained records (e.g., assessments, treatment plans, 
progress notes, data) facilitate the provision of future services, 
meet agency and/or institutional requirements, ensure accurate 
billing, allow for future research, and help meet legal require- 
ments. Behavior analysts must typically maintain all case file 
records for 7 years, subject to individual, state, or jurisdictional 
requirements. The Compliance Code states that the behavior 
analyst is responsible for maintaining individual client confi- 
dentiality in terms of “creating, storing, accessing, transferring, 
or disposing of records under their control, whether these are 
written, automated, electronic, or in any other medium” [The 
Compliance Code 2.07(a)]. When destroying paper records, 


of “social media mentoring,” since Guy has modeled this as a 
form of appropriate professional behavior. Any advice given 
on a case-related question, no matter how wise it may be, is 
based on incomplete information. Others reading these posts, 
including non—behavior analysts, might misuse these treatment 
suggestions by inappropriately applying them to other indi- 
viduals with other behavior problems in other settings. Every- 
thing about Guy’s apparently supportive action violates the 
core focus of our clinical approach: to directly evaluate the 
behavior of the individual in context, and base treatment on 
identified functional relations. The behavior of this seasoned 
adviser may help a few people, but he will almost certainly 
never know the extent of the damage he may well have caused. 


use crosscut shredding. Destruction of electronic records 
requires that all stored files in all possible storage locations 
be fully expunged. Be aware that electronic transmission of 
confidential records across any unsecured medium (e.g., fax 
lines in public areas, e-mail) is prohibited under Public Law 
104-191 (HIPAA). 


ADVOCATING FOR THE CLIENT 
Providing Necessary and Needed Services 


Prior to initiating services, a behavior analyst has the respon- 
sibility to validate that a referral warrants further action. This 
poses the first ethical challenge to the practitioner: deciding 
whether to accept or reject the case. The decision to provide 
treatment may be divided into two sequential decisions: (1) Is 
the presenting problem amenable to behavioral intervention? 
and (2) Is the proposed intervention likely to be successful? 


Is the Problem Amenable to Behavior Treatment? 


To determine whether behavioral intervention is necessary and 
appropriate, the behavior analyst should seek answers to the 
following questions: 


1. Has the problem emerged suddenly? 
a. Might the problem have a medical cause? 
b. Has a medical evaluation been done? 


2. Is the problem with the client or with someone else? (For 
example, a student who has done well through fourth 
grade suddenly exhibits “behavior problems” in fifth 
grade, although she remains well behaved at home. Per- 
haps a simple change of teachers will solve the problem.) 


3, Have other interventions been tried? 
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4. Does the problem actually exist? (For example, a parent 
has serious concerns about a 3-year-old child who “refuses 
to eat” everything provided at every meal.) 


5. Can the problem be solved simply or informally? (For 
example, the same 3-year-old “elopes” through a back 
door that is left wide open all day.) 


6. Might the problem be better addressed by another disci- 
pline? (For example, a child with cerebral palsy may need 
adaptive equipment instead of behavioral treatment.) 


7. Is the behavioral problem considered an emergency? 


Is the Proposed Intervention Likely to Be Successful? 


Questions to ask when considering whether the intervention is 
likely to be successful include the following: 


1. Is the client willing to participate? 

2. Are the caregivers surrounding the client willing or able 
to participate? 

3. Has the behavior been successfully treated in the research 
literature? 


4. Is public support likely? 


5. Does the behavior analyst have the appropriate experience 
to deal with the problem? 


6. Will those most likely to be involved in implementing the 
program have adequate control of the critical environmen- 
tal contingencies? 


If the answer to all six of the questions posed in this 
second list is “yes,” then the behavior analyst can act. If the 
answer to any of these six questions is no, the behavior analyst 
should seriously consider declining to initiate treatment. 


Embracing the Scientific Method 


General questions related to what constitutes science and how 
it applies to the study and improvement of behavior were 
addressed in Chapter 1. The scientific method relates to ethics, 
at least in part, in its insistence that evidence-based practices 
utilize direct measurement to establish the effectiveness of 
methods that are grounded in peer-reviewed literature, and 
tests any emerging practice to assess its effectiveness before 
implementation. The importance of embracing evidence-based 
practices increases as questionable claims regarding effective 
“alternative” treatments become more widespread, and are 
seemingly accepted by the general public without critical exam- 
ination (Shermer, 1997). In Sagan’s (1996) view, extraordinary 
claims require extraordinary evidence, and as space engineer 
James Oberg wryly quipped, “In science keeping an open mind 
is a virtue, but not so open that your brains fall out” (cited in 
Sagan, 1996, p. 187). 

Applied behavior analysts should base their practice on 
two primary sources: scientific literature and frequent direct 
measurement of behavior in context. Peer-reviewed scientific 
reports published in reputable journals and well-referenced 
texts provide objective information on effective intervention 
strategies. Direct repeated measurement of behavior provides 


empirical data to inform our assessments and evaluations of 
the effectiveness of our work. The cornerstone of ethical prac- 
tice is providing effective services based on solid, replicated 
research evidence. 


Alternative/Fad Treatments 


The educational and therapeutic world abounds in a wide vari- 
ety of teaching techniques and approaches to address prob- 
lem behavior. Most of these so-called remedies have never 
been scientifically or empirically validated. As new, untested 
treatments arise, parents, teachers, and other caregivers who 
legitimately seek educational, social, or behavioral assistance 
can easily fall prey to ill-advised and uninformed pied pipers 
(Heron et al., 2005). For example, Kohn (1997) promotes the 
notion that repeated practice of a skill is harmful because it 
destroys a student’s motivation to learn, and ultimately results 
in the student being unhappy with school in general. This 
assertion has gained traction with the general public despite 
a complete lack of peer-reviewed evidence to validate the 
claim. On the contrary, decades of carefully controlled and 
replicated research have demonstrated that repeated practice 
is a key determinant in the mastery of a skill (Binder, 1996; 
Ericsson & Charness, 1994). 

Box 31.5 explores several issues and problems related to 
the continuing emergence of alternative and fad treatments. 


Evidenced-based Best Practice and 
Least Restrictive Alternatives 


Silvestri and Heward (2016) define evidence as “the outcome 
of the application of the scientific method to test the effective- 
ness of a claim, a theory, or a practice.” Moreover, they state 
that, “extraordinary evidence requires replication. A single 
study, anecdote, or theoretical article, no matter how impressive 
the findings or how complicated the writing, is not a basis for 
practice” and that “an extraordinary claim based on anything 
but proportionally strong evidence should bring skepticism” 
(pp. 149-150). 

An essential component of ethically driven behavior 
analysis is not only that interventions and related practices 
be evidence-based, but also that the most powerful, yet least 
intrusive methods be used first. Moreover, intervention plans 
must be designed, implemented, and evaluated systematically. 
If the individual fails to show progress, data systems should be 
reviewed and, if necessary, interventions modified. If progress 
is made, the intervention should be phased out and assessed 
for generalization and maintenance. In all phases of an inter- 
vention, data and direct observations should drive treatment 
decisions. Models and programs that meet the above standard 
are deemed “best practice” and can be judged by their meet- 
ing at least five standards: Does the model or program derive 
from a sound theoretical base? Is the research methodology 
convincing and compelling? Is there consensus with existing 
literature? Is there evidence that outcome data are consistently 
produced? Is there evidence of social validity? (Peters and 
Heron, 1993). The Compliance Code [{2.09(c)] also indicates 
that when “more than one scientifically supported treatment 
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Lotions, Potions, and False Notions 


“Cures livers, quivers, and 
shivers! It delivers! For 
fevers, heavers, and 
believers! Step right up!” 


“Lots of stuff 
| never heard of 
here... it's gotta be 


“I'm from Missouri, 
mister. Show me the 
proof!” 


The promotion and sale of ineffective treatments for a variety 
of ailments has a long history. “Snake-oil salesmen” were so 
common in the Wild West era of the United States that the 
term is now used to label any purveyor of fake cures or, more 
broadly, false solutions to problems. Individuals who are likely 
to seek behavior analytic services have often been confronted 
with a dizzying array of claims and counter-claims regarding 
a wide variety of alternative treatment options. Many prom- 
ise spectacularly positive results and even total “cures” for 
everything from confrontational teenage behavior to the full 
spectrum of autism-related disorders. Most of these claims 
have no solid foundation in research or peer-reviewed litera- 
ture. Treatments such as DIRFloortime, Doman-Delacato pat- 
terning treatment, aromatherapy, animal-assisted therapies, 


has been established, additional factors may be considered in 
selecting interventions, including, but not limited to, efficiency 
and cost-effectiveness, risks and side-effects of the interven- 
tions, client preference, and practitioner experience and train- 
ing” (2014, p. 9). Geiger, Carr, and LeBlanc (2010) describe 


“Maybe this could 
really help Billy” 


and auditory integration training (to name but a few) lack 
reputable scientific evidence base (Hines, 2001; Koocher & 
Gill, 2016; Mudford & Cullen, 2016). Other alt-treatments 
have been extensively studied and soundly debunked, as in the 
case of facilitated communication (Jacobsen, Foxx, & Mulick, 
2016; Lilienfeld, Marshall, Todd, & Shane, 2015). We expect 
other untested treatments to enter the marketplace, many likely 
to use new technology-based interventions that sound highly 
scientific but lack a firm basis in research and replication. 
When a behavior analyst is providing services to an individual 
who may be tempted to try one or more of these alt-treatments, 
he or she has a responsibility to assist that individual in evalu- 
ating the likely effectiveness and potential costs related to the 
proposed intervention. 


a similar approach to selecting among function-based treat- 
ments for problem behavior, taking into account consideration 
of implications for safety, the resources of the environment, 
and prioritization of improved quality of life and skills for the 
client (see Figure 31.12). 
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Figure 31.12 Selecting function-based interventions. 


Presenting Concern 


Background: Marco is a 3-year-old child diagnosed with autism 
spectrum disorder. He has very minimal verbal skills but is for- 
tunate enough to be enrolled in an early intensive behavioral 
intervention program. He is making progress in developing 


his parents, particularly when they are performing activities of 
daily living (e.g., dressing him) and hygiene maintenance (e.g., 
brushing his teeth). During these tantrums, he yells, cries, and 
engages in self-injurious behavior (e.g., scratching at his own 
arms) as well as assaultive behavior (e.g., scratching and pinch- 
ing others). Marco has begun to exhibit these same behaviors 
with the staff who work with him in his behavioral program. 
His BCBA has conducted a comprehensive functional assess- 
ment, including a functional analysis. The identified function 
of these behaviors is escape (i.e., negative reinforcement func- 
tion). The BCBA must now develop a behavior plan to reduce 
his problem behavior and increase his appropriate completion 
of tasks. This BCBA has read a number of articles from the lit- 
erature and discovered that many evidence-based interventions 
are function-based treatments for escape-maintained problem 
behavior, and he is unsure which of them would be best suited 
to this situation. 


CONFLICT OF INTEREST 


A conflict of interest occurs when a principal party, alone or in 
connection with family, friends, or associates, has a vested inter- 
est in the outcome of the interaction. The most common form of 
conflict arises in the form of dual- or multiple-role relationships. 
Conflicts arise when a person acting as a therapist enters into 
another type of relationship with the client, a family member, or 
a close associate of the client, or promises to enter into such a 
relationship in the future. These relationships may be financial, 
personal, professional (providing another service, for example), 
or otherwise beneficial to the therapist. 

Direct and frequent observations during assessment and 
intervention phases bring the behavior analyst into close contact 
with the client (and often family members, other professionals, 
and caregivers) in various natural settings. Personal relationships 
can be formed that may imperceptibly cross professional bound- 
aries. For example, family members may offer unsolicited gifts 
or make invitations to parties or other events. In every interaction 
related to their professional duties, behavior analysts must mon- 
itor their own behavior with vigilance, guarding against blurring 
the lines between personal and professional conduct. This is espe- 
cially true when treatment is provided within a private home. In 
service contexts, personal relationships of any kind can quickly 
develop ethical complications, and are to be avoided. 

Other professional conflicts of interest might arise as well. 
For example, a teacher should never hire a current student as an 


new skills more rapidly than others. He often has tantrums with 


Ethical Problem-solving Strategies 


Analysis and Action: Marco's BCBA identified a published 
clinical decision-making model for selecting function-based 
treatments for escape-maintained problem behavior 
(Geiger, Carr & LeBlanc, 2010). This model suggests that 
five core considerations should guide the determination 

of which intervention is best suited for a specific situation: 
Quality of life of the client, safety of everyone involved, 
resource constraints of the environment, existing skill sets of 
the client, and remaining skill needs of the client. The BCBA 
answered the progressive list of questions suggested in this 
model and determined that the optimal intervention would 
be functional communication training (FCT). The tasks that 
are occurring now are necessary and the instructional pro- 
cedures are generally quite good. Marco's staff and parents 
felt as though they could remain safe and tolerate a small 
amount of problem behavior as long as it is decreasing 

over time. Marco will comply with several demands, and it 
is acceptable to allow periodic breaks from tasks as part of 
a treatment plan. This allowed Marco’s BCBA to consider 
the most important skill to prioritize. Since Marco has very 
limited verbal skills, it is important to teach him a way to 
ask for a break or assistance when he finds himself in an 
aversive situation. So FCT is a great first intervention, and 
other interventions might be added if needed. 


employee in a business outside of school. That student’s perfor- 
mance in one area could potentially affect the teacher-student 
interaction in the other. A supervisee should be off limits as a 
potential romantic partner. A member of a peer review commit- 
tee must not participate in a review of his own work, or the work 
of his supervisees. The general rule to follow is that the practi- 
tioner should strive to avoid all potential conflicts of interest. A 
practitioner in doubt should consult with a supervisor or trusted 
and experienced confidant. 


CREATING A CULTURE OF ETHICAL PRACTICE 


Ethical or unethical behavior occurs in an environmental con- 
text, and environments can be arranged to support behavior 
that is ethical. Moreover, “the ability to sort through the issues 
and make decisions that most people would see as ethical is 
a skill that emphatically can and should be taught” (Lattal & 
Clark, 2007, p.13). Even the most proficient professionals ben- 
efit from robust environmental supports for problem-solving 
repertoires that help maintain ongoing ethical practice. To 
foster a culture of ethical practice (The Compliance Code 
7.01), organizations can commit to providing ethics-related 
trainings, support systems, and differential reinforcement of 
ethical behavior. Brodhead and Higbee (2012) outline vari- 
ous strategies for teaching and maintaining ethical behavior in 
a professional organization. They address the importance of 
initial training in ethics and ongoing training and supervision 
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BOX 31.6 


Does Your ABA Organization Have an Ethics Director? 


The Ethics Network at Linda LeBlanc’s clinical service 
organization includes a position of Ethics Director. This 
doctoral-level behavior analyst has extensive experience as a 
supervisor and extensive training in ethics. The Ethics Director 
serves as a mentor to the ethics and assistant ethics chairper- 
sons, each of whom fosters and supports ethics throughout 
the organization. The Ethics Network engages in four primary 
activities designed to sustain continuous discussion about 
ethics at all levels of the organization. 

First, the network director designed the initial ethics 
training for all members of the organization with a focus on 
(1) core principles that underlie all aspects of our ethical code 
and (2) structured practice in a six-step problem-solving pro- 
cess that includes detection and (3) an analysis of situations as 
potential ethical dilemmas. 

Second, the network director monitors an internal ethics 
hotline that can be accessed by any member of the organiza- 
tion who needs support with a difficult or ambiguous situation. 
Asking for assistance through this mechanism is differentially 
reinforced with swift assistance and praise for critical analysis 
of the situation, as prompted by the submission portal (i.e., 
the problem-solving steps). The ethics leadership team ana- 
lyzes topics and situations submitted to the hotline to hone 
team members’ skills in responding to ethical conundrums. 


that directly incorporates ethics as a professional development 
topic. In addition, they recommend using previous ethically 
complex situations as instructional opportunities to prevent 
recurrence of the same problem. 

In sum, continuous education and discussion about ethics 
and the contingencies that support analysis and assistance seek- 
ing are core features of an environment designed to maintain 
ethical behavior in an organization (see Box 31.6). When indi- 
vidual behavior analysts do not operate within an environment 
with other behavior analysts, they should initiate connections 
with other behavior analysts through professional organizations, 
conference sessions, or seminars to access support and consul- 
tation as needed. 


CONCLUSION 


Practicing ethically poses daily challenges. It requires vigilance, 
self-monitoring, and the ongoing dynamic application of one’s 
principles and professional code. Our profession has created 
4 code of ethical conduct and set consequences for violating 
those rules. But we can also create cultures of ethics in a wide 
variety of settings, to help train practitioners in ethical decision 
making and to support ongoing discussion of ethical issues. 


Those situations are used monthly to create instructional mate- 
rials called “discussion starters”; these include specific role 
plays that can be used in supervision activities throughout the 
organization. 

Third, semi-annual trainings occur with the clinical team 
to review material that is pertinent to the most challenging 
situations submitted to the hotline (e.g., ethical issues asso- 
ciated with serving families who are experiencing divorce or 
separation). 

Fourth, the ethics network director leads discussions 
with the administrative leaders of the organization at least 
annually to focus on specific parts of The Compliance Code 
that are directly pertinent to their ongoing activities (e.g., 
review of codes relevant to social media and nonsolicitation of 
testimonials with the marketing team). Since these individuals 
generally have very different training and backgrounds than 
behavior analysts, they may be likely to misrepresent important 
features of our code and our field uniess the behavior analyst 
accepts the responsibility to educate them about the underlying 
principles that lead us to value certain actions (Code 10.07). 

Establishing the Ethics Network has been personally and 
professionally rewarding, as it has led to improved commu- 
nication, enhanced problem solving, team collaboration, and 
higher quality service to our clients. 


In this way, we take an antecedent approach that promotes ethi- 
cal action and effectively prepares the behavior analyst for acting 
ethically at the moment when an ethical dilemma presents itself, 
the moment when unethical behavior is most likely to occur. 

Most ethical challenges, regardless of the details involved, 
may best be addressed by revisiting the three questions posed 
early in the chapter: What is the right thing to do? What iS 
worth doing? What does it mean to be a good behavior analyst? 
These three questions can serve as focal points for effective and 
ethical decision making. Addressing these questions honestly 
and without bias will help practitioners avoid many potential 
ethical pitfalls and keep their efforts centered on the interests of 
the client, the student, or others receiving their services. 

By practicing ethically, and by maintaining adherence to 
the scientific method and the principles, procedures, and dimen- 
sions of applied behavior analysis, practitioners have a ready 
source of valid, accurate, reliable, and believable data to inform 
decision making. As a result, they will be more likely to achieve 
the original and continuing promise of applied behavior analysis: 
to apply knowledge gained through research in both the 
experimental analysis of behavior and applied behavior analysis 
to help solve problems of social importance to each individual 
recipient of services. 
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SUMMARY 
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What Is Ethics and Why Is It Important? 


1. 


For the applied behavior analyst, ethics defines specific 
rules of conduct. 


. Two approaches to ethics are termed the deontological 


or “acts-oriented” approach, which evaluates only the act 
itself, and the utilitarian or “results-oriented” approach, 
which looks at the results or consequences of the act. Both 
approaches contribute to the ethical conduct of behavior 
analysts. 


. Ethics describes behaviors, practices, and decisions that 


address three basic and fundamental questions: What is the 
right thing to do? What is worth doing? What does it mean 
to be a good behavior analyst? 


. Ethics is important because it helps practitioners decide 


whether a course of action related to the professional prac- 
tice of behavior analysis is appropriate or inappropriate. 
Codes of ethics guide decisions on how to act even in the 
presence of outside pressures, priorities, or expediencies. 


. Ethical practices are derived from other behavior 


analysts and professionals to ensure the well-being of 
(a) clients, (b) the profession itself, and (c) the culture 

as a whole. Over time, practices become codified into 
ethical rules of conduct that may change in response to a 
changing world. 


. Personal histories, including cultural and religious 


experiences, influence how practitioners decide on a 
course of action in any given situation. 


. Practicing behavior analysts must be aware of the settings 


in which they work and the specific rules and ethical 
standards applicable to those settings. Behavior analysts 
are required to be fully familiar with all ethical and legal 
issues pertaining to their practice. 


Standards of Professional Practice for Applied 
Behavior Analysts 


8. 


9. 


10. 


11. 


Professional organizations adopt formalized statements 

of conduct, codes of professional behavior, or ethical 
standards of behavior to guide their members in making 
decisions. Also, standards exact sanctions when deviations 
from The Compliance Code occur. 


The client’s welfare is at the forefront of ethical decision 
making. 


Behavior analysts must take steps to ensure that their 
professional conduct is self-regulating. 


Professional standards, guidelines, or rules of practice 
are written statements that provide direction for conduct- 
ing the practices associated with an organization. 


12. Behavior analysts are guided by position statements 


and ethical codes such as the Professional and Ethical 


Compliance Codes for Behavior Analysts, Ethical 
Principles of Psychologists and Code of Conduct, 
The Right to Effective Behavioral Treatment, State- 
ment on Restraint and Seclusion, and The Right to 
Effective Education. 


. Behavior analysts are now held to The Compliance Code, 


and perceived violations of that code can be reported using 
a notice of alleged violation. 


Ensuring Professional Competence 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


Professional competence in applied behavior analysis is 
achieved through formal academic training that involves 
coursework, supervised practica, and mentored profes- 
sional experience. 


Behavior analysts must be truthful and accurate in all 
professional and personal interactions. 


Competence in supervision is critical, as most behavior 
analysts supervise the work of others in the context of 
their practice. 


Formal certification and/or licensure credentials in behav- 
ior analysis allow consumers to identify practitioners who 
have demonstrated at least a minimum standard of training 
and competence. 


Behavior analysts must practice within their areas of com- 
petence, training, and experience. 


Behavior analysts can develop additional competencies 
through supervised or mentored experience. 


Behavior analysts take responsibility for all facets of the 
undertaking when providing supervision and/or mentoring 
of other behavior analysts, individuals seeking certification 
in behavior analysis, paraprofessionals, or non—behavior 
analysts. 


Behavior analysts have a responsibility to maintain and 
expand professional competencies, and may do so using 
continuing education, participation in professional 
conferences, maintaining contact with current profes- 
sional literature, and/or participating in peer review 
committees. 


Ethical Issues in Client Services 


22. 


~) 
a 


Three tests must be met before informed consent can be 
considered valid: capacity to decide, voluntary decision 
and knowledge of the treatment. 


7. 


Confidentiality refers to the professional standard 
requiring that the behavior analyst not discuss or oth- 
erwise release information regarding someone in his 
or her care. Information may be released only with 
formal permission of the individual or the individual’s 
guardiaii. 
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24. Practicing behavior analysts have a responsibility to 


behavior ar Advocating for the Client 
protect a client’s dignity, health, and safety. Various rights 


must be observed and protected, including the right to 33. In deciding to provide services, the behavior analyst must 
make choices, the right to privacy, the right to a therapeutic determine that services are needed, that medical causes 
treatment environment, and the right to refuse treatment. have been ruled out, that the treatment environment will 
support service delivery, and that a reasonable expectation 
25. Negligence (not doing what one should) and fraudulent of success exists. 
conduct (willful deceit likely to cause harm) on the part ; 
of the behavior analyst pose a direct threat to the client’s Sat Se oe 2 pce Ce 
health and safety, and can result in professional sanctions SetpiaL decsigninies ¢l) demining aloe Daa 
anu lpmeat sigdlacinno antes amenable to behavioral intervention and (2) evaluating the 


likely success of an intervention. 


Coordinating with Other Professionals 35. The behavior analyst should advocate for the most 
effective, scientifically supported treatment procedures. 
The behavior analyst must also take into account the 
social validity of any proposed treatment procedures. 
The behavior analyst has the responsibility to objectively 
evaluate alternative treatments when necessary and ® 
27. Effective coordination between disciplines can only occur appropriate. 
if communication among all treatment team members 36 
remains clear and frequent. 


26. Behavior analysts must coordinate services with other 
professionals in a way that best meets the interests of the 
individuals receiving services, and that represents the field 
of behavior analysis in a positive and professional manner. 


. Behavior analysts embrace the scientific method, and 
promote evidence-based practices in all consumer-related 


28. The best interests of the client will be served when the services. 
behavior analyst guides collaboration with other profes- 
sionals in a way that enhances client services, and reflects Conflict of Interest 
well on the field. 


37. A conflict of interest occurs when a principal party, alone 
or in connection with family, friends, or associates, has a 


Social Media and New Technologies ; 
vested interest in the outcome of the interaction. 


29. The use of social media and the emergence of other 
new technologies present an increasing array of oppor- 
tunities, challenges, and potential ethical pitfalls for the 
behavior analyst. Special care must be taken to protect client 
confidentiality, and the behavior analyst should avoid making 


case-specific treatment recommendations in a public forum. 39. When coordinating with other professional service provid- 
ers, behavior analysts have a responsibility to review and 
appraise the effects of any treatments that mi ght affect the 
goals of the behavior change program. 


38. All sources of conflict of interest, particularly dual rela- 
tionships, are to be avoided. 


Creating a Culture of Ethical Practice 


30. New technologies allow practitioners to view treatment 
sessions from remote locations, in real time, which can 
be useful for supervisors, parents/guardians, and behavior 


analysts in training. 40. To foster a culture of ethical practice, organizations 

31. All therapeutic disciplines must come to terms with the can commit to providing ethics-related trainings, 
proper way to use social media, without breaching confi- support systems, and differential reinforcement of 
dentiality or ethical guidelines. ethical behaviors. 

32. The individual receiving services must be given the oppor- 41. Continuous education and discussion about ethics and the 
tunity to assist in selecting and approving treatment goals. contingencies that support analysis and assistance seeking 
Outcomes must be selected that are primarily aimed at are core features of an environment designed to maintain 
benefiting the individual receiving services. ethical behavior in an organization. 

KEY TERMS 

confidentiality ethical codes of behavior informed consent 


conflict of interest ethics 


838 Part 13 © Ethics 


a mc a SB gS SEU RNIB ESSERE EAE CR EATER RHEE ESS SSD RROOBES NNER TON 


MULTIPLE-CHOICE QUESTIONS 


1. When trying to determine the right thing to do in a given 6. What should you do if you are in a situation where the 


situation, which factors should you consider? 
a. Your own personal history in similar circumstances 
b. Your professional training and experience 
c. The rules of the local setting 
d 


. Ethical guidelines published by your professional 
organization 


e. All of these 
Hint: (See “What Is Ethics and Why Is It Important?”’) 


. “Ethics” encompasses: 
a. The primary goal is of furthering the welfare of the client 


b. Behaviors and decisions that revolve around the “right 
thing to do” 


c. Following the rules of professional codes and standards 
d. What is morally right 

e. All of these 

Hint: (See “What Is Ethics and Why Is It Important?”’) 

. Which of the following organizations does not have ethical 


guidelines, standards, or position statements that behavior 
analysts should follow, according to your text? 


a. American Psychological Association 

b. Association for Behavior Analysis 

c. Association for Direct Instruction 

d. Behavior Analyst Certification Board 

Hint: (See “Standards of Professional Practice for Applied 
Behavior Analysts”’) 

. Which of the following is not a right to effective education, as 
delineated by the Association for Behavior Analysis in 1990? 


a. Social and physical school environments that encourage 
and maintain academic achievement and progress 


b. Curriculum and objectives based on empirically validated 
hierarchies or sequences of instructional objectives 


c. Financial consequences for the school and its performance 
are based on subjective measures by the school board 


d. Specific mastery criteria that include both the accuracy 
and speed dimensions of fluent performance 


Hint: (See “Standards of Professional Practice for Applied 
Behavior Analysts’’) 

. Which of these does not help individuals gain professional 
competence in their field? 

a. Formal coursework 

. Supervised practica 

. Mentored professional experience 


- Attending nonaccredited workshops and training 


Oo. Gr Ge Gs: 


- Successfully completing accredited university training 
programs 


Hint: (See “Ensuring Professional Competence’’) 


demands of the situation exceed your training? 


a. Reassure the clients that you are a qualified behavior 
analyst and continue with the intervention you have in 
place. 


b. Tell the clients that you are not qualified to provide 
treatment and walk away. 


c. Seek additional training in the area of competence so as 
to improve your skills. 


d. Make a referral to another behavior analyst or consultant. 
e. The third and fourth answer choices are correct. 


Hint: (See “Ensuring Professional Competence’) 


. Joan is a behavior analyst providing services to a child 


with autism. She has been having a difficult time with 
the child’s family. She is out for coffee one day, when 
her cell phone rings. It is her mother, who asks how 
things are going. She tells her mother about her woes 
with this family, explaining that the intervention she 
has implemented doesn’t seem to be working and that 
she is having trouble with the parents, who seem to be 
questioning her skills. Her mother, who 1s a retired teacher 
(not a behavior analyst), asks, “Is this the same family 
that you had trouble with last year?” Joan replies, “No, 
this is a different family. This is the Jacksons. They are 
very wealthy and live on the upper side of town. You 
remember them. You met them at a school board meeting 
once.” Consider Joan’s behavior relative to issues of 
confidentiality. Which of the following statements is true? 
a. Joan committed a breach of confidentiality 
unintentionally, so this breech of confidentiality is 
not of concern. 


b. Joan committed a breach of confidentiality intentionally, 
and this is of great concern. 

c. Joan committed a breach of confidentiality intentionally, 
but it is not of great concern because her goal was to serve 
the greater good and avoid future problems for this child. 

d. No breach of confidentiality occurred because Joan did 
not give out any personally identifiable information. 


Hint: (See “Ethical Issues in Client Services”) 


- Michael is a behavior analyst working at an adult day 


treatment program, where the staff always involves their 
clients in planning their day programs. Today, Michael 

is holding a meeting with one of his clients, Dan, who 

has moderate mental disabilities and has been having 
trouble with his behavior during work periods. Dan has 
been getting up partway through the day and running out 
of the room, screaming. While he is out of the room, he 
often tears papers off the wall and destroys the agency’s 
property. The team is meeting with Dan today to determine 
how his intervention will be revised. Michael is proposing 
that a response cost intervention be imposed, where Dan 
will lose some of the money he has earned for every minute 
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he is out of the room. Dan is opposed to this intervention 
because he doesn’t want to be fined. Michael and the team 
say to Dan, “But this intervention is trying to help you. I 
guess if you don’t want to do this, we will have to revise 


9. Which of these are important considerations in deciding 
whether or not an intervention is likely to be successful? 


a. Is the client willing to participate? 


Rotminterveuticn so thar you'don't carn ae much money. b. Has the behavior been successfully treated in the research 
We need to pay for all the damage you have caused.” After pier 

discussing some other options that the team didn’t feel ¢. Is public support likely? 

would be as effective, Dan says he’d like to think about it, d. Will those involved in the treatment have control over the 


but his bus just pulled up and he needs to leave for the day. critical contingencies? 
The team tells him this needs to be decided today, or he e. All of these 
cannot return to the facility tomorrow. What ethical issue 


est {Rea parse etna Ertan Hint: (See “Advocating for the Client’) 
a. They have implemented intervention and assessment 10. What is a conflict of interest? 
without Dan’s consent. a. When a behavior analyst, either alone or in tandem with 
b. They are attempting to coerce Dan into the intervention, a client, family, friends or associates of the client has a 
interfering with his right to voluntarily participate in the vested interest in the outcome of the interaction. 
intervention. b. When all members of a team have a similar treatment 
c. They have breeched confidentiality by sharing this goal for an individual. 
information with Dan and staff. c. When one member of a team has a negative history with 
d. Michael has not shared all of the relevant treatment the behavior analyst and the two do not get along well. 
information with Dan so that he can make an informed d. When parents hire a behavior analyst to consult with their 
decision. child and to provide recommendations to a local school. 
Hint: (See “Ethical Issues in Client Services”) Hint: (See “Advocating for the Client”) 
ESSAY-TYPE QUESTIONS 
1. Why are ethics important? Specifically, what do ethical 4. Give an example of a situation where you might provide 
principles do to help guide the practitioner? treatment without consent from either an individual or a 
Hint: (See “What Is Ethics and Why Is It Important?”) guardian. 


: f Ey FESS . k Hint: (See “Ethical Issues in Client Services”) 
2. Explain the role of professional organizations in creating 


and enforcing professional ethical codes. 5. Prior to initiating treatment, the behavior analyst must take 
Hint: (See “Standards of Professional Practice for Applied into account a number of important considerations. List 
Behavior Analysts”) those considerations. 


ax “sri Hint: (See “Ad ting for the Client” 
3. Give an example of individuals practicing within their aa aie SOE eee ee 


area of competence and an example of persons practicing 
outside their area of competence. 


Hint: (See “Ensuring Professional Competence”) 


nn 


NOTES 

1. The Behavior Analyst Certification Board’s (BACB’s) Professional and 2. For a list of colleges and universities with graduate programs in behavior 
Ethical Compliance Code for Behavior Analysts (the “Code”) consolidates, analysis, see the ABAI’s Behavior Analysis Training Directory \https://www 
updates, and replaces the BACB’s Professional Disciplinary and Ethical abainternational.org/constituents/educators/training-directory.aspx]. 
Standards and Guidelines for Responsible Conduct for Behavior Analysts. 3. Information on the BACB’s certification requirements and process is 
The Compliance Code includes 10 sections relevant to professional and eth- available at: www.BACB.com. 
ical behavior of behavior analysts, along with a glossary of terms. Effective 4. Readers interested in early literature on risk-benefit analysis should see 
January 1, 2016, all BACB applicants, certificants, and registrants will be Spreat (1982) and Axelrod, Spreat, Berry, & Moyer (1993). 


required to adhere to The Compliance Code. Source: www.BACB.com. 


Epilogue 


From its earliest days, the story of applied behavior analysis has been one of continuous advancements 
in understanding how behavior works and how to change it for the better. Today’s behavior analysts 
help more people learn new skills, achieve performance goals, and replace problem behaviors with 
healthy alternatives in a wider range of settings and circumstances than ever before. We are confident 
tomorrow’s behavior analysts will be even more skillful and effective in improving the quality of 
people’s lives. 

Some of you will earn advanced degrees in behavior analysis. Your research, writing, and teaching 
will expand the knowledge base and practice of the science. Many of you will become professionals 
whose practice is informed by applied behavior analysis. Your skillful and ethical use of behavioral 
interventions will enhance the quality of the lives you touch. We hope all of you will bring what you 
have learned about behavior to your everyday lives. 

We cannot express our perspective on the potential of, and necessity for, applied behavior analysis 
to contribute to a healthier and happier human experience more eloquently than did Ellen Reese (1966) 
more than 50 years ago: 


The pertinent question is how to use our increasing knowledge of behavior to best advantage. . . . It 
is a question that concerns all of us, as all of us exercise control over our families and friends, and 
all of us delegate control to professionals in education, government, and law. We can close our eyes 
to the techniques of behavior control and risk everything, or we can use these techniques effectively 
to create the kind of world we want to live in. The latter course requires that we keep informed. 
Fortunately, the science of behavior is neither esoteric nor obscure. It is a young science, an exciting 
one, and an important one; for surely a proper study of mankind is his behavior. (p. 63) 


Applied behavior analysis is still a young, exciting, and vital science. Given humanity’s current challenges, 
its potential for good is more important than ever before. 

Thank you for reading our book. We hope you will continue learning about applied behavior 
analysis and use your growing knowledge to help make the world a better place. These endeavors are 
worthy of your talent, time, and effort. 
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GLOSSARY 


A 


A-B design A two-phase experimental design consisting of a pretreatment 
baseline condition (A) followed by a treatment condition (B). 


A-B-A design A three-phase experimental design consisting of an initial 
baseline phase (A) until steady state responding (or counter-therapeutic trend) 
is obtained, an intervention phase in which the treatment condition (B) is imple- 
mented until the behavior has changed and steady state responding is obtained, 
and a return to baseline conditions (A) by withdrawing the independent variable 
to see whether responding “reverses” to levels observed in the initial baseline 
phase. (See also A-B-A-B design, reversal design, and withdrawal design.) 


A-B-A-B design An experimental design consisting of (1) an initial 
baseline phase (A) until steady state responding (or counter-therapeutic trend) 
is obtained, (2) an initial intervention phase in which the treatment variable 
(B) is implemented until the behavior has changed and steady state responding is 
obtained, (3) a return to baseline conditions (A) by withdrawing the independent 
variable to see whether responding “reverses” to levels observed in the initial 
baseline phase, and (4) a second intervention phase (B) to see whether initial 
treatment effects are replicated. (Also called reversal design and withdrawal 
design.) 


abative effect (of a motivating operation) A decrease in the current 
frequency of behavior that has been reinforced by some stimulus, object, or 
event whose reinforcing effectiveness depends on the same motivating opera- 
tion. For example, food ingestion abates (decreases the current frequency of) 
behavior such as opening the fridge that has been reinforced by food. 


ABC recording See anecdotal observation. 


abolishing operation (AO) A motivating operation that decreases the rein- 
forcing effectiveness of a stimulus, object, or event. For example, the reinforcing 
effectiveness of food is abolished as a result of food ingestion. 


acceptance and commitment therapy (ACT) An evidence-based 
behavior therapy focusing on general well-being, defined as making reliable 
contact with high-priority positive reinforcers. 


accuracy (of measurement) The extent to which observed values, the data 
produced by measuring an event, match the true state, or true values, of the event 
as it exists in nature. (See also observed value and true value.) 


adapted alternating treatments design A variation of the multiclement 
design for comparing the efficiency of instructional procedures. The comparison 
phase of the design features the alternating application of two (usually) or more 
different teaching methods, each method applied to different but equivalent sets 
of instructional items. All items are topographically different members of the 
same response or skill class, such as reading printed words, defining vocabulary 
terms, spelling words, answering math problems, and stating history facts. 


add-in component analysis A method for conducting a component analysis 
in which components are assessed individually or in combination before the 
complete treatment package is presented. The add-in method can identify suf- 
ficient components. Sequence and floor or ceiling effects may mask the effects 
of components added in toward the end of the analysis. 


adjunctive behavior Behavior that occurs as a collateral effect of a schedule 
of periodic reinforcement for other behavior; time-filling or interim activities 
(e.g., doodling, idle talking, smoking, drinking) that are induced by schedules 
of reinforcement during times when reinforcement is unlikely to be delivered. 
(Also called schedule-induced behavior.) 


affirmation of the consequent A three-step form of reasoning that begins 
with a true antecedent-consequent (if-A-then-B) statement and proceeds as 
follows: (1) If A is true, then B is true; (2) B is found to be true; (3) therefore, 
A is true. Although other factors could be responsible for the truthfulness of A, 
a sound experiment affirms several if-A-then-B possibilities, each one reducing 
the likelihood of factors other than the independent variable being responsible 
for the observed changes in behavior. 


alternating treatments design See multielement design. 


alternative schedule (alt) Provides reinforcement when the response 


requirements of any of two or more simultaneously available component 
schedules are met. 


anecdotal observation A form of direct, continuous observation in 
which the observer records a descriptive, temporally sequenced account of all 
behavior(s) of interest and the antecedent conditions and consequences for those 


behaviors as those events occur in the client’s natural environment. (Also called 
ABC recording.) 


antecedent An environmental condition or stimulus change existing or occur- 
ring prior to a behavior of interest. 


antecedent exercise An antecedent intervention, implemented independently 
of occurrences of the problem behavior, that usually has clients engage in some 
effortful form of aerobic activity (e.g., walking, jogging, dancing, calisthenics, 
roller skating). Applied behavior analysts have used antecedent exercise in the 
treatment of many maladaptive behaviors such as self-injurious behavior (SIB), 
aggression, and diverse behaviors such as inappropriate vocalizations, repetitive 
movements, talking-out, out-of-seat, and stereotypic behaviors. 


antecedent intervention A behavior change strategy that manipulates 
antecedent stimuli based on (a) motivating operations (evocative and abative 
effects), (b) stimulus control (differential availability of reinforcement), and 
(c) contingency-independent interventions (e.g., protective equipment, and 
restraint). (See also functional communication training, high-probability 
request sequence, and noncontingent reinforcement. Contrast with antecedent 
control, a behavior change intervention that manipulates contingency-dependent 
consequence events to affect stimulus control.) 


antecedent stimulus class A set of stimuli that share a common relationship. 
All stimuli in an antecedent stimulus class evoke the same operant behavior, or 
elicit the same respondent behavior. (See also arbitrary stimulus class and 
feature stimulus class.) 


applied behavior analysis (ABA) The science in which tactics derived 
from the principles of behavior are applied to improve socially significant 
behavior and experimentation is used to identify the variables responsible for 
the improvement in behavior. 


arbitrarily applicable relational responding (AARR) Forming new 
stimulus classes with little or no reinforced practice. 


arbitrary relations Stimuli to which people respond in interlocked ways, 
not because of physical similarity, but because social-verbal reinforcement 
contingencies teach people to respond to them in this way. 


arbitrary stimulus class Antecedent stimuli that evoke the same response 
but do not resemble each other in physical form or share a relational aspect such 
as bigger or under (e.g., peanuts, cheese, coconut milk, and chicken breasts are 
members of an arbitrary stimulus class if they evoke the response “sources of 
protein”). (Compare to feature stimulus class.) 


artifact An outcome or result that appears to exist because of the way it is 
measured but in fact does not correspond to what actually occurred. 


ascending baseline A data path that shows an increasing trend in the 
response measure over time. (Compare to descending baseline.) 


audience Anyone who functions as a discriminative stimulus evoking verbal 
behavior. Different audiences may control different verbal behavior about the 
same topic because of a differential reinforcement history. Teens may describe 
the same event in different ways when talking to peers versus parents. 


autoclitic The autoclitic relation involves two interlocking levels of verbal 
behavior emitted in one utterance. One level is a primary response (¢.g., “The ice 
is solid’), while the other type is the secondary autoclitic response (e.g., adding 
“I think”). Autoclitic behavior benefits the listener by providing additional 
information regarding the primary response. 


autoclitic frame Provide structure among verbal operants in terms of order, 
agreement, grouping, and composition of larger units of verbal behavior such 
as sentences (see Palmer, 2016). Autoclitic frames help speakers generate novel 
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utterances. For example, experience with the frame “the girl’s (ball, dog, coat) 
may enable a child to say “the girl’s hat” without prior teaching. 


autoclitic mand An autoclitic mand involves supplemental control by an 
MO related to some aspect of the speaker’s primary verbal response. The auto- 
clitic mand (e.g., “Believe me, they are wrong.”) enjoins a listener to take some 
specific action regarding the primary response, and the autoclitic behavior is 
reinforced by that action. 


autoclitic tact An autoclitic tact is controlled by some nonverbal feature of 
the primary response or its controlling variables, and the autoclitic response 
informs the listener of that feature. This behavior is reinforced by listeners who 
provide generalized reinforcement. 


automatic contingencies Skinner (1957) used “automatic” to identify cir- 
cumstances in which behavior is evoked, shaped, maintained, or weakened by 
environmental variables occurring without direct manipulation by other people. 
All behavioral principles (e.g., reinforcement, extinction, punishment) can affect 
our behavior automatically. 


automatic punishment Punishment that occurs independent of the social 
mediation by others (1.e., a response product serves as a punisher independent 
of the social environment). 


automatic reinforcement Reinforcement that occurs independent of the 
social mediation of others (e.g., scratching an insect bite relieves the itch). 


automaticity (of reinforcement) Refers to the fact that behavior is 
modified by its consequences irrespective of the person’s awareness; a person 
does not have to recognize or verbalize the relation between her behavior and 
a reinforcing consequence, or even know that a consequence has occurred, for 
reinforcement to “work.” (Contrast with automatic reinforcement.) 


aversive stimulus In general, an unpleasant or noxious stimulus; more 
technically, a stimulus change or condition that functions (a) to evoke a behavior 
that has terminated it in the past, (b) as a punisher when presented following 
behavior, and/or (c) as a reinforcer when withdrawn following behavior. 


avoidance contingency A contingency in which a response prevents or 
postpones the presentation of a stimulus. (Compare to escape contingency.) 


B-A-B design A three-phase experimental design that begins with the treat- 
ment condition. After steady state responding has been obtained during the ini- 
tial treatment phase (B), the treatment variable is withdrawn (A) to see whether 
responding changes in the absence of the independent variable. The treatment 
variable is then reintroduced (B) in an attempt to recapture the level of respond- 
ing obtained during the first treatment phase. 


backup reinforcers Preferred items, activities, or privileges that participants 
obtain by exchanging earned tokens in a token economy. 


backward chaining A teaching procedure in which a trainer completes all 
but the last behavior in a chain, which is performed by the learner, who then 
receives reinforcement for completing the chain. When the learner shows com- 
petence in performing the final step in the chain, the trainer performs all but the 
last two behaviors in the chain, the learner emits the final two steps to complete 
the chain, and reinforcement is delivered. This sequence is continued until the 
learner completes the entire chain independently. 


backward chaining with leaps ahead A backward chaining procedure in 
which some steps in the task analysis are skipped; used to increase the efficiency 
of teaching long behavior chains when there is evidence that the skipped steps 
are in the learner’s repertoire. 


bar graph A simple and versatile graphic format for summarizing behavioral 
data; shares most of the line graph’s features except that it does not have distinct 
data points representing successive response measures through time. (Also 
called a histogram.) 


baseline A condition of an experiment in which the independent variable is not 
present; data obtained during baseline are the basis for determining the effects of 
the independent variable; a control condition that does not necessarily mean the 
absence of instruction or treatment, only the absence of a specific independent 
variable of experimental interest. 


baseline logic A term sometimes used to refer to the experimental reasoning 
inherent in single-subject experimental designs; entails three elements: 
prediction, verification, and replication. (See also steady state strategy.) 


behavior That portion of an organism’s interaction with its environment that 
involves movement of some part of the organism (Johnston & Pennypacker, 
2009, p. 31). (See also operant behavior, respondent behavior, response, 
and response class.) 


Behavior Analyst Certification Board’s (BACB’s) Professional 
and Ethical Compliance Code for Behavior Analysts Consolidates, 
updates, and replaces the BACB’s Professional Disciplinary and Ethical 
Standards and Guidelines for Responsible Conduct for Behavior Analysts. 
The Compliance Code includes 10 sections relevant to professional and ethical 
behavior of behavior analysts, along with a glossary of terms. Effective January 
1, 2016, all BACB applicants, certificants, and registrants will be required to 
adhere to The Compliance Code. 


behavior chain A sequence of responses in which each response produces a 
stimulus change that functions as conditioned reinforcement for that response 
and as a discriminative stimulus for the next response in the chain; reinforcement 
for the last response in a chain maintains the reinforcing effectiveness of the 
stimulus changes produced by all previous responses in the chain. 


behavior chain interruption strategy An intervention that relies on 
the participant’s skill to initially perform all the critical elements of a chain 
independently, but the chain is then interrupted, or a link in the chain is made 
unavailable at a predetermined time, so that another behavior can be prompted 
or emitted. 


behavior chain with a limited hold A contingency that specifies a time 
interval by which a behavior chain must be completed for reinforcement to be 
delivered. 


behavior change tactic A technologically consistent method for changing 
behavior derived from one or more principles of behavior (e.g., response cost is 
derived from the principle of negative punishment); possesses sufficient gener- 
ality across subjects, settings, and/or behaviors to warrant its codification and 
dissemination. 


behavior checklist An itemized list that provides descriptions of specific 
skills (usually in hierarchical order) and the conditions under which each skill 
should be observed. Some checklists are designed to assess one particular behay- 
ior or skill area. Others address multiple behaviors or skill areas. Most use a 
Likert scale to rate responses. 


behavior trap An interrelated community of contingencies of reinforcement 
that can be especially powerful, producing substantial and long-lasting behavior 
changes. Effective behavior traps share four essential features: (a) They are 
“baited” with virtually irresistible reinforcers that “lure” the student to the trap; 
(b) only a low-effort response already in the student’s repertoire is necessary to 
enter the trap; (c) once inside the trap, interrelated contingencies of reinforce- 
ment motivate the student to acquire, extend, and maintain targeted academic 
and/or social skills; and (d) they can remain effective for a long time because 
students show few, if any, satiation effects. 


behavior-altering effect (of a motivating operation) Either (a) an 
increase in the current frequency of behavior that has been reinforced by some 
stimulus, object, or event, called an evocative effect; or (b) a decrease in the 
current frequency of behavior that has been reinforced by some stimulus, 
object, or event, called an abative effect. For example, the current frequency 
of behavior that has been reinforced with food, such as opening the fridge, is 
evoked (increased) or abated (decreased) by food deprivation or food ingestion, 
respectively. 


behavioral assessment A form of assessment that includes indirect and 
direct procedures such as interviews, checklists, and tests to identify and define 
the specific target behavior. In addition to identifying behavior(s) to change, 
comprehensive behavioral assessment can uncover functional relations between 
variables; it provides context on the resources, assets, significant others, compet- 
ing contingencies, maintenance and generalization factors, and reinforcers (or 
punishers) that can be combined to improve the efficiency of an intervention. 


behavioral contract See contingency contract. 


behavioral contrast The phenomenon in which a change in one component 
of a multiple schedule that increases or decreases the rate of responding on that 


component is accompanied by a change in the response rate in the opposite 
direction on the other, unaltered component of the schedule. 


behavioral cusp A behavior that has sudden and dramatic consequences 
that extend well beyond the idiosyncratic change itself because it exposes the 


person to new environments, reinforcers, contingencies, responses, and stimulus 
controls. (See also pivotal behavior.) 


behavioral inflexibility An insensitivity to external stimuli occurring 
when private events interfere with well-being behaviors on which high-priority 
positive reinforcers are contingent. 


behavioral momentum Describes the resistance to change in a behavior’s 
rate of responding following an alteration in reinforcement conditions. The 
momentum metaphor has also been used to describe the effects produced by the 
high-probability (high-p) request sequence. 


behaviorism The philosophy of a science of behavior; there are various 


forms of behaviorism. (See also methodological behaviorism and radical 
behaviorism. ) 


believability The extent to which the researcher convinces herself and others 
that the data are trustworthy and deserve interpretation. Measures of interob- 
server agreement (IOA) are the most often used index of believability in applied 
behavior analysis. (See also interobserver agreement [IOA].) 


bidirectional naming (BiN) A higher-order verbal cusp consisting of the 
fusing together of the speaker and listener repertoires in bidirectional relations 
(Horne & Lowe, 1996). A new word acquired as listener can generate a tact 
without further training, and a new word acquired as a tact can generate a listener 
relation without further training (these effects are consistent with emergent 
symmetry and mutual entailment). 


bonus response cost A procedure for implementing response cost in which 
the person is provided a reservoir of reinforcers that are removed in predeter- 
mined amounts contingent on the occurrence of the target behavior. 


brief functional analysis An analysis in which only one or two 5- to 10-min 
sessions are conducted for each condition. A convincing demonstration of func- 
tion may be achieved by either alternating a condition that produces problem 
behavior with one that does not or conducting a contingency reversal. Brief 
functional analysis may reveal a functional relation in fewer sessions than a full 
functional analysis. 


C 


calibration The process of comparing the data produced by a measurement 
system to a known standard or true value and, when sources of error are found, 
using that information to correct or improve the measurement system. 


causal relations If-then relationships (e.g., if A, then B; if B, then C) that are 
a central feature of understanding and doing science. With respect to stimulus 
relations, causal relations can define the structure of a stimulus class or define 
the behavior function through which stimuli in a class are transformed. 


celeration The change (acceleration or deceleration) in rate of responding 
over time; based on count per unit of time (rate); expressed as a factor by which 
responding is accelerating or decelerating (multiplying or dividing); displayed 
with a trend line on a Standard Celeration Chart. Celeration is a generic term 
without specific reference to accelerating or decelerating rates of response. (See 
also Standard Celeration Chart.) 


celeration time period A unit of time (e.g., per week, per month) in which 
celeration is plotted on a Standard Celeration Chart. (See also celeration and 
celeration trend line.) 


celeration trend line The celeration trend line is measured as a factor by 
which rate multiplies or divides across the celeration time periods (e.g., rate per 
week, rate per month, rate per year, and rate per decade). (See also celeration.) 


chained schedule A schedule of reinforcement in which the response 
requirements of two or more basic schedules must be met in a specific sequence 
before reinforcement is delivered; a discriminative stimulus is correlated with 
each component of the schedule. 

chaining Various methods for linking specific sequences of stimuli and 
responses to form new performances. (See also backward chaining, backward 
chaining with leaps ahead, behavior chain, and forward chaining.) 
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changing criterion design An experimental design in which an initial base- 
line phase is followed by a series of treatment phases consisting of successive 
and gradually changing criteria for reinforcement or punishment. Experimental 


control is evidenced by the extent the level of responding changes to conform 
to each new criterion. 


class expansion A new member is added to a demonstrated stimulus equiva- 
lence class as the result of teaching a new conditional discrimination. 


class merger Independent equivalence classes are combined as the result of 
teaching a new but interrelated conditional discrimination. 


class-specific reinforcement A match-to-sample procedure in which not 
only is the correct comparison choice conditional on the sample stimulus, but the 
type of consequence delivered is, too; class-specific consequences themselves 
become members of the equivalence classes. 


clicker training A term popularized by Pryor (1999) for shaping behavior 
using conditioned reinforcement in the form of an auditory stimulus. A handheld 
device produces a click sound when pressed. The trainer pairs other forms of 
reinforcement (e.g., edible treats) with the click sound so that the sound becomes 
a conditioned reinforcer. 


codic A type of verbal behavior where the form of the response is under the 
functional control of a verbal stimulus with point-to-point correspondence, but 
without formal similarity. There is also a history of generalized reinforcement. 


combinatorial entailment A relation involving two stimuli that both 
participate in mutual entailment with some common third stimulus (See also 
mutual entailment). 


Compliance Code See the Behavior Analyst Certification Board’s 
(BACB’s) Professional and Ethical Compliance Code for Behavior Analysts. 


component analysis Any experiment designed to identify the active ele- 
ments of a treatment condition, the relative contributions of different variables 
in a treatment package, and/or the necessary and sufficient components of an 
intervention. Component analyses take many forms, but the basic strategy is to 
compare levels of responding across successive phases in which the intervention 
is implemented with one or more components left out. 


compound schedule A schedule of reinforcement consisting of two or more 
elements of continuous reinforcement (CRF), the four intermittent schedules of 
reinforcement (FR, VR, FI, VI), differential reinforcement of various rates of 
responding (DRH, DRL), and extinction. The elements from these basic sched- 
ules can occur successively or simultaneously and with or without discriminative 
stimuli; reinforcement may be contingent on meeting the requirements of each 
element of the schedule independently or in combination with all elements. 


compound verbal discrimination Involves two or more verbal SPs 
(convergent multiple control) that each independently evoke behavior, but when 
they both occur in the same antecedent configuration, a different SP is generated, 
and a more specific behavior is evoked. 


concept A stimulus class whose members share a common set of features. 


concept formation A complex example of stimulus control that requires stimulus 
generalization within a class of stimuli and discrimination between classes of stimuli. 


concurrent chains design An experimental design in which participants 
are presented with two or more response options; each option is associated with 
4 distinctive discriminative stimulus and leads to a different set of treatment 
procedures. (Also called concurrent schedule design.) 


concurrent schedule (conc) A schedule of reinforcement in which two 
or more contingencies of reinforcement (elements) operate independently and 
simultaneously for two or more behaviors. 


conditional discrimination Performance in a match-to-sample procedure 
in which discrimination between the comparison stimuli is conditional on, or 
depends on, the sample stimulus present on each trial. 


conditional probability The likelihood that a target behavior will occur ina 
given circumstance; computed by calculating (a) the proportion of occurrences 
of behavior that were preceded by a specific antecedent variable and (b) the 
proportion of occurrences of problem behavior that were followed by a specific 
consequence. Conditional probabilities range from 0.0 to 1.0; the closer the con- 
ditional probability is to 1.0, the stronger the relationship is between the target 
behavior and the antecedent/consequence variables. 
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conditioned motivating operation (CMO) A motivating operation 
whose value-altering effect depends on a learning history. For example, because 
of the relation between locked doors and keys, having to open a locked door is 
a CMO that makes keys more effective as reinforcers, and evokes behavior that 
has obtained such keys. 


conditioned negative reinforcer A previously neutral stimulus change 
that functions as a negative reinforcer because of prior pairing with one or more 
negative reinforcers. (See also negative reinforcer; compare to unconditioned 
negative reinforcer.) 


conditioned punisher A previously neutral stimulus change that func- 
tions as a punisher because of prior pairing with one or more other punishers. 
(Sometimes called secondary or learned punisher; compare to unconditioned 
punisher.) 


conditioned reflex A learned stimulus—response functional relation consist- 
ing of an antecedent stimulus (e.g., sound of refrigerator door opening) and 
the response it elicits (e.g., salivation); each person’s repertoire of conditioned 
reflexes is the product of his or her history of interactions with the environment 
(ontogeny). (See also respondent conditioning and unconditioned reflex.) 


conditioned reinforcer A stimulus change that functions as a reinforcer 
because of prior pairing with one or more other reinforcers. (Sometimes called 
secondary or learned reinforcer.) 


conditioned stimulus (CS) The stimulus component of a conditioned 
reflex; a formerly neutral stimulus change that elicits respondent behavior only 
after it has been paired with an unconditioned stimulus (US) or another CS. 


confidentiality Describes a situation of trust insofar as any information regard- 
ing a person receiving or having received services may not be discussed with or oth- 
erwise made available to another person or group, unless that person has provided 
explicit authorization, usually written and signed, for release of such information. 


conflict of interest A situation in which a person in a position of responsibil- 
ity or trust has competing professional or personal interests that make it difficult 
to fulfill his or her duties impartially. 


confounding variable An uncontrolled factor known or suspected to exert 
influence on the dependent variable. 


conjunctive schedule (conj) A schedule of reinforcement that is in effect 
whenever reinforcement follows the completion of response requirements for 
two or more schedules of reinforcement. 


consequence A stimulus change that follows a behavior of interest. Some 
consequences, especially those that are immediate and relevant to current moti- 
vational states, have significant influence on future behavior; others have little 
effect. (See also punisher and reinforcer.) 


constant time delay A procedure for transferring stimulus control from 
contrived response prompts to naturally existing stimuli. After the student has 
responded correctly to several 0-sec delay trials, after which presentation of the 
response prompt follows the instructional stimulus by a predetermined and fixed 
delay (usually 3 or 4 seconds) for all subsequent trials. (See also progressive 
time delay). 


contextual control The situation or context in which a stimulus (or stimulus 
class) occurs determines its function. More specifically: a type of stimulus con- 
trol requiring three levels of antecedent stimuli, such that the functions of the 
stimuli in a conditional discrimination vary depending on the context. Contextual 
control training requires a five-term contingency. It allows for the same stimuli 
to be members of more than one equivalence class, depending on the context. 


contextual stimulus Signals the type of relational responding that will be 
reinforced. 


contingency Refers to dependent and/or temporal relations between operant 
behavior and its controlling variables. (See also contingent and three-term 
contingency.) 


contingency contract A mutually agreed-upon document between parties 
(e.g., parent and child) describing a contingent relationship between the comple- 
tion of specified behavior(s) and access to specified reinforcer(s). (Also called 
behavioral contract.) 


contingency reversal Exchanging the reinforcement contingencies for 
two topographically different responses. For example, if Behavior A results in 


reinforcement on an FR 1 schedule of reinforcement and Behavior B results in 
reinforcement being withheld (extinction), a contingency reversal consists of 
changing the contingencies such that Behavior A now results in extinction and 
Behavior B results in reinforcement on an FR 1 schedule. 


contingency-shaped behavior Behavior acquired by direct experience 
with contingencies. 


contingent Describes reinforcement (or punishment) that is delivered only 
after the target behavior has occurred. 


contingency space analysis A graphic display of the probability of one 
event (e.g., a particular consequence) given the occurrence (or not) of another 
event (e.g., the presence or absence of a particular behavior). Contingencies are 
considered positive (i.e., a specific consequence is more likely to occur), nega- 
tive (i.e., a specific consequence is less likely to occur), or neutral (i.e., a specific 
consequence is neither more or less likely to occur). 


contingent observation A procedure for implementing time-out in which 
the person is repositioned within an existing setting such that observation of 
ongoing activities remains, but access to reinforcement is lost. 


continuous measurement Measurement conducted in a manner such 
that all instances of the response class(es) of interest are detected during the 
observation period. 


continuous reinforcement (CRF) A schedule of reinforcement that 
provides reinforcement for each occurrence of the target behavior. 


contrived contingency Any contingency of reinforcement (or punishment) 
designed and implemented by a behavior analyst or practitioner to achieve the 
acquisition, maintenance, and/or generalization of a targeted behavior change. 
(Contrast with naturally existing contingency.) 


contrived mediating stimulus Any stimulus made functional for the target 
behavior in the instructional setiing that later prompts or aids the learner in 
performing the target behavior in a generalization setting. 


copying text An elementary verbal operant involving a written response that 
is evoked by a written verbal discriminative stimulus that has formal similarity 
and a history of generalized reinforcement. 


count A simple tally of the number of occurrences of a behavior. The obser- 
vation period, or counting time, should always be noted when reporting count 
measures. 


countercontrol Behavior evoked by coercion or aversive forms of external 
control; takes many forms, including escape, attack, passive resistance; “an emo- 
tional reaction of anger or frustration including operant behavior which injures 
or is otherwise aversive to the controller” (Skinner, 1953, p. 321). 


counting time The period of time in which a count of the number of responses 
emitted is recorded. 


cumulative record A type of graph on which the cumulative number of 
responses emitted is represented on the vertical axis; the steeper the slope of the 
data path, the greater the response rate. 


cumulative recorder A device that automatically draws cumulative records 
(graphs) that show the rate of response in real time; each time a response is 
emitted, a pen moves upward across paper that continuously moves at a constant 
speed. 


D 


data The results of measurement, usually in quantifiable form; in applied 
behavior analysis, it refers to measures of some quantifiable dimension of a 
behavior. The word data is the plural form of datum. 


data path The level and trend of behavior between successive data points; 
created by drawing a straight line from the center of each data point in a given 
data set to the center of the next data point in the same set. 


deictic relation A relation between the self, as one stimulus, and other stimuli 
from the external world. 


delay discounting A phenomenon in which delayed rewards, regardless of 
their significance and magnitude (e.g., enough money for a secure retirement), 
exert decreasing influence over choice-making behavior as a function of their 
temporal distance from present circumstances. Both humans and nonhuman 


laboratory animals discount the value of delayed rewards; the greater the delay 
to the reward, the greater the discount (i-e., the less value or influence the reward 
has on current behavior). (Also called temporal discounting.) 


delayed multiple baseline design A variation of the multiple base- 
line design in which an initial baseline, and perhaps intervention, begin for 
one behavior (or setting, or subject), and subsequent baselines for additional 
behaviors begin in a staggered or delayed fashion. 


dependent group contingency A contingency in which reinforcement for 
all members of a group is dependent on the behavior of one member of the group 
or the behavior of a select group of members within the larger group. 


dependent variable The measured behavior in an experiment to determine 
if it changes as a result of manipulations of the independent variable; in applied 
behavior analysis, it represents some measure of a socially significant behavior. 
(See also target behavior; compare to independent variable.) 


deprivation The state of an organism with respect to how much time has 
elapsed since it has consumed or contacted a particular type of reinforcer; also 
refers to a procedure for increasing the effectiveness of a reinforcer (e.g., with- 
holding a person’s access to a reinforcer for a specified period prior to a session). 
(See also motivating operation [MO]; contrast with satiation.) 


derived stimulus relations Responding indicating a relation (e.g., same 
as, opposite, different from, better than) between two or more stimuli that 
emerges as an indirect function of related instruction or experience. (Also called 
emergent stimulus relations.) 


descending baseline A data path that shows a decreasing trend in the 
response measure over time. (Compare to ascending baseline.) 


descriptive functional behavior assessment Direct observation of 
problem behavior and the antecedent and consequent events under naturally 
occurring conditions. 


determinism The assumption that the universe is a lawful and orderly place 
in which phenomena occur in relation to other events and not in a willy-nilly, 
accidental fashion. 


differential reinforcement Reinforcing only those responses within a 
response class that meet a specific criterion along some dimension(s) (i.e., 
frequency, topography, duration, latency, or magnitude) and placing all other 
responses in the class on extinction. (See also differential reinforcement of 
alternative behavior, differential reinforcement of incompatible behavior, 
differential reinforcement of other behavior, stimulus discrimination train- 
ing, and shaping.) 


differential reinforcement of alternative behavior (DRA) A proce- 
dure for decreasing problem behavior in which reinforcement is delivered for 
a behavior that serves as a desirable alternative to the behavior targeted for 
reduction and withheld following instances of the problem behavior (e.g., rein- 
forcing completion of academic worksheet items when the behavior targeted 
for reduction is talk-outs). 


differential reinforcement of diminishing rates (DRD) A schedule of 
reinforcement in which reinforcement is provided at the end of a predetermined 
interval contingent on the number of responses emitted during the interval being 
fewer than a gradually decreasing criterion based on the individual’s perfor- 
mance in previous intervals (e.g., fewer than five responses per 5 minutes, fewer 
than four responses per 5 minutes, fewer than three responses per 5 minutes). 


differential reinforcement of high rates (DRH) A schedule of rein- 
forcement in which reinforcement is provided at the end of a predetermined 
interval contingent on the number of responses emitted during the interval being 
greater than a gradually increasing criterion based on the individual’s perfor- 
mance in previous intervals (e.g., more than three responses per 5 minutes, more 
than five responses per 5 minutes, more than eight responses per 5 minutes). 


differential reinforcement of incompatible behavior (DRI) A pro- 
cedure for decreasing problem behavior in which reinforcement is delivered for 
a behavior that is topographically incompatible with the behavior targeted for 
reduction and withheld following instances of the problem behavior (e.g., sitting 
in seat is incompatible with walking around the room). 


differential reinforcement of low rates (DRL) A schedule of reinforce- 
ment in which reinforcement (a) follows each occurrence of the target behavior 
that is separated from the previous response by a minimum interresponse time 
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(IRT), or (b) is contingent on the number of responses within a period of time 
not exceeding a predetermined criterion. Practitioners use DRL schedules to 
decrease the rate of behaviors that occur too frequently but should be main- 


tained in the learner’s repertoire. (See also full-session DRL, interval DRL, 
and spaced-responding DRL.) 


differential reinforcement of other behavior (DRO) A procedure 
for decreasing problem behavior in which reinforcement is contingent on the 
absence of the problem behavior during or at specific times (i.e., momentary 
DRO; sometimes called differential reinforcement of zero rates of responding 
or omission training; see also fixed-interval DRO, fixed-momentary DRO, 
variable-interval DRO, and variable-momentary DRO.) 


direct measurement Occurs when the behavior that is measured is the same 


as the behavior that is the focus of the investigation. (Contrast with indirect 
measurement.) 


direct replication An experiment in which the researcher attempts to 
duplicate exactly the conditions of an earlier experiment. 


disciplinary standards Statements describing corrective, disciplinary, 
and revocation actions, depending on the circumstances for violations of a 
professional; see the BACB Compliance Code. 


discontinuous measurement Measurement conducted in a manner such 
that some instances of the response class(es) of interest may not be detected. 


discrete trial Any operant whose response rate is controlled by a given 
opportunity to emit the response. Each discrete response occurs when an 
opportunity to respond exists. Discrete trial, restricted operant, and controlled 
operant are synonymous technical terms. (Contrast with free operant.) 


discriminated avoidance A contingency in which responding in the 
presence of a signal prevents the onset of a stimulus from which escape is a 
reinforcer, (See also discriminative stimulus, discriminated operant, free- 
operant avoidance, and stimulus control.) 


discriminated operant An operant that occurs more frequently under some 
antecedent conditions than under others. (See also discriminative stimulus [s?] 
and stimulus control.) 


discriminative stimulus (S?) A stimulus in the presence of which a given 
behavior has been reinforced and in the absence of which that behavior has 
not been reinforced; as a result of this history, an S? evokes operant behavior 
because its presence signals the availability of reinforcement (See also differ- 
ential reinforcement, stimulus control, stimulus delta (S41, and stimulus 
discrimination training.) 


discriminative stimulus for punishment (S””) A stimulus in the pres- 
ence of which a given behavior has been punished and in the absence of which 
that behavior has not been punished; as a result of this history, the behavior 
occurs less often in the presence of the SPP than in its absence. 


distinction relation Responding jointly to two stimuli on the basis of their 
differences. 


double-blind control A procedure that prevents the subject and the 
experimenter(s) from detecting the presence or absence of the treatment vari- 
able: used to eliminate confounding of results by subject expectations, parent 
and teacher expectations, differential treatment by others, and observer bias. 
(See also placebo control.) 


DRI/DRA reversal technique An experimental technique that demon- 
strates the effects of reinforcement; it uses differential reinforcement of an 
incompatible or alternative behavior (DRI/DRA) as a control condition instead 
of a no-reinforcement (baseline) condition. During the DRI/DRA condition, 
the stimulus change used as reinforcement in the reinforcement condition 
is presented contingent on occurrences of a specified behavior that is either 
incompatible with the target behavior or an alternative to the target behavior. 
A higher level of responding during the reinforcement condition than during 
the DRI/DRA condition demonstrates that the changes in behavior are the result 
of contingent reinforcement, not simply the presentation of or contact with the 
stimulus event. (Compare to DRO reversal technique and noncontingent 
reinforcement [NCR] reversal technique.) 


DRO reversal technique An experimental technique for demonstrat- 
ing the effects of reinforcement by using differential reinforcement of other 
behavior (DRO) as a control condition instead of a no-reinforcement (baseline) 
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condition. During the DRO condition, the stimulus change used as reinforce- 
ment in the reinforcement condition is presented contingent on the absence of 
the target behavior for a specified time period. A higher level of responding 
during the reinforcement condition than during the DRO condition demon- 
strates that the changes in behavior are the result of contingent reinforcement, 
not simply the presentation of or contact with the stimulus event. (Compare 
to DRI/DRA reversal technique and noncontingent reinforcement [NCR] 
reversal technique.) 


drop-out component analysis A method for conducting a component 
analysis in which the investigator presents the treatment package and then sys- 
tematically removes components. If the treatment’s effectiveness wanes when 
a component is removed, the researcher has identified a necessary component. 
(Contrast with add-in component analysis). 


duplic A type of verbal behavior where the form of the response is under the 
functional control of a verbal stimulus with formal similarity, and a history of 
generalized reinforcement. 


duration The total time that behavior occurs; measured by elapsed time from 
the onset of a response to its end point. 


E 


echoic An elementary verbal operant involving a vocal response that is evoked 
by a vocal verbal S? that has formal similarity between an auditory verbal stim- 
ulus and an auditory verbal response product, and a history of generalized 
reinforcement. 


ecological assessment An assessment protocol that acknowledges complex 
interrelationships between environment and behavior. An ecological assessment 
is a method for obtaining data across multiple settings and persons. 


elementary verbal operants Michael’s (1982) term for Skinner’s (1957) 
taxonomy of five different types of speaker behavior (i.e., expressive language) 
distinguished by their antecedent controlling variables and related history of 
consequences: mand, tact, intraverbal, duplic, and codic. (See also echoic, 
copying text, taking dictation, and textual behavior). 


emergent stimulus relations Stimulus relations that are not taught directly 
but emerge as an indirect function of related instruction or experience. (Also 
called derived stimulus relations.) 


empiricism The objective observation of the phenomena of interest; objective 
observations are “independent of the individual prejudices, tastes, and private 
opinions of the scientist. . . . Results of empirical methods are objective in that 
they are open to anyone’s observation and do not depend on the subjective belief 
of the individual scientist” (Zuriff, 1985, p. 9). 


enriched environment An intervention that provides noncontingent access 
to preferred sources of reinforcement (e.g., toys, games, social and recreation 
activities). This noncontingent access to preferred reinforcers arranges a com- 
petition between the enriched environment and the stimulation provided by the 
problem behavior. 


environment The conglomerate of real circumstances in which the organism 
or referenced part of the organism exists; behavior cannot occur in the absence 
of environment. 


environmental enrichment (EE) An antecedent intervention that provides 
noncontingent access to preferred sources of reinforcement (e.g., toys, games, 
social and recreation activities). This noncontingent access to preferred sources 
of reinforcement arranges a competition between the EE intervention and the 
stimulation provided by the problem behavior 


equal-interval scale A scale in which equal distances on the axis represent 
equal absolute amounts of the variable plotted on the axis (e.g., behavior change 
on an equal-interval vertical axis). 


equivalence test A probe for the emergence of untrained stimulus—stimulus 
relations that evaluates both symmetry and transitivity simultaneously. 


equivalence-class formation The emergence of accurate responding 
to untrained and nonreinforced stimulus—stimulus relations following the 
reinforcement of responses to some stimulus-stimulus relations. Requires 
successful performances on three types of probe trials—reflexivity, symmetry, 
and transitivity—in the absence of reinforcement. (Sometimes called stimulus 
equivalence.) 


errorless learning A variety of techniques for gradually transferring stimu- 
lus control with a minimum of errors. 


escape contingency A contingency in which a response terminates (produces 
escape from) an ongoing stimulus. (Compare to avoidance contingency.) 


escape extinction Behaviors maintained with negative reinforcement are 
placed on escape extinction when those behaviors are not followed by termina- 
tion of the aversive stimulus; emitting the target behavior does not enable the 
person to escape the aversive situation. 


establishing operation (EO) A motivating operation that increases the 
effectiveness of some stimulus, object, or event as a reinforcer. For example, 
food deprivation establishes food as an effective reinforcer. 


ethical codes of behavior Documents generated or adopted by professional 
organizations that provide clear guidelines for their members when deciding a 
course of proper action in conducting their professional duties. 


ethics Behaviors, practices, and decisions that address such basic and 
fundamental questions as the following: What is the right thing to do? What’s 
worth doing? What does it mean to be a good behavior analytic practitioner? The 
Behavior Analyst Certification Board’s (BACB’s) Professional and Ethical Com- 
pliance Code for Behavior Analysts details ethical practice for behavior analysts. 


event recording Measurement procedure for obtaining a tally or count of the 
number of times a behavior occurs. 


evocative effect (of a motivating operation) An increase in the current 
frequency of behavior that has been reinforced by some stimulus, object, or 
event whose reinforcing effectiveness depends on the same motivating opera- 
tion. For example, food deprivation evokes (increases the current frequency of) 
behavior such as opening the fridge that has been reinforced by food. 


exact count-per-intervai IOA The percentage of total intervals in which 
two observers recorded the same count; the most stringent description of IOA 
for most data sets obtained by event recording. 


exclusion (training) A procedure for building new arbitrary conditional 
discriminations based on the robust finding that learners will select a novel 
comparison stimulus over a known one in the presence of a novel sample. 


exclusion time-out A procedure for implementing time-out in which, con- 
tingent on the occurrence of a target behavior, the person is removed physically 
from the current environment for a specified period. 


experiment A carefully controlled comparison of some measure of the 
phenomenon of interest (the dependent variable) under two or more different 
conditions in which only one factor at a time (the independent variable) differs 
from one condition to another. 


experimental analysis of behavior (EAB) A natural science approach 
to the study of behavior as a subject matter in its own right founded by 
B. F. Skinner; methodological features include rate of response as a basic depen- 
dent variable, repeated or continuous measurement of clearly defined response 
classes, within-subject experimental comparisons instead of group design, visual 
analysis of graphed data instead of statistical inference, and an emphasis on 
describing functional relations between behavior and controlling variables in 
the environment over formal theory testing. 


experimental control Two meanings: (a) the outcome of an experiment that 
demonstrates convincingly a functional relation, meaning that experimental con- 
trol is achieved when a predictable change in behavior (the dependent variable) 
can be reliably produced by manipulating a specific aspect of the environment 
(the independent variable); and (b) the extent to which a researcher maintains 
precise control of the independent variable by presenting it, withdrawing it, and/ 
or varying its value, and also by eliminating or holding constant all confound- 
ing and extraneous variables. (See also confounding variable, extraneous 
variable, and independent variable.) 


experimental design The particular type and sequence of conditions in a 
study so that meaningful comparisons of the effects of the presence and absence 
(or different values) of the independent variable can be made. 


explanatory fiction A fictitious or hypothetical variable that often takes the 
form of another name for the observed phenomenon it claims to explain and 
contributes nothing to a functional account or understanding of the phenome- 
non, such as “intelligence” or “cognitive awareness” as explanations for why 


an organism pushes the lever when the light is on and food is available but does 
not push the lever when the light is off and no food is available. F 


a 


external validity The degree to which a study’s findings have generality 
to other subjects, settings, and/or behaviors. (Compare to internal validity.) 


extinction The discontinuing of a reinforcement of a previously reinforced 
behavior (i.e., responses no longer produce reinforcement); the primary effect 
is a decrease in the frequency of the behavior until it reaches a prereinforced 
level or ultimately ceases to occur. (See also extinction burst and spontaneous 
recovery; compare to respondent extinction.) 


extinction burst An increase in the frequency of responding when an extinc- 
tion procedure is initially implemented. 


extinction-induced variability Phenomenon in which diverse and novel 
forms of behavior are sometimes observed during the extinction process. 


extraneous variable Any aspect of the experimental setting (e.g., lighting, 


temperature) that must be held constant to prevent unplanned environmental 
variation. 


F 


fading A procedure for transferring stimulus control in which features of an 
antecedent stimulus (e.g., shape, size, position, color) controlling a behavior are 
gradually changed to a new stimulus while maintaining the current behavior; 
stimulus features can be faded in (enhanced) or faded out (reduced). 


feature stimulus class Stimuli that share common physical forms or 
structures (e.g., made from wood, four legs, round, blue) or common relative 
relationships (e.g., bigger than, hotter than, higher than, next to). (Compare to 
arbitrary stimulus class.) 


fixed interval (FI) A schedule of reinforcement in which reinforcement is 
delivered for the first response emitted following the passage of a fixed duration 
of time since the last response was reinforced (e.g., on an FI 3-min schedule, the 
first response following the passage of 3 minutes is reinforced). 


fixed ratio (FR) A schedule of reinforcement requiring a fixed number of 
responses for reinforcement (e.g., an FR 4 schedule of reinforcement follows 
every fourth response). 


fixed-interval DRO (FI-DRO) A DRO procedure in which reinforcement 
is available at the end of intervals of fixed duration and delivered contingent on 
the absence of the problem behavior during each interval. (See also differential 
reinforcement of other behavior [DRO].) 


fixed-momentary DRO (FM-DRO) A DRO procedure in which 
reinforcement is available at specific moments in time, which are separated 
by a fixed amount of time, and delivered contingent on the problem not 
occurring at those moments. (See also differential reinforcement of other 
behavior [DRO].) 


fixed-time schedule (FT) A schedule for the delivery of noncontingent 
stimuli in which a time interval remains the same from one delivery to the next. 


formal similarity Occurs when the controlling antecedent stimulus and the 
response or response product (a) share the same sense mode (e.g., both stimulus 
and response are visual, auditory, or tactile) and (b) physically resemble each 
other, Verbal relations with formal similarity are echoic, copying a text, and 
imitation as it relates to sign language. 


forward chaining A method for teaching behavior chains that begins with 
the learner being prompted and taught to perform the first behavior in the task 
analysis; the trainer completes the remaining steps in the chain. When the learner 
shows competence in performing the first step in the chain, he is then taught to 
perform the first two behaviors in the chain, with the training completing the 
chain. This process is continued until the learner completes the entire chain 
independently. 


fraudulent conduct Intentional, willful, and deceitful behavior; such 
behavior can cause harm to others. 


free operant Any operant behavior that results in minimal displacement of 
the participant in time and space. A free operant can be emitted at nearly any 
time: it is discrete, it requires minimal time for completion, and it can produce 
a wide range of response rates. Examples in ABA include (a) the number of 
words read during a 1-min counting period, (b) the number of hand slaps per 
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6 seconds, and (c) the number of letter strokes written in 3 minutes. (Contrast 
with discrete trial.) 


free-operant avoidance A contingency in which responses at any time 
during an interval prior to the scheduled onset of an aversive stimulus delays 


the presentation of the aversive stimulus. (Contrast with discriminated 
avoidance.) 


frequency How often a behavior occurs. Some behavior analysts use 
frequency to mean rate (a ratio of responses per standard unit of time); others 
use frequency as a synonym for count. 


full-session DRL A procedure for implementing DRL in which reinforcement 
is delivered at the end of the session if the total number of responses emitted 


during the session does not exceed a criterion limit. (See also differential 
reinforcement of low rates [DRL].) 


function-altering effect (relevant to operant relations) A change in 
an organism’s repertoire of MO, stimulus, and response relations, caused by 
reinforcement, punishment, an extinction procedure, or a recovery from pun- 
ishment procedure. Respondent function-altering effects result from the pairing 
and unpairing of antecedent stimuli. 


function-based definition Designates responses as members of the targeted 
response class solely in terms of their common effect on the environment. 


functional analysis A term with two meanings in contemporary behavior 
analysis literature. In its original and most fundamental usage, functional 
analysis denotes demonstrations of functional relations between environmental 
variables and behavior. In the context of determining the purposes (functions) 
of problem behavior for an individual, functional analysis entails experimentally 
arranging antecedents and consequences representing those in the person’s 
natural routines so that their separate effects on problem behavior can be 
observed and measured. 


functional behavior assessment (FBA) A systematic method of assessment 
for obtaining information about the purposes (functions) a problem behavior 
serves for a person; results are used to guide the design of an intervention for 
decreasing the problem behavior and increasing appropriate behavior. 


functional communication training (FCT) An antecedent intervention 
in which an appropriate communicative behavior is taught as a replacement 
behavior for problem behavior usually evoked by an establishing operation 
(EO); involves differential reinforcement of alternative behavior (DRA). 


functional relation A verbal statement summarizing the results of an exper- 
iment (or group of related experiments) that describes the occurrence of the 
phenomena under study as a function of the operation of one or more specified 
and controlled variables in the experiment in which a specific change in one 
event (the dependent variable) can be produced by manipulating another event 
(the independent variable), and that the change in the dependent variable was 
unlikely the result of other factors (confounding variables); in behavior analysis 
expressed as b = f (x1), (x2),..., where b is the behavior and x1, x2, etc., are 
environmental variables of which the behavior is a function. 


functionally equivalent Serving the same function or purpose; different 
topographies of behavior are functionally equivalent if they produce the same 
consequences. 


G 


general case analysis A systematic process for identifying and selecting 
teaching examples that represent the full range of stimulus variations and 
response requirements in the generalization setting(s). (See also multiple- 
exemplar training and teaching enough examples.) 


generalization A generic term for a variety of behavioral processes and 
behavior change outcomes. (See also generalization gradient, generalized 
behavior change, response generalization, response maintenance, setting/ 
situation generalization, and stimulus generalization.) 

generalization across subjects Changes in the behavior of people not 
directly treated by an intervention as a function of treatment contingencies 
applied to other people. 

generalization probe Any measurement of a learner’s performance of a 
target behavior in a setting and/or stimulus situation in which direct training 
has not been provided. 
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generalization setting Any place or stimulus situation that differs in some 
meaningful way from the instructional setting and in which performance of the 
target behavior is desired. (Contrast with instructional setting.) 


generalized behavior change A behavior change that has not been taught 
directly. Generalized outcomes take one, or a combination of, three primary 
forms: response maintenance, stimulus/setting generalization, and response 
generalization. (Sometimes called generalized outcome.) 


generalized conditioned reinforcer A conditioned reinforcer that as 
a result of having been paired with many other reinforcers does not depend 
on an establishing operation for any particular form of reinforcement for its 
effectiveness. 


generalized imitation A term often used when a learner imitates a wide 
variety of unprompted, untrained, non-reinforced modeled behaviors in different 
settings and situations. Generalized imitation could be considered a higher order 
response class in which participants imitate novel models without training and 
that are not predictive of reinforcement. 


generative learning A behavioral effect whereby previously acquired 
speaker and listener skills enable or accelerate the acquisition of other speaker 
and listener skills, without dependence on direct teaching or a history of 
reinforcement. 


generic (tact) extension A tact evoked by a novel stimulus that shares all of 
the relevant or defining features associated with the original stimulus. 


Good Behavior Game An interdependent group contingency in which a 
group is divided into two or more teams that compete against each other and/or 
a specified criterion. The team with fewest marks at the end of the game earns a 
reinforcer. Each team is also told that it can earn a reinforcer if it has fewer than 
a specified number of marks. 


graph A visual format for displaying data; reveals relations among and 
between a series of measurements and relevant variables. 


group contingency A contingency in which reinforcement for all mem- 
bers of a group is dependent on the behavior of (a) a person within the group, 
(b) a select group of members within the larger group, or (c) each member 
of the group meeting a performance criterion. (See also dependent group 
contingency, independent group contingency, and interdependent group 
contingency.) 


H 


habilitation Habilitation (adjustment) occurs when a person’s repertoire has 
been changed such that short- and long-term reinforcers are maximized and 
short- and long-term punishers are minimized. 


habit reversal A multiple-component treatment package for reducing 
unwanted habits such as fingernail biting and muscle tics; treatment typically 
includes self-awareness training involving response detection and procedures 
for identifying events that precede and trigger the response; competing response 
training; and motivation techniques including self-administered consequences, 
social support systems, and procedures for promoting the generalization and 
maintenance of treatment gains. 


habituation A decrease in responsiveness to repeated presentations of a 
stimulus; most often used to describe a reduction of respondent behavior as a 
function of repeated presentation of the eliciting stimulus over a short span of 
time; some researchers suggest that the concept also applies to within-session 
changes in operant behavior. 


hero procedure A term sometimes used for a dependent group contingency 
(i.e., a reward for the group is contingent upon the behavior of an individual 
group member). 


hierarchical relation A nested stimulus relation in which a category, 
subsuming multiple stimuli, is itself a member of a higher-order category 
subsuming multiple stimuli. 


high-probability (high-p) request sequence An antecedent intervention 
in which two to five easy tasks with a known history of learner compliance 
(the high-p requests) are presented in quick succession immediately before 
requesting the target task, the low-p request. (Also called interspersed requests, 
pretask requests, or behavioral momentum.) 


high-probability instructional sequence (high-p). A procedure for 
improving compliance and diminishing escape-maintained problem behaviors. 
The practitioner presents two to five easy-to-follow (high-p) instructions for 
which the participant has a history of compliance. When the learner complies 
with several high-p instructions, the practitioner immediately presents a target 
instruction (i.e., low-p). 


higher-order conditioning Development of a conditioned reflex by pairing 
of a neutral stimulus (NS) with a conditioned stimulus (CS). (Also called 


secondary conditioning.) 


higher-order operant class Behavior defined in terms of general relations 
between antecedents and responses, rather than in terms of specific stimuli 
and responses; examples include generalized imitation, manding, instruction 
following, naming, and relational framing. 


history of reinforcement An inclusive term referring in general to all 
of a person’s learning experiences and more specifically to past conditioning 
with respect to particular response classes or aspects of a person’s repertoire. 
(See also ontogeny.) 


hypothetical construct A presumed but unobserved process or entity 
(e.g., Freud’s id, ego, and superego). 


imitation Behavior that (a) is occasioned by another person’s model of the 
behavior (or symbolic representation thereof), (b) has formal similarity with 
the model, (c) follows the modeled behavior closely in time, and (d) the model 
is the primary controlling variable for the imitative behavior. An imitative 
behavior is a new behavior emitted following a novel antecedent event (i.e., 
the model). (See also formal similarity; contrast with echoic.) 


imitation training A systematic, research-based set of procedures for 
teaching a nonimitative learner to imitate models of novel behaviors. 


impure tact A verbal operant involving a response that is evoked by both 
an MO and a nonverbal stimulus; thus, the response is part mand and part tact. 
(See also mand and tact.) 


independent group contingency A contingency in which reinforcement 
for each member of a group is dependent on that person’s meeting a performance 
criterion that is in effect for all members of the group. 


independent variable The variable that is systematically manipulated by the 
researcher in an experiment to see whether changes in the independent variable 
produce reliable changes in the dependent variable. In applied behavior analysis, 
it is usually an environmental event or condition antecedent or consequent to the 
dependent variable. (Sometimes called the intervention or treatment variable; 
compare to dependent variable.) 


indirect functional assessment Structured interviews, checklists, rating 
scales, or questionnaires used to obtain information from people who are 
familiar with the person exhibiting the problem behavior (e.g., teachers, par- 
ents, caregivers, and/or the individual himself or herself); used to identify 
conditions or events in the natural environment that correlate with the problem 
behavior. 


indirect measurement Occurs when the behavior that is measured is in 
some way different from the behavior of interest; considered less valid than 
direct measurement because inferences about the relation between the data 
obtained and the actual behavior of interest are required. (Contrast with direct 
measurement.) 


indiscriminable contingency A contingency that makes it difficult for the 
learner to discriminate whether the next response will produce reinforcement. Prac- 
titioners use indiscriminable contingencies in the form of intermittent schedules 
of reinforcement and delayed rewards to promote generalized behavior change. 


informed consent Formal permission—usually written, signed, and dated— 
given by a potential recipient of behavioral services or participant in a research 
study; requires that full disclosure of all pertinent information be provided to the 
individual prior to a granting of permission. To give consent, the person must (a) 
demonstrate the capacity to decide, (b) do so voluntarily, and (c) have adequate 
knowledge of all salient aspects of the treatment. 


instructional setting The environment where instruction occurs: includes 
all aspects of the environment, planned and unplanned, that may influence the 


learner’s acquisition, maintenance, and generalization of the target behavior. 
(Contrast with generalization setting.) 


interdependent group contingency A contingency in which reinforce- 
ment for all members of a group is dependent on each member of the group 
meeting a performance criterion that is in effect for all members of the group. 


intermittent schedule of reinforcement (INT) A contingency of rein- 


forcement in which some, but not all, occurrences of the behavior produce 
reinforcement. 


internal validity The extent to which an experiment shows convincingly 
that changes in behavior are a function of the independent variable and not the 
result of uncontrolled or unknown variables. (Compare to external validity.) 


interobserver agreement (IOA) The degree to which two or more inde- 


pendent observers report the same observed values after measuring the same 
events. 


interresponse time (IRT) A measure of temporal locus, defined as the 
elapsed time between two successive responses. 


interrupted chain procedure A procedural variation of the behavior chain 
interruption strategy, entails arranging the environment such that the learner 
is unable to continue the chain at a predetermined point until responding to a 
prompt (e.g., “What do you want?”). 


interval DRL A procedure for implementing DRL in which the total session 
is divided into equal intervals and reinforcement is provided at the end of each 
interval in which the number of responses during the interval is equal to or below 
a criterion limit. (See also differential reinforcement of low rates [DRL].) 


interval-by-interval IOA An index of the agreement between obsery- 
ers for data obtained by interval recording or time sampling measurement; 
calculated for a given session or measurement period by comparing the two 
observers’ recordings of the occurrence or nonoccurrence of the behavior in 
each observation interval and dividing the number of intervals of agreement 
by the total number of intervals and multiplying by 100. (Also called the 
point-by-point or total interval IOA; compare to scored-interval LOA and 
unscored-interval IOA.) 


interview-informed synthesized contingency analysis A variation of 
functional analysis designed to increase efficiency. In the test condition, multi- 
ple contingencies are implemented simultaneously (e.g., attention and escape) 
when the problem behavior is demonstrated. In the control condition, those same 
contingencies are presented noncontingently and continuously. 


intraverbal An elementary verbal operant involving a response that is evoked 
by a verbal discriminative stimulus that does not have point-to-point correspon- 
dence with that verbal stimulus. The intraverbal is the opposite of the echoic, 
in that the words emitted by one speaker do not match the words of another 
speaker. Intraverbal behavior constitutes the basis for social interaction, conver- 
sations, and much of academic and intellectual behavior. Questions are mands, 
and answers are intraverbal. 


intraverbal control Some verbal stimuli only supplement other more critical 
antecedents, but nonetheless they play a causal role in evoking behavior. Palmer 
(2016) recommends “In cases in which the verbal antecedent is, by itself, insuf- 
ficient to evoke the relevant response, we should speak of ‘intraverbal control,’ 
usually as one of a number of concurrent controlling variables” (p. 99). 


irreversibility A situation that occurs when the level of responding observed 
in a previous phase cannot be reproduced even though the experimental condi- 
tions are the same as they were during the earlier phase. 


J 


joint control A phenomenon in which two separate, but interrelated forms 
of a person’s own verbal behavior, combine to acquire stimulus control of a 
response that would not have occurred in the absence of either. 


L 

lag reinforcement schedule A schedule of reinforcement in which rein- 
forcement is contingent on a response being different in some specified way 
(e.g., different topography) from the previous response (e.g., Lag 1) ora speci- 
fied number of previous responses (e.g., Lag 2 or more). 


latency See response latency. 


Glossary G-9 


latency-based functional analysis An analysis in which each session is 
terminated as soon as a problem behavior occurs. The index of problem behavior 


is the latency from onset of the establishing operation to the first occurrence of 
the problem behavior. 


least-to-most response prompts A technique of transferring stimulus 
control in which the practitioner gives the participant an opportunity to perform 
the response with the least amount of assistance on each trial. The participant 
receives greater degrees of assistance with each successive trial without a correct 
response. The procedure for least-to-most prompting requires the participant 
to make a correct response within a set time limit (e.g., 3 seconds) from the 
presentation of the natural S?. If the response does not occur within the spec- 
ified time, the applied behavior analyst presents the natural SP and a response 
prompt of least assistance, such as a verbal response prompt. If after the same 
specified time limit (e.g., another 3 seconds) the participant does not make a 
correct response, the analyst gives the natural SP and another response prompt, 
such as a gesture. The participant receives partial or full physical guidance if the 
less intrusive prompt does not evoke a correct response. 


level The value on the vertical axis around which a series of behavioral 
measures converge. 


level system A component of some token economy systems in which partic- 
ipants advance up (or down) through a succession of levels contingent on their 
behavior at the current level. The performance criteria and sophistication or 
difficulty of the behaviors required at each level are higher than those of pre- 
ceding levels; as participants advance to higher levels, they gain access to more 
desirable reinforcers, increased privileges, and greater independence. 


limited hold A situation in which reinforcement is available only during a 
finite time following the elapse of an FI or VI interval; if the target response does 
not occur within the time limit, reinforcement is withheld and a new interval 
begins (e.g., on an FI 5-min schedule with a limited hold of 30 seconds, the 
first correct response following the elapse of 5 minutes is reinforced only if 
that response occurs within 30 seconds after the end of the S-minute interval). 


line graph Based on a Cartesian plane, a two-dimensional area formed by the 
intersection of two perpendicular lines. Any point within the plane represents a spe- 
cific relation between the two dimensions described by the intersecting lines. It is 
the most common graphic format for displaying data in applied behavior analysis. 
listener Someone who provides reinforcement for a speaker’s verbal behav- 
ior. A listener may also serve as an audience evoking verbal behavior. The 
distinction between listener and speaker is often blurred by the fact that much 
of a listener's behavior may involve becoming a speaker at the coyert level 
(e.g., thinking about what was said). A speaker may also serve as her own 
listener. (Contrast with speaker.) 


listener discriminations When verbal SP evokes a specific nonverbal 
behavior, due to a history of reinforcement. 


local response rate The average rate of response during a smaller period of 
time within a larger period for which an overall response rate has been given. 
(See also overall response rate.) 


M 


magnitude The force or intensity with which a response is emitted; provides 
important quantitative parameters used in defining and verifying the occurrence 
of some response classes. Responses meeting those criteria are measured and 
reported by one or more fundamental or derivative measures such as frequency, 
duration, or latency. (Sometimes called amplitude.) 


maintenance Two different meanings in applied behavior analysis: (a) the 
extent to which the learner continues to perform the target behavior after a por- 
tion or all of the intervention has been terminated (i.e., response maintenance), 
a dependent variable or characteristic of behavior; and (b) a condition in which 
treatment has been discontinued or partially withdrawn, an independent variable 
or experimental condition. 


mand An elementary verbal operant involving a response of any form that is 
evoked by an MO and followed by specific reinforcement. Manding allows a 
speaker to get what she wants or refuse what she does not want. 


massed practice A self-directed behavior change technique in which the 
person forces herself to perform an undesired behavior (e.g., a compulsive ritual) 
repeatedly, which sometimes decreases the future frequency of the behavior. - 
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matching law The allocation of responses to choices available on concurrent 
schedules of reinforcement; rates of responding across choices are distributed 
in proportions that match the rates of reinforcement received from each choice 
alternative. 


matching-to-sample procedure A discrete trial procedure for investigat- 
ing conditional relations and stimulus equivalence. A matching-to-sample trial 
begins with the participant making a response that presents or reveals the sample 
stimulus; next, the sample stimulus may or may not be removed, and two or 
more comparison stimuli are presented. The participant then selects one of the 
comparison stimuli. Responses that select a comparison stimulus that matches 
the sample stimulus are reinforced; no reinforcement is provided for responses 
selecting the nonmatching comparison stimuli. 


mean count-per-interval IOA The average percentage of agreement 
between the counts reported by two observers in a measurement period com- 
posed of a series of smaller counting times; a more conservative measure of 
IOA than total count IOA. 


mean duration-per-occurrence IOA An IOA index for duration per 
occurrence data; also a more conservative and usually more meaningful 
assessment of IOA for total duration data calculated for a given session or 
measurement period by computing the average percentage of agreement of 
the durations reported by two observers for each occurrence of the target 
behavior. 


measurement The process of applying quantitative labels to describe and dif- 
ferentiate objects and natural events. Measurement in applied behavior analysis 
involves three steps: (a) identifying the behavior to be measured, (b) defining 
the behavior in observable terms, and (c) selecting an appropriate observation 
and data-recording method. 


measurement bias Nonrandom measurement error; a form of inaccurate 
measurement in which the data consistently overestimate or underestimate the 
true value of an event. 


measurement by permanent product A method of measuring behavior 
after it has occurred by recording the effects that the behavior produced on the 
environment. 


mentalism An approach to explaining behavior that assumes that a mental, 
or “inner,” dimension exists that differs from a behavioral dimension and that 
phenomena in this dimension either directly cause or at least mediate some 
forms of behavior, if not all. 


metaphorical (tact) extension A tact evoked by a novel stimulus that 
shares some, but not all, of the relevant features of the original stimulus. 


methodological behaviorism A philosophical position that views behav- 
ioral events that cannot be publicly observed as outside the realm of science. 


metonymical (tact) extension A tact evoked by a novel stimulus that 
shares none of the relevant features of the original stimulus configuration, but 
some irrelevant yet related feature has acquired stimulus control. 


mixed schedule (mix) A compound schedule of reinforcement consisting 
of two or more basic schedules of reinforcement (elements) that occur in an 
alternating, usually random, sequence; no discriminative stimuli are correlated 
with the presence or absence of each element of the schedule, and reinforcement 
is delivered for meeting the response requirements of the element in effect at 
any time. 


modeling A behavior change strategy in which learners acquire new skills by 
imitating demonstrations of the skills by live or symbolic models. The model 
shows, demonstrates, or conveys exactly the behavior the learner is expected 
to perform. Models can be live demonstrations or symbolic representations of 
the desired behavior. 


momentary time sampling A measurement method in which the presence 
or absence of behaviors is recorded at precisely specified time intervals. (Con- 
trast with interval recording.) 


most-to-least response prompts A technique of transferring stimulus con- 
trol in which the practitioner physically guides the participant through the entire 
performance sequence, and then gradually reduces the level of assistance in 
successive trials. Customarily, most-to-least prompting transitions from physi- 
cal guidance to visual prompts to verbal instructions, and finally to the natural 
stimulus without prompts. 


motivating operation (MO) An environmental variable that (a) alters 
(increases or decreases) the reinforcing or punishing effectiveness of some 
stimulus, object, or event; and (b) alters (increases or decreases) the cur- 
rent frequency of all behavior that has been reinforced or punished by that 
stimulus, object, or event. (See also abative effect, abolishing operation [AO], 
behavior-altering effect, establishing operation [EO], evocative effect, 
value-altering effect.) 


motor imitation (relating to sign language) A type of duplic verbal 
behavior in which the form of a motor response is under the functional control 
of a visual verbal S? that has formal similarity between a verbal stimulus and a 
verbal response product, and a history of generalized reinforcement. 


multielement design An experimental design in which two or more condi- 
tions (one of which may be a no-treatment control condition) are presented in 
rapidly alternating succession (e.g., on alternating sessions or days) independent 
of the level of responding; differences in responding between or among con- 
ditions are attributed to the effects of the conditions. (Also called alternating 
treatments design, concurrent schedule design, and multiple schedule design.) 


multiple baseline across behaviors design A multiple baseline design 
in which the treatment variable is applied to two or more different behaviors of 
the same subject in the same setting. 


multiple baseline across settings design A multiple baseline design in 
which the treatment variable is applied to the same behavior of the same subject 
across two or more different settings, situations, or time periods. 


multiple baseline across subjects design A multiple baseline design in 
which the treatment variable is applied to the same behavior of two or more 
subjects (or groups) in the same setting. 


multiple baseline design An experimental design that begins with the con- 
current measurement of two or more behaviors in a baseline condition, fol- 
lowed by the application of the treatment variable to one of the behaviors while 
baseline conditions remain in effect for the other behavior(s). After maximum 
change has been noted in the first behavior, the treatment variable is applied in 
sequential fashion to each of the other behaviors in the design. Experimentai 
control is demonstrated if each behavior shows similar changes when, and only 
when, the treatment variable is introduced. (See also multiple baseline across 
behaviors design, multiple baseline across settings design, and multiple 
baseline across subjects design.) 


multiple control (of verbal behavior) There are two types of multiple 
control. Convergent multiple control occurs when a single verbal response is a 
function of more than one variable (i.e., what is said has more than one antecedent 
source of control). Divergent multiple control occurs when a single antecedent 
variable affects the strength of more than one response. 


multiple-exemplar training Instruction that provides the learner with prac- 
tice with a variety of stimulus conditions, response variations, and response 
topographies to ensure the acquisition of desired stimulus control response 
forms; used to promote both setting/situation generalization and response gen- 
eralization. (See also teaching enough examples.) 


multiple probe design A variation of the multiple baseline design that fea- 
tures intermittent measures, or probes, during baseline. It is used to evaluate the 
effects of instruction on skill sequences in which it is unlikely that the subject can 
improve performance on later steps in the sequence before learning prior steps. 


multiple schedule (mult) A compound schedule of reinforcement consisting 
of two or more basic schedules of reinforcement (elements) that occur in an alter- 
nating, usually random, sequence; a discriminative stimulus is correlated with the 
presence or absence of each element of the schedule, and reinforcement is deliy- 
ered for meeting the response requirements of the element in effect at any time. 


multiple treatment interference The effects of one treatment on a 


subject’s behavior being confounding by the influence of another treatment 
administered in the same study. 


multiple treatment reversal design Any experimental design that uses 
the experimental methods and logic of the reversal tactic to compare the effects 
of two or more experimental conditions to baseline and/or to one another 
(e.g., A-B-A-B-C-B-C, A-B-A-C-A-D-A-C-A-D, A-B-A-B-B+C-B-B+C). 


mutual entailment A bidirectional stimulus relation in which one direction 
(e.g., if A, then R) is directly learned and the other (if B, then A) is derived, 


N 


naive observer An observer who is unaware of the study’s purpose and/ 
or the experimental conditions in effect during a given phase or observation 


period. Data obtained by a naive observer are less likely to be influenced by 
observers’ expectations. 


naturally existing contingency Any contingency of reinforcement (or pun- 
ishment) that operates independent of the behavior analyst’s or practitioner’s 
efforts; includes socially mediated contingencies contrived by other people and 
already in effect in the relevant setting. (Contrast with contrived contingency.) 


negative punishment A response behavior followed immediately by the 
removal of a stimulus (or a decrease in the intensity of the stimulus) that results 
in similar responses occurring less often. (Contrast with positive punishment.) 


negative reinforcement A contingency in which the occurrence of a 
response is followed immediately by the termination, reduction, postponement, 
or avoidance of a stimulus, and which leads to an increase in the future occur- 
rence of similar responses. 


negative reinforcer A stimulus whose termination (or reduction in intensity) 
functions as reinforcement. (Contrast with positive reinforcer.) 


negligence Failure to demonstrate professional integrity on the part of the 
provider that threatens the client’s health and safety; typically exhibited as 
nonfeasance—not doing what ought to be done. 


neutral stimulus (NS) A stimulus change that does not elicit respondent 


behavior. (Compare to conditioned stimulus [CS] and unconditioned stim- 
ulus [US].) 


nodal stimulus (node) A stimulus set that must be held in common across 
a minimum of two conditional discriminations to provide a basis for all equiv- 
alence properties. 


nonequivalence relations Derived stimulus relations in which stimuli are 
related on some basis other than “sameness.” 


noncontingent reinforcement (NCR) A procedure in which stimuli with 
known reinforcing properties are presented on fixed-time (FT) or variable-time 
(VT) schedules completely independent of behavior; often used as an antecedent 
intervention to reduce problem behavior. (See also fixed-time schedule [FT], 
variable-time schedule [VT].) 


noncontingent reinforcement (NCR) reversal technique An exper- 
imental control technique that demonstrates the effects of reinforcement by 
using noncontingent reinforcement (NCR) as a control condition instead of a 
no-reinforcement (baseline) condition. During the NCR condition, the stimulus 
change used as reinforcement in the reinforcement condition is presented on a 
fixed or variable time schedule independent of the subject’s behavior. A higher 
level of responding during the reinforcement condition than during the NCR 
condition demonstrates that the changes in behavior are the result of contingent 
reinforcement, not simply the presentation of or contact with the stimulus event. 
(Compare to DRI/DRA reversal technique and DRO reversal technique.) 


nonconcurrent multiple baseline across participants design An 
experimental design that consists of a related series of A-B (baseline-interven- 
tion) sequences conducted across participants at different points in time; often 
used to assess treatment effects when concurrent measurement of participants’ 
behavior is not possible. 


nonexclusion time-out A procedure for implementing time-out in which, 
contingent on the occurrence of the target behavior, the person remains within 
the setting, but does not have access to reinforcement, for a specified period. 


normalization As a philosophy and principle, the belief that people with 
disabilities should, to the maximum extent possible, be physically and socially 
integrated into the mainstream of society regardless of the degree or type of 
disability. As an approach to intervention, the use of progressively more typical 
settings and procedures “to establish and/or maintain personal behaviors which 
are as culturally normal as possible” (Wolfensberger, 1972, p. 28). 


0) 


observed value A measure produced by an observation and measurement 
system. Observed values serve as the data that the researcher and others will 
interpret to form conclusions about an investigation. (Compare to true value.) 
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observer drift Any unintended change in the way an observer uses a mea- 
surement system over the course of an investigation that results in measurement 
error; often entails a shift in the observer’s interpretation of the original defini- 
tions of the target behavior subsequent to being trained. (See also measurement 
bias and observer reactivity.) 


observer reactivity Influence on the data reported by an observer that results 
from the observer’s awareness that others are evaluating the data. (See also 
measurement bias and observer drift.) 


ontogeny The history of the development of an individual organism during its 
lifetime. (See also history of reinforcement; compare to phylogeny.) 


operant behavior Behavior that is selected, maintained, and brought under 
stimulus control as a function of its consequences; each person’s repertoire of 
operant behavior is a product of his history of interactions with the environment 
(ontogeny). 


operant conditioning The basic process by which operant learning occurs; 
consequences (stimulus changes immediately following responses) result in an 
increased (reinforcement) or decreased (punishment) frequency of the same 
type of behavior under similar motivational and environmental conditions 
in the future. (See also motivating operation, punishment, reinforcement, 
response class, and stimulus control.) 


overall response rate The rate of response over a given time period. (See 
also local response rate.) 


overcorrection A behavior change tactic based on positive punishment in 
which, contingent on the problem behavior, the learner is required to engage 
in effortful behavior directly or logically related to fixing the damage caused 
by the behavior. Forms of overcorrection are restitutional overcorrection and 
positive practice overcorrection. (See also positive practice overcorrection 
and restitutional overcorrection.) 


overselective stimulus control A condition in which the range of discrim- 
inative stimuli, or stimulus features controlling behavior, is extremely limited; 
often interferes with learning. (Also called stimulus overselectivity.) 


overshadowing Occurs when the most salient component of a compound 
stimulus arrangement controls responding and interferes with the acquisition of 
stimulus control by the more relevant stimulus. 


P 


parametric analysis An experiment designed to discover the differential 
effects of a range of values of an independent variable. 


parsimony The practice of ruling out simple, logical explanations, experimen- 
tally or conceptually, before considering more complex or abstract explanations. 


partial-interval recording A time sampling method for measuring behay- 
ior in which the observation period is divided into a series of brief time inter- 
vals (typically from 5 to 10 seconds). The observer records whether the target 
behavior occurred at any time during the interval. Partial-interval recording is 
not concerned with how many times the behavior occurred during the interval 
or how long the behavior was present, just that it occurred at some point during 
the interval: tends to overestimate the proportion of the observation period that 
the behavior actually occurred. 


partition time-out A variation of time-out from positive reinforcement 
in which contingent on the occurrence of the target behavior, the participant 
remains in the time-in setting, but his view of the setting is restricted by a panel 
or cubicle, or a select space is arranged to serve as the time-out area (1.e., a 
carpet, a corner). Sometimes called select space time-out. 


percentage A ratio (i.e., a proportion) formed by combining the same 
dimensional quantities, such as count (number + number) or time (duration = 
duration; latency + latency); expressed as a number of parts per 100; typically 
expressed as a ratio of the number of responses of a certain type per total number 
of responses (or opportunities or intervals in which such a response could have 
occurred). A percentage presents a proportional quantity per 100. 


perspective shifting Responding as if from the vantage point of another 
person, place, or time than the personal here and now. 


philosophic doubt An attitude that the truthfulness and validity of all scien- 
tific theory and knowledge should be continually questioned. 
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phylogeny The history of the natural evolution of a species. (Compare to 
ontogeny.) 


pivotal behavior A behavior that, when learned, produces correspond- 
ing modifications or covariation in other untrained behaviors. (Compare to 
behavioral cusp.) 


placebo control A procedure that prevents a subject from detecting the pres- 
ence or absence of the treatment variable. To the subject, the placebo condition 
appears the same as the treatment condition (e.g., a placebo pill contains an 
inert substance but looks, feels, and tastes exactly like a pill that contains the 
treatment drug). (See also double-blind control.) 


planned activity check (PLACHECK) A variation cf momentary time 
sampling in which the observer records whether each person in a group is 
engaged in the target behavior at specific points in time; provides a measure 
of “group behavior.” 


planned ignoring A procedure for implementing time-out in which social 
reinforcers—usually attention, physical contact, and verbal interaction—are 
withheld for a brief period contingent on the occurrence of the target behavior. 


point-to-point correspondence A relation between the stimulus and 
response or response product that occurs when the beginning, middle, and end 
of the verbal stimulus matches the beginning, middle, and end of the verbal 
response. The verbal relations with point-to-point correspondence are echoic, 
copying text, imitation as it relates to sign language, textual, and transcription. 


positive practice overcorrection A form of overcorrection in which, con- 
tingent on an occurrence of the target behavior, the learner is required to repeat 
a correct form of the behavior, or a behavior incompatible with the problem 
behavior, a specified number of times; entails an educative component. (See 
also overcorrection and restitutional overcorrection.) 


positive punishment A response followed immediately by the presentation 
of a stimulus that decreases the future frequency of the behavior. (Contrast with 
negative punishment.) 


positive reinforcement A response followed immediately by the presenta- 
tion of a stimulus change that results in similar responses occurring more often. 
(Contrast with negative reinforcement.) 


positive reinforcer A stimulus whose presentation or onset functions as 
reinforcement. (Contrast with negative reinforcer.) 


postreinforcement pause The absence of responding for a period of time 
following reinforcement; an effect commonly produced by fixed interval (FI) 
and fixed ratio (FR) schedules of reinforcement. 


practice effects Improvements in performance resulting from opportunities 
to perform a behavior repeatedly so that baseline measures can be obtained. 


pragmatism A philosophical position asserting that the truth value of a state- 
ment is determined by how well it promotes effective action; pragmatism is a 
primary criterion by which behavior analysts judge the value of their findings. 


precision teaching An instructional approach that involves (a) pinpointing 
the skills to be learned; (b) measuring the initial frequency or rate per minute at 
which the student can perform those skills; (c) setting an aim, or goal, for the 
child’s improvement; (d) using direct, daily measurement to monitor progress made 
under an instructional program; (e) charting the results of those measurements on a 
Standard Celeration Chart; and (f) changing the program if progress is not adequate. 


prediction A statement of the anticipated outcome of a presently unknown 
or future measurement; one of three components of the experimental reasoning, 
or baseline logic, used in single-subject research designs. (See also replication 
and verification.) 


Premack principle A principle that states that making the opportunity 
to engage in a high-probability behavior contingent on the occurrence of a 
low-frequency behavior will function as reinforcement for the low-frequency 
behavior. Sometimes called “Grandma’s Law.” (See also response-deprivation 
hypothesis.) 


principle of behavior A statement describing a functional relation between 
behavior and one or more of its controlling variables with generality across 
organisms, species, settings, behaviors, and time (e.g., extinction, positive 
reinforcement); an empirical generalization inferred from many experiments 
demonstrating the same functional relation. 


private events Covert events typically accessible only to the person expe- 
riencing them. Skinner’s radical behaviorism holds three major assumptions 
about private events: (a) private events are behavior; (b) behavior that takes 
place within the skin is distinguished from other (“public”) behavior only by its 
inaccessibility; and (c) private behavior is influenced by (i.e., is a function of) 
the same kinds of variables as publicly accessible behavior. 


procedural fidelity The extent to which procedures in all phases and condi- 
tions of an experiment, including baseline, are implemented correctly. (Contrast 
with treatment integrity). 


programming common stimuli A tactic for promoting setting/situation 
generalization by making the instructional setting similar to the generalization 
setting; the two-step process involves (1) identifying salient stimuli that char- 
acterize the generalization setting and (2) incorporating those stimuli into the 
instructional setting. 


progressive-ratio (PR) schedule of reinforcement A variation of the 
fixed ratio (FR) schedule of reinforcement that increases the ratio requirements 
incrementally within the session; PR schedule requirements are changed using 
(a) arithmetic progressions to add a constant number to each successive ratio, 
or (b) geometric progressions to add successively a constant proportion of the 
preceding ratio. 


progressive schedule of reinforcement A schedule that systematically 
thins each successive reinforcement opportunity independent of the individu- 
al’s behavior; progressive ratio (PR) and progressive interval (PI) schedules are 
thinned using arithmetic or geometric progressions. 


progressive time delay A procedure for transferring stimulus control 
from contrived response prompts to naturally existing stimuli that starts with 
simultaneous presentation of the natural stimulus and the response prompt (i.e., 
0-sec delay). The number of 0-sec trials depends on the task difficulty and the 
functioning level of the participant. Following the simultaneous presentations, 
the time delay is gradually and systematically extended. 


punisher A stimulus change that decreases the future occurrence of behav- 
ior that immediately precedes it. (See also aversive stimulus, conditioned 
punisher, and unconditioned punisher.) 


punishment A basic principle of behavior describing a response—consequence 
functional relation in which a response is followed immediately by a stimulus 
change that decreases future occurrences of that type of behavior.(See also 
negative punishment and positive punishment.) 


R 


radical behaviorism A form of behaviorism that attempts to understand 
all human behavior, including private events such as thoughts and feelings, in 
terms of controlling variables in the history of the person (ontogeny) and the 
species (phylogeny). 


range-bound changing criterion design A variation of the changing 
criterion design in which each intervention sub-phase includes a lower and an 
upper criterion within which the participant is expected to perform. 


rate A fundamental measure of how often behavior occurs expressed as count 
per standard unit of time (e.g., per minute, per hour, per day) and calculated 
by dividing the number of responses recorded by the number of standard units 
of time in which observations were conducted. Some behavior analysts use 
rate and frequency interchangeably; others use frequency to mean count. (See 
frequency and count.) 


ratio scale A scale in which equal distances on a graph’s axis correspond to 
equal ratios of change in the variable plotted on the axis. 


ratio strain A behavioral effect associated with abrupt increases in ratio 
requirements when moving from denser to thinner reinforcement schedules: 
common effects include avoidance, aggression, and unpredictable pauses or 
cessation in responding. 


reactivity Effects of an observation and measurement procedure on the behay- 
ior being measured. Reactivity is most likely when measurement procedures 
are obtrusive, especially if the person being observed is aware of the observer’s 
presence and purpose. 


recombinative generalization “Differential responding to novel combi- 


nations of stimulus components that have been included previously in other 
stimulus contexts” (Goldstein, 1983, p. 280). 


recovery from punishment The occurrence of a previously punished type 
of response without its punishing consequence; analogous to the extinction of 


previously reinforced behavior and has the effect of undoing the effect of the 
punishment. 


reflex A stimulus—response relation consisting of an antecedent stimulus 
and the respondent behavior it elicits (e.g., bright light—pupil contraction). 
Unconditioned and conditioned reflexes protect against harmful stimuli, help 
regulate the internal balance and economy of the organism, and promote 
reproduction. (See also conditioned reflex, respondent behavior, respondent 
conditioning, and unconditioned reflex.) 


reflexive conditioned motivating operation (CMO-R) A stimulus that 
acquires MO effectiveness by preceding some form of worsening or improve- 
ment. It is exemplified by the warning stimulus in a typical escape—avoidance 
procedure, which establishes its own offset as reinforcement and evokes all 
behavior that has accomplished that offset. 


reflexivity A type of stimulus-to-stimulus relation in which the learner, 
without any prior training or reinforcement for doing so, selects a comparison 
stimulus that is the same as the sample stimulus (e.g., A = A). Reflexivity 
would be demonstrated in the following matching-to-sample procedure: The 
sample stimulus is a picture of a tree, and the three comparison stimuli are a 
picture of a mouse, a picture of a cookie, and a duplicate of the tree picture used 
as the sample stimulus. The learner selects the picture of the tree without spe- 
cific reinforcement in the past for making the tree-picture-to-tree-picture match. 
(Also called generalized identity matching; see also stimulus equivalence; 
compare to transitivity and symmetry.) 


reinforcement A basic principle of behavior describing a response— 
consequence functional relation in which a response is followed immediately 
by a stimulus change that results in similar responses occurring more often. (See 
also negative reinforcement and positive reinforcement.) 


reinforcer A stimulus change that increases the future frequency of behavior 
that immediately precedes it. (See also positive reinforcer, conditioned 
reinforcer and unconditioned reinforcer.) 


reinforcer assessment Refers to a variety of direct, empirical methods for 
presenting one or more stimuli contingent on a target response and measuring 
their effectiveness as reinforcers. 


reinforcer-abolishing effect (of a motivating operation) A decrease in 
the reinforcing effectiveness of a stimulus, object, or event caused by a motivat- 
ing operation. For example, food ingestion abolishes (decreases) the reinforcing 
effectiveness of food. 


reinforcer-establishing effect (of a motivating operation) An 
increase in the reinforcing effectiveness of a stimulus, object, or event caused 
by a motivating operation. For example, food deprivation establishes (increases) 
the reinforcing effectiveness of food. 


relational frame Any specific type of arbitrarily applicable relational 
responding. 


relational frame theory (RFT) A theory of derived stimulus relations 
proposing that stimulus relations are inherently verbal and that accumulated 
experience with relational exemplars creates generalized repertoires of relating. 


relevance of behavior rule Holds that only behaviors likely to produce 
reinforcement in the person’s natural environment should be targeted for change. 


reliability (of measurement) Refers to the consistency of measurement, 
specifically, the extent to which repeated measurement of the same event yields 
the same values. 


repeatability Refers to the fact that a behavior can occur repeatedly through 
time (i.e., behavior can be counted); one of the three dimensional quantities 
of behavior from which all behavioral measurements are derived. (See also 
celeration, count, frequency, rate, temporal extent, and temporal locus.) 


repertoire All of the behaviors a person can do; or a set of behaviors relevant 
to a particular setting or task (e.g., gardening, mathematical problem solving). 


replication (a) Repeating conditions within an experiment to determine the 
reliability of effects and increase internal validity. (See also baseline logic, 
prediction, and verification.) (b) Repeating whole experiments to determine 
the generality of findings of previous experiments to other subjects, settings, 
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and/or behaviors. (See also direct replication, external validity, and system- 
atic replication.) 


research question A statement of what the researcher seeks to learn by con- 
ducting the experiment; may be presented in question form and is most often 
found in a published account as a statement of the experiment’s purpose. All 
aspects of an experiment’s design should follow from the research question. 


resistance to extinction The relative frequency with which operant behavior 
is emitted during extinction. 


respondent behavior The response component of a reflex; behavior that 


is elicited, or induced, by antecedent stimuli. (See also reflex and respondent 
conditioning.) 


respondent conditioning A stimulus—stimulus pairing procedure in which 
a neutral stimulus (NS) is presented with an unconditioned stimulus (US) until 
the neutral stimulus becomes a conditioned stimulus that elicits the conditioned 
response. (Also called classical or Pavlovian conditioning; see also conditioned 
reflex and higher order conditioning.) 


respondent extinction The repeated presentation of a conditioned stimulus 
(CS) in the absence of the unconditioned stimulus (US); the CS gradually loses 
its ability to elicit the conditioned response until the conditioned reflex no longer 
appears in the individual’s repertoire. 


response A single instance or occurrence of a specific class or type of 
behavior. Technical definition: an “action of an organism’s effector. An effector 
is an organ at the end of an efferent nerve fiber that is specialized for altering 
its environment mechanically, chemically, or in terms of other energy changes” 
(Michael, 2004, p. 8). (See also response class.) 


response blocking A procedure in which the therapist physically intervenes 
as soon as the learner begins to emit a problem behavior, to prevent completion 
of the targeted behavior. 


response class A group of responses of varying topography, all of which 
produce the same effect on the environment. 


response cost The response-contingent loss of a specific number of positive 
reinforcers (e.g., a fine) that decreases the frequency of similar responses in the 
future; a form of negative punishment. 


response differentiation A behavior change produced by differential 
reinforcement: Reinforced members of the current response class occur with 
greater frequency, and unreinforced members occur less frequently (undergo 
extinction); the overall result is the emergence of a new response class. 


response generalization The extent to which a learner emits untrained 
responses that are functionally equivalent to the trained target behavior. 
(Compare to response maintenance and setting/situation generalization.) 


response interruption and redirection (RIRD) A procedural variation 
of response blocking that involves interrupting stereotypic behavior at its onset 
and redirecting the individual to complete high-probability behaviors instead. 


response latency A measure of temporal locus; the elapsed time from the 
onset of a stimulus (e.g., task direction, cue) to the initiation of a response. 


response maintenance The extent to which a learner continues to perform 
the target behavior after a portion or all of the intervention responsible for the 
behavior’s initial appearance in the learner’s repertoire has been terminated. 
(Often called maintenance, durability, behavioral persistence, and (incorrectly) 
resistance to extinction; compare to response generalization and setting/ 
situation generalization.) 


response prompts Prompts that operate directly on the response to cue 
a correct response. The three major forms of response prompts are verbal 
instructions, modeling, and physical guidance. 


response-deprivation hypothesis A model for predicting whether 
contingent access to one behavior will function as reinforcement for engaging 
in another behavior based on whether access to the contingent behavior rep- 
resents a restriction of the activity compared to the baseline level of engagement. 
(See also Premack principle.) 

restitutional overcorrection A form of overcorrection in which, contingent 


on the problem behavior, the learner is required to repair the damage or return 
the environment to its original state and then to engage in additional behavior 
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to bring the environment to a condition vastly better than it was in prior to the 
misbehavior. (See also overcorrection and positive practice overcorrection.) 


restraint Three common types of restraint—personal restraint, protective 
equipment restraint, and self-restraint—appear in the behavior analysis litera- 
ture. With personal restraint, therapists physically restrict the problem behavior 
by holding the body parts (e.g., response blocking) that produce the problem 
behavior. With protective equipment restraints, therapists often use protective 
equipment restraints (e.g., padded helmets, arm splints, boxing gloves, padded 
hand mitts, safety belts, safety straps) to protect individuals from causing seri- 
ous injury. Persons with self-injurious behavior (SIB) often apply se/f-restraint, 
such as sitting on hands, placing arms between folded legs, and wrapping arms 
in tight clothing. 


resurgence Refers to the reoccurrence of a previously reinforced behavior 
when reinforcement for an alternative behavior is terminated or decreased and 
to the three-phase procedure that produces the effect: (1) A target behavior is 
reinforced, (2) the target behavior is placed on extinction and reinforcement 
provided for an alternative behavior, and (3) both responses are placed on 
extinction. 


reversal design Any experimental design in which the researcher attempts 
to verify the effect of the independent variable by “reversing” responding to a 
level obtained in a previous condition; encompasses experimental designs in 
which the independent variable is withdrawn (A-B-A-B) or reversed in its focus 
(e.g., DRI/DRA). (See also A-B-A design, A-B-A-B design, B-A-B design, 
DRI/DRA reversal technique, DRO reversal technique, and noncontingent 
reinforcement [NCR] reversal technique.) 


risk—-benefit analysis A comparison prior to conducting a behavioral 
assessment or intervention of the potential harm to the client (or others) versus 
the benefits likely to result from the procedures. 


rule-governed behavior Behavior controlled by a rule (i.e., a verbal state- 
ment of an antecedent-behavior-consequence contingency); enables human 
behavior (e.g., fastening a seat belt) to come under the indirect control of 
temporally remote or improbable, but potentially significant consequences 
(e.g., avoiding injury in an auto accident). Often used in contrast to contingency- 
shaped behavior, a term used to indicate behavior selected and maintained by 
controlled, temporally close consequences. 


S 


satiation A decrease in the frequency of operant behavior presumed to be the 
result of continued contact with or consumption of a reinforcer that has followed 
the behavior; also refers to a procedure for reducing the effectiveness of a rein- 
forcer (e.g., presenting a person with copious amounts of a reinforcing stimulus 
prior to a session). (See also motivating operation; contrast with deprivation.) 


scatterplot A two-dimensional graph that shows the relative distribution of 
individual measures in a data set with respect to the variables depicted by the 
x- and y-axes. Data points on a scatterplot are not connected. 


scatterplot recording A procedure for recording the extent to which a target 
behavior occurs more often at particular times than others; involves dividing the 
day into blocks of time, and using different symbols on an observation form to 
indicate the level of the target behavior (e.g., a lot, some, or not at all). 


schedule of reinforcement A rule specifying the environmental arrangements 
and response requirements for reinforcement; a description of a contingency of 
reinforcement. 


schedule thinning Changing a contingency of reinforcement by gradually 
increasing the response ratio or the extent of the time interval; it results in a 
lower rate of reinforcement per responses, time, or both. 


science A systematic approach to the understanding of natural phenomena (as evi- 
denced by description, prediction, and control) that relies on determinism as its fun- 
damental assumption, empiricism as its primary rule, experimentation as its basic 
strategy, replication as a requirement for believability, parsimony as a value, and 
philosophic doubt as its guiding conscience. 


scored-interval IOA An interobserver agreement index based only on the 
intervals in which either observer recorded the occurrence of the behavior; calcu- 
lated by dividing the number of intervals in which the two observers agreed that 
the behavior occurred by the number of intervals in which either or both observers 


recorded the occurrence of the behavior and multiplying by 100. Scored-interyal 
IOA is recommended as a measure of agreement for behaviors that occur at low 
rates because it ignores the intervals in which agreement by chance is highly 
likely. (Compare to interval-by-interval IOA and unscored-interval IOA.) 


selection by consequences The fundamental principle underlying operant 
conditioning; the basic tenet is that all forms of (operant) behavior, from sim- 
ple to complex, are selected, shaped, and maintained by their consequences 
during an individual’s lifetime; Skinner’s concept of selection by consequences 
is parallel to Darwin’s concept of natural selection of genetic structures in the 
evolution of species. 


selection-based verbal behavior A category of verbal behavior in which the 
speaker points to or selects a particular stimulus; what is conveyed to the listener 
is the information on the stimulus selected. (Contrast with topography-based 
verbal behavior) 


selectionism A theory that all forms of life naturally and continually evolve 
as a result of the interaction between function and the survival value of that 
function. Operant selection by consequences is the conceptual and empirical 
foundation of behavior analysis. 


self-contract Contingency contract that a person makes with himself or herself, 
incorporating a self-selected task and reward as well as personal monitoring of 
task completions and self-delivery of the reward. 


self-control [impulse analysis] A person’s ability to “delay gratification” by 
emitting a response that will produce a larger (or higher quality) delayed reward 
over a response that produces a smaller but immediate reward. (Sometimes called 
impulse control.) 


self-control [Skinner’s analysis] Skinner (1953) conceptualized 
self-control as a two-response phenomenon: The controlling response affects 
variables in such a way as to change the probability of the controlled response. 
(See also self-management.) 


self-evaluation A procedure in which a person compares her performance of 
a target behavior with a predetermined goal or standard; often a component of 
self-management. (Sometimes called self-assessment.) 


self-instruction Self-generated verbal responses, covert or overt, that function 
as rules or response prompts for a desired behavior; as a self-management tactic, 
self-instruction can guide a person through a behavior chain or sequence of tasks. 


self-management The personal application of behavior change tactics that 
produces a desired change in behavior. 


self-monitoring A procedure whereby a person systematically observes his 
behavior and records the occurrence or nonoccurrence of a target behavior. 
(Also called self-recording or self-observation.) 


semilogarithmic chart A two-dimensional graph with a logarithmic scaled 
y axis so that equal distances on the vertical axis represent changes in behavior 
that are of equal proportion. (See also Standard Celeration Chart.) 


sensory extinction A procedure by which behaviors maintained by automatic 
reinforcement are placed on extinction by masking or removing the sensory 
consequence. 


sequence effects The effects on a subject’s behavior in a given condition that 
are the result of the subject’s experience with a prior condition. 


setting/situation generalization The extent to which a learner emits the tar- 
get behavior in a setting or stimulus situation that is different from the instructional 
setting. 


shaping Using differential reinforcement to produce a series of gradually 
changing response classes; each response class is a successive approximation 
toward a terminal behavior. Members of an existing response class are selected 
for differential reinforcement because they more closely resemble the termi- 
nal behavior. (See also differential reinforcement, response class, response 
differentiation, and successive approximations.) 


simple discrimination Responding is under stimulus control of a single 


antecedent stimulus condition; described by the three-term contingency: 
SP—>R— SFr, 


simple verbal discrimination A single-component word or phrase evokes 


a nonmatching intraverbal response (e.g., upon hearing “Ready, set... ,” a child 
says “go,”) 


simple-to-complex testing protocol An arrangement in which types of 
stimulus equivalence probes are introduced sequentially, beginning with symme- 
try, followed by transitivity (if relevant), and then combined tests for equivalence. 


single-case designs A wide variety of research designs that use a form 
of experimental reasoning called baseline logic to demonstrate the effects of 
the independent variable on the behavior of individual subjects. (Also called 
single-subject, within-subject, and intra-subject designs; see also baseline logic, 
changing criterion design, multielement design, multiple baseline design, 
reversal design, and steady state strategy.) 


social validity Refers to the extent to which target behaviors are appropriate, 


intervention procedures are acceptable, and important and significant changes 
in target and collateral behaviors are produced. 


socially mediated contingency (reinforcement) A contingency in which 


an antecedent stimulus and/or the consequence for the behavior is presented by 
another person. 


solistic (tact) extension A verbal response evoked by a stimulus property 
that is only indirectly related to the proper tact relation (e.g., Yogi Berra’s classic 
malapropism: “Baseball is ninety percent mental; the other half is physical.” 


spaced-responding DRL A procedure for implementing DRL in which 
reinforcement follows each occurrence of the target behavior that is separated 
from the previous response by a minimum interresponse time (IRT). (See also 
differential reinforcement of low rates [DRL].) 


spatial relation Responding jointly to two stimuli on the basis of their 
juxtaposition in space. 


speaker Someone who engages in verbal behavior by emitting mands, tacts, 
intraverbals, autoclitics, etc. A speaker is also someone who uses sign lan- 
guage, gestures, signals, written words, codes, pictures, or any form of verbal 
behavior. (Contrast with listener.) 


split-middle line of progress A line drawn through a series of graphed data 
points that shows the overall trend in the data; drawn through the intersections 
of the vertical and horizontal middles of each half of the charted data and then 


adjusted up or down so that half of all the data points fall on or above and half 


fall on or below the line. 


spontaneous recovery A behavioral effect associated with extinction in 
which the behavior suddenly begins to occur after its frequency has decreased 
to its prereinforcement level or stopped entirely. 


stable baseline Data that show no evidence of an upward or downward trend; 
all of the measures fall within a relatively small range of values. (See also steady 
state responding.) 


Standard Celeration Chart A multiply—divide chart with six base-10 
(or X 10, + 10) cycles on the vertical axis that can accommodate response rates 
as low as 1 per 24 hours (0.000695 per minute) to as high as 1000 per minute. 
It enables the standardized charting of celeration, a factor by which rate of behav- 
ior multiplies or divides per unit of time. (See also semilogarithmic chart.) 


steady state responding A pattern of responding that exhibits relatively 
little variation in its measured dimensional quantities over a period of time. 


steady state strategy Repeatedly exposing a subject to a given condition 
while trying to eliminate or control extraneous influences on the behavior and 
obtaining a stable pattern of responding before introducing the next condition. 


stimulus “An energy change that affects an organism through its receptor 
cells” (Michael, 2004, p. 7). 


stimulus blocking Occurs when a competing stimulus blocks the evocative 
function of a stimulus that has acquired stimulus control over the behavior. 
(Sometimes called masking.) Stimulus blocking can be mitigated by rearranging 
the physical environment, making instructional stimuli appropriately intense, 
and consistently reinforcing behavior in the presence of the instructionally 
relevant discriminative stimuli. 


stimulus class A group of stimuli that share specified common elements 
along formal (e.g., size, color), temporal (e.g., antecedent or consequent), and/ 
or functional (e.g., discriminative stimulus) dimensions. 


stimulus control A situation in which the frequency, latency, duration, or 
amplitude of a behavior is altered by the presence or absence of an antecedent 
stimulus. (See also discrimination and discriminative stimulus.) 
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stimulus delta (S*) A stimulus in the presence of which a given behavior 
has not produced reinforcement, or has produced reinforcement of lesser quality, 
in the past. (Contrast with discriminative stimulus [S?].) 


stimulus discrimination When one stimulus (the S”) signals the availabil- 
ity of reinforcement and the absence of that stimulus (the S4) signals a zero 
or reduced chance of reinforcement, responses will occur more often in the 
presence of the S? than in its absence (the S4). 


stimulus discrimination training The conyentional procedure requires 
one behavior and two antecedent stimulus conditions. Responses are reinforced 


in the presence of one stimulus condition, the S°, but not in the presence of the 
other stimulus, the S*. 


stimulus equivalence The emergence of accurate responding to untrained 
and nonreinforced stimulus—stimulus relations following the reinforcement of 
responses to some stimulus—stimulus relations. Requires successful performances 
on three types of probe trials—reflexivity, symmetry, and transitivity—in the 
absence of reinforcement. (Also called equivalence-class formation.) 


stimulus fading A method of transferring stimulus control that involves 
highlighting a physical dimension of a stimulus (e.g., size, color, position) to 
increase the likelihood of a correct response and then gradually diminishing the 
exaggerated dimension until the learner is responding correctly to the naturally 
occurring stimulus. 


stimulus generalization When an antecedent stimulus has a history of 
evoking a response that has been reinforced in its presence, the same type of 
behavior tends to be evoked by stimuli that share similar physical properties with 
the controlling antecedent stimulus. 


stimulus generalization gradient A graphic depiction of the extent to 
which behavior that has been reinforced in the presence of a specific stimulus 
condition is emitted in the presence of other stimuli. The gradient shows relative 
degrees of stimulus generalization and stimulus control (or discrimination). 
A flat slope across test stimuli shows a high degree of stimulus generaliza- 
tion and relatively little discrimination between the trained stimulus and other 
stimuli; a slope that drops sharply from its highest point corresponding to the 
trained stimulus indicates a high degree of stimulus control (discrimination) and 
relatively little stimulus generalization. 


stimulus preference assessment A variety of procedures used to deter- 
mine the stimuli that a person prefers, the relative preference values (high versus 
low) of those stimuli, the conditions under which those preference values remain 
in effect, and their presumed value as reinforcers. 


stimulus prompts Prompts that operate directly on the antecedent task stim- 
uli to cue a correct response in conjunction with the critical SP (e.g., changing 
the size, color, or position of a stimulus within an array to make its selection 
more likely). 


stimulus—control topographies Refers to two different forms of stimulus 
control that can result from a match-to-sample procedure involving one sample 
stimulus and two comparison stimuli; when presented with Al as the sample, a 
participant can either select B1 (known as Type S or select responding) or reject 
B2 (known as Type R or reject responding). 


stimulus—stimulus pairing A procedure in which two stimuli are presented 
at the same time, usually repeatedly for a number of trials, which often results 
in one stimulus acquiring the function of the other stimulus. 


successive approximations The sequence of new response classes that 
emerge during the shaping process as the result of differential reinforcement; 
each successive response class is closer in form to the terminal behavior than 
the response class it replaces. 


surrogate conditioned motivating operation (CMO-S) A stimulus 
that acquires its MO effectiveness by being paired with another MO and has 
the same value-altering and behavior-altering effects as the MO with which it 
was paired. 


symmetry A type of stimulus-to-stimulus relationship in which the learner, 
without prior training or reinforcement for doing so, demonstrates the revers- 
ibility of matched sample and comparison stimuli (e.g., if A = B, then B = A). 
Symmetry would be demonstrated in the following matching-to-sample 
procedure: The learner is taught, when presented with the spoken word car 
(sample stimulus A), to select a comparison picture of a car (comparison B). 
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When presented with the picture of a car (sample stimulus B), without additional 
training or reinforcement, the learner selects the comparison spoken word car 
(comparison A). (See also stimulus equivalence; compare to reflexivity and 
transitivity.) 


systematic desensitization A behavior therapy treatment for anxieties, 
fears, and phobias that involves substituting one response, generally muscle 
relaxation, for the unwanted behavior—the fear and anxiety. The client practices 
relaxing while imagining anxiety-producing situations in a sequence from the 
least fearful to the most fearful. 


systematic replication An experiment in which the researcher purposefully 
varies one or more aspects of an earlier experiment. A systematic replication 
that reproduces the results of previous research not only demonstrates the reli- 
ability of the earlier findings but also adds to the external validity of the ear- 
lier findings by showing that the same effect can be obtained under different 
conditions. 


T 


tact An elementary verbal operant involving a response that is evoked by a 
nonverbal discriminative stimulus and followed by generalized conditioned rein- 
forcement. Tacting allows a speaker to identify or describe the features of the 
physical environment. The elements that make up one's physical environment 
are vast; thus, much of language instruction and educational programs focus 
on teaching tacts. 


tact extension Once a tact has been established, the tact response can occur 
under novel stimulus conditions through the process of stimulus generalization. 
Skinner (1957) identifies four different levels of generalization based on the 
degree to which a novel stimulus shares the relevant or defining features of the 
original stimulus. The four types of tact extension are generic, metaphorical, 
metonymical, and solecistic. 


taking dictation An elementary verbal operant involving a spoken verbal 
stimulus that evokes a written, typed, or fingerspelled response that does not 
have formal similarity between the stimulus and the response, but does have 
point-to-point correspondence and a history of generalized reinforcement. 


tandem schedule (tand) A schedule of reinforcement identical to the 
chained schedule except, like the mix schedule, the tandem schedule does not 
use discriminative stimuli with the elements in the chain. (See also chained 
schedule and mixed schedule.) 


target behavior The response class selected for intervention; can be defined 
either functionally or topographically. 


task analysis The process of breaking a complex skill or series of behaviors 
into smaller, teachable units; also refers to the results of this process. 


teach loosely Randomly varying functionally irrelevant stimuli within and 
across teaching sessions; promotes setting/situation generalization by reducing 
the likelihood that (a) a single or small group of noncritical stimuli will acquire 
exclusive control over the target behavior and (2) the learner’s performance of 
the target behavior will be impeded or “thrown off” should he encounter any of 
the “loose” stimuli in the generalization setting. 


teach enough examples A strategy for promoting generalized behavior 
change that consists of teaching the learner to respond to a subset of all the 
relevant stimulus and response examples and then assessing the learner’s perfor- 
mance on untrained examples. (See also multiple exemplar training.) 


temporal extent Refers to the fact that every instance of behavior occurs 
during some amount of time; one of the three dimensional quantities of behavior 
from which all behavioral measurements are derived. (See also repeatability 
and temporal locus.) 


temporal locus Refers to the fact that every instance of behavior occurs at 
a certain point in time with respect to other events (i.e., when in time behav- 
ior occurs can be measured); often measured in terms of response latency and 
interresponse time (IRT); one of the three dimensional quantities of behavior 
from which all behavioral measurements are derived. (See also repeatability 
and temporal extent.) 


temporal relation Responding jointly to two stimuli on the basis of their 
juxtaposition in time. 


terminal behavior The end product of shaping. 


terminate specific reinforcer contact A variation of nonexclusion 
time-out whereby each occurrence of the target behavior immediately stops an 
activity or sensory reinforcer. 


textual An elementary verbal operant involving a response that is evoked by 
a written verbal discriminative stimulus that does not have formal similarity 
between the stimulus and the response, but does have point-to-point correspon- 
dence and a history of generalized reinforcement. 


three-term contingency The basic unit of analysis in the analysis of operant 
behavior; encompasses the temporal and possibly dependent relations among an 
antecedent stimulus, behavior, and consequence. 


time delay A procedure for transferring stimulus control from contrived 
response prompts to naturally existing stimuli that begins with the simultane- 
ous presentation of the natural stimulus and response prompt. After several 
correct responses, a delay is introduced between the stimulus and the response 
prompt until the student emits the unprompted correct response. Time delay is 
considered an “errorless learning” technique as students make few or no errors 
transitioning from the contrived prompt to the instructional stimulus. (See also 
constant time delay and progressive time delay.) 


time sampling A measurement of the presence or absence of behavior within 
specific time intervals. It is most useful with continuous and high-rate behav- 
iors. (See also momentary time sampling, partial-interval recording, and 
whole-interval recording.) 


time-out from positive reinfercement The immediate response- 
contingent withdrawal of the opportunity to earn positive reinforcement or the 
immediate loss of access to positive reinforcers for a specified time; a form of 
negative punishment. (Also called time-out.) 


token An object or symbol that is awarded contingent on appropriate target 
behavior(s) that can be traded for a wide variety of backup reinforcers; tokens 
function as generalized conditioned reinforcers. 


token economy A behavior change system consisting of a list of target 
behaviors, with tokens (points or small objects) participants earn for emitting 
the target behaviors, and a menu of backup reinforcers (i.e., preferred items, 
activities, or privileges) for which participants exchange earned tokens. (Also 
called token reinforcement system.) 


topography The physical form or shape of a behavior. 


topography-based definition Defines instances of the targeted response 
class by the shape or form of the behavior. 


topography-based verbal behavior A category of verbal behavior in 
which the listener is affected by a specific response topography emitted by the 
speaker; includes (e.g., speech, sign language, writing, fingerspelling). (Contrast 
with selection-based verbal behavior) 


total count IOA The simplest indicator of IOA for event recording data; 
based on comparing the total count recorded by each observer per measurement 
period; calculated by dividing the smaller of the two observers’ counts by the 
larger count and multiplying by 100. 


total duration IOA A relevant index of IOA for total duration measurement; 
computed by dividing the shorter of the two durations reported by the observers 
by the longer duration and multiplying by 100. 


total-task chaining A variation of forward chaining in which the learner 
receives training on each behavior in the chain during each session. 


training structure Refers to dimensions of procedural arrangements when 
teaching multiple conditional discriminations. Commonly used training struc- 
tures include one-to-many training or sample-as-node training, many-to-one or 
comparison-as-node structure, and linear series training. 


transcription An elementary verbal operant involving a spoken verbal stim- 
ulus that evokes a written, typed, or finger-spelled response. Like the textual, 
there is point-to-point correspondence between the stimulus and the response 
product, but no formal similarity. 


transfer of function Occurs when teaching a new function for one member 
of an established equivalence class results in the same function holding for all 
members of the class. 


transformation of function Occurs when the behavioral function of one 


stimulus in a stimulus class changes as a predictable function of the behavior 
function of other stimuli in the class. 


transitive conditioned motivating operation (CMO-T) An environ- 
mental variable that, as a result of a learning history, establishes (or abolishes) 
the reinforcing effectiveness of another stimulus and evokes (or abates) the 
behavior that has been reinforced by that other stimulus. 


transitivity Describes derived stimulus—stimulus relations (e.g., A = C) 
that emerge as a product of training two other stimulus—stimulus relations 
(A = B and B = C),. Transitivity would be demonstrated if, after training, the 
following occurs: (1) If A (e.g., spoken word bicycle) = B (e.g., the picture of a 
bicycle) and (2) B (the picture of a bicycle) = C (e.g., the written word bicycle), 
then (3) A (the spoken name, bicycle) = C (the written word bicycle). (See also 
stimulus equivalence; compare to reflexivity and symmetry). 


treatment drift An undesirable situation in which the independent variable 


of an experiment is applied differently during later stages than it was at the 
outset of the study. 


treatment integrity The extent to which the independent variable is applied 
exactly as planned and described and no other unplanned variables are adminis- 


tered inadvertently along with the planned treatment. (Contrast with procedural 
fidelity.) 


treatment package A behavioral intervention consisting of multiple com- 
ponents (e.g., contingent praise, tokens, and extinction). 


trend The overall direction taken by a data path. It is described in terms of 
direction (increasing, decreasing, or zero trend), degree (gradual or steep), 
and the extent of variability of data points around the trend. Trend is used in 
predicting future measures of the behavior under unchanging conditions. 


trial-based functional analysis An analysis in which a series of trials is 
interspersed among classroom activities. Each trial consists of two 1-minute 
components: (a) the establishing operation and contingency for problem 
behavior (test condition), and (b) continuous access to the reinforcer (control 
condition). 


trial-by-trial IOA An IOA index for discrete trial data based on comparing 
the observers’ counts (0 or 1) on a trial-by-trial, or item-by-item, basis; yields 
a more conservative and meaningful index of IOA for discrete trial data than 
does total count IOA. 


true value A measure accepted as a quantitative description of the true state of 
some dimensional quantity of an event as it exists in nature. Obtaining true val- 
ues requires “special or extraordinary precautions to ensure that all possible 
sources of error have been avoided or removed” (Johnston & Pennypacker, 
1993a, p. 136). (Compare to observed value.) 


trials-to-criterion A special form of event recording; a measure of the 
number of responses or practice opportunities needed for a person to achieve a 
pre-established level of accuracy or proficiency. 


Type I error An error that occurs when a researcher concludes that the 
independent variable had an effect on the dependent variable, when no such 
relation exists; a false positive. (Contrast with Type II error.) 


Type If error An error that occurs when a researcher concludes that the 
independent variable had no effect on the dependent variable, when in truth it 
did; a false negative. (Contrast with Type I error.) 


U 


unchaining Occurs when, in the case of a two-step chain, the second behavior 
in the chain (R>) produces reinforcement in the presence of the prior Sa (So), 
but (R>) also produces reinforcement when that SP is not present. Unchaining 
may weaken a chain. 


unconditioned motivating operation (UMO) A motivating operation 
whose value-altering effect does not depend on a learning history. For exam- 
ple, food deprivation increases the reinforcing effectiveness of food without the 
necessity of any learning history. 

unconditioned negative reinforcer A stimulus that functions as a negative 


reinforcer as a result of the evolutionary development of the species (phylog- 
eny); no prior learning is involved (e.g., shock, loud noise, intense light, extreme 
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temperatures, strong pressure against the body). (See also negative reinforcer; 
compare to conditioned negative reinforcer.) 


unconditioned punisher A stimulus change that decreases the frequency 
of any behavior that immediately precedes it irrespective of the organism’ s 
learning history with the stimulus. Unconditioned punishers are products of the 
evolutionary development of the species (phylogeny), meaning that all members 
of a species are more or less susceptible to punishment by the presentation of 
unconditioned punishers. (Also called primary or unlearned punishers; compare 
to conditioned punisher.) 


unconditioned reflex An unlearned stimulus—response functional relation 
consisting of an antecedent stimulus (e.g., food in mouth) that elicits the response 
(e.g., Salivation); a product of the phylogenic evolution of a given species; all 
biologically intact members of a species are born with similar repertoires of 
unconditioned reflexes. (See also conditioned reflex.) 


unconditioned reinforcer A stimulus change that increases the frequency 
of any behavior that immediately precedes it irrespective of the organism’s 
learning history with the stimulus. Unconditioned reinforcers are the product of 
the evolutionary development of the species (phylogeny). (Also called primary 
or unlearned reinforcer; compare to conditioned reinforcer.) 


unconditioned stimulus (US) The stimulus component of an unconditioned 


reflex; a stimulus change that elicits respondent behavior without any prior 
learning. 


unpairing Two kinds: (a) The occurrence alone of a stimulus that acquired 
its function by being paired with an already effective stimulus, or (b) the occur- 
rence of the stimulus in the absence as well as in the presence of the effective 
stimulus. Both kinds of unpairing undo the result of the pairing: the occurrence 
alone of the stimulus that became a conditioned reinforcer; and the occurrence 
of the unconditioned reinforcer in the absence as well as in the presence of the 
conditioned reinforcer. 


unscored-interval IOA An interobserver agreement index based only on 
the intervals in which either observer recorded the nonoccurrence of the behay- 
ior; calculated by dividing the number of intervals in which the two observers 
agreed that the behavior did not occur by the number of intervals in which either 
or both observers recorded the nonoccurrence of the behavior and multiplying 
by 100. Unscored-interval IOA is recommended as a measure of agreement 
for behaviors that occur at high rates because it ignores the intervals in which 
agreement by chance is highly likely. (Compare to interval-by-interval IOA 
and scored-interval IOA.) 


V 


validity (of measurement) The extent to which data obtained from 
measurement are directly relevant to the target behavior of interest and to the 
reason(s) for measuring it. 


value-altering effect (of a motivating operation) Either (a) an increase 
in the reinforcing effectiveness of some stimulus, object, or event, in which 
case the MO is an establishing operation (EO); or (b) a decrease in reinforcing 
effectiveness, in which case the MO is an abolishing operation (AO). For 
example, the reinforcing effectiveness of food is altered as a result of food 
deprivation and food ingestion. 


variability The frequency and extent to which multiple measures of behavior 
yield different outcomes. 


variable baseline Data points that do not consistently fall within a narrow 
range of values and do not suggest any clear trend. 


variable interval (VI) A schedule of reinforcement that provides reinforce- 
ment for the first correct response following the elapse of variable durations 
of time occurring in a random or unpredictable order. The mean duration of 
the intervals is used to describe the schedule (e.g., on a VI 10-min schedule, 
reinforcement is delivered for the first response following an average of 10 
minutes since the last reinforced response, but the time that elapses following 
the last reinforced response might range from 30 seconds or less to 25 minutes 
or more). 


variable ratio (VR) A schedule of reinforcement requiring a varying 
number of responses for reinforcement. The number of responses required 
varies around a random number; the mean number of responses required for 
reinforcement is used to describe the schedule (e.g., on a VR 10 schedule an 
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average of 10 responses must be emitted for reinforcement, but the number of 
responses required following the last reinforced response might range from | 
to 30 or more). 


variable-interval DRO (VI-DRO) A DRO procedure in which 
reinforcement is available at the end of intervals of variable duration and 
delivered contingent on the absence of the problem behavior during the interval. 
(See also differential reinforcement of other behavior [DRO].) 


variable-momentary DRO (VM-DRO) A DRO procedure in which 
reinforcement is available at specific moments of time, which are separated 
by variable amounts of time in random sequence, and delivered if the problem 
is not occurring at those times. (See also differential reinforcement of other 
behavior [DRO].) 


variable-time schedule (VT) A schedule for the delivery of noncon- 
tingent stimuli in which the interval of time from one delivery to the next 
randomly varies around a given time. For example, on a VT 1-min schedule, 
the delivery-to-delivery interval might range from 5 seconds to 2 minutes, but 
the average interval would be | minute. 


verbal behavior Behavior whose reinforcement is mediated by a listener; 
includes both vocal-verbal behavior (e.g., saying “Water, please” to get water) and 
nonvocal-verbal behavior (pointing to a glass of water to get water). Encompasses 
the subject matter usually treated as language and topics such as thinking, gram- 
mar, composition, and understanding. 


verbal conditional discrimination (VC?) A type of convergent multiple 
control involving a verbal stimulus that alters the evocative effects of another 
verbal stimulus in the same antecedent configuration. The conditional discrim- 
ination is between the words in the antecedent event. 


verbal episode An interaction between a speaker and a listener. A speaker 
emits any type of verbal response (e.g., echoic, mand, tact, intraverbal), in any 
form (speech, sign language, icon selection, eye contact), and a listener (1) serves 
as an audience for a speaker, (2) provides reinforcement for a speaker, and 
(3) responds in specific ways to the speaker’s behavior. The roles of speaker and 
listener switch back and forth in an exchange, and usually involve covert speaker 
and listener behavior as well. 


verbal function-altering effect Verbal stimuli can alter the functional 
effects of immediate or future SD s and MOs and, accordingly, change a listener’s 
behavior. For example, being told “The bridge is out, turn left at the 7-Eleven and 
there will be another one in 5 miles” can alter the functional effects of stimuli 
encountered in the future and evoke verbal and nonverbal behavior at that time 
(e.g., tacting the 7-Eleven, turning left). 


verification One of three components of the experimental reasoning, or 
baseline logic, used in single-subject research designs; accomplished by 
demonstrating that the prior level of baseline responding would have remained 
unchanged had the independent variable not been introduced. Verifying the accu- 
racy of the original prediction reduces the probability that some uncontrolled 
(confounding) variable was responsible for the observed change in behavior. 
(See also prediction and replication.) 


video modeling A behavior change strategy in which the participant views a 
video of a model performing the target behavior and then imitates the behavior. 


video self-modeling A behavior change strategy in which the participant 
views a video of himself successfully performing the target behavior and then 
imitates his own model. 


visual analysis A systematic approach for interpreting the results of behav- 
ioral research and treatment programs that entails visual inspection of graphed 
data for level, trend, and variability within and between experimental conditions. 


W 


whole-interval recording A time sampling method for measuring behavior 
in which the observation period is divided into a series of brief time intervals 
(typically from 5 to 15 seconds). At the end of each interval, the observer records 
whether the target behavior occurred throughout the entire interval; tends to 
underestimate the overall percentage of the observation period in which the 
target behavior actually occurred. 


withdrawal design A term used by some authors as a synonym for A-B- 
A-B design; also used to describe experiments in which an effective treatment 
is sequentially or partially withdrawn to promote the maintenance of behavior 
changes. (See also A-B-A-B design and reversal design.) 
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CMOs. See Conditioned motivating operations 
CMO-S. See Surrogate CMO 
CMO-T. See Transitive CMO 
Codics, 459-460, 469, 482 
reinforcement for, 471 
Cognitive and Behavioral Practices, 150 
Cognitive psychology, private events in, 28 
Collaborative learning group (CLG), 128 
Combinatorial entailment, 545 
Commitment response, 737 
Common bidirectional naming (C-BiN), 
467, 476 
Comparative research question, 184 
Comparison 
social validity and, 265 
in three-phase multielement design, 213-215 
in two-phase multielement design, 213-215 
Comparison stimuli, in EBI, 497-498, 507 
Compliance Code, 812-835 
Component analyses, in experiments, 252-254 
Component research question, 186 
Compound schedules of reinforcement, 350-356 
Compound verbal discrimination, 469, 
483-484, 485 
Computer technology 
for graphical displays, 168 
for shaping, 593-594 
Concept, 443-444 
Concept acquisition, 443 
Concept formation, 443 
Conceptually systematic (meaning), for ABA, 32 
Concurrent chains (schedule) design, 
215-216 
social validity and, 264 
Concurrent contingencies, 59 
Concurrent schedule (conc) of reinforcement, 
308-309, 350-351 
Conditional discrimination, 440 
in EBI, 497 
teaching, 442-444 
Conditional probability 
ABC continuous recording for, 687 
correlation for, 19 
Conditional stimulus, in EBI, 497 


Conditional stimulus control, 440 
Condition change lines, 167, 173-175 
Conditioned motivating operations (CMOs), 
422-430 
mands and, 463 
Conditioned negative reinforcers, 325 
Conditioned punishers, 55-56, 367 
MOs and, 420 
Conditioned reflexes, 48 
Conditioned reinforcers, 55-56, 295 
CMO-T and, 427 
Conditioned stimulus (CS), 48 
Condition labels, for line graphs, 152, 167 
Conferences, 822 
Confidentiality, 826-828 
in behavioral assessment, 70 
Conflict of interest, 834 
Confounding variables, 182 
in experiments, 254-260 
functional relation and, 20 
internal validity and, 272-273 
Conjunctive schedule (conj) of reinforcement, 
354 
Consequence 
to behavior, 51 
for behavior-altering effects, 414 
in contingency space analysis, 688-689 
delayed 
for generalized behavior change, 788-791 
from reinforcement, 285-286 
in EBI, 517, 518 
FBA and, 680 
positive reinforcers and, 285 
punishment and, 365 
RC and, 404 
response and, 51-52 
selection by, 49-51 
of self-management, 752-757 
stimulus, 45 
in two-phase multielement design, 213 
Consequences affect, 51 
Constant time delay, 448 
Constructed response requirement, 520 
Consumer opinion, social validity and, 
264-265 
Contacts, 305 
Contextual control, in EBI, 507-510 
Contextual stimuli, 550-552 
Contiguous usage, 467 
Contingency, 57. See also specific types 
for behavior-altering effects, 414 
for positive reinforcement, 315 
for RC, 404-406 
of self-management, 761 
Contingency adduction, 58 
generalization and, 773 
Contingency contracts, 724-731 
Contingency-dependent antecedent interventions, 
662-663 
Contingency-independent antecedent 
interventions, 662 
Contingency reversal, 312, 684, 694 
Contingency-shaped behavior, 58 
Contingency space analysis, 687-689 
Contingent, 57 
Contingent attention, FA and, 681 
Contingent escape, FA and, 681 
Contingent exercise, as positive punishment, 378 
Contingent observation, 394-395 
Contingent tangibles, FA and, 682 


Contingent teacher praise, ABA and, 29 
Continuing education units (CEUs), 822 
Continuous behaviors 

MTS for, 112 

rate in, 99 
Continuous measurement, 126-127 
Continuous reinforcement (CRF), 337 

DRA and, 645 

extinction and, 635 

for generalized behavior change, 787 
Contrived contingency, 776 
Contrived free operant observation, 

304-305 

Contrived mediating stimulus, 795 
Control 

joint, 59 

in science, 19-20 
Controlled response, 735 
Controlling response, 735 
Convergent multiple control, 462 

tacts and, 465 
Copying text, 459, 469 
Correlated usage, 466-467 
Correlation, 182 

in science, 19 
Cost-benefit ratio, ethics and, 816 
Count 

duration and, 100 

repeatability and, 97 
Countercontrol, 817 
Counting time, 97 
CRA. See Criterion-referenced assessment 
CRF. See Continuous reinforcement 
Criterion records, IOA and, 130 
Criterion-referenced assessment (CRA), 72 
CS. See Conditioned stimulus 
Cumulative recorder, 156 
Cumulative records, 156-159 
Curriculum-based assessment (CBA), 72, 76 
Curriculum-based measurement (CBM), 266 
Curriculum design, ABA and, 29 


Data 

graphic display of, 148-175 

for imitation training, 576 
Data paths, for line graphs, 152-153, 168 
Data points 

on graphical displays, 170-171 

on line graphs, 167-168 

for multielement design, 209 

in multiple baseline design, 224 
Default interventions, 662 
Default technologies, FBA and, 680 
Definitional measures, 102-104, 107 
Deictic relations, 557 
Delay discounting, 737 
Delayed consequences 

for generalized behavior change, 

787-791 

from reinforcement, 285-286 
Delayed emergence, 503, 520 
Delayed match-to-sample, 520 
Delayed multiple baseline design, 232-235 
Delay schedule of reinforcement, 349-350 
Demonstration research question, 184 
Dependent group contingency, TalGh 
Dependent variable 

for behavior, 185 

experimental control and, 182 


in experiments, 21 
functional relation and, 20 
for internal validity, 271, 272 
for IOA, 139-140 
on line graph, 150 
in multiple probe design, 230 
trials-to-criteria for, 102 
Deprivation, 54 
UMOs and, 462 
Derivative measures, 101—102, 106-107 
Derived relations, 544—545 
RFT and, 562 
well-being and, 560-566 
Derived stimulus relations, 498 
Descending baseline, 188-189 
Description, in science, 19 
Descriptive autoclitic tacts, 477 
Descriptive functional behavior assessment, 
685-690 
Descriptive praise, 315 
Design analysis, internal validity and, 273 
Determinism, in science, 20 
Differential negative reinforcement (DNR), 
330 
Differential negative reinforcement of alternative 
behavior (DNRA), 643 
Differential negative reinforcement of other 
behavior (DNRO), 239 
Differential positive reinforcement, 330-331 
Differential reinforcement, 642-657 
defined, 642-643 
shaping and, 586, 587 
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(DRA), 293, 312, 643-647, 822 
FCT and, 668 
punishment and, 383 
RC and, 403-404 
Differential reinforcement of diminishing rates 
(DRD), 347-348 
Differential reinforcement of high rates (DRH), 
347, 660 
Differential reinforcement of incompatible 
behavior (DRI), 643 
DRO and, 652 
punishment and, 383 
Differential reinforcement of low rates (DRL), 
347, 653-657 
Differential reinforcement of other behavior 
(DRO), 312, 647-653 
NCR and, 665 
punishment and, 383 
token economy and, 708 
Dignity, 828-830 
Direct assessment, 72, 75, 76 
Direct measurement, 125 
Direct observations, 76 
Direct replication, external validity and, 269 
Discontinuous measurement, 126 
Discrete trials 
in EBI, 496 
event recording for, 104 
rate in, 98-99 
Discriminated avoidance, 323-324, 425 
Discriminated operants, 56-58 
reinforcement and, 289 
Discrimination 
compound verbal, 469, 484, 485 
conditional, 440 
auditory, 483 
in EBI, 497 
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teaching, 442-444 
verbal, 469-471, 484, 485, 486 
in EBI, 514 
for OL, 581 
simple, 522-523 
simple verbal, 469, 483, 485 
stimulus, 437-438 
stimulus training, 441442 
Discriminative stimulus (SP), 57, 437 
antecedent interventions and, 662 
behavior chains and, 620 
CMO-T and, 428-429 
differential reinforcement and, 586 
errorless learning and, 447 
MOs and, 416-418, 441, 462 
multiple control and, 470 
for punishment, 368 
shaping and, 592 
tacts and, 459, 465 
VB and, 468-469 
Distinction relations, 546 
Divergent multiple control, 462 
DNR. See Differential negative reinforcement 
DNRA. See Differential negative reinforcement of 
alternative behavior 
DNRO. See Differential negative reinforcement of 
other behavior 
“Don’t do it” teaching examples, 782—783 
Double-blind control, 257 
DRA. See Differential reinforcement of alternative 
behavior 
DRD. See Differential reinforcement of diminishing 
rates 
DRH. See Differential reinforcement of high rates 
DRL. See Differential reinforcement of incompatible 
behavior 
DRI/DRA reversal technique, 205-206 
Drive, 412 
DRL. See Differential reinforcement of low rates 
DRO. See Differential reinforcement of other 
behavior 
Drop-out component analysis, 253 
DRO reversal technique, 205 
Duplics, 459, 468-469, 482 
reinforcement for, 471 
Duration, 105 
count and, 100 
of response, 99-100 
Duration per occurrence, 99-100, 105 


EAB. See Experimental analysis of behavior 
EBI. See Equivalence-based instruction 
Echoics, 348, 459, 468 
Ecological assessment, 76-717 
Edible reinforcers, 296 
Educational reinforcement, 466 
EE. See Enriched environment 
Effective (meaning), for ABA, 32 
Effectors, 44 

operant behavior and, 50 

reflexes and, 47 
Effect size, internal validity and, 275 
Elementary verbal operants, 458-460 
Elimination phase, of DRA, 646 
Emergent stimulus relations, 498 
Emotions, operant behavior and, 59 
Empiricism, in science, 20 
Empowerment, for ABA, 33 
Enriched environment (EE), 671-672 
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Entailment, 544-545 
Environment, 44-47 
behavior and, 249 
block chains and, 622 
cumulative records and, 157 
in experiments, 255 
independent variable of, 186 
listeners and, 466 
for self-management, 745 
EOs. See Establishing operations 
Equal-interval scale, for line graphs, 151 
Equivalence-based instruction (EBI), 496-537 
applications, 524-537 
class expansion in, 503-505 
class formation in, 502-503, 519-520 
class merger in, 503-505 
class-specific reinforcement in, 521-522 
comparison stimuli in, 497-498, 507 
conditional discrimination in, 497 
conditional stimulus in, 497 
contextual control in, 507-510 
core concepts, 497-510 
designing, 510-524 
discrete trials in, 117 
discrimination in, 514 
exclusion in, 514-515 
four-term contingency in, 497 
matching-to-sample in, 497-498, 517 
naming theory and, 533-534 
reflexivity in, 498, 500 
response in, 512 
RFT and, 533, 535-536 
stimulus-control shaping in, 514-515 
stimulus in, 511-512 
symmetry in, 498-499 
task instructions in, 512-513 
testing protocols for, 517-520 
three-term contingency in, 522-524 
training structure for, 516-517 
transfer of function in, 505-507 


creating culture for, 834-835 
defined, 813-817 


dignity and, 828-830 

informed consent and, 823-826 

of negative reinforcement, 332 

of punishment, 384-386 

of reversal design, 207 

social media and, 827, 830-831 

social validity and, 816 
Event recording, 104, 104, 108 
Evocative effects, 413, 414, 421-422 

UMOs and, 419 
Exact count-per-interval, 135 
Exclusion, in EBI, 514-515 
Exclusion time-out, 395-396 
Experiment 

in ABA, 34, 184-187, 247-277 

component analyses in, 252—254 

confounding variables in, 254—260 

design for, 250-254 

external validity in, 267-271 

internal validity in, 254-260 

procedural fidelity in, 257 

in science, 20-21 

social validity of, 261-267 

treatment integrity in, 257 

treatment package in, 252 
Experimental analysis of behavior (EAB), 22, 25-26 

domains of, 35 

high-tech measurement tools for, 117-118 
Experimental control, 182, 186, 255 
Experimental design, 187 
Experimental question, 184, 246, 512 
Expert evaluation, social validity and, 265 
Explanatory fiction, 27-28, 735 
EXT. See Extinction 
External validity, 182 

in ABA, 276 

in experiments, 267-271 


FBA. See Functional behavior assessment 
FCT. See Functional communication training 
Feature stimulus class, 444 
Feedback 

for DRL, 657 

for modeling, 577 

reinforcement and, 288-289 
FGA. See Fractional graph area 
FI-DRO. See Fixed interval DRO 
Figure captions, for line graphs, 152, 168 
Five-term contingency, 507 
Fixed-interval DRO (FI-DRO), 647-648 
Fixed interval (FI) schedule, of reinforcement, 

343-344 

Fixed-momentary DRO (FM-DRO), 649-650 
Fixed ratio (FR) schedules 

for DRA, 645-646 

for reinforcement, 338 

token economy and, 708 
Fixed-time (FT) schedule, for NCR, 665 
FM-DRO. See Fixed-momentary DRO 
FO. See Free operants 
Formal similarity, 458-460, 469, 572 
Formative assessment, measurement in, 95 
Forward chaining, 610-611 
Four-term contingency, 442 

in EBI, 497 
FR. See Fixed ratio schedules 
Fractional graph area (FGA), 150 
Fraudulent conduct, 828 
Free-operant avoidance, 323-324 
Free operants (FO), 98 

for SPA, 304, 307 
Frequency, 122 
FT. See Fixed-time schedule 
Full-session DRL, 653-654 
Functional analysis (FA), 681-685 

of behavior, 44 

for experiments, 20-21 


Exteroreceptors, 45 
Extinction (EXT), 628-638 
attention, 631 
automatic reinforcement and, 632-633 
of behavior chains, 616 
CRF and, 338 


Functional Analysis Screening Tool (FAST), 691 
Functional Assessment Checklist for Teachers and 
Staff (FACTS), 72-74 
Functional behavior assessment (FBA), 93, 678-702 
behavior rating scales for, 691 
case examples for, 693-702 


transitivity in, 499 

trial types in, 513-514 
Equivalence-class formation, 500 
Equivalence tests, 499 
Errorless learning, 447 


Escape, 53 defined, 628-629 conducting, 691-693 
FA and, 681 effective use of, 636-638 hypothesis testing for, 692 
FBA and, 680 FCT and, 350 indirect, 690-691 
NCR and, 663 functional forms of, 629 methods, 681-691 


punishment and, 375 
tact and, 465 
Escape contingency, 323-324 
Escape extinction, 631-632 
Establishing operations (EOs), 412 
CMO-R and, 426-427 
CMO-T and, 429 
for negative reinforcement, 321, 325, 326, 
328-330, 332 
for positive reinforcement, 290, 307 
and discriminative stimulus (S°), 440 
Establishing stimulus (SE), CMO-T and, 427 
Ethical codes of behavior, 816 


negative reinforcement and, 330, 631-632 
positive reinforcement and, 630-631 
of reinforcement, 54 
respondent, 48-49 
secondary effects of, 633-635 
spontaneous recovery and, 634 
variables for, 635-636 
when not to use, 638 
Extinction burst, 633-634 
Extinction-induced variability, 587, 634 
Extraneous variables, 186 
shaping and, 597 


NCR and, 663 
Functional communication training (FCT) 

for antecedent interventions, 668-67 1 

brief functional analysis and, 693-695 

DRA and, 660 

extinction and, 350 

line graphs for, 152 

punishment and, 383 

social validity and, 264 

systematic replication and, 270 
Functionally equivalent, 692 
Functional relation, 20 
Function-altering effects, 421 


Ethics FA. See Functional analysis behavior-altering effects and, 417 
in ABA, 812-835 Facilitative reinforcement, 471 Function-based definition, for target behaviors, 
advocating for client and, 831-834 FACTS. See Functional Assessment Checklist for 86-87 


in client services, 822-828 

confidentiality and, 69, 826-828 

conflict of interest and, 834 

coordinating with other professionals, 828-830 
cost-benefit ratio and, 816 


Teachers and Staff 
Fading. See Stimulus fading 
FAST. See Functional Analysis Screening Tool 
Fatalism, 20 
Faulty stimulus control, 439-440, 773 


Fundamental measurements, 96-100, 105—106 


GBG. See Good Behavior Game 
General case analysis, 779-781 
Generality (meaning), for ABA, 32-33 


Generalization 
confusing terminology of, 774 
EBI for, 528 
maps, 773 
mediation of, 795-799 
from multielement design, 217 
training for, 799-801 
Generalization across subjects, 773 
Generalization probe, 778 
Generalization setting, 710-771 
instructional setting and, 783-785 
reinforcement in, 785-795 
Generalized behavior change, 768-753 
behavior traps for, 791, 794 
delayed consequences for, 787-791 
generalization mediation for, 795-799 
intermittent reinforcement for, 787 
modifying or terminating interventions for, 
801-803 
planning for, 775-777 
promotion of, 803-806 
reinforcement for, 794-795 
as relative, 772-773 
response for, 777—783 
self-management for, 799 
stimulus for, 777—783, 795-798 
strategies and tactics for, 777-801 
target behaviors for, 775-777 
undesirable possibilities of, 773 
Generalized conditioned punishers, 367 
Generalized identity matching, 498 
Generalized imitation, 574 
Generative learning, 467 
Generic (tact) extension, 465 
Generic positive statements, 88 
GES. See Grade equivalent score 
Good Behavior Game (GBG), 720-722, 723 
Good Student Game, 722, 723 
Grade equivalent score (GES), in norm-referenced 
tests, 75 
Graduated guidance, 447 
Graphical displays 
for ABA, 150-164 
for contingency space analysis, 688 
of data, 148-175 
for internal validity, 271 
interpretation of, 168-175 
for IOA, 139-140 
level of, 171-172 
for punishment, 383-384 
of stimulus generalization gradient, 438 
of treatment integrity, 260 
trends and, 172-173 
variability in, 171 
visual analysis of, 173-175 
Graphs, 148 
Group contingencies, 716-724 
Guardian consent, 824 
Guilt control, 748 


Habilitation, 78-79 

Habit reversal, 758 

Habituation, 48 

Hallway time-out, 395 

Headsprout Comprehension, 266 

Health Insurance Portability and Accountability 
Act (HIPAA), 827 

Hero procedure, 717 

Hierarchical relations, 555-557 


Higher-order conditioning, 49 
Higher-order operant class, 533 
High-preference (HP) stimuli, 305, 308-309 
High-probability instructional sequence, for 
antecedent interventions, 667-668 
HIPAA. See Health Insurance Portability and 
Accountability Act 
History of reinforcement, 60 
Horizontal axis (x-axis) 
for line graphs, 150, 166 
for ratio charts, 159 
HP. See High-preference stimuli 
Hypothetical constructs, in behaviorism, 27 


I-BiN. See Intraverbal bidirectional naming 
ICP. See Interrupted chain procedure 
Identity matching-to-sample, 497, 520 
IISCA. See Interview-informed synthesized 
contingency analysis 
Imitation, 571-576 
motor, 459, 468-469 
for OL, 581 
Imitation training, 573-576 
Imitative prompts 
for shaping, 592 
for VB, 463, 469 
Impulse control, 737 
Independent group contingency, 717 
Independent variable 
baseline and, 189 
confounds, 256-260 
of environment, 186 
experimental control and, 182 
in experiments, 20-21 
functional relation and, 20 
on line graph, 150, 154 
in multiple baseline design, 225-226, 235-237 
replication and, 193 
verification and, 191-192 
Indirect assessment, 70-72 
Indirect functional assessment, 689-691 
Indirect measurement, 125-126 
Indiscriminable contingency, 787-791 
Individual differences, in behavior, 59-60 
Informed consent, 823-826 
Instructional setting, 770 
generalization setting and, 783-785 
INT. See Intermittent reinforcement 
Intelligence, 557-558 
Intelligence quotient (IQ), 265 
Intensity. See Magnitude 
Interdependent group contingency, 717-7122 
Intermittent punishment, 372-373, 382 
Intermittent reinforcement (INT), 54 
extinction and, 635 
for generalized behavior change, 787 
Internal validity, 182 
in ABA, 271-274 
in experiments, 254-255 
Interobserver agreement (IOA), 128, 132-141 
accuracy of, 140-141 
benefits of, 132-133 
calculating, 138-139 
criterion records and, 130 
dependent variable for, 139 
Interval-by-interval, 136 
IRT and, 136 
mean duration-per-occurrence, 135-136 
for measurement by permanent products, 133 
reliability of, 140-141 
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scored-interval, 137 

from time sampling, 136 

total count, 134 

total duration, 135 

trial-by-trial, 135 

unscored-interval, 137 

variables in, 136-137 
Interoceptors, 45 
Interresponse time (IRT), 97-98, 100-101, 106, 

115-116 

DRH/DRL and, 347 

for DRL, 656 

high-tech measurement tools for, 117 

IOA and, 135 

latency and, 109 

spaced-responding DRL and, 654—656 
Interrupted chain procedure (ICP), 429, 619-620 
Intersubject direct replication, 269 
Interval-by-interval IOA, 136 
Interval DRL, 654 
Interval DRO, 647-649 
Intervention Rating Profile (IRP-5), 261 
Interview-informed synthesized contingency 

analysis (IISCA), 684 

Interviews, 70-72 

for indirect functional assessment, 689-690 
In-the-moment reinforcer analysis, 308 
Intrasubject direct replication, 269 
Intraverbal bidirectional naming (I-BiN), 471-473 
Intraverbals, 348, 460, 469, 485-486 

autoclitic frames for, 478 

reinforcement for, 471 
IOA. See Interobserver agreement 
IQ. See Intelligence quotient 
IRA-per-response IOA, 135 
IRP-5. See Intervention Rating Profile 
Irreversibility 

in multielement design, 216 

reversal design and, 206 
IRT. See Interresponse time 
Isolating behavior-altering effects, 413-414 
Isolating value-altering effects, 413-414 


JABA. See Journal of Applied Behavior Analysis 
JEAB. See Journal of the Experimental Analysis 
of Behavior 
Joint control, 59 
listeners and, 467 
Journal of Applied Behavior Analysis (JABA), 276 
on adjunctive behaviors, 356-357 
on functional analysis, 682 
graphical displays in, 150-151 
on replications, 271 
on visual analysis, 275 
Journal of the Experimental Analysis of Behavior 
(JEAB) 
graphical displays in, 150-151 
on visual analysis, 275 


Lag schedule, of reinforcement, 348 
Language 
assessment of, 479-488 
EBI for, 524-525 
form and function of, 456-458 
Latency. See Response latency 
Latency-based functional analysis, 684-685 
case example, 696-697 
Learned helplessness, transformation of function 
and, 548 
Learning, ABA and, 29 
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Least restrictive alternative 
for punishment, 384—385 
RC and, 402 
Least-to-most response prompts, 447-448 
Level, of graphical displays, 171-172 
Level systems, for token economy, 709-710 
Limited hold, for interval reinforcement 
schedules, 345 
Linear charts, 160-161 
Line graphs, 150-155 
constructing, 164-168 
Listener, 473-476, 483-484 
mands and, 460-463 
VB of, 457-458 
Listener discrimination, 460 
Local response rate, 157 
Locus of control, self-management and, 734-735 
Low-preference (LP) stimuli, 305, 308-309 


Magnitude, 104, 107 
Maintenance of behavior, 337-338 
Mands, 459, 477, 479-480, 487 
autoclitic, 477 
CMOs and, 462 
listeners and, 460-463 
MOs and, 460-461 
specific reinforcement and, 462 
UMOs and, 462 
MARS. See Motivation Analysis Rating Scale 
MAS. See Motivation Assessment Scale 
Massed practice, 758-7159 
Mastery level 
in EBI, 516 
task analysis for, 607-608 
Matching law, 351 
Matching-to-sample (MTS), 442-443, 
460, 473 
in EBI, 497-498, 517 
A Matter of Consequences (Skinner, B. F.), 291 
Maturation, in experiments, 255 
Mean count-per-interval IOA, 134-135 
Mean duration-per-occurrence IOA, 135-136 
Mean latency-per-response IOA, 136 
Mean level line, 171-175 
Measurement 
accuracy of, 124, 130-132 
artifacts, 126 
of behavior, 94-119 
for behavior analysis, 94-96, 186 
confounds, 256 
continuous, 126 
defined, 94-95 
definitional, 102-104, 107 
derivative, 101-102, 106-107 
direct, 125 
discontinuous, 126-127 
fundamental, 96-100, 105-106 
high-tech tools for, 117-118 
improving and assessing quality of, 123-141 
indirect, 125-126 
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low-tech tools for, 117 
methods for, 104-113 
selection of, 118-119 
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reliability of, 124, 132 
temporal locus in, 100-101, 116 
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timing of, 109 
validity of, 123-125 
Measurement bias, 124 
Measurement by permanent products, 114-116 
IOA for, 134 
for target behaviors, 116 
Median level line, 172-175 
Mediation, of generalization, 795—801 
MEI. See Multiple exemplar instruction 
Mentalism, 27 
Meta-analysis, for internal validity, 275 
Methodological behaviorism, 28 
Metonymical (tact) extension, 465 
Mindfulness, ACT and, 563-565 
Mixed-schedule, 160 
MLT. See Model-lead-test 
Modeling, 206, 258, 375, 576-579 
Model-lead-test (MLT), for token economy, 714 
Models, for imitation, 571-576 
Momentary DRO, 649-651 
Momentary time sampling (MTS), 113 
MOs. See Motivating operations 
Most-to-least response prompts, 447 
Motivating operations (MOs), 54, 412-431 
antecedent interventions and, 662 
aversive stimulus and, 425 
behavior analysis and, 430-431 
CMOs, 422-430 
mands and, 463 
definition and characteristics, 411-416 
direct and indirect effects of, 414-416 
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FA and, 681-682 
FCT and, 668 
mands and, 460-461, 481 
multiple effects of, 421-422 
for punishment, 420-421 
reinforcement and, 289-290 
SP and, 416-418, 440 
for self-management, 74 
treatment effects of, 430 
UMOs, 418-420 
Aversive stimulus and, 462 
CMO-S and, 423 
CMO-T and, 427, 428 
mands and, 461, 480 
for VB, 460-464, 489 
Motivation Analysis Rating Scale (MARS), 691 
Motivation Assessment Scale (MAS), 691 
Motor imitation, 459, 468-469 
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MSWI. See Multiple stimuli with replacement 
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Multielement design, 207-216 
advantages of, 216-217 
appropriateness of, 217-218 
concurrent chains (schedule) design, 217-218 
generalization from, 217 
irreversibility in, 217 
multiple treatment interference and, 217-218 
variations of, 210-216 
Multiple baseline across behaviors design, 226 
Multiple baseline across settings design, 226-228 
Multiple baseline across subjects design, 228-230 
Multiple baseline design, 223-238 
appropriateness of, 238 
assumptions and guidelines of, 235-237 
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delayed, 232-235 
multiple probe design, 230-232 
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operation and logic of, 224-229 
variations of, 230-235 
Multiple control, 462 
S? and, 470 
tacts and, 481 
Multiple exemplar instruction (MEI), 487, 
552-555, 779 
Multiple-opportunity method, for task analysis, 
608-610 
Multiple probe design, 230-232 
Multiple schedules (multi) of reinforcement, 309, 
351-353 
Multiple stimuli without replacement (MSWO), 307 
Multiple stimuli with replacement (MSWI), 307 
Multiple treatment interference, 217-218 
Multiple treatment reversal design, 203—204 
Mutual entailment, 544-545 


Naive observers, 129 
Naming theory, EBI and, 530-531 
Narrative description, for IOA, 139 
Narrative recording, 687, 689, 697-699 
National Institute of Standards and Technology, 144 
Naturalistic free operant observation, 305 
Naturally existing contingency, 775 
Naturally occurring reinforcement, 338 
NCR. See Noncontingent reinforcement 
Needs assessment survey, 72 
Negative punishment, 54, 392-407 
contingent observation and, 394 
exclusion time-out and, 395-396 
nonexclusion time-out and, 393-395 
partition/select space time-out and, 395 
planned ignoring and, 393 
positive punishment and, 366 
RC and, 401-407 
terminate specific reinforcer contact and, 393-394 
time-out from positive reinforcement and, 392-396 
Negative reinforcement, 53-54, 321-332 
applications of, 327-331 
automatic, 325 
FBA and, 678 
avoidance contingency and, 323-324 
characteristics of, 325 
defined, 321-323 
DNR, 330 
DNRA, 643 
DNRO, 239 
escape contingency and, 323-324 
ethics of, 332 
extinction and, 330, 330-331 
FBA and, 679-680 
positive reinforcement and, 322-323 
punishment and, 323, 376 
self-management and, 754-755 
social, 325 
source of, 325 
Negative reinforcers, 325 
Negligence, 828 
Neutral stimulus (NS), 48 
NI. See Number Index 
Nodal stimulus, in EBI, 516 
Nodes, in EBI, 516 
Noncontingent reinforcement (NCR), 312 
ABA and, 664 
for antecedent interventions, 663—665 
automatic reinforcement for, 663 


escape and, 663 

extinction and, 629 

FBA and, 664 

FT for, 665 

positive reinforcement and, 663 
punishment and, 382 

reversal design, 204-205 
schedule thinning for, 665 

VT for, 665-666 


Noncurrent multiple baseline across participants, 


447 


Nondiscriminated schedules of reinforcement, 
353-354 
Nonequivalence relations, 541-566 


big-picture psychological constructs and, 555-560 


explained, 542-544 

planning interventions for, 550 

stimulus classes and, 550-551 

theoretical foundations of, 550-555 

types of, 546-550 

vocabulary of, 544-546 
Nonexclusion time-out, 393-395 
Nonverbal stimuli, tacts and, 465-466 


Norm-referenced tests, 75 

Novel models, for imitation training, 
575-576 

NS. See Neutral stimulus 

Number Index (NI), for SITS, 266 
Number skills, EBI for, 525-527 


Observational learning (OL), 580-582 
Observed value, 124 
Observer drift, 129-130 
Observer reactivity, 129 
Observers. See also Interobserver agreement 
expectations of, 129 
for measurement, 127-129 
reactivity of, 129-130 
Oddity matching-to-sample, 497 
OL. See Observational learning 
On-task behavior, 82, 258 
contingencies for, 692 
duration of, 99 
measurement period for, 127 
RC for, 214 
reinforcers for, 314 
whole-interval recording for, 109 
Ontogeny, 50 
Open-ended indirect assessment, 70 
Open-ended questions, for SPA, 303 
Operant behavior, 49-50 
antecedent stimulus class and, 440 
emotions and, 59 
Operant chains 
CMO-T and, 428 
MOs and, 421-422 
Operant conditioning, 51 
history of reinforcement for, 59 
Operant learning, 58 
Opposite relations, 548 
Origin of Species (Darwin), 60 
Overall response rate, 157 
Overcorrection, as positive punishment, 
378-380 
Overgeneralization, 773 
Overselective stimulus control, 446 
Overshadowing, 447 
Oxford English Dictionary, 131 


Normalization, of age-appropriate behavior, 82 
Normative comparison, social validity and, 265 


Paired stimuli. See Stimulus-stimulus pairing 
Parametric analysis, 187 

Parametric research question, 185 
Parsimony, 2\—22 

Parsons Language Sample, 478 
Partial-interval recording, 110 
Partition/select space time-out, 395 


Peabody Individual Achievement Test-R/NU, 72 
PEAK. See Promoting the Emergence of Advanced 


Knowledge 

Peer review, 822 
Percentages, 101-102, 106-107 

in discontinuous measurement, 126 

in partial-interval recording, 110-111 
Percentile rank, in norm-referenced tests, 75 
Percentile schedules, for shaping, 593-594 
Performance dimensions, shaping and, 589 
Permanent products. See Measurement by 

permanent products 

Personal restraint, 672 
Perspective shifting, 557 
Philosophic doubt, in science, 22 
Phonemic variability, 348 
Phylogeny, 50 
Physical guidance, 445 

for imitation training, 574, 575 

for shaping, 591 
Pivotal behavior, 81-82 
Placebo control, 256 


Planned activity check (PLACHECK), 111-112 


Planned ignoring, 393 
Planned models, 571-572 
Point-to-point correspondence, 460 
Portable Operant Research and Teaching Lab 
(PORTL), 598 
Positive practice overcorrection, 379-380 
Positive punishment, 54, 364-387 
contingent exercise as, 378 
negative punishment and, 366-367 
overcorrection as, 378-380 
reprimands as, 376 
response blocking as, 376-377 
RIRD as, 378 
Positive reinforcement, 53, 284-316. See also 
Time-out from positive reinforcement 
automatic, 679 
contingency for, 320 
control procedures for, 311-312 
defined, 316-317 
differential, 330-331 
effective use of, 312-316 
extinction and, 630-631 
FBA and, 678 
NCR and, 663 
negative reinforcement and, 322-323 
RC and, 404 
self-management and, 753-754, 755 
in shaping, 588-589 
social, 679 
for target behavior, 314 
time-out from, 392 
Positive reinforcer, 285 
Postassessment, for imitation training, 575 
Postreinforcement pause, 22, 187, 338-347 


PR. See Progressive-ratio schedule of reinforcement 


Practice effects, 190 

Pragmatism, 28 

Preassessment, for imitation training, 575 
Preattending skills, 446 

Precision teaching, 162 
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Precursors, FA of, 685, 697 
Prediction, 189-190 
in changing criterion design, 238 
in multielement design, 208 
in multiple baseline design, 224 
in reversal design, 199-200 
in science, 18-19 
Premack principle, 296 
Prestige, modeling with, 577 
Pretask choice, for SPA, 304 
Pretesting, for imitation training, 575 
Prevention, FBA and, 680-68 1 
Primary-target-only condition, in two-phase 
multielement design, 212 
Principle of behavior, 55 
Private events 
ACT and, 563-564 
in cognitive psychology, 28 
indirect measurement for, 125 
MOs and, 463 
reinforcers and, 560-563 
stimulus control and, 471 
tacts and, 465, 482 
Probe-type trials, in EBI, 498-499, 517-518 
Procedural fidelity, in experiments, 257 
Professional and Ethical Compliance Code for 
Behavior Analysts, of BACB, 816 
Professional competence, 822 
Professional standards, 818-822 
Programmed instruction, ABA and, 29 
Programming common stimuli, 783, 785 
Progressive delay, 737 
Progressive-ratio schedule (PR) of reinforcement, 
309-311, 348-349 
Progressive time delay, 448 
Promoting the Emergence of Advanced 
Knowledge (PEAK), 550 
Proportional measurement, 102 
Proprioceptors, 45 
Protective equipment restraint, 672 
Public, ABA and, 33 
Punishers, 51 
assessments, 381 
conditioned punishers, 55-56, 367 
MOs and, 420 
stimulus and, 54—56 
transformation of function and, 548 
types of, 367-369 
unconditioned, 54, 367, 387 
MOs and, 420 
Punishment, 54. See also Negative punishment; 
Positive punishment 
automatic, 366, 466 
aversive stimulus as, 425 
avoidance and, 375 
behavioral contrast and, 375 
defined, 365-366 
DRO and, 652 
effectiveness of, 370-374 
escape and, 375 
ethics of, 384-386 
guidelines for, 380-381 
history of reinforcement for, 60 
intermittent, 371-372, 382 
least restrictive alternative for, 384-385 
MOs for, 420-421 
negative reinforcement and, 323, 376 
recovery from, 369 
reinforcement and, 373-374 
response to, 373 
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Punishment (continued) 
response-to-punishment delay, 383 
SP for stimulus, 368 
side effects and problems with, 375-376 
Puns, 508 


QABEF. See Questions About Behavioral Function 
Quantifying autoclitic tacts, 477 
Questions About Behavioral Function (QABF), 691 


R. See Response 
Radical behaviorism, 26-29, 34 

domains of, 35 
RAISD. See Reinforcer Assessment for 

Individuals with Severe Disabilities 

Random control trial (RCT), 269 
Range-bound changing criterion design, 240-241 
Ranking matrix, for target behaviors, 85—86 
Rank-ordering, 304 
Rate, 105 

reinforcer, 314 

repeabability and, 97-99 
Ratio charts, 159-162 
Ratio scale, 160 
Ratio schedules, for reinforcement, 338 
Ratio strain, of schedules of reinforcement, 345-346 
RC. See Response cost 
RCT. See Random control trial 
Reactivity 

to behavioral assessment, 76 

of observers, 129 
Real-world tests, social validity and, 267 
Recombinative generalization, 467 
Recovery from punishment, 369-370 
Reflexes, 47, 48-49 
Reflexive CMO (CMO-R), 425-427 
Reflexivity, 545 

in EBI, 498, 499 
Reinforcement, 53-54. See also specific types 

antecedents and, 286, 288-289 

arbitrary behavior and, 292 

circular reasoning and, 286-287 

for FCT, 671 

feedback and, 288-289 

FI of, 343-344 

FR for, 338 

in generalization setting, 785-795 

for generalized behavior change, 794 

history of reinforcement for, 59 

in imitation training, 576 

immediacy of, 285—286 

lag schedule of, 348 

for modeling, 580 

MOs and, 289-90 

phase, of DRA, 645 

PR of, 349 

punishment and, 373 

ratio schedules for, 338 

reserve, RC and, 405 

of self-monitoring, 747-748, 751 

for shaping, 596 

for target behaviors, 78, 85, 373 

traps, 738 

for VB, 471 

VI for, 344 

VR for, 340-343 
Reinforcer-abolishing effects, 419 
Reinforcer assessment, 308-311 
Reinforcer Assessment for Individuals with Severe 

Disabilities (RAISD), 312-313 


Reinforcer-establishing effects, 419 
Reinforcer-probe trials, 521-522 
Reinforcers, 51, 286-287. See also specific types 

classification of, 294-299 

CMO-R and, 425 

for DRA, 644-645 

for high-probability instructional sequence, 667 

identifying, 299-311 

for NCR, 665 

for on-task behavior, 314 

private events and, 560-562 

rate, 313 

stimulus and, 54-56 

transformation of function for, 549-550 

unconditioned, 325 
Relational autoclitic tacts, 477 
Relational frames, 554 
Relational frame theory (RFT), 552-555, 

Sys, oe) 

derived relations and, 562 

EBI and, 533, 535-537 

generalization and, 773 

listeners and, 467 
Relative reinforcement 308-309 
Relevance of behavior rule, 79, 766 
Reliability 

of IOA, 140-141 

of measurement, 124, 132 
Repeatability, of measurement, 97-98 
Repertoire, 44 

complexity of, 59 

history of reinforcement for, 60 

operant behavior and, 50 

response, shaping and, 596 

RFT for, 552-555 

shaping and, 596 
Repertoire-altering effect, 416 
Replication, 192-193 

in changing criterion design, 238 

external validity and, 268-271 

in multielement design, 208 

in multiple baseline design, 226, 235 

in reversal design, 199 

in science, 20 
Reprimands, as positive punishment, 376 
Research question, 184-185 
Resistance to extinction, 635 
Respondent behavior, 47-49 
Respondent conditioning, 48 
Respondent extinction, 48-49 
Response (R), 22. See also specific types 

anxiety and, 324-325 

behavior and, 44 

block chains and, 622 

complexity of, 98 

consequences and, 51-52 

duration of, 99 

in EBI, 512 

for generalized behavior change, 777-783 

mands and, 480-481 

to punishment, 373 

rates of, 97 

repertoire, shaping and, 596-597 
Response blocking 

extinction and, 629-630 

as positive punishment, 376-378 
Response class, 44 

measurement by permanent products for, 114 
Response cost (RC) 

aggression and, 406 


consequences and, 404 
contingency for, 406 
DRA and, 404 
least restrictive alternative and, 402 
MOs and, 420-421 
negative punishment and, 401-407 
positive reinforcement and, 404 
reinforcement reserve and, 405 
social validity of, 404 
in token economy, 713 
treatment integrity of, 407 
Response-deprivation hypothesis, 297 
Response differentiation, 587 
Response generalization, 769-770 
Response interruption and redirection (RIRD), 
as positive punishment, 378 
Response latency, 100, 106 
IRT and, 108 
Response maintenance, 769 
setting/situation generalization and, 769-770 
Response prompts, 314 
for imitation training, 575 
least-to-most, 447-448 
most-to-least, 447 
for self-management, 743-744 
for self-monitoring, 750-751 
for stimulus control, 444-445 
Response topography, 44 
shaping and, 589-590 
Response-to-punishment delay, 382 
Response-to-reinforcement delay, 285—286, 289, 315 
Response variation, 634 
Restitutional overcorrection, 378-379 
Restraint, 672-673 
Resurgence, 634-635 
of DRA, 645 
Reversal design 
A-B-A-B design, 199-201 
appropriateness of, 206-207 
B-A-B-A design, 202—206 
B-A-B design, 202-260 
delayed multiple baseline design and, 233 
DRI/DRA, 205 
DRO, 205 
educational and clinical issues of, 207 
ethics of, 207 
irreversibility and, 206-207 
multiple treatment, 203-204 
NCR, 204-205 
social concerns for, 207 
RFT. See Relational frame theory 
Ripple effects, 773 
RIRD. See Response interruption and redirection 
Risk-benefit analysis, 825 
Robotics engineering, shaping and, 
594-595 
Rule-governed behavior, 58 
reinforcement and, 289 
well-being and, 562 


S. See Stimulus 
Salience, of stimulus, 446 
Satiation, of behavior chains, 616 
SB. See Selection-based verbal behavior 
Scatterplots, 162-164 
recording, 689 
Schedules of reinforcement, 337-357 
in applied settings, 356-357 
for block chains, 629 
delay to, 349-350 


ratio strain of, 345-346 
schedule thinning of, 345-346 
Schedule thinning 
DRO and, 652 
for FCT, 670 
for NCR, 665 
of schedules of reinforcement, 345-346 
School Reinforcement Survey Schedule, 304 
Science, 18—22 
Science and Human Behavior (Skinner, B. F.), 27, 
479, 586, 735 
Scientific generality. See External validity 
Scored-interval IOA, 137 
SP. See Discriminative stimulus 
SE. See Establishing stimulus 
Selection, by consequence, 49-50 
Selection-based (SB) verbal behavior, 458 
Selectionism, 50 
Selection of consequences, 302 
Self-administered consequences, 
752-7157 
Self-contract, 726 
Self-control, self-management and, 
736-737 
Self-control analysis, 735 
Self-editing, 472 
Self-evaluation, 747 
Self-Injury Trauma Scale (SITS), 266 
Self-instruction, 757-158 
Self-management, 734-762 
advantages and benefits of, 739-742 
antecedents of, 742-745 
applications of, 738-739 
behavior chains of, 744 
consequences of, 751—757 
contingency of, 761 
defined, 735-737 
effectiveness of, 760-763 
environment for, 745 
for generalized behavior change, 799 
locus of control and, 734-735 
MOs for, 743 
negative reinforcement and, 754, 755 
positive reinforcement and, 753-754, 755 
response prompts for, 743-744 
self-control and, 736-737 
stimulus and, 744 
of target behaviors, 760 
Self-monitoring, 745-752 
ABA for, 35-36 
of behavior, 760-761 
reinforcement and, 747-748 
response prompts for, 750-751 
of target behaviors, 751 
Self-reinforcement, 752—753 
Self-restraint, 672-673 
Self-rules, 562 
Self-study, 822 
Semantic network, 542-543 
Semilogarithmic charts, 156 
Sensory extinction, 632 
Sensory receptors, tacts and, 464, 481 
Sensory reinforcers, 296 
Sequence effects, 204, Oy 
Setting confounds, 255-256 
Setting events, 413 
Setting/situation generalization, response 
maintenance and, 769-772 
Severity Index (SI), for SITS, 266 
Shaping, 585-599 


computer technology for 593-594 
defined, 599-600 
differential reinforcement and, 586-587 
learning, 598-599 
percentile schedules for, 592 
performance dimensions and, 588-589 
reinforcement for, 597 
response topography and, 589-590 
robotics engineering and, 594-595 
SP and stimulus, 592 
stimulus control, in EBI, 514 
stimulus fading and, 592 
successive approximations and, 587-588 
telemedicine and, 593 
terminal behaviors and, 586, 590-591, 595-596 
SI. See Severity Index 
Simple correction, 379 
Simple discrimination, 522-523 
Simple-to-complex testing, 519 
Simple verbal discriminations, 469-470, 484, 485 
Simultaneous discrimination, in EBI, 514 
Single-case (subject) designs, 185 
Single-phase multielement design, 210-211 
SITS. See Self-Injury Trauma Scale 
Socially mediated contingency, 45, 314 
to stimulus, 47 
Socially significant behaviors, 34, 68 
behavioral assessment for, 80-83 
target behaviors for, 80-83 
Social media 
breach of confidentiality and, 827 
ethics and, 827, 830-831 
Social negative reinforcement, 325 
Social positive reinforcement, FBA and, 
678-679 
Social reinforcers, 298-299 
Social validity, 89 
for ABA, 274-275 
ethics and, 817 
of experiments, 261-267 
of RC, 404 
Solecistic (tact) extension, 465 
SPA. See Stimulus preference assessment 
Spaced-responding DRL, 654 
Spatial relations, 547-548 
Speaker 
VB of, 457 
verbal episodes and, 475-476 
Specific reinforcement, mands and, 461 
Spillover effects, 773 
Split-middle line of progress, 172-173 
Spontaneous recovery, extinction and, 634 
S-R. See Stimulus-response psychology 
Stable baseline, 188 
Standard Celeration Chart, 99, 161, 162 
Standardized tests, 72 
social validity and, 265-267 
Steady state responding, 187 
verification and, 191-192 
Stimulus (S), 22, 48. See also specific types 
alternatives, 828 
block chains and, 622 
CMO-R and, 425 
context, 779 
distinction relations and, 546-547 
in EBI, 511-512 
environment and, 44—45 
formal dimensions of, 45 
for generalized behavior change, 777-183, 
795-798 
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multiple functions to changes of, 45-47 
negative reinforcement and, 325 
positive reinforcers and, 285 
punishers and, 55-56 
reinforcers and, 55-56 
salience of, 446 
self-management and, 744 
temporal loci of, 45 
transformation of function and, 545 
Stimulus blocking (masking), 446-447 
Stimulus classes, 45 
incoherent, 550 
transformation of function and, 560-562 
Stimulus control, 57, 437-451 
conditional, 440 
faulty, 439 
overselective, 446 
private events and, 471 
reinforcement and, 289 
response prompts for, 444-445 
shaping, in EBI, 514 
stimulus fading for, 448, 451 
stimulus prompts for, 445 
time delay in, 448 
transferring, 447-451 
Stimulus-control topographies, 511 
Stimulus delta, 289, 330, 437 
errorless learning and, 447 
Stimulus discrimination, 438 
Stimulus discrimination training, 441-442 
Stimulus equivalence, 467, 773 
Stimulus fading 
DRA and, 646 
DRO and, 653 
shaping and, 592 
for stimulus control, 448, 450 
Stimulus generalization gradients, 438-439 
concepts and, 443-444 
Stimulus preference assessment (SPA), 300-308 
for EE, 671-672 
Stimulus prompts, 445 
Stimulus-response (S-R) psychology, 25 
Stimulus-stimulus pairing, 49, 293 
for SPA, 308 
Subject confounds, 255 
Successive approximations, shaping and, 
587-588 
Successive discrimination, in EBI, 514 
Surrogate CMO (CMO-S), 423-425 
Surrogate consent, 823-824 
Symbolic matching-to-sample, 497 
Symmetry, in EBI, 498-499 
Synthesized functional analyses, 684 
Systematic desensitization, 758 


Systematic replication, in experiments, 269-271 


Tables, for IOA, 139, 140 
Tact, 459, 463-464, 477, 481-482, 487 
abstraction, 465 
autoclitic, 477-478 
control, 465 
Tact extension, 465 
Tactile prompting devices, for whole-interval 
recording, 109 
Taking dictation, 460, 468 
Tandem schedule (tand) of reinforcement, 354 
Tangible reinforcers, 296 
FA and, 681-682 
for imitation training, 576 
TARE. See Treatment Acceptability Rating Form 
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Target behaviors, 69-89 
in ABA, 185 
access behaviors as, 80 
adaptive behaviors as, 82 
changing criterion design for, 242 
criteria for change in, 89 
defining, 86-89 
event recording for, 109 
for extinction, 638 
for generalized behavior change, 775-777 
IOA for, 137 
measurement by permanent products for, 116 
measurement of wrong dimension for, 125 
positive reinforcement for, 315 
prioritizing, 84-86 
ranking matrix for, 85-86 
reinforcement of, 79, 373 
for self-management, 736 
self-management of, 761 
self-monitoring of, 751 
for socially significant behavior, 80-83 
for token economy, 710 
TARPA. See Training and Assessment of 
Relational Precursors and Abilities 
Task analysis, for behavior chains, 605-608, 621 
Task records, in contingency contracts, 725 
TB. See Topography-based 
TBFA. See Trial-based functional analysis 
Teach enough examples, 778, 785 
Teach enough response examples, 779 
Teach enough stimulus examples, 778-779 
Teach loosely, 784-785 
Technological (meaning), for ABA, 32 
Telemedicine, shaping and, 595 
Temporal contiguity, 57 
Temporal discounting, 737 
Temporal extent, of measurement, 99-100 
Temporal locus, in measurement, 100-104, 116 
Temporal relations, 547 
between behavior and consequence, 52-53 
for imitation, 571 
Terminal behaviors, shaping and, 587, 590-591, 
595-596 
Terminate specific reinforcer contact, 394-395 
Textual relations, 459, 467 
Thematic matching, in EBI, 514 
Three-phase multielement design, 213-214 
Three-term contingency, 56-58, 441 
in EBI, 522-524 
Time delay 
in multiple probe design, 231 
in stimulus control, 447-448 
Time-out from positive reinforcement, 392-401 
MOs and, 420-421 
Time-out room, 395-396 
Time sampling, 109-113 
IOA from, 136 
Token economy, 708-716 
ABA and, 29 
evaluating, 715-716 
level systems for, 715-716 
RC in, 712 
target behaviors for, 711 
withdrawing, 715 
Tokens, 708-709 
exchange ratio for, 712 
selection of, 711-712 
Topography, 102, 107 
FBA and, 679, 700-702 


of language, 456-457 
response, 44 
shaping and, 589-590 
Topography-based definitions, for target 
behaviors, 87—88 
Topography-based (TB) verbal behavior, 458 
Total count IOA, 134 
Total duration, 99, 105 
Total duration IOA, 135 
Total-task chaining, 611-613 
Training and Assessment of Relational Precursors 
and Abilities (TARPA), 550 
Training structure, for EBI, 516-517 
Transfer of function, in EBI, 505-507 
Transformation of function, 545 
learned helplessness and, 548 
for reinforcers, 549-550 
stimulus classes and, 560-561 
Transitive class containment, 556 
Transitive CMO (CMO-T), 427-430 
mands and, 463, 481 
Transitivity, 544-545 
in EBI, 499 
Translational research, 500 
Treatment Acceptability Rating Form (TARF), 
261, 262 
Treatment drift, 257 
Treatment integrity 
in experiments, 257—260 
of RC, 406-407 
Treatment package, 252-253 
Treatment relapse, with DRA, 645-646 
Treatment withdrawal, in multielement design, 216 
Trends, graphical displays and, 172-173 
Trial-and-error, for task analysis, 606 
Trial-based functional analysis (TBFA), 684 
case example, 695-696 
Trial-based methods, for SPA, 305-307 
Trial-by-trial IOA, 135 
Trials-to-criterion, 102, 107 
True language, tacts and, 482 
True value, 124, 131 
t-score, in norm-referenced tests, 75 
Two-phase multielement design, 211-213 
Type I error, 273 
Type II error, 273 


UMOs. See Unconditioned motivating operations 
Unchaining, 617-618 
Unconditioned motivating operations (UMOs), 
418-420 
Aversive stimulus and, 462 
CMO-S and, 423 
CMO-T and, 427, 428 
mands and, 461, 481 
Unconditioned negative reinforcers, 325 
Unconditioned punishers, 54, 367, 387 
MOs and, 420 
Unconditioned reinforcers, 54, 294-295 
Unconditioned response, reflexes and, 47 
Unconditioned stimulus (US), 48 
reflexes and, 47 
Undifferentiated response, 682 
Unilateral property induction, 556 
Unpairing 
of MOs, 424 
unchaining and, 617 
Unplanned models, 582 


Unscored-interval IOA, 137 
US. See Unconditioned stimulus 


Validity. See also External validity; Internal 
validity; Social validity 
of measurement, 123-124 
Value-altering effect, 413 
direct and indirect effects of, 414-416 
isolating, 413-414 
UMOs and, 419 
Values clarification, 574 
Variability 
in behavior, 183 
in graphical displays, 171 
in multielement design, 209 
in multiple baseline design, 235 
Variable baseline, 189 
Variable-interval DRO (VI-DRO), 648-649 
Variable interval (VI) schedule 
for reinforcement, 344 
stimulus generalization and, 440 
token economy and, 708 
Variable-momentary DRO (VR-DRO), 651 
Variable ratio (VR) schedule, of reinforcement, 
340-343 
Variables. See also Confounding variables; 
Dependent variable; Independent variable 
for behavior-altering effects, 414 
complexity of, 58-59 
cumulative records and, 157 
for extinction, 635-636 
extraneous, 186 
shaping and, 597 
in FA, 682 
in IOA, 137 
Variable-time (VT) schedule, for NCR, 665 
VB. See Verbal behavior 
Verbal Behavior (Skinner, B. F.), 456, 472, 474 
Verbal behavior (VB), 58, 456-489 
applications for, 478 
autoclitic relations in, 477 
behavioral cusps for, 458 
defined, 457 
elementary verbal elements for, 458—460 
of listener, 457 
loss of, 489 
reinforcement for, 471 
RFT for, 552 
SB, 457 
SP and, 468-469 
of speaker, 457 
TB, 458 
Verbal conditional discrimination, 469-470, 481, 
484, 485 
Verbal episodes, 457 
listeners and, 473 
speakers and, 475-476 
Verbal function-altering, 470, 484, 485 
Verbal prompts 
for FCT, 670 
shaping and, 590 
Verification, 191-192 
in changing criterion design, 238 
in multielement design, 208 
in multiple baseline design, 224, 235 
in reversal design, 199-200 
Vertical axis (y-axis), for bar graphs, 156 
VI. See Variable interval schedule 
Vicarious reinforcement, 773 


Video modeling (VM), 576-577 
Video self-modeling (VSM), 577 
VI-DRO. See Variable-interval DRO 
Visual analysis 

of behavior analysis, 186 

of graphical displays, 173-175 

for internal validity, 273-275 
VM. See Video modeling 
VR. See Variable ratio schedule 
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VR-DRO. See Variable-momentary DRO 
VSM. See Video self-modeling 
VT. See Variable-time schedule 


WRAT-S. See Wide Range Achievement Test-5 


X-axis (horizontal axis), for line graphs, 150, 


: 159, 166 

Walden Two (Skinner, B. F.), 26 

Well-being, derived relations and, 560-566 ; ; aes 

Whole-interval recording, 109-110 Y-axis (vertical axis), for bar graphs, 156 


Wide Range Achievement Test-5 (WRAT-5), 72 


Withdrawal design, 222 z-score, in norm-referenced tests, 75 
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Applied Behavior Analysis equips students with a sound repertoire of fundamental 
knowledge about this field that serves as the foundation for more advanced 
study. The third edition of this pioneering text continues to illustrate behavior 
analysis principles and procedures with actual applications and real data, helping 
students acquire the conceptual and technical skills necessary to foster socially 
adaptive behavior in diverse individuals. The book consists of seven chapters 
authored by prominent scholars who first reported some of the most signifi- 
cant advances in behavior analysis in publications such as the Journal of Applied 
Behavior Analysis, The Analysis of Verbal Behavior, and the Journal of the Experimental 
Analysis of Behavior. 

In addition to updated content and the latest research, this edition includes 
the following: 


* Two new chapters, by noted scholars, on equivalence-based instruction and non- 
equivalence relations (Chapters 19 and 20) build upon ground-breaking research in 
the field of applied behavior analysis. 


¢ Multiple-choice and essay-type questions at the end of each chapter test students’ 
mastery of the concepts they have learned. 


* Over 2,700 citations and extensive use of quotations and extracts from key publica- 
tions increase students’ exposure to and appreciation for the field’s rich primary- 
source literature. 


* More than 150 graphs display original data from peer-reviewed research, often ac- 
companied by detailed descriptions of the procedures used to collect the data rep- 
resented, helping students develop and refine higher-level visual analysis skills. 


* Over 500 technical terms and concepts in the glossary help students finesse the 
specialized vocabulary of applied behavior analysis. 
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